cfr is an R package to estimate disease severity and under-reporting in real-time, accounting for delays in epidemic time-series.
cfr provides simple, fast methods to calculate the overall or static case fatality risk (CFR) of an outbreak up to a given time point, as well as how the CFR changes over the course of the outbreak. cfr can help estimate disease under-reporting in real-time, accounting for delays reporting the outcomes of cases.
cfr implements methods outlined in Nishiura et al. (2009). There are plans to add estimates based on other methods.
cfr is developed at the Centre for the Mathematical Modelling of Infectious Diseases at the London School of Hygiene and Tropical Medicine as part of the Epiverse-TRACE initiative.
cfr can be installed from CRAN using
The current development version of cfr can be installed from GitHub using the pak
package.
This example shows how to use cfr to estimate the overall case fatality risks from the 1976 Ebola outbreak (Camacho et al. 2014), while correcting for delays using a Gamma-distributed onset to death duration taken from Barry et al. (2018), with a shape \(k\) of 2.40 and a scale \(\theta\) of 3.33.
# Load package
library(cfr)
# Load the Ebola 1976 data provided with the package
data("ebola1976")
# Calculate the static CFR without correcting for delays
cfr_static(data = ebola1976)
#> severity_mean severity_low severity_high
#> 1 0.955102 0.9210866 0.9773771
# Calculate the static CFR while correcting for delays
cfr_static(
data = ebola1976,
delay_density = function(x) dgamma(x, shape = 2.40, scale = 3.33)
)
#> severity_mean severity_low severity_high
#> 1 0.959 0.842 1
In this example we show how the estimate of overall severity can change as more data on cases and deaths over time becomes available, using the function cfr_rolling()
. Because there is a delay from onset-to-death, a simple “naive” calculation that just divides deaths-to-date by cases-to-date will underestimate severity. The cfr_rolling()
function uses the estimate_severity()
adjustment internally to account for delays, and instead compares deaths-to-date with cases-with-known-outcome-to-date. The adjusted estimate converges to the naive estimate as the outbreak declines and a larger proportion of cases have known outcomes.
# Calculate the CFR without correcting for delays on each day of the outbreak
rolling_cfr_naive <- cfr_rolling(
data = ebola1976
)
# see the first few rows
head(rolling_cfr_naive)
#> date severity_mean severity_low severity_high
#> 1 1976-08-25 0 0 0.975
#> 2 1976-08-26 0 0 0.975
#> 3 1976-08-27 0 0 0.975
#> 4 1976-08-28 0 0 0.975
#> 5 1976-08-29 0 0 0.975
#> 6 1976-08-30 0 0 0.975
# Calculate the rolling daily CFR while correcting for delays
rolling_cfr_corrected <- cfr_rolling(
data = ebola1976,
delay_density = function(x) dgamma(x, shape = 2.40, scale = 3.33)
)
head(rolling_cfr_corrected)
#> date severity_mean severity_low severity_high
#> 1 1976-08-25 NA NA NA
#> 2 1976-08-26 0.001 0.001 0.999
#> 3 1976-08-27 0.001 0.001 0.999
#> 4 1976-08-28 0.001 0.001 0.999
#> 5 1976-08-29 0.001 0.001 0.999
#> 6 1976-08-30 0.001 0.001 0.994
We plot the rolling CFR to visualise how severity changes over time, using the ggplot2 package. The plotting code is hidden here.
# combine the data for plotting
rolling_cfr_naive$method <- "naive"
rolling_cfr_corrected$method <- "corrected"
data_cfr <- rbind(
rolling_cfr_naive,
rolling_cfr_corrected
)
More details on how to use cfr can be found in the online documentation as package vignettes, under “Articles”.
To report a bug please open an issue.
Contributions to cfr are welcomed. Please follow the package contributing guide.
Please note that the cfr project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.
cfr functionality overlaps with that of some other packages, including
cfr is in future expected to benefit from the functionality of the forthcoming epiparameter package, which is also developed by Epiverse-TRACE. epiparameter aims to provide a library of epidemiological parameters to parameterise delay density functions, as well as the convenient <epidist>
class to store, access, and pass these parameters for delay correction.
Barry, Ahmadou, Steve Ahuka-Mundeke, Yahaya Ali Ahmed, Yokouide Allarangar, Julienne Anoko, Brett Nicholas Archer, Aaron Aruna Abedi, et al. 2018. “Outbreak of Ebola virus disease in the Democratic Republic of the Congo, April–May, 2018: an epidemiological study.” The Lancet 392 (10143): 213–21. https://doi.org/10.1016/S0140-6736(18)31387-4.
Camacho, A., A. J. Kucharski, S. Funk, J. Breman, P. Piot, and W. J. Edmunds. 2014. “Potential for Large Outbreaks of Ebola Virus Disease.” Epidemics 9 (December): 70–78. https://doi.org/10.1016/j.epidem.2014.09.003.
Nishiura, Hiroshi, Don Klinkenberg, Mick Roberts, and Johan A. P. Heesterbeek. 2009. “Early Epidemiological Assessment of the Virulence of Emerging Infectious Diseases: A Case Study of an Influenza Pandemic.” PLOS ONE 4 (8): e6852. https://doi.org/10.1371/journal.pone.0006852.