The self‐exciting Hawkes point process model (Hawkes, 1971) has been used to describe and forecast communicable diseases. A variant of the Hawkes model, called the recursive model, was proposed by Schoenberg et al. (2019) and has been shown to fit well to various epidemic disease datasets. Unlike the Hawkes model, the recursive model allows the productivity to vary as the overall rate of incidence of the disease varies. Here, we extend the data‐driven nonparametric expectation‐maximization method of Marsan and Lengliné (2008) in order to fit the recursive model without assuming a particular functional form for the productivity. The nonparametric recursive model is trained to fit to weekly reported cases of mumps in Pennsylvania during the January 1970–September 1990 time frame and then assessed using one week forecasts for the October 1990–December 2001 time period. Both its training and predictive ability are evaluated compared to that of other candidate models, such as Hawkes and SVEILR (susceptible, vaccinated, exposed, infected, lightly infected, recovered) compartmental models.
Point process models, such as Hawkes and recursive models, have recently been shown to offer improved accuracy over more traditional compartmental models for the purposes of modeling and forecasting the spread of disease epidemics. To explicitly test the performance of these two models in a real‐world and ongoing epidemic, we compared the fit of Hawkes and recursive models to outbreak data on Ebola virus disease (EVD) in the Democratic Republic of the Congo in 2018–2020. The models were estimated, and the forecasts were produced, time‐stamped, and stored in real time, so that their prospective value can be assessed and to guard against potential overfitting. The fit of the two models was similar, with both models resulting in much smaller errors in the beginning and waning phases of the epidemic and with slightly smaller error sizes on average for the Hawkes model compared with the recursive model. Our results suggest that both Hawkes and recursive point process models can be used in near real time during the course of an epidemic to help predict future cases and inform management and mitigation strategies.
more » « less- NSF-PAR ID:
- 10360689
- Publisher / Repository:
- Wiley Blackwell (John Wiley & Sons)
- Date Published:
- Journal Name:
- Journal of Forecasting
- Volume:
- 41
- Issue:
- 1
- ISSN:
- 0277-6693
- Page Range / eLocation ID:
- p. 201-210
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract -
This paper proposes a new meta-learning method – named HARMLESS (HAwkes Relational Meta LEarning method for Short Sequences) for learning heterogeneous point process models from short event sequence data along with a relational network. Specifically, we propose a hierarchical Bayesian mixture Hawkes process model, which naturally incorporates the relational information among sequences into point process modeling. Compared with existing methods, our model can capture the underlying mixed-community patterns of the relational network, which simultaneously encourages knowledge sharing among sequences and facilitates adaptive learning for each individual sequence. We further propose an efficient stochastic variational meta expectation maximization algorithm that can scale to large problems. Numerical experiments on both synthetic and real data show that HARMLESS outperforms existing methods in terms of predicting the future events.more » « less
-
This paper proposes a new meta-learning method – named HARMLESS (HAwkes Relational Meta LEarning method for Short Sequences) for learning heterogeneous point process models from short event sequence data along with a relational net- work. Specifically, we propose a hierarchical Bayesian mixture Hawkes process model, which naturally incorporates the relational information among sequences into point process modeling. Compared with existing methods, our model can capture the underlying mixed-community patterns of the relational network, which simultaneously encourages knowledge sharing among sequences and facilitates adaptive learning for each individual sequence. We further propose an efficient stochastic variational meta expectation maximization algorithm that can scale to large problems. Numerical experiments on both synthetic and real data show that HARMLESS outperforms existing methods in terms of predicting the future events.more » « less
-
Abstract Hawkes models are frequently used to describe point processes that are clustered spatial‐temporally, and have been used in numerous applications including the study of earthquakes, invasive species, and contagious diseases. An extension of the Hawkes model is considered where the productivity is variable. In particular, the case is explored where each point may have its own productivity and a simple analytic formula is derived for the maximum likelihood estimators of these productivities. This estimator is compared with an empirical estimator and ways are explored of stabilizing both estimators by lower truncating, smoothing, and rescaling the estimates. Properties of the estimators are explored in simulations, and the methods are applied to seismological and epidemic datasets to show and quantify substantial variation in productivity.
-
Abstract This article reviews some of the statistical issues involved with modeling SARS-CoV02 (Covid-19) in Los Angeles County, California, using Hawkes point process models and SEIR models. The two types of models are compared, and their pros and cons are discussed. We also discuss particular statistical decisions, such as where to place the upper limits on y-axes, and whether to use a Bayesian or frequentist version of the model, how to estimate seroprevalence, and fitting the density of transmission times in the Hawkes model.