Infectious diseases, either emerging or long-lasting, place numerous people at risk and bring heavy public health burdens worldwide. In the process against infectious diseases, predicting the epidemic risk by modeling the disease transmission plays an essential role in assisting with preventing and controlling disease transmission in a more effective way. In this paper, we systematically describe how machine learning can play an essential role in quantitatively characterizing disease transmission patterns and accurately predicting infectious disease risks. First, we introduce the background and motivation of using machine learning for infectious disease risk prediction. Next, we describe the development and components of various machine learning models for infectious disease risk prediction. Specifically, existing models fall into three categories: Statistical prediction, data-driven machine learning, and epidemiology-inspired machine learning. Subsequently, we discuss challenges encountered when dealing with model inputs, designing task-oriented objectives, and conducting performance evaluation. Finally, we conclude with a discussion of open questions and future directions.
Reinforcement learning (RL), owing to its adaptability to various dynamic systems in many real-world scenarios and the capability of maximizing long-term outcomes under different constraints, has been used in infectious disease control to optimize the intervention strategies for controlling infectious disease spread and responding to outbreaks in recent years. The potential of RL for assisting public health sectors in preventing and controlling infectious diseases is gradually emerging and being explored by rapidly increasing publications relevant to COVID-19 and other infectious diseases. However, few surveys exclusively discuss this topic, that is, the development and application of RL approaches for optimizing strategies of non-pharmaceutical and pharmaceutical interventions of public health. Therefore, this paper aims to provide a concise review and discussion of the latest literature on how RL approaches have been used to assist in controlling the spread and outbreaks of infectious diseases, covering several critical topics addressing public health demands: resource allocation, balancing between lives and livelihoods, mixed policy of multiple interventions, and inter-regional
Univariate zero-inflated models are increasingly being used to account for excess zeros in spatio-temporal infectious disease counts. However, the multivariate case is challenging due to the need to account for correlations across space, time and disease in both the count and zero-inflated components of the model. We are interested in comparing the transmission dynamics of several co-circulating infectious diseases across space and time, where some of the diseases can be absent for long periods. We first assume there is a baseline disease that is well-established and always present in the region. The other diseases switch between periods of presence and absence in each area through a series of coupled Markov chains, which account for long periods of disease absence, disease interactions and disease spread from neighboring areas. Since we are mainly interested in comparing the diseases, we assume the cases of the present diseases in an area jointly follow an autoregressive multinomial model. We use the multinomial model to investigate whether there are associations between certain factors, such as temperature, and differences in the transmission intensity of the diseases. Inference
Interaction patterns among individuals play vital roles in spreading infectious diseases. Understanding these patterns and integrating their impact in modeling diffusion dynamics of infectious diseases are important for epidemiological studies. Current network-based diffusion models assume that diseases transmit through interactions where both infected and susceptible individuals are co-located at the same time. However, there are several infectious diseases that can transmit when a susceptible individual visits a location after an infected individual has left. Recently, we introduced a diffusion model called same place different time (SPDT) transmission to capture the indirect transmissions that happen when an infected individual leaves before a susceptible individual's arrival along with direct transmissions. In this paper, we demonstrate how these indirect transmission links significantly enhance the emergence of infectious diseases simulating airborne disease spreading on a synthetic social contact network. We denote individuals having indirect links but no direct links during their infectious periods as hidden spreaders. Our simulation shows that indirect links play similar ro
This paper conducts research on the established model and presents the main conclusions . Firstly, by separately considering the infectivity of each of the two infectious diseases and the infectivity of the population simultaneously infected with the two infectious diseases, the existence of three types of boundary equilibrium points is determined, as well as the existence of the interior equilibrium point when the parameters are under specific conditions. Then, the stability of the equilibrium points is analyzed. It is concluded that under different parameter conditions, the stability of the disease free equilibrium point can exhibit various scenarios, such as a stable node or a saddle-node, etc. For the boundary equilibrium points, the situation is more intricate,and a cusp may occur. The stability of the interior equilibrium point under specific conditions is also presented. Finally,the degeneracy of the equilibrium points is studied through the bifurcation theory.Mainly, the saddle-node bifurcation occurring at the interior equilibrium point is obtained, and when the infection rate of the first infectious disease, the infection rate of the second infectious disease, and the inf
Mitigation measures are essential for controlling the spread of infectious diseases during pandemics and epidemics, but they impose considerable societal, individual, and economic costs. We developed a general optimization framework to balance costs related to infection and to mitigation. Optimizing the trade-off between mitigation and infection cost, we identified three novel, surprising effects: First, assuming a constant reproduction number $R_0$, the optimal response to an infectious disease requires either strict mitigation or none at all, depending on disease severity, but never does one find an intermediate mitigation level to be optimal. Second, under seasonal variations, optimal mitigation is stricter during winter. Interestingly, a single wave of infections still arises in spring with 3 months delay to the seasonal peak of infectivity, replacing the autumn/winter waves known for classical influenza. Third, during steady vaccination campaigns, even optimal mitigation can result in transient infection waves. Finally, we quantify the cost of delayed mitigation onset and show that even short delays can substantially increase total costs -- if the disease is severe. Overall, o
The global public health landscape is perpetually challenged by the looming threat of infectious diseases. Central to addressing this concern is the imperative to prevent and manage disease transmission during pandemics, particularly in unique settings. This study addresses the transmission dynamics of infectious diseases within conference venues, presenting a computational model designed to simulate transmission processes within a condensed timeframe (one day), beginning with sporadic cases. Our model intricately captures the activities of individual attendees within the conference venue, encompassing meetings, rest intervals, and meal breaks. While meetings entail proximity seating, rest and lunch periods allow attendees to interact with diverse individuals. Moreover, the restroom environment poses an additional avenue for potential infection transmission. Employing an individual-based model, we meticulously replicated the transmission dynamics of infectious diseases, with a specific emphasis on close-contact interactions between infected and susceptible individuals. Through comprehensive analysis of model simulations, we elucidated the intricacies of disease transmission dynamic
Supervised machine learning models and public surveillance data has been employed for infectious disease forecasting in many settings. These models leverage various data sources capturing drivers of disease spread, such as climate conditions or human behavior. However, few models have incorporated the organizational structure of different geographic locations for forecasting. Traveling waves of seasonal outbreaks have been reported for dengue, influenza, and other infectious diseases, and many of the drivers of infectious disease dynamics may be shared across different cities, either due to their geographic or socioeconomic proximity. In this study, we developed a machine learning model to predict case counts of four infectious diseases across Brazilian cities one week ahead by incorporating information from related cities. We compared selecting related cities using both geographic distance and GDP per capita. Incorporating information from geographically proximate cities improved predictive performance for two of the four diseases, specifically COVID-19 and Zika. We also discuss the impact on forecasts in the presence of anomalous contagion patterns and the limitations of the prop
Effective public health decisions require early reliable inference of infectious disease properties. In this paper we assess the ability to infer infectious disease attributes from population-level stochastic epidemic trajectories. In particular, we construct stochastic Kermack-McKendrick model trajectories, sample them with and without observational error, and evaluate inversions for the population mean infectiousness as a function of time since infection, the infection duration distribution, and its complementary cumulative distribution, the infection survival distribution. Based on an integro-differential equation formulation we employ a natural regression approach to fit the corresponding integral kernels and show that these disease attributes are recoverable from both multi-trajectory inversions and regularized single trajectory inversions. Moreover, we demonstrate that the infection duration distribution (or alternatively the infection survival distribution) and population mean infectiousness kernel recovered can be used to solve for the individual infectiousness profile, the infectiousness of an individual over the duration of their infection, assuming that individual infect
Understanding dynamics of an infectious disease helps in designing appropriate strategies for containing its spread in a population. Recent mathematical models are aimed at studying dynamics of some specific types of infectious diseases. In this paper we propose a new model for infectious diseases spread having susceptible, infected, and recovered populations and study its dynamics in presence of incubation delays and relapse of the disease. The influence of treatment and vaccination efforts on the spread of infection in presence of time delays are studied. Sufficient conditions for local stability of the equilibria and change of stability are derived in various cases. The problem of global stability is studied for an important special case of the model. Simulations carried out in this study brought out the importance of treatment rate in controlling the disease spread. It is observed that incubation delays have influence on the system even under enhanced vaccination. The present study has clearly brought out the fact that treatment rate in presence of time delays would contain the disease as compared to popular belief that eradication can only be done through vaccination.
Artificial Intelligence (AI) and infectious diseases prediction have recently experienced a common development and advancement. Machine learning (ML) apparition, along with deep learning (DL) emergence, extended many approaches against diseases apparition and their spread. And despite their outstanding results in predicting infectious diseases, conflicts appeared regarding the types of data used and how they can be studied, analyzed, and exploited using various emerging methods. This has led to some ongoing discussions in the field. This research aims not only to provide an overview of what has been accomplished, but also to highlight the difficulties related to the types of data used, and the learning methods applied for each research objective. It categorizes these contributions into three areas: predictions using Public Health Data to prevent the spread of a transmissible disease within a region; predictions using Patients' Medical Data to detect whether a person is infected by a transmissible disease; and predictions using both Public and patient medical data to estimate the extent of disease spread in a population. The paper also critically assesses the potential of AI and out
In randomized controlled trials (RCTs) of infectious disease interventions, it is well recognized that unmeasured individual heterogeneity at baseline can induce selection bias over time, thereby complicating the interpretation of the estimated hazard ratio. The present study examines a simplified setting: RCTs consisting of homogeneous participants, with no individual heterogeneity at baseline. However, even in such an apparently ideal setting, selection bias can emerge over time due to temporal dependence in exposure, a realistic feature of infectious disease transmission. In this study, we mathematically characterize the mechanism underlying this bias and quantitatively evaluate its magnitude. Our results show that this bias should be recognized as an issue in both the design and interpretation of RCTs of infectious disease interventions.
Infectious diseases are a significant threat to human society which was over sighted before the incidence of COVID-19, although according to the report of the World Health Organisation (WHO) about 4.2 million people die annually due to infectious disease. Due to recent COVID-19 pandemic, more than 2 million people died during 2020 and 96.2 million people got affected by this devastating disease. Recent research shows that applying individual interactions and movements data could help managing the pandemic though modelling the spread of infectious diseases on social contact networks. Infectious disease spreading can be explained with the theories and methods of diffusion processes where a dynamic phenomena evolves on networked systems. In the modelling of diffusion process, it is assumed that contagious items spread out in the networked system through the inter-node interactions. This resembles spreading of infectious virus, e.g. spread of COVID-19, within a population through individual social interactions. The evolution behaviours of the diffusion process are strongly influenced by the characteristics of the underlying system and the mechanism of the diffusion process itself. Thus
The main aim to build models capable of simulating the spreading of infectious diseases is to control them. And along this way, the key to find the optimal strategy for disease control is to obtain a large number of simulations of disease transitions under different scenarios. Therefore, the models that can simulate the spreading of diseases under scenarios closer to the reality and are with high efficiency are preferred. In the realistic social networks, the random contact, including contacts between people in the public places and the public transits, becomes the important access for the spreading of infectious diseases. In this paper, a model can efficiently simulate the spreading of infectious diseases under random contacts is proposed. In this approach, the random contact between people is characterized by the random matrix with elements randomly generated and the spread of the diseases is simulated by the Markov process. We report an interesting property of the proposed model: the main indicators of the spreading of the diseases such as the death rate are invariant of the size of the population. Therefore, representative simulations can be conducted on models consist of small
Infectious disease models can be of great use for understanding the underlying mechanisms that influence the spread of diseases and predicting future disease progression. Modeling has been increasingly used to evaluate the potential impact of different control measures and to guide public health policy decisions. In recent years, there has been rapid progress in developing spatio-temporal modeling of infectious diseases and an example of such recent developments is the discrete time individual-level models (ILMs). These models are well developed and provide a common framework for modeling many disease systems, however, they assume the probability of disease transmission between two individuals depends only on their spatial separation and not on their spatial locations. In cases where spatial location itself is important for understanding the spread of emerging infectious diseases and identifying their causes, it would be beneficial to incorporate the effect of spatial location in the model. In this study, we thus generalize the ILMs to a new class of geographically-dependent ILMs (GD-ILMs), to allow for the evaluation of the effect of spatially varying risk factors (e.g., education
The annual occurrence of many infectious diseases remains a constant burden to public health systems. The seasonal patterns in respiratory disease incidence observed in temperate regions have been attributed to the impact of environmental conditions on pathogen survival. A model describing the transmission of an infectious disease by means of a pathogenic state capable of surviving in an environmental reservoir outside of its host organism is presented in this paper. The ratio of pathogen lifespan to the duration of the infectious disease state is found to be a critical parameter in determining disease dynamics. The introduction of a seasonally forced pathogen inactivation rate identifies a time delay between peak pathogen survival and peak disease incidence. The delay is dependent on specific disease parameters and, for influenza, decreases with increasing reproduction number. The observed seasonal oscillations are found to have a period identical to that of the seasonally forced inactivation rate and which is independent of the duration of infection acquired immunity
As global living standards improve and medical technology advances, many infectious diseases have been effectively controlled. However, certain diseases, such as the recent COVID-19 pandemic, continue to pose significant threats to public health. This paper explores the evolution of infectious disease modeling, from early ordinary differential equation-based models like the SIR framework to more complex reaction-diffusion models that incorporate both temporal and spatial dynamics. The study highlights the importance of numerical methods, such as the Runge-Kutta method, implicit-explicit time-discretization techniques, and finite difference methods, in solving these models. By analyzing the development and application of these methods, this research underscores their critical role in predicting disease spread, informing public health strategies, and mitigating the impact of future pandemics.
Major advances in public health have resulted from disease prevention. However, prevention of a new infectious disease by vaccination or pharmaceuticals is made difficult by the slow process of vaccine and drug development. We propose an additional intervention that allows rapid control of emerging infectious diseases, and can also be used to eradicate diseases that rely almost exclusively on human-to-human transmission. The intervention is based on (1) testing every individual for the disease, (2) repeatedly, and (3) isolation of infected individuals. We show here that at a sufficient rate of testing, the reproduction number is reduced below 1.0 and the epidemic will rapidly collapse. The approach does not rely on strong or unrealistic assumptions about test accuracy, isolation compliance, population structure or epidemiological parameters, and its success can be monitored in real time by following the test positivity rate. In addition to the compliance rate and false negatives, the required rate of testing depends on the design of the testing regime, with concurrent testing outperforming random sampling. Provided that results are obtained rapidly, the test frequency required to s
In this work, we present an approach called Disease Informed Neural Networks (DINNs) that can be employed to effectively predict the spread of infectious diseases. This approach builds on a successful physics informed neural network approaches that have been applied to a variety of applications that can be modeled by linear and non-linear ordinary and partial differential equations. Specifically, we build on the application of PINNs to SIR compartmental models and expand it a scaffolded family of mathematical models describing various infectious diseases. We show how the neural networks are capable of learning how diseases spread, forecasting their progression, and finding their unique parameters (e.g. death rate). To demonstrate the robustness and efficacy of DINNs, we apply the approach to eleven highly infectious diseases that have been modeled in increasing levels of complexity. Our computational experiments suggest that DINNs is a reliable candidate for effectively learn about the dynamics of spread and forecast its progression into the future from available real-world data.
Forecasting transmission of infectious diseases, especially for vector-borne diseases, poses unique challenges for researchers. Behaviors of and interactions between viruses, vectors, hosts, and the environment each play a part in determining the transmission of a disease. Public health surveillance systems and other sources provide valuable data that can be used to accurately forecast disease incidence. However, many aspects of common infectious disease surveillance data are imperfect: cases may be reported with a delay or in some cases not at all, data on vectors may not be available, and case data may not be available at high geographical or temporal resolution. In the face of these challenges, researchers must make assumptions to either account for these underlying processes in a mechanistic model or to justify their exclusion altogether in a statistical model. Whether a model is mechanistic or statistical, researchers should evaluate their model using accepted best practices from the emerging field of infectious disease forecasting while adopting conventions from other fields that have been developing forecasting methods for decades. Accounting for assumptions and properly eva