Antimicrobial resistance (AMR) is escalating and outpacing current antibiotic development. Thus, discovering antibiotics effective against emerging pathogens is becoming increasingly critical. However, existing approaches cannot rapidly identify effective molecules against novel pathogens or emerging drug-resistant strains. Here, we introduce ApexOracle, an artificial intelligence (AI) model that both predicts the antibacterial potency of existing compounds and designs de novo molecules active against strains it has never encountered. Departing from models that rely solely on molecular features, ApexOracle incorporates pathogen-specific context through the integration of molecular features captured via a foundational discrete diffusion language model and a dual-embedding framework that combines genomic- and literature-derived strain representations. Across diverse bacterial species and chemical modalities, ApexOracle consistently outperformed state-of-the-art approaches in activity prediction and demonstrated reliable transferability to novel pathogens with little or no antimicrobial data. Its unified representation-generation architecture further enables the in silico creation of
Epidemic spreading over populations networks has been an important subject of research for several decades, and especially during the Covid-19 pandemic. Most epidemic outbreaks are likely to create multiple mutations during their spreading over the population. In this paper, we study the evolution of a pathogen which can mutate continuously during the epidemic spreading. We consider pathogens whose mutating parameter is the mortality mean-time, and study the evolution of this parameter over the spreading process. We use analytical methods to compute the dynamic equation of the epidemic and the conditions for it to spread. We also use numerical simulations to study the pathogen flow in this case, and to understand the mutation phenomena. We show that the natural selection leads to less violent pathogens becoming predominant in the population. We discuss a wide range of network structures and show how different effects are manifested in each case. We also applied our theory in the context of the Covid-19 pandemic, using relevant epidemiological data collected for this outbreak. We provided explanations for the variants spreading processes observed throughout this pandemic.
We analyse a model that describes the propagation of many pathogens within and between many species. A branching process approximation is used to compute the probability of disease outbreaks. Special cases of aquatic environments with two host species and one or two pathogens are considered both analytically and computationally.
A persistent public health challenge is finding immunization schemes that are effective in combating highly mutable pathogens such as HIV and influenza viruses. To address this, we analyze a simplified model of affinity maturation, the Darwinian evolutionary process B cells undergo during immunization. The vaccination protocol dictates selection forces that steer affinity maturation to generate antibodies. We focus on determining the optimal selection forces exerted by a generic time-dependent vaccination protocol to maximize production of broadly neutralizing antibodies (bnAbs) that can protect against a broad spectrum of pathogen strains. The model lends itself to a path integral representation and operator approximations within a mean-field limit, providing guiding principles for optimizing time-dependent vaccine-induced selection forces to enhance bnAb generation. We compare our analytical mean-field results with the outcomes of stochastic simulations and discuss their similarities and differences.
Host-pathogen interactions consist of an attack by the pathogen, frequently a defense by the host and possibly a counter-defense by the pathogen. Here, we present a game-theoretical approach to describing such interactions. We consider a game where the host and pathogen are players and they can choose between the strategies of defense (or counter-defense) and no response. Specifically, they may or may not produce a toxin and an enzyme degrading the toxin, respectively. We consider that the host and pathogen must also incur a cost for toxin or enzyme production. We highlight both the sequential and non-sequential versions of the game and determine the Nash equilibria. Further, we resolve a paradox occurring in that interplay. If the inactivating enzyme is very efficient, producing the toxin becomes useless, leading to the enzyme being no longer required. Then, production of the defense becomes useful again. In game theory, such situations can be described by a generalized matching pennies game. As a novel result, we find under which conditions the defense cycle leads to a steady state or to an oscillation. We obtain, for saturating dose-response kinetics and considering monotonic co
Cooperation and competition between pathogens can alter the amount of individuals affected by a co-infection. Nonetheless, the evolution of the pathogens' behavior has been overlooked. Here, we consider a co-evolutionary model where the simultaneous spreading is described by a two-pathogen susceptible-infected-recovered model in an either synergistic or competitive manner. At the end of each epidemic season, the pathogens species reproduce according to their fitness that, in turn, depends on the payoff accumulated during the spreading season in a hawk-and-dove game. This co-evolutionary model displays a rich set of features. Specifically, the evolution of the pathogens' strategy induces abrupt transitions in the epidemic prevalence. Furthermore, we observe that the long-term dynamics results in a single, surviving pathogen species, and that the cooperative behavior of pathogens can emerge even under unfavorable conditions.
To optimize strategies for curbing the transmission of airborne pathogens, the efficacy of three key controls -- face masks, ventilation, and physical distancing -- must be well understood. In this study we used the Quadrature-based model of Respiratory Aerosol and Droplets to quantify the reduction in exposure to airborne pathogens from various combinations of controls. For each combination of controls, we simulated thousands of scenarios that represent the tremendous variability in factors governing airborne transmission and the efficacy of mitigation strategies. While the efficacy of any individual control was highly variable among scenarios, combining universal mask-wearing with distancing of 1~m or more reduced the median exposure by more than 99\% relative to a close, unmasked conversation, with further reductions if ventilation is also enhanced. The large reductions in exposure to airborne pathogens translated to large reductions in the risk of initial infection in a new host. These findings suggest that layering controls is highly effective for reducing transmission of airborne pathogens and will be critical for curbing outbreaks of novel viruses in the future.
Despite being similar in structure, functioning, and size viral pathogens enjoy very different mostly well-defined ways of life. They occupy their hosts for a few days (influenza), for a few weeks (measles), or even lifelong (HCV), which manifests in acute or chronic infections. The various transmission routes (airborne, via direct contact, etc.), degrees of infectiousness (referring to the load required for transmission), antigenic variation/immune escape and virulence define further pathogenic lifestyles. To survive pathogens must infect new hosts; the success determines their fitness. Infection happens with a certain likelihood during contact of hosts, where contact can also be mediated by vectors. Besides structural aspects of the host-contact network, three parameters/concepts appear to be key: the contact rate and the infectiousness during contact, which encode the mode of transmission, and third the immunity of susceptible hosts. From here, what can be concluded about the evolutionary strategies of viral pathogens? This is the biological question addressed in this paper. The answer extends earlier results (Lange & Ferguson 2009, PLoS Comput Biol 5 (10): e1000536) and mak
Human pathogens transmitted through environmental pathways are subject to stress and pressures outside of the host. These pressures may cause pathogen pathovars to diverge in their environmental persistence and their infectivity on an evolutionary time-scale. On a shorter time-scale, a single-genotype pathogen population may display wide variation in persistence times and exhibit biphasic decay. Using an infectious disease transmission modeling framework, we demonstrate in both cases that fitness-preserving trade-offs have implications for the dynamics of associated epidemics: less infectious, more persistent pathogens cause epidemics to progress more slowly than more infectious, less persistent (labile) pathogens, even when the overall risk is the same. Using identifiability analysis, we show that the usual disease surveillance data does not sufficiently inform these underlying pathogen population dynamics, even with basic environmental monitoring. These results suggest directions for future microbial research and environmental monitoring. In particular, determining the relative infectivity of persistent pathogen subpopulations and the rates of phenotypic conversion will help asce
Despite the availability of effective vaccines, the persistence of SARS-CoV-2 suggests that co-circulation with other pathogens and resulting multi-epidemics may become increasingly frequent. To better forecast and control the risk of such multi-epidemics, it is essential to elucidate the potential interactions of SARS-CoV-2 with other pathogens; these interactions, however, remain poorly defined. Here, we aimed to review the current body of evidence about SARS-CoV-2 interactions. To study pathogen interactions in a systematic way, we first developed a general framework to capture their major components: sign, strength, symmetry, duration, and mechanism. We then reviewed the experimental evidence from animal models about SARS-CoV-2 interactions. Of the 14 studies identified, 11 focused on the outcomes of co-infection with non-attenuated influenza A viruses and generally demonstrated that co-infection increased disease severity compared with either mono-infection. By contrast, the effect of co-infection on the viral load of either virus was variable and inconsistent across studies. Next, we reviewed the epidemiological evidence about SARS-CoV-2 interactions in human populations. Alt
Face masks provide effective, easy-to-use, and low-cost protection against airborne pathogens or infectious agents, including SARS-CoV-2. There is a wide variety of face masks available on the market for various applications, but they are all passive in nature, i.e., simply act as air filters for the nasal passage and/or mouth. In this paper, we present a new "active mask" paradigm, in which the wearable device is equipped with smart sensors and actuators to both detect the presence of airborne pathogens in real time and take appropriate action to mitigate the threat. The proposed approach is based on a closed-loop control system that senses airborne particles of different sizes close to the mask and then makes intelligent decisions to reduce their concentrations. This paper presents a specific implementation of this concept in which the on-board controller determines ambient air quality via a commercial particulate matter sensor, and if necessary activates a piezoelectric actuator that generates a mist spray to load these particles, thus causing them to fall to the ground. The proposed system communicates with the user via a smart phone application that provides various alerts, in
Diseases spread through host populations over the networks of contacts between individuals, and a number of results about this process have been derived in recent years by exploiting connections between epidemic processes and bond percolation on networks. Here we investigate the case of two pathogens in a single population, which has been the subject of recent interest among epidemiologists. We demonstrate that two pathogens competing for the same hosts can both spread through a population only for intermediate values of the bond occupation probability that lie above the classic epidemic threshold and below a second higher value, which we call the coexistence threshold, corresponding to a distinct topological phase transition in networked systems.
As sequencing technologies become more affordable and genomic databases expand continuously, the reuse of publicly available sequencing data emerges as a powerful strategy for studying microbial pathogens. Indeed, raw sequencing reads generated for the study of a given organism often contain reads originating from the associated microbiota. This review explores how such off-target reads can be detected and used for the study of microbial pathogens. We present genomic data mining as a method to identify relevant sequencing runs from petabase-scale databases, highlighting recent methodological advances that allow efficient database querying. We then briefly outline methods designed to retrieve relevant data and associated metadata, and provide an overview of common downstream analysis pipelines. We discuss how such approaches have (i) expanded the known genetic diversity of microbial pathogens, (ii) enriched our understanding of their spatiotemporal distribution, and (iii) highlighted previously unrecognized ecological interactions involving microbial pathogens. However, these analyses often rely on the completeness and accuracy of accompanying metadata, which remain highly variable.
During the recent pandemic, a rise in COVID-19 cases was followed by a decline in influenza. In the absence of cross-immunity, a potential explanation for the observed pattern is behavioral: non-pharmaceutical interventions (NPIs) designed and promoted for one disease also reduce the spread of others. We study short-term and long-term dynamics of two pathogens where NPIs targeting one pathogen indirectly influence the spread of another - a phenomenon we term behavioral spillover. We examine how perceived risk of and response to one disease substantially alters the spread of other pathogens, revealing how waves of different pathogens emerge over time as a result of behavioral interdependencies and human response. Our analysis identifies the parameter space where two diseases simultaneously co-exist, and where shifts in prevalence occur. Our findings are consistent with observations from the COVID-19 pandemic, where NPIs contributed to significant declines in infections such as influenza, pneumonia, and Lyme disease.
Pathogen identification is pivotal in diagnosing, treating, and preventing diseases, crucial for controlling infections and safeguarding public health. Traditional alignment-based methods, though widely used, are computationally intense and reliant on extensive reference databases, often failing to detect novel pathogens due to their low sensitivity and specificity. Similarly, conventional machine learning techniques, while promising, require large annotated datasets and extensive feature engineering and are prone to overfitting. Addressing these challenges, we introduce PathoLM, a cutting-edge pathogen language model optimized for the identification of pathogenicity in bacterial and viral sequences. Leveraging the strengths of pre-trained DNA models such as the Nucleotide Transformer, PathoLM requires minimal data for fine-tuning, thereby enhancing pathogen detection capabilities. It effectively captures a broader genomic context, significantly improving the identification of novel and divergent pathogens. We developed a comprehensive data set comprising approximately 30 species of viruses and bacteria, including ESKAPEE pathogens, seven notably virulent bacterial strains resistan
Antiterminators are essential components of bacterial transcriptional regulation, allowing the control of gene expression in response to fluctuating environmental conditions. Among them, RNA-binding antiterminator proteins play a major role in preventing transcription termination by binding to specific RNA sequences. These RNA-binding antiterminators have been extensively studied for their role in regulating various metabolic pathways. However, their function in modulating the physiology of pathogens requires further investigation. This review focuses on RNA-binding proteins displaying CAT (Co-AntiTerminator) or ANTAR (AmiR and NasR Transcription Antitermination Regulators) domains reported in model bacteria. In particular, their structures, mechanism of action, and target genes will be described. The involvement of the antitermination mechanisms in bacterial pathogenicity is also discussed. This knowledge is crucial for understanding the regulatory mechanisms that control bacterial virulence, and opens up exciting prospects for future research, and potentially new alternative strategies to combat infectious diseases.
DNA, encoding genetic instructions for almost all living organisms, fuels groundbreaking advances in genomics and synthetic biology. Recently, DNA Foundation Models have achieved success in designing synthetic functional DNA sequences, even whole genomes, but their susceptibility to jailbreaking remains underexplored, leading to potential concern of generating harmful sequences such as pathogens or toxin-producing genes. In this paper, we introduce GeneBreaker, the first framework to systematically evaluate jailbreak vulnerabilities of DNA foundation models. GeneBreaker employs (1) an LLM agent with customized bioinformatic tools to design high-homology, non-pathogenic jailbreaking prompts, (2) beam search guided by PathoLM and log-probability heuristics to steer generation toward pathogen-like sequences, and (3) a BLAST-based evaluation pipeline against a curated Human Pathogen Database (JailbreakDNABench) to detect successful jailbreaks. Evaluated on our JailbreakDNABench, GeneBreaker successfully jailbreaks the latest Evo series models across 6 viral categories consistently (up to 60\% Attack Success Rate for Evo2-40B). Further case studies on SARS-CoV-2 spike protein and HIV-1
The goal of this study is to develop a computational model of the progression of changes in mitochondrial phenotype resulting from infection with pathogenic mycobacteria. This ultimately will enable a large-scale virulence screen of mutant bacterial libraries. Mycobacterium tuberculosis (Mtb) is an intracellular pathogen, but only a small number of its genes have been studied for roles in intracellular host cell survival and replication. Mitochondria are the powerhouse of the host cell and play critical roles in cell survival when attacked by certain pathogens. When Mtb bacteria invade host cells, they induce changes in mitochondrial morphology, making mitochondria a novel target for image processing and machine learning to determine virulence associations of genes in Mtb and potentially other related intracellular pathogens. By hypothesizing mitochondria as an instance of a dynamic and interconnected graph, we demonstrate a statistical approach for quantitatively recognizing novel mitochondrial phenotypes induced by invading pathogens.
The emerging field of immunometabolism has underscored the central role of metabolic pathways in orchestrating immune cell function. Far from being passive background processes, metabolic activities actively regulate key immune responses. Fundamental pathways such as glycolysis, the tricarboxylic acid (TCA) cycle, and oxidative phosphorylation critically shape the behavior of immune cells, influencing macrophage polarization, T cell activation, and dendritic cell function. In this review, we synthesize recent advances in immunometabolism, with a focus on the metabolic mechanisms that govern the responses of both innate and adaptive immune cells to bacterial, viral, and fungal pathogens. Drawing on experimental, computational, and integrative methodologies, we highlight how metabolic reprogramming contributes to host defense in response to infection. These findings reveal new opportunities for therapeutic intervention, suggesting that modulation of metabolic pathways could enhance immune function and improve pathogen clearance.
Understanding the spatio-temporal evolution of epidemics with multiple pathogens requires not only new theoretical models but also careful analysis of their practical consequences. Building on the Multiplex Bi-Virus Reaction-Diffusion framework (MBRD) introduced in our companion paper, we investigate how the super-infection model (MBRD-SI) and the co-infection model (MBRD-CI) behave under different epidemiological and network conditions. Through numerical experiments, we study the effects of pathogen virulence, diffusion rates, and cross-diffusion on epidemic hotspot formation and long-term prevalence. Our results highlight the role of multiplex structure in amplifying or suppressing co-circulating infections, and provide quantitative insight into conditions that drive persistent epidemic patterns. Beyond epidemiology, these findings have broader implications for multiplex contagion processes such as information diffusion and malware propagation.