Antimicrobial resistance (AMR) poses a significant public health and economic challenge, increasing treatment costs and reducing antibiotic effectiveness. This study employs machine learning to analyze genomic and epidemiological data from the public databases for molecular typing and microbial genome diversity (PubMLST), incorporating data from UK government-supported AMR surveillance by the Food Standards Agency and Food Standards Scotland. We identify AMR patterns in Campylobacter jejuni and Campylobacter coli isolates collected in the UK from 2001 to 2017. The research integrates whole-genome sequencing (WGS) data, epidemiological metadata, and economic projections to identify key resistance determinants and forecast future resistance trends and healthcare costs. We investigate gyrA mutations for fluoroquinolone resistance and the tet(O) gene for tetracycline resistance, training a Random Forest model validated with bootstrap resampling (1,000 samples, 95% confidence intervals), achieving 74% accuracy in predicting AMR phenotypes. Time-series forecasting models (SARIMA, SIR, and Prophet) predict a rise in campylobacteriosis cases, potentially exceeding 130 cases per 100,000 peo
Background: Antimicrobial resistance (AMR) is a major global public health problem, contributing to an estimated 4.95 million deaths in 2019 and projected to cause up to 10 million deaths annually and 100 trillion dollars in cumulative economic losses by 2050. Its emergence and spread result from complex biological, ecological, and socioeconomic interactions. Mathematical modelling is a key tool to study AMR dynamics, yet the literature remains fragmented and methodologically limited. This review synthesizes recent mathematical modelling studies to identify trends, biases, and research gaps. Methods: A scoping review following PRISMA-ScR guidelines was conducted. PubMed, Web of Science, and Scopus were searched for studies published between 2019 and 2024 that developed mathematical models of AMR. After screening and duplicate removal, 36 studies were included. Data were extracted using a structured framework covering model context, construction and parameters, and outputs and validation. Results: Most studies relied on deterministic ordinary differential equation (ODE) models and focused on bacterial resistance in human hosts, with only one adopting a One Health perspective. Conjug
The evolutionary and ecological dynamics of tumors under immune responses and therapeutic interventions pose major challenges to long-term treatment success. Although treatment may initially achieve short-term disease control, resistant cancer cell subpopulations often arise, leading to relapse with more aggressive and treatment-resistant forms of the disease. Here, we develop and analyze mathematical models describing the interactions among effector cells, chemo-resistant tumor cells, and immuno-resistant tumor cells under distinct immune-evasion strategies. The models incorporate competition and cooperation between resistant and sensitive tumor subpopulations. We identify threshold conditions governing tumor persistence, elimination, and phenotype dominance under varying therapeutic intensities. These findings provide a theoretical framework for designing targeted and combination therapies and offer insights into strategies for mitigating the treatment resistance.
Accurate and timely identification of hospital outbreak clusters is crucial for preventing the spread of infections that have epidemic potential. While assessing pathogen similarity through whole genome sequencing (WGS) is considered the gold standard for outbreak detection, its high cost and lengthy turnaround time preclude routine implementation in clinical laboratories. We explore the utility of two rapid and cost-effective alternatives to WGS, matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry and antimicrobial resistance (AR) patterns. We develop a machine learning framework that extracts informative representations from MALDI-TOF spectra and AR patterns for outbreak detection and explore their fusion. Through multi-species analyses, we demonstrate that in some cases MALDI-TOF and AR have the potential to reduce reliance on WGS, enabling more accessible and rapid outbreak surveillance.
Aims. Resistance to targeted therapies remains a major challenge in EGFR-mutant non-small cell lung cancer (NSCLC). Here, we describe a novel metabolic adaptation in osimertinib-resistant cells characterized by elevated acetate levels and activation of an unconventional pyruvate-acetaldehyde-acetate (PAA) shunt. Methods. Integrated transcriptomic, exometabolomic, and functional analyses reveal suppression of canonical metabolic pathways and upregulation of ALDH2 and ALDH7A1, that mediate the NADP+-dependent oxidation of acetaldehyde to acetate, generating NADPH. Results. This shift generates reducing power essential for biosynthesis and redox balance under conditions of oxidative pentose phosphate inhibition. These metabolic changes promote endurance in resistant cells and rewire the interplay between glycolysis, the pentose phosphate pathway, and the tricarboxylic acid cycle, offering a de novo bypass for anaplerosis and bioenergetics. Systematic metabolite profiling revealed distinct transcriptomic and metabolic signatures distinguishing resistant from drug sensitive parental cells. Conclusions. Together, these findings depict a unique, resistance-driven adaptive metabolic shift
Antimicrobial resistance to drugs (AMR), a global threat to human and animal health, is often regarded as resulting from cooperative behaviour. Moreover, microbes generally evolve in volatile environments that, together with demographic fluctuations (birth and death events), drastically alter population size and strain survival. Motivated by the need to better understand the evolution of AMR, we study a population of time-varying size consisting of two competing strains, one drug-resistant and one drug-sensitive, subject to demographic and environmental variability. This is modelled by a binary carrying capacity randomly switching between mild and harsh environmental conditions, and driving the fluctuating volume (total amount of nutrients and antimicrobials at fixed concentration), and thus the size of the community (number of resistant and sensitive cells). We assume that AMR is a shared public good when the concentration of resistant cells exceeds a fixed concentration cooperation threshold, above which the sensitive strain has a growth advantage, whereas resistant cells dominate below it. Using computational means, and devising an analytical treatment (built on suitable quenche
Antimicrobial resistance (AMR) poses a mounting global health crisis, requiring rapid and reliable prediction frameworks that capture its complex evolutionary dynamics. Traditional antimicrobial susceptibility testing (AST), while accurate, remains laborious and time-consuming, limiting its clinical scalability. Existing computational approaches, primarily reliant on single nucleotide polymorphism (SNP)-based analysis, fail to account for evolutionary drivers such as horizontal gene transfer (HGT) and genome-level interactions. This study introduces a novel Evolutionary Mixture of Experts (Evo-MoE) framework that integrates genomic sequence analysis, machine learning, and evolutionary algorithms to model and predict AMR evolution. A Mixture of Experts model, trained on labeled genomic data for multiple antibiotics, serves as the predictive core, estimating the likelihood of resistance for each genome. This model is embedded as a fitness function within a Genetic Algorithm designed to simulate AMR development across generations. Each genome, encoded as an individual in the population, undergoes mutation, crossover, and selection guided by predicted resistance probabilities. The resu
AMRScan is a hybrid bioinformatics toolkit implemented in both R and [Nextflow](https://www.nextflow.io/) for the rapid and reproducible detection of antimicrobial resistance (AMR) genes from next-generation sequencing (NGS) data. The toolkit enables users to identify AMR gene hits in sequencing reads by aligning them against reference databases such as CARD using BLAST. The R implementation provides a concise, script-based approach suitable for single-sample analysis, teaching, and rapid prototyping. In contrast, the Nextflow implementation enables reproducible, scalable workflows for multi-sample batch processing in high-performance computing (HPC) and containerized environments. It leverages modular pipeline design with support for automated database setup, quality control, conversion, BLAST alignment, and results parsing. AMRScan helps bridge the gap between lightweight exploratory analysis and production-ready surveillance pipelines, making it suitable for both research and public health genomics applications.
Antimicrobial Resistance (AMR) is a rapidly escalating global health crisis. While genomic sequencing enables rapid prediction of resistance phenotypes, current computational methods have limitations. Standard machine learning models treat the genome as an unordered collection of features, ignoring the sequential context of Single Nucleotide Polymorphisms (SNPs). State-of-the-art sequence models like Transformers are often too data-hungry and computationally expensive for the moderately-sized datasets that are typical in this domain. To address these challenges, we propose AMR-EnsembleNet, an ensemble framework that synergistically combines sequence-based and feature-based learning. We developed a lightweight, custom 1D Convolutional Neural Network (CNN) to efficiently learn predictive sequence motifs from high-dimensional SNP data. This sequence-aware model was ensembled with an XGBoost model, a powerful gradient boosting system adept at capturing complex, non-local feature interactions. We trained and evaluated our framework on a benchmark dataset of 809 E. coli strains, predicting resistance across four antibiotics with varying class imbalance. Our 1D CNN-XGBoost ensemble consis
Antimicrobial resistance (AMR) poses a global health threat, reducing the effectiveness of antibiotics and complicating clinical decision-making. To address this challenge, we introduce abx_amr_simulator, a Python-based simulation package designed to model antibiotic prescribing and AMR dynamics within a controlled, reinforcement learning (RL)-compatible environment. The simulator allows users to specify patient populations, antibiotic-specific AMR response curves, and reward functions that balance immedi- ate clinical benefit against long-term resistance management. Key features include a modular design for configuring patient attributes, antibiotic resistance dynamics modeled via a leaky-balloon abstraction, and tools to explore partial observability through noise, bias, and delay in observations. The package is compatible with the Gymnasium RL API, enabling users to train and test RL agents under diverse clinical scenarios. From an ML perspective, the package provides a configurable benchmark environment for sequential decision-making under uncertainty, including partial observability induced by noisy, biased, and delayed observations. By providing a customizable and extensible
There is a pressing need to better understand how microbial populations respond to antimicrobial drugs, and to find mechanisms to possibly eradicate antimicrobial-resistant cells. The inactivation of antimicrobials by resistant microbes can often be viewed as a cooperative behavior leading to the coexistence of resistant and sensitive cells in large populations and static environments. This picture is however greatly altered by the fluctuations arising in volatile environments, in which microbial communities commonly evolve. Here, we study the eco-evolutionary dynamics of a population consisting of an antimicrobial resistant strain and microbes sensitive to antimicrobial drugs in a time-fluctuating environment, modeled by a carrying capacity randomly switching between states of abundance and scarcity. We assume that antimicrobial resistance is a shared public good when the number of resistant cells exceeds a certain threshold. Eco-evolutionary dynamics is thus characterised by demographic noise (birth and death events) coupled to environmental fluctuations which can cause population bottlenecks. By combining analytical and computational means, we determine the environmental conditi
Antimicrobial-resistant (AMR) microbes are a growing challenge in healthcare, rendering modern medicines ineffective. AMR arises from antibiotic production and bacterial evolution, but quantifying its transmission remains difficult. With increasing AMR-related data, data-driven methods offer promising insights into its causes and treatments. This paper reviews AMR research from a data analytics and machine learning perspective, summarizing the state-of-the-art and exploring key areas such as surveillance, prediction, drug discovery, stewardship, and driver analysis. It discusses data sources, methods, and challenges, emphasizing standardization and interoperability. Additionally, it surveys statistical and machine learning techniques for AMR analysis, addressing issues like data noise and bias. Strategies for denoising and debiasing are highlighted to enhance fairness and robustness in AMR research. The paper underscores the importance of interdisciplinary collaboration and awareness of data challenges in advancing AMR research, pointing to future directions for innovation and improved methodologies.
Nanofluids are known to have significantly different thermal properties relative to the corresponding conventional fluids. Heat transfer at the solid-fluid interface affects the thermal properties of nanofluids. The current work helps in understanding the role of two nanoscale phenomena, namely ordering of fluid layer around the nanoparticle (nanolayer) and thermal resistance at the interface of solid-fluid in the enhancement of thermal conductivity of Al2O3 - CO2 nanofluid. In this study, molecular dynamics (MD) simulations have been used to study the thermal interfacial resistance by transient non-equilibrium heat technique and nanolayer formed between Al2O3 nanoparticle (np) and surrounded CO2 molecules in the gaseous and supercritical phase. The nanoparticle diameter (dNP) is varied between 2 and 5 nm to investigate the size effect on thermal interfacial resistance (TIR) and thermal conductivity of nanofluid and the results indicate that the TIR for larger diameters is relatively high in both the phases. The study of the effect of surface wettability and temperature on TIR reveals that the resistance decreases with increase in interaction strength and temperature, but is entire
Antimicrobial resistance is becoming a major threat to public health throughout the world. Researchers are attempting to contrast it by developing both new antibiotics and patient-specific treatments. In the second case, whole-genome sequencing has had a huge impact in two ways: first, it is becoming cheaper and faster to perform whole-genome sequencing, and this makes it competitive with respect to standard phenotypic tests; second, it is possible to statistically associate the phenotypic patterns of resistance to specific mutations in the genome. Therefore, it is now possible to develop catalogues of genomic variants associated with resistance to specific antibiotics, in order to improve prediction of resistance and suggest treatments. It is essential to have robust methods for identifying mutations associated to resistance and continuously updating the available catalogues. This work proposes a general method to study minimal inhibitory concentration (MIC) distributions and to identify clusters of strains showing different levels of resistance to antimicrobials. Once the clusters are identified and strains allocated to each of them, it is possible to perform regression method to
Antimicrobial peptides have emerged as promising molecules to combat antimicrobial resistance. However, fragmented datasets, inconsistent annotations, and the lack of standardized benchmarks hinder computational approaches and slow down the discovery of new candidates. To address these challenges, we present the Expanded Standardized Collection for Antimicrobial Peptide Evaluation (ESCAPE), an experimental framework integrating over 80.000 peptides from 27 validated repositories. Our dataset separates antimicrobial peptides from negative sequences and incorporates their functional annotations into a biologically coherent multilabel hierarchy, capturing activities across antibacterial, antifungal, antiviral, and antiparasitic classes. Building on ESCAPE, we propose a transformer-based model that leverages sequence and structural information to predict multiple functional activities of peptides. Our method achieves up to a 2.56% relative average improvement in mean Average Precision over the second-best method adapted for this task, establishing a new state-of-the-art multilabel peptide classification. ESCAPE provides a comprehensive and reproducible evaluation framework to advance A
Background: Antimicrobial resistance (AMR) is a global health threat. While the WHO Global Antimicrobial Resistance and Use Surveillance System (GLASS) provides standardized data, population-level machine learning forecasting of resistance trends remains limited. Translating computational forecasts into policy requires transparent interpretation mechanisms. Methods: Surveillance data (2021-2023) comprising 5,909 observations across 44 countries and five WHO regions were processed. A rigorous temporal split prevented data leakage. Six models (Naive, Linear, Ridge, XGBoost, LightGBM, LSTM) were benchmarked to forecast one-year-ahead resistance rates using features including prior-year resistance and antibiotic consumption. Evaluation metrics (MAE, RMSE, sMAPE) were computed, with 95% bootstrap confidence intervals for MAE. A local Retrieval-Augmented Generation (RAG) system utilizing Gemma 4 was implemented to translate forecast findings into policy guidance grounded in retrieved WHO documents. Results: XGBoost achieved the best performance (test MAE = 6.13% [95% CI: 5.83-6.44]), an 85.3% error reduction versus the naive baseline (MAE = 41.79%). SHAP analysis identified prior-year re
Antimicrobial resistance (AMR) is a global threat and combating its spread is of paramount importance. AMR often results from a cooperative behaviour with shared drug protection. Microbial communities generally evolve in volatile, spatially structured settings. Migration, space, fluctuations, and environmental variability all have a significant impact on the development and proliferation of AMR. While drug resistance is enhanced by migration in static conditions, this changes in time-fluctuating spatially structured environments. Here, we consider a two-dimensional metapopulation consisting of demes in which drug-resistant and sensitive cells evolve in a time-changing environment. This contains a toxin against which protection can be shared (cooperative AMR). Cells migrate between demes and connect them. When the environment and the deme composition vary on the same timescale, strong population bottlenecks cause fluctuation-driven extinction events, countered by migration. We investigate the influence of migration and environmental variability on the AMR eco-evolutionary dynamics by asking at what migration rate fluctuations can help clear resistance and what are the near-optimal e
We introduce and discuss a kinetic framework describing the time evolution of the statistical distributions of a population divided into the compartments of susceptible, infectious, recovered, and resistant in the presence of a microbial infection driven by susceptible infectious interactions. Our main objective is to quantify the impact of excessive and inappropriate antimicrobial use, which accelerates the spread of resistance by enabling a fraction of infectious individuals to transition into the resistant compartment. The model consists of a system of Boltzmann type equations capturing binary interactions between susceptible and infectious individuals, complemented by linear redistribution operators that represent recovery, the development of resistance, and reinfection processes. In the grazing collision limit, we show that this Boltzmann system is well approximated by a system of coupled Fokker Planck equations. This limiting description allows for a more tractable analysis of the dynamics, including the characterization of the long-time behavior of the population densities. Our analysis highlights how interaction terms drive the system toward a stable equilibrium and quantif
Cross-species antimicrobial resistance (AMR) prediction is fundamentally an out-of-distribution (OOD) generalization problem: models trained on one set of bacterial taxa must transfer to phylogenetically distinct genomes that may rely on different resistance mechanisms. Across species, resistance arises from a heterogeneous mixture of localized, horizontally transferred gene cassettes and diffuse species-specific genomic backgrounds, making successful transfer inherently mechanism-dependent. Using a strict species holdout protocol, we first establish an interpretable k-mer baseline with Kover and show that strong within-species performance collapses under true cross-species evaluation. This motivates representation-level approaches that preserve transferable biological signals rather than amplify phylogenetic shortcuts. We investigate genomic foundation model embeddings derived from Evo-1-8k-base and introduce diagnostics for layer selection based on activation scale, isotropy, effective rank, and cross-seed stability under native bfloat16 inference. These analyses identify a stability boundary in deeper layers and reveal that embeddings extracted near this boundary provide more ro
This study demonstrates that generative large language models can be utilized in a more flexible manner for DNA sequence analysis and classification tasks compared to traditional transformer encoder-based models. While recent encoder-based models such as DNABERT and Nucleotide Transformer have shown significant performance in DNA sequence classification, transformer decoder-based generative models have not yet been extensively explored in this field. This study evaluates how effectively generative Large Language Models handle DNA sequences with various labels and analyzes performance changes when additional textual information is provided. Experiments were conducted on antimicrobial resistance genes, and the results show that generative Large Language Models can offer comparable or potentially better predictions, demonstrating flexibility and accuracy when incorporating both sequence and textual information. The code and data used in this work are available at the following GitHub repository: https://github.com/biocomgit/llm4dna.