Microorganisms are ubiquitous in nature, and microbial activities are closely intertwined with the entire life cycle system and human life. Developing novel technologies for the detection, characterization and manipulation of microorganisms promotes their applications in clinical, environmental and industrial areas. Over the last two decades, terahertz (THz) technology has emerged as a new optical tool for microbiology. The great potential originates from the unique advantages of THz waves including the high sensitivity to water and inter-/intra-molecular motions, the non-invasive and label-free detecting scheme, and their low photon energy. THz waves have been utilized as a stimulus to alter microbial functions, or as a sensing approach for quantitative measurement and qualitative differentiation. This review specifically focuses on recent research progress of THz technology applied in the field of microbiology, including two major parts of THz biological effects and the microbial detection applications. In the end of this paper, we summarize the research progress and discuss the challenges currently faced by THz technology in microbiology, along with potential solutions. We also
Advancements in artificial intelligence (AI) have transformed many scientific fields, with microbiology and microbiome research now experiencing significant breakthroughs through machine learning applications. This review provides a comprehensive overview of AI-driven approaches tailored for microbiology and microbiome studies, emphasizing both technical advancements and biological insights. We begin with an introduction to foundational AI techniques, including primary machine learning paradigms and various deep learning architectures, and offer guidance on choosing between traditional machine learning and sophisticated deep learning methods based on specific research goals. The primary section on application scenarios spans diverse research areas, from taxonomic profiling, functional annotation \& prediction, microbe-X interactions, microbial ecology, metabolic modeling, precision nutrition, clinical microbiology, to prevention \& therapeutics. Finally, we discuss challenges in this field and highlight some recent breakthroughs. Together, this review underscores AI's transformative role in microbiology and microbiome research, paving the way for innovative methodologies an
The Antibiotic Resistance Microbiology Dataset (ARMD) is a de-identified resource derived from electronic health records (EHR) that facilitates research in antimicrobial resistance (AMR). ARMD encompasses big data from adult patients collected from over 15 years at two academic-affiliated hospitals, focusing on microbiological cultures, antibiotic susceptibilities, and associated clinical and demographic features. Key attributes include organism identification, susceptibility patterns for 55 antibiotics, implied susceptibility rules, and de-identified patient information. This dataset supports studies on antimicrobial stewardship, causal inference, and clinical decision-making. ARMD is designed to be reusable and interoperable, promoting collaboration and innovation in combating AMR. This paper describes the dataset's acquisition, structure, and utility while detailing its de-identification process.
The study of microorganisms, or microbiology, has demonstrated significant development since its inception and is currently a key field of biological sciences that has a huge impact on modern society and scientific research. Over the centuries, this discipline has undergone significant changes, shaping our understanding of infectious diseases and food safety. Starting from the simplest observations of microscopic organisms such as bacteria, viruses, fungi and protozoa, and ending with modern molecular and genomic research methods. This article describes a brief historical path of microbiology development. The heuristic, morphological, physiological, immunological, and molecular genetic stages are the main periods into which the development of this science is traditionally divided, despite the lack of full-fledged and precise boundaries between them.
Mathematical models are increasingly a part of microbiological research. Here, we share our perspective on how modeling advances the discipline by: (i) enforcing logical consistency, (ii) enabling quantitative prediction, (iii) extracting hidden parameters from data, and (iv) generating intuitive understanding. We map a spectrum of modeling frameworks, from whole-cell simulations to minimal logistic growth equations, and provide interactive examples for some common frameworks. Building on this overview, we outline pragmatic criteria for choosing an appropriate level of description to capture phenomena of interest. Finally, we present a case study in modeling of microbial ecosystems from our own work to illustrate how mechanistic modeling can yield generalizable intuition. This perspective aims to be an introductory roadmap for integrating mathematical modeling into experimental microbiology.
The Earth possesses many environmental extremes that mimic conditions on extraterrestrial worlds. The stratosphere at 30-40 km altitude closely resembles the surface of Mars in terms of pressure, temperature, and radiation levels (UV, proton, and Galactic cosmic rays). While microbial life in the troposphere is well documented, the true upper limit of Earth's biosphere remains unclear. The stratosphere offers a promising environment to explore microbial survival in such extreme conditions. Despite its significance to astrobiology, this region remains largely unexplored due to difficulties in access and avoiding contamination. To address this, we have developed SAMPLE (Stratospheric Altitude Microbiology Probe for Life Existence), a balloon-borne payload designed to collect dust samples from the stratosphere and return them in conditions suitable for lab analysis. The entire system is novel and designed in-house, with weight- and stress-optimized components. The main payload includes three pre-sterilized sampling trays and a controller that determines altitude and governs tray operation. One tray will remain closed during flight (airborne control) and another on the ground (cleanroo
This study addresses from the Optimal Experimental Design perspective the use of the isothermal experimentation procedure to precisely estimate the parameters defining models used in predictive microbiology. Starting from a case study set out in the literature, and taking the Baranyi model as the primary model, and the Ratkowsky square-root model as the secondary, D- and c-optimal designs are provided for isothermal experiments, taking the temperature both as a value fixed by the experimenter and as a variable to be designed. The designs calculated show that those commonly used in practice are not efficient enough to estimate the parameters of the secondary model, leading to greater uncertainty in the predictions made via these models. Finally, an analysis is carried out to determine the effect on the efficiency of the possible reduction in the final experimental time.
The SSPACE Astrobiology Payload (SAP) series, starting with the SAP-1 project is designed to conduct in-situ microbiology experiments in low earth orbit. This payload series aims to understand the behaviour of microbial organisms in space, particularly those critical for human health, and the corresponding effects due to microgravity and solar/galactic radiation. SAP-1 focuses on studying Bacillus clausii and Bacillus coagulans, bacteria beneficial to humans. It aims to provide a space laboratory for astrobiology experiments under microgravity conditions. The hardware developed for these experiments is indigenous and tailored to meet the unique requirements of autonomous microbiology experiments by controlling pressure, temperature, and nutrition flow to bacteria. A rotating platform, which forms the core design, is innovatively utilised to regulate the flow and mixing of nutrients with dormant bacteria. The technology demonstration models developed at SSPACE have yielded promising results, with ongoing efforts to refine, adapt for space conditions, and prepare for integration with nanosatellites or space modules. The anticipated payload will be compact, approximately 1U in size (1
Despite the value of VR (Virtual Reality) for educational purposes, the instructional power of VR in Biology Laboratory education remains under-explored. Laboratory lectures can be challenging due to students' low motivation to learn abstract scientific concepts and low retention rate. Therefore, we designed a VR-based lecture on fermentation and compared its effectiveness with lectures using PowerPoint slides and a desktop application. Grounded in the theory of distributed cognition and motivational theories, our study examined how learning happens in each condition from students' learning outcomes, behaviors, and perceptions. Our result indicates that VR facilitates students' long-term retention to learn by cultivating their longer visual attention and fostering a higher sense of immersion, though students' short-term retention remains the same across all conditions. This study extends current research on VR studies by identifying the characteristics of each teaching artifact and providing design implications for integrating VR technology into higher education.
The purpose of this paper is to re-open from a practical perspective the question of the extent in altitude of the Earth's biosphere. We make a number of different suggestions for how searches for biological material could be conducted in the mesosphere and lower thermosphere, colloquially referred to as the ignoreosphere due to its lack of investigation in the meteorological community compared to other regions. Relatively recent technological advances such as CubeSats in Very Low Earth Orbit or more standard approaches such as the rocket borne MAGIC meteoric smoke particle sampler, are shown as potentially viable for sampling biological material in the ignoreosphere. The issue of contamination is discussed and a potential solution to the problem is proposed by the means of a new detector design which filters for particles based on their size and relative-velocity to the detector.
Microbiology culture reports contain critical information for important clinical and public health applications. However, microbiology reports often have complex, semi-structured, free-text data that present a barrier for secondary use. Here we present the development and validation of an open-source package designed to ingest free-text microbiology reports, determine whether the culture is positive, and return a list of SNOMED-CT mapped bacteria. Our rule-based natural language processing algorithm was developed using microbiology reports from two different electronic health record systems in a large healthcare organization, and then externally validated on the reports of two other institutions with manually-extracted results as a benchmark. Our algorithm achieved F-1 scores >0.95 on all classification tasks across both validation sets. Our concept extraction Python package, MicrobEx, is designed to be reused and adapted to individual institutions as an upstream process for other clinical applications, such as machine learning studies, clinical decision support, and disease surveillance systems.
Microbiology is the science of microbes, particularly bacteria. Many bacteria are motile: they are capable of self-propulsion. Among these, a significant class execute so-called run-and-tumble motion: they follow a fairly straight path for a certain distance, then abruptly change direction before repeating the process. This dynamics has something in common with Brownian motion (it is diffusive at large scales), and also something in contrast. Specifically, motility parameters such as the run speed and tumble rate depend on the local environment and hence can vary in space. When they do so, even if a steady state is reached, this is not generally invariant under time-reversal: the principle of detailed balance, which restores the microscopic time-reversal symmetry of systems in thermal equilibrium, is mesoscopically absent in motile bacteria. This lack of detailed balance (allowed by the flux of chemical energy that drives motility) creates pitfalls for the unwary modeller. Here I review some statistical mechanical models for bacterial motility, presenting them as a paradigm for exploring diffusion without detailed balance. I also discuss the extent to which statistical physics is u
The detection and classification of bacterial colonies in images of agar-plates is important in microbiology, but is hindered by the lack of labeled datasets. Therefore, we propose Colony Grounded SAM2, a zero-shot inference pipeline to detect and segment bacterial colonies in multiple settings without any further training. By utilizing the pre-trained foundation models Grounding DINO and Segment Anything Model 2, fine-tuned to the microbiological domain, we developed a model that is robust to data changes. Results showed a mean Average Precision of 93.1\% and a $Dice@detection$ score of 0.85, showing excellent detection and segmentation capabilities on out-of-distribution datasets. The entire pipeline with model weights are shared open access to aid with annotation- and classification purposes in microbiology.
Matrix-Assisted Laser Desorption/Ionization Time-of-Flight Mass Spectrometry (MALDI-TOF MS) has become a cornerstone technology in clinical microbiology, enabling rapid and accurate microbial identification. However, the development of data-driven diagnostic models remains limited by the lack of sufficiently large, balanced, and standardized spectral datasets. This study investigates the use of deep generative models to synthesize realistic MALDI-TOF MS spectra, aiming to overcome data scarcity and support the development of robust machine learning tools in microbiology. We adapt and evaluate three generative models, Variational Autoencoders (MALDIVAEs), Generative Adversarial Networks (MALDIGANs), and Denoising Diffusion Probabilistic Model (MALDIffusion), for the conditional generation of microbial spectra guided by species labels. Generation is conditioned on species labels, and spectral fidelity and diversity are assessed using diverse metrics. Our experiments show that synthetic data generated by MALDIVAE, MALDIGAN, and MALDIffusion are statistically and diagnostically comparable to real measurements, enabling classifiers trained exclusively on synthetic samples to reach perfo
The process of quantifying mold colonies on Petri dish samples is of critical importance for the assessment of indoor air quality, as high colony counts can indicate potential health risks and deficiencies in ventilation systems. Conventionally the automation of such a labor-intensive process, as well as other tasks in microbiology, relies on the manual annotation of large datasets and the subsequent extensive training of models like YoloV9. To demonstrate that exhaustive annotation is not a prerequisite anymore when tackling a new vision task, we compile a representative dataset of 5000 Petri dish images annotated with bounding boxes, simulating both a traditional data collection approach as well as few-shot and low-shot scenarios with well curated subsets with instance level masks. We benchmark three vision foundation models against traditional baselines on task specific metrics, reflecting realistic real-world requirements. Notably, MaskDINO attains near-parity with an extensively trained YoloV9 model while finetuned only on 150 images, retaining competitive performance with as few as 25 images, still being reliable on $\approx$ 70% of the samples. Our results show that data-eff
Modeling of growth (or decay) curves arises in many fields such as microbiology, epidemiology, marketing, and econometrics. Parametric forms like Logistic and Gompertz are often used for modeling such monotonic patterns. While useful for compact description, the real-life growth curves rarely follow these parametric forms perfectly. Therefore, the curve estimation methods that strike a balance between prior information in the parametric form and fidelity with the observed data are preferred. In hierarchical, longitudinal studies the interest lies in comparing the growth curves of different groups while accounting for the differences between the within-group subjects. This article describes a flexible state space modeling framework that enables semiparametric growth curve modeling for the data generated from hierarchical, longitudinal studies. The methodology, a type of functional mixed effects modeling, is illustrated with a real-life example of bacterial growth in different settings.
Bacterial heterogeneity is pivotal for adaptation to diverse environments, posing significant challenges in microbial diagnostics and therapeutic interventions. Recent advancements in high-resolution optical microscopy have revolutionized our ability to observe and characterize individual bacteria, offering unprecedented insights into their metabolic states and behaviors at the single-cell level. This review discusses the transformative impact of various high-resolution imaging techniques, including fluorescence and label-free imaging, which have enhanced our understanding of bacterial pathophysiology. These methods provide detailed visualizations that are crucial for developing targeted treatments and improving clinical diagnostics. We highlight the integration of these imaging techniques with computational tools, which has facilitated rapid, accurate pathogen identification and real-time monitoring of bacterial responses to treatments. The ongoing development of these optical imaging technologies promises to significantly advance our understanding of microbiology and to catalyze the translation of these insights into practical healthcare solutions.
A birth-death-move process with mutations is a Markov model for a system of marked particles in interaction, that move over time, with births and deaths. In addition the mark of each particle may also change, which constitutes a mutation. Assuming a parametric form for this model, we derive its likelihood expression and prove its local asymptotic normality. The efficiency and asymptotic distribution of the maximum likelihood estimator, with an explicit expression of its covariance matrix, is deduced. The underlying technical assumptions are showed to be satisfied by several natural parametric specifications. As an application, we leverage this model to analyse the joint dynamics of two types of proteins in a living cell, that are involved in the exocytosis process. Our approach enables to quantify the so-called colocalization phenomenon, answering an important question in microbiology.
Based on bibliometric information from Scopus for the period 1996-2019, this document characterizes the evolution of Uruguayan scientific production and establishes the areas in which the country has a revealed comparative advantage (RCA). Methodologically, it is proposed that there is a RCA in an area if this area has a greater share in national scientific production than the share of the area in world scientific production. The evidence presented considers two measurements of scientific production (published articles and citations) and three levels of aggregation in the areas (a minor one with 5 large areas, a more detailed one with 27 disciplines and another even more granular with more than 300 disaggregations). Within Health Sciences there is a RCA in Veterinary, Nursing and Medicine. Within Life Sciences there is a RCA in Agricultural and Biological Sciences, Immunology and Microbiology and Biochemistry, Genetics and Molecular Biology. In Physical Sciences there is only a RCA in Environmental Science and in Social Sciences only in Economics, Econometrics and Finance.
Exoplanet habitability remains a challenging field due to the large distances separating Earth from other stars. Using insights from biology and astrophysics, we studied the habitability of M-dwarf exoplanets by modeling their surface temperature and flare UV and X-ray doses using the Martian atmosphere as a shielding model. Analyzing the Proxima Centauri and TRAPPIST-1 systems, our models suggest that Proxima b and TRAPPIST-1 e are likeliest to have temperatures compatible with surface liquid water, as well as tolerable radiation environments. Results of the modeling were used as a basis for microbiology experiments to assess spore survival of the melanin-rich fungus Aspergillus niger to exoplanet-like radiation (UV-C and X-rays). Results showed that A. niger spores can endure superflare events on M-dwarf planets when shielded by a Mars-like atmosphere or by a thin layer of soil or water. Melanin-deficient spores suspended in a melanin-rich solution showed higher survival rates and germination efficiency when compared to melanin-free solutions. Overall, the models developed in this work establish a framework for microbiological research in habitability studies. Finally, we showed