Microbiome studies increasingly indicate that disease-associated shifts cannot be understood from compositional changes alone. The functional architecture of microbial communities encoded in patterns of association among microbial gene families may reveal how these systems reorganize across biological conditions. Here, we present a network-based framework for characterizing microbiome rewiring across conditions. The approach combines condition-specific network inference, differential network analysis and pathway enrichment to identify interactions that are gained, lost or altered between groups, with a specific focus on sex-dependent differences. We apply the framework to inflammatory bowel disease, type 2 diabetes and atherosclerotic cardiovascular disease, comparing male and female specific microbial gene-family networks within each disease context. Across these settings, differential networks reveal extensive rewiring of microbial functional interactions, suggesting that microbiome alterations are shaped not only by changes in abundance but also by shifts in community organization. Importantly, pathway enrichment of rewired interactions uncovers functional signals that are not a
We develop and analyze a model for a flat microbial droplet growing on the surface of a three-dimensional viscous fluid. The model describes growth-induced stresses at the fluid surface, density variations in the bulk due to nutrient consumption, and the resulting fluid flows that arise. We reformulate this free-boundary problem as a system of integro-differential equations defined solely on the microbial domain. From this formulation, we identify an axisymmetric solution corresponding to a radially expanding disk and analyze its morphological stability. We find that growth forces stabilize the axisymmetric solution while buoyancy forces destabilize it. We connect these findings to experimental observations.
Statistical physics can describe the behavior of microbial populations consisting of many heterogeneous individuals. A direct consequence is the existence of phase transitions, where the behavior of a population changes discontinuously upon a small perturbation. While such phase transitions have often been proposed in biology, connecting observed behavior to the underlying physics has remained challenging. We show how phase transitions naturally arise in microbial population dynamics and highlight their connection with genealogies. We rigorously demonstrate the existence of a first-order phase transition in a model of bacterial plasmid engineering and find a strict lower bound on the number of plasmids that can be stably maintained in a population.
The gut microbiome plays a crucial role in human health, yet the mechanisms underlying host-microbiome interactions remain unclear, limiting its translational potential. Recent microbiome multiomics studies, particularly paired microbiome-metabolome studies (PM2S), provide valuable insights into gut metabolism as a key mediator of these interactions. Our preliminary data reveal strong correlations among certain gut metabolites, suggesting shared metabolic pathways and microbial co-metabolism. However, these findings are confounded by various factors, underscoring the need for a more rigorous statistical approach. Thus, we introduce microbial correlation, a novel metric that quantifies how two metabolites are co-regulated by the same gut microbes while accounting for confounders. Statistically, it is based on a partially linear model that isolates microbial-driven associations, and a consistent estimator is established based on semi-parametric theory. To improve efficiency, we develop a calibrated estimator with a parametric rate, maximizing the use of large external metagenomic datasets without paired metabolomic profiles. This calibrated estimator also enables efficient p-value ca
Bacteria frequently colonize natural microcavities such as gut crypts, plant apoplasts, and soil pores. Recent studies have shown that the physical structure of these spaces plays a crucial role in shaping the stability and resilience of microbial populations (Karita et al., PNAS 2022, Postek et al. PNAS 2024). Here, we demonstrate that protected microhabitats can emerge dynamically, even in the absence of physical barriers. Interactions with surface features -- such as roughness or friction -- lead microbial populations to self-organize into effectively segregated subpopulations. Our numerical and analytical models reveal that this self-organization persists even when strains have different growth rates, allowing slower-growing strains to avoid competitive exclusion. These findings suggest that emergent spatial structuring can serve as a fundamental mechanism for maintaining microbial diversity, despite selection pressures, competition, and genetic drift.
Iron (Fe) reduction is one of Earth's most ancient microbial metabolisms, but after atmosphere-ocean oxygenation, this anaerobic process was relegated to niche anoxic environments below the water and soil surface. However, new technologies to monitor redox processes at the microscale relevant to microbial cells have recently revealed that the oxygen (O2) concentrations controlling the distribution of aerobic and anaerobic metabolisms are more heterogeneous than previously believed. To explore how O2 levels regulate microbial Fe reduction, we cultivated a facultative Fe-reducing bacterium using a cutting-edge microfluidic reactor integrated with transparent planar O2 sensors. Contrary to expectations, microbial growth induced Fe(III)-oxide (ferrihydrite) reduction under fully oxygenated conditions without forming O2-depleted microsites. Batch incubations highlighted the importance of the process at a larger scale, fundamentally changing our understanding of Fe cycling from the conceptualization of metal and nutrient mobility in the subsurface to our interpretation of Fe mineralogy in the rock record.
The human body consists of microbiomes associated with the development and prevention of several diseases. These microbial organisms form several complex interactions that are informative to the scientific community for explaining disease progression and prevention. Contrary to the traditional view of the microbiome as a singular, assortative network, we introduce a novel statistical approach using a weighted stochastic infinite block model to analyze the complex community structures within microbial co-occurrence microbial interaction networks. Our model defines connections between microbial taxa using a novel semi-parametric rank-based correlation method on their transformed relative abundances within a fully connected network framework. Employing a Bayesian nonparametric approach, the proposed model effectively clusters taxa into distinct communities while estimating the number of communities. The posterior summary of the taxa community membership is obtained based on the posterior probability matrix, which could naturally solve the label switching problem. Through simulation studies and real-world application to microbiome data from postmenopausal patients with recurrent urinar
The rise of complex multicellular ecosystems Neoproterozoic time was preceded by a microbial Proterozoic biosphere, where productivity may have been largely restricted to microbial mats made up of bacteria including oxygenic photosynthetic Cyanobacteria, anoxygenic phototrophs, and heterotrophs. In modern environments, analogous microbial mats can be found in restricted environments such as carbonate tidal flats and terrestrial hot springs. Here, we report metagenomic sequence data from an analog in the hot springs of Waikite Valley, Aotearoa New Zealand, where carbon-rich, slightly-alkaline geothermal waters support diverse phototrophic microbial mats. The Waikite Valley hot spring in the Taupo Volcanic Zone of Aotearoa New Zealand was sampled in duplicate at 8 points along a temperature gradient transect of the outflow, from ~62 C (near the source) to ~37 C (~100 meters downstream). ~686 Gb of shotgun metagenomic sequence data was generated by Illumina Novaseq. Each sample was assembled using SPAdes, followed by binning of metagenome-assembled genomes (MAGs) by MetaBAT. These data are useful for the genomic analysis of novel phototrophic bacteria, as well as for ecological compar
Over the past years, substantial numbers of microbial species' genomes have been deposited outside of conventional INSDC databases. The GlobDB aggregates 14 independent genomic catalogues to provide a comprehensive database of species-dereplicated microbial genomes, with consistent taxonomy, annotations, and additional analysis resources. The GlobDB is available at https://globdb.org/.
Microbial communities assemble through a complex set of interactions between microbes and their environment, and the resulting metabolic impact on the host ecosystem can be profound. Microbial activity is known to impact human health, plant growth, water quality, and soil carbon storage which has lead to the development of many approaches and products meant to manipulate the microbiome. In order to understand, predict, and improve microbial community engineering, genome-scale modeling techniques have been developed to translate genomic data into inferred microbial dynamics. However, these techniques rely heavily on simulation to draw conclusions which may vary with unknown parameters or initial conditions, rather than more robust qualitative analysis. To better understand microbial community dynamics using genome-scale modeling, we provide a tool to investigate the network of interactions between microbes and environmental metabolites over time. Using our previously developed algorithm for simulating microbial communities from genome-scale metabolic models (GSMs), we infer the set of microbe-metabolite interactions within a microbial community in a particular environment. Because t
For more than 3.5 billion years, life experienced dramatic environmental extremes on Earth. These include shifts from oxygen-less to over-oxygenated atmospheres and cycling between hothouse conditions and global glaciations. Meanwhile, an ecological revolution took place. The planet evolved from one dominated by microbial life to one containing the plants and animals that are most familiar today. The activities of many key cellular inventions evolved early in the history of life, collectively defining the nature of our biosphere and underpinning human survival. There is a critical need for a new disciplinary synthesis to reveal how microbes and their molecular systems survived ever changing global conditions over deep time. This review critically examines our current understanding of early microbial life and describes the foundations of an emerging area in microbiology and evolutionary synthetic biology to reconstruct the earliest microbial innovations.
Inferring microbial interaction networks from abundance patterns is an important approach to advance our understanding of microbial communities in general and the human microbiome in particular. Here we suggest discriminating two levels of information contained in microbial abundance data: (1) the quantitative abundance values and (2) the pattern of presences and absences of microbial organisms. The latter allows for a binary view on microbiome data and a novel interpretation of microbial data as attractors, or more precisely as fixed points, of a Boolean network. Starting from these attractors, our aim is to infer an interaction network between the species present in the microbiome samples. To accomplish this task, we introduce a novel inference method that combines the previously published ESABO (Entropy Shifts of Abundance vectors under Boolean Operations) method with an evolutionary algorithm. The key idea of our approach is that the inferred network should reproduce the original set of (observed) binary abundance patterns as attractors. We study the accuracy and runtime properties of this evolutionary method, as well as its behavior under incomplete knowledge of the attractor
Soil microbial communities are known to be robust against perturbations such as nutrition inputs, which appears as an obstacle for the soil improvement. On the other hand, its adaptable aspect has been also reported. Here we propose simple measures for these seemingly contradicting features of soil microbial communities, robustness and plasticity, based on the distribution of the populations. The first measure is the similarity in the population balance, i.e. the shape of the distribution function, which is found to show resilience against the nutrition inputs. The other is the similarity in the composition of the species measured by the rank order of the population, which shows an adaptable response during the population balance is recovering. These results clearly show that the soil microbial system is robust (or, homeostatic) in its population balance, while the composition of the species is rather plastic and adaptable.
With a fleet of exploratory space missions on the horizon, the study of target specific biospheres is crucial for accurately determining the probability of the existence of microbial life on various planetary bodies and prioritising targets accordingly. Although previous studies have compared the potential habitability of objects in our solar system by bulk characteristics, it is less common that precise qualitative methods are developed for ranking candidates hospitable to microbial life on a local environment basis. In this review we create a planetary environmental database and use it to motivate a list of primary habitability candidates and essential criteria for microbial survival. We then propose a new method, the Microbial Habitability Index (MHI) which uses a metric of microbial survival factor values in target environments compared with appropriate Earth analogues to assess their potential for life. We arrive at a selection of eight primary candidates and from this set conclude that Europa, Mars, and Enceladus have the highest potential for facilitating microbial survival.
Microbial communities play important roles in the function and maintenance of various biosystems, ranging from human body to the environment. Current methods for analysis of microbial communities are typically based on taxonomic phylogenetic alignment using 16S rRNA metagenomic or Whole Genome Sequencing data. In typical characterizations of microbial communities, studies deal with billions of micobial sequences, aligning them to a phylogenetic tree. We introduce a new approach for the efficient analysis of microbial communities. Our new reference-free analysis tech- nique is based on n-gram sequence analysis of 16S rRNA data and reduces the processing data size dramatically (by 105 fold), without requiring taxonomic alignment. The proposed approach is applied to characterize phenotypic microbial community differ- ences in different settings. Specifically, we applied this approach in classification of microbial com- munities across different body sites, characterization of oral microbiomes associated with healthy and diseased individuals, and classification of microbial communities longitudinally during the develop- ment of infants. Different dimensionality reduction methods are in
Microbes are often discussed in terms of dichotomies such as copiotrophic/oligotrophic and fast/slow-growing microbes, defined using the characterisation of microbial growth in isolated cultures. The dichotomies are usually qualitative and/or study-specific, sometimes precluding clear-cut results interpretation. We are able to interpret microbial dichotomies as life history strategies by combining ecology theory with Monod curves, a classical laboratory tool of bacterial physiology. Monod curves relate the specific growth rate of a microbe with the concentration of a limiting nutrient, and provide quantities that directly correspond to key ecological parameters in McArthur and Wilsons r/K selection theory, Tilmans resource competition and community structure theory and Grimes triangle of life strategies. The resulting model allows us to reconcile the copiotrophic/oligotrophic and fast/slow-growing dichotomies as different subsamples of a life history strategy triangle that also includes r/K strategists. We analyzed some ecological context by considering the known viable carbon sources for heterotrophic microbes in the framework of community structure theory. This partly explains th
The interactions among the constituent members of a microbial community play a major role in determining the overall behavior of the community and the abundance levels of its members. These interactions can be modeled using a network whose nodes represent microbial taxa and edges represent pairwise interactions. A microbial network is a weighted graph that is constructed from a sample-taxa count matrix, and can be used to model co-occurrences and/or interactions of the constituent members of a microbial community. The nodes in this graph represent microbial taxa and the edges represent pairwise associations amongst these taxa. A microbial network is typically constructed from a sample-taxa count matrix that is obtained by sequencing multiple biological samples and identifying taxa counts. From large-scale microbiome studies, it is evident that microbial community compositions and interactions are impacted by environmental and/or host factors. Thus, it is not unreasonable to expect that a sample-taxa matrix generated as part of a large study involving multiple environmental or clinical parameters can be associated with more than one microbial network. However, to our knowledge, micr
An active area of research interest is the inference of ecological models of complex microbial communities. Inferring such ecological models entails understanding the interactions between microbes and how they affect each other's growth. This dissertation employs a statistical perspective to contribute further to the knowledge currently addressing this problem. Part I explains how high-throughput droplet-based microfluidics technology can be used to screen for microbial interactions. An explicit, statistical framework is motivated and developed that can guide the analysis of data from such experiments. Part II explains how it might be possible to predict, based on the experimental setup, how much data will be produced to infer given microbial interactions. Running the experiment once without incubating the droplets turns out to be necessary to make such predictions. Part III demonstrates the feasibility of inferring microbial interactions from the data produced by these experiments. Relevant ideas from the microbiological and ecological literature are recast into an explicit, statistical framework.
The human habitat is a host where microbial species evolve, function, and continue to evolve. Elucidating how microbial communities respond to human habitats is a fundamental and critical task, as establishing baselines of human microbiome is essential in understanding its role in human disease and health. However, current studies usually overlook a complex and interconnected landscape of human microbiome and limit the ability in particular body habitats with learning models of specific criterion. Therefore, these methods could not capture the real-world underlying microbial patterns effectively. To obtain a comprehensive view, we propose a novel ensemble clustering framework to mine the structure of microbial community pattern on large-scale metagenomic data. Particularly, we first build a microbial similarity network via integrating 1920 metagenomic samples from three body habitats of healthy adults. Then a novel symmetric Nonnegative Matrix Factorization (NMF) based ensemble model is proposed and applied onto the network to detect clustering pattern. Extensive experiments are conducted to evaluate the effectiveness of our model on deriving microbial community with respect to bod
The dynamics of microbial communities is incredibly complex, determined by competition for metabolic substrates and cross-feeding of byproducts. Species in the community grow by harvesting energy from chemical reactions that transform substrates to products. In many anoxic environments, these reactions are close to thermodynamic equilibrium and growth is slow. To understand the community structure in these energy-limited environments, we developed a microbial community consumer-resource model incorporating energetic and thermodynamic constraints on an interconnected metabolic network. The central ingredient of the model is product inhibition, meaning that microbial growth may be limited not only by depletion of metabolic substrates but also by accumulation of products. We demonstrate that these additional constraints on microbial growth cause a convergence in the structure and function of the community metabolic network -- independent of species composition and biochemical details -- providing a possible explanation for convergence of community function despite taxonomic variation observed in many natural and industrial environments. Furthermore, we discovered that the structure of