Metabolic models condense biochemical knowledge about organisms in a structured and standardised way. As large-scale network reconstructions are readily available for many organisms, genome-scale models are being widely used among modellers and engineers. However, these large models can be difficult to analyse and visualise and occasionally generate predictions that are hard to interpret or even biologically unrealistic. Of the thousands of enzymatic reactions in a typical bacterial metabolism, only a few hundred form the metabolic pathways essential to produce energy carriers and biosynthetic precursors. These pathways carry relatively high flux, are central to maintaining and reproducing the cell, and provide precursors and energy to engineered metabolic pathways. Focusing on these central metabolic subsystems, we present iCH360, a manually curated medium-scale model of energy and biosynthesis metabolism for the well-studied bacterium Escherichia coli K-12 MG1655. The model is a sub-network of the most recent genome-scale reconstruction, iML1515, and comes with an updated layer of database annotations and with a range of metabolic maps for visualisation. We enriched the stoichiom
The article examines the theoretical, methodological, and technical foundations of research on audiovisual corpora within the field of digital humanities. It outlines the main transversal issues underlying the processes of constructing, exploiting, and interpreting such corpora, which are conceived as specific forms of textual data in the broad sense - that is, as sets of semiotic traces (written, visual, sound, or multimodal) that make it possible to document, analyze, and transmit domains of knowledge. The analysis is organized around five complementary themes. The first concerns the status and structure of textual data lato sensu: any data, regardless of its medium, participates in a meaningful representation of a domain and therefore requires a unified theoretical and methodological framework based on a transdisciplinary semiotic approach. The second theme addresses the documentary value of data and corpora, understood as the relevance of materials for documenting a research object in relation to the goals and perspectives of the projects in which they are used. This value depends both on provenance and reasoned selection, and on the pragmatic context of their use. The third th
The origin of microbial cells required the emergence of metabolism, an autocatalytic network of roughly 400 enzymatically catalyzed chemical reactions that synthesize the building blocks of life: amino acids, nucleotides and cofactors. Proposals for metabolic origin are theoretical in nature [1-9], empirical studies addressing the origin and early evolution of the 400-reaction chemical network itself are lacking. Here we identify intermediate states in the primordial assembly of metabolism from its inorganic origins, using structure-refined clusters for metabolic enzymes of prokaryotic genomes. We show that metabolism in the last universal common ancestor (LUCA) was enzymatically incomplete, undergoing final assembly independently in the lineages leading to bacteria and archaea, with metal catalysts that predated both enzymes and cofactors providing essential functions. Over half of modern core metabolism corresponds to laboratory reactions catalyzed by native transition metals--Fe(0), Co(0), Ni(0) and their alloys--under conditions of serpentinizing hydrothermal vents. As the hitherto elusive source of primordial aqueous phosphorylation, we show that phosphite, a constituent of se
Metabolism displays striking and robust regularities in the forms of modularity and hierarchy, whose composition may be compactly described. This renders metabolic architecture comprehensible as a system, and suggests the order in which layers of that system emerged. Metabolism also serves as the foundation in other hierarchies, at least up to cellular integration including bioenergetics and molecular replication, and trophic ecology. The recapitulation of patterns first seen in metabolism, in these higher levels, suggests metabolism as a source of causation or constraint on many forms of organization in the biosphere. We identify as modules widely reused subsets of chemicals, reactions, or functions, each with a conserved internal structure. At the small molecule substrate level, module boundaries are generally associated with the most complex reaction mechanisms and the most conserved enzymes. Cofactors form a structurally and functionally distinctive control layer over the small-molecule substrate. Complex cofactors are often used at module boundaries of the substrate level, while simpler ones participate in widely used reactions. Cofactor functions thus act as "keys" that incor
Local perturbations of individual metabolic reactions may result in different levels of lethality, depending on their roles in metabolism and the size of subsequent cascades induced by their failure. Moreover, essentiality of individual metabolic reactions may show large variations within and across species. Here we quantify their essentialities in hundreds of species by computing the growth rate after removal of individual and pairs of reactions by flux balance analysis. We find that about 10% of reactions are essential, i.e., growth stops without them, and most of the remaining reactions are redundant in the metabolic network of each species. This large-scale and cross-species study allows us to determine ad hoc ages of each reaction and species. We find that when a reaction is older and contained in younger species, the reaction is more likely to be essential. Such correlations of essentiality with the ages of reactions and species may be attributable to the evolution of cellular metabolism, in which alternative pathways are recruited to ensure the stability of important reactions to various degrees across species.
Molecular chirality is critical to biochemical function, but it is unknown when chiral selectivity first became important in the evolutionary transition from geochemistry to biochemistry during the emergence of life. Here, we identify key transitions in the selection of chiral molecules in metabolic evolution, showing how achiral molecules (lacking chiral centers) may have given rise to specific and abundant chiral molecules in the elaboration of metabolic networks from geochemically available precursor molecules. Simulated expansions of biosphere-scale metabolism suggest new hypotheses about the evolution of chiral molecules within biochemistry, including a prominent role for both achiral and chiral compounds as nucleation sites of early metabolic network growth, an increasing enrichment of molecules with more chiral centers as these networks expand, and conservation of broken chiral symmetries along reaction pathways as a general organizing principle. We also find an unexpected enrichment in large, non-polymeric achiral molecules. Leveraging metabolic data of 40,023 genomes and metagenomes, we analyzed the statistics of chiral and achiral molecules in the large-scale organization
Genome-scale metabolic models have become a fundamental tool for examining metabolic principles. However, metabolism is not solely characterized by the underlying biochemical reactions and catalyzing enzymes, but also affected by regulatory events. Since the pioneering work of Covert and co-workers as well as Shlomi and co-workers it is debated, how regulation and metabolism synergistically characterize a coherent cellular state. The first approaches started from metabolic models which were extended by the regulation of the encoding genes of the catalyzing enzymes. By now, bioinformatics databases in principle allow addressing the challenge of integrating regulation and metabolism on a system-wide level. Collecting information from several databases we provide a network representation of the integrated gene regulatory and metabolic system for Escherichia coli, including major cellular processes, from metabolic processes via protein modification to a variety of regulatory events. Besides transcriptional regulation, we also take into account regulation of translation, enzyme activities and reactions. Our network model provides novel topological characterizations of system components
The metabolic network plays a crucial role in regulating bacterial metabolism and growth, but it is subject to inherent molecular stochasticity. Previous studies have utilized flux balance analysis and the maximum entropy method to predict metabolic fluxes and growth rates, while the underlying principles governing bacterial metabolism and growth, especially the criticality hypothesis, remain unclear. In this study, we employ a maximum entropy approach to investigate the universality in various constraint-based metabolic networks of Escherichia coli. Our findings reveal the existence of universal scaling relations across different nutritional environments and metabolic network models, similar to the universality observed in physics. By analyzing single-cell data, we confirm that metabolism of Escherichia coli operates close to the state with maximum Fisher information, which serves as a signature of criticality. This critical state provides functional advantages such as high sensitivity and long-range correlation. Moreover, we demonstrate that a metabolic system operating at criticality takes a compromise solution between growth and adaptation, thereby serving as a survival strateg
Web archives are a historically valuable source of information. In some respects, web archives are the only record of the evolution of human society in the last two decades. They preserve a mix of personal and collective memories, the importance of which tends to grow as they age. However, the value of web archives depends on their users being able to search and access the information they require in efficient and effective ways. Without the possibility of exploring and exploiting the archived contents, web archives are useless. Web archive access functionalities range from basic browsing to advanced search and analytical services, accessed through user-friendly interfaces. Full-text and URL search have become the predominant and preferred forms of information discovery in web archives, fulfilling user needs and supporting search APIs that feed complex applications. Both full-text and URL search are based on the technology developed for modern web search engines, since the Web is the main resource targeted by both systems. However, while web search engines enable searching over the most recent web snapshot, web archives enable searching over multiple snapshots from the past. This m
Metabolism plays a crucial role in sleep regulation, yet its effects are challenging to track in real time. This study introduces a machine learning-based framework to analyze sleep patterns and identify how metabolic changes influence sleep at specific time points. We first established that sleep periods in Drosophila melanogaster function independently, with no causal relationship between different sleep episodes. Using gradient boosting models and explainable artificial intelligence techniques, we quantified the influence of time-dependent sleep features. Causal inference and autocorrelation analyses further confirmed that sleep states at different times are statistically independent, providing a robust foundation for exploring metabolic effects on sleep. Applying this framework to flies with altered monocarboxylate transporter 2 expression, we found that changes in ketone transport modified sleep stability and disrupted transitions between day and night sleep. In an Alzheimers disease model, metabolic interventions such as beta hydroxybutyrate supplementation and intermittent fasting selectively influenced the timing of day to night transitions rather than uniformly altering sl
The different active roles of neurons and astrocytes during neuronal activation are associated with the metabolic processes necessary to supply the energy needed for their respective tasks at rest and during neuronal activation. Metabolism, in turn, relies on the delivery of metabolites and removal of toxic byproducts through diffusion processes and the cerebral blood flow. A comprehensive mathematical model of brain metabolism should account not only for the biochemical processes and the interaction of neurons and astrocytes, but also the diffusion of metabolites. In the present article, we present a computational methodology based on a multidomain model of the brain tissue and a homogenization argument for the diffusion processes. In our spatially distributed compartment model, communication between compartments occur both through local transport fluxes, as is the case within local astrocyte-neuron complexes, and through diffusion of some substances in some of the compartments. The model assumes that diffusion takes place in the extracellular space (ECS) and in the astrocyte compartment. In the astrocyte compartment, the diffusion across the syncytium network is implemented as a
What makes living things special is how they manage matter, energy, and entropy. A general theory of organismal metabolism should therefore be quantified in these three currencies while capturing the unique way they flow between individuals and their environments. We argue that such a theory has quietly arrived -- 'Dynamic Energy Budget' (DEB) theory -- which conceptualises organisms as a series of macrochemical reactions that use energy to transform food into structured biomass and bioproducts while producing entropy. We show that such conceptualisation is deeply rooted in thermodynamic principles and that, with the help of a small set of biological assumptions, it underpins the emergence of fundamental ecophysiological phenomena, most notably the three-quarter power scaling of metabolism. Building on the subcellular nature of the theory, we unveil the eco-evolutionary relevance of coarse-graining biomass into qualitatively distinct, stoichiometricially fixed pools with implicitly regulated dynamics based on surface area-volume relations. We also show how generalised enzymes called 'synthesising units' and an information-based state variable called 'maturity' capture transitions b
Cancer cells are often seen to prefer glycolytic metabolism over oxidative phosphorylation even in the presence of oxygen-a phenomenon termed the Warburg effect. Despite significant strides in the decades since its discovery, a clear basis is yet to be established for the Warburg effect and why cancer cells show such a preference for aerobic glycolysis. In this review, we draw on what is known about similar metabolic shifts both in normal mammalian physiology and overflow metabolism in microbes to shed new light on whether aerobic glycolysis in cancer represents some form of optimisation of cellular metabolism. From microbes to cancer, we find that metabolic shifts favouring glycolysis are sometimes driven by the need for faster growth, but the growth rate is by no means a universal goal of optimal metabolism. Instead, optimisation goals at the cellular level are often multi-faceted and any given metabolic state must be considered in the context of both its energetic costs and benefits over a range of environmental contexts. For this purpose, we identify the conceptual framework of resource allocation as a potential testbed for the investigation of the cost-benefit balance of cellu
Cancer cells have the plasticity to adjust their metabolic phenotypes for survival and metastasis. During metastasis, a developmental program known as the epithelial-mesenchymal transition (EMT) plays a critical role. There is extensive cross-talk between metabolism and EMT, but how this leads to coordinated physiological changes is still uncertain. The elusive connection between metabolism and EMT compromises the efficacy of metabolic therapies targeting metastasis. In this review, we aim for clarifying causation between metabolism and EMT based on recent experimental studies and propose integrated theoretical-experimental efforts to better understand the coupled decision-making of metabolism and EMT.
Representation of cities as organisms with metabolic processes is a useful analogy for urban design, development and sustainability. Urban metabolism can be modeled by representing urban systems as networks. The various networks included in a city's metabolism are interdependent in complex ways. Thus, understanding the interaction among these networks is essential to understanding how a healthy urban metabolism is sustained and how injuries to the metabolic system can "heal". It is particularly important to understand how disruptions to one system in an urban area affect the functioning of other systems. Using distribution-level data from a real U.S. city on the electricity distribution system and road geometry, we apply connected network modeling to two critical inter-connected urban infrastructure sectors: energy and transportation. We quantify the robustness of these interdependent networks by evaluating the connectivity disruptions that may occur due to natural or synthetic disruptive events, using both unweighted and weighted metrics.
A classic problem in metabolism is that fast-proliferating cells use seemingly wasteful fermentation for energy biogenesis in the presence of sufficient oxygen. This counterintuitive phenomenon, known as overflow metabolism or the Warburg effect, is universal across various organisms. Despite extensive research, its origin and function remain unclear. Here, we show that overflow metabolism can be understood through growth optimization combined with cell heterogeneity. A model of optimal protein allocation, coupled with heterogeneity in enzyme catalytic rates among cells, quantitatively explains why and how cells choose between respiration and fermentation under different nutrient conditions. Our model quantitatively illustrates the growth rate dependence of fermentation flux and enzyme allocation under various perturbations and is fully validated by experimental results in Escherichia coli. Our work provides a quantitative explanation for the Crabtree effect in yeast and the Warburg effect in cancer cells and can be broadly used to address heterogeneity-related challenges in metabolism.
We outline a modeling and optimization strategy for investigating dynamic metabolic engineering interventions. Our framework is particularly useful at the early stages of research and development, often constrained by limited knowledge and experimental data. Elucidating a priori optimal trajectories of manipulatable intracellular fluxes can guide the design of suitable control schemes, e.g., cyber(ge)netic or in-cell approaches, and the selection of appropriate actuators, e.g., at the transcriptional or post-translational levels. Model-based dynamic optimization is proposed to predict optimal trajectories of target manipulatable intracellular fluxes. A challenge emerges as existing models are often oversimplified, lacking insights into metabolism, or excessively complex, making them difficult to build and implement. Here, we use surrogates derived from steady-state solutions of constraint-based metabolic models to link manipulatable intracellular fluxes to the process exchange rates of structurally simple hybrid dynamic models. The latter can be conveniently used in optimal control problems of metabolism. As a proof of concept, we apply our method to a reduced metabolic network of
Although the Internet Archive's Wayback Machine is the largest and most well-known web archive, there have been a number of public web archives that have emerged in the last several years. With varying resources, audiences and collection development policies, these archives have varying levels of overlap with each other. While individual archives can be measured in terms of number of URIs, number of copies per URI, and intersection with other archives, to date there has been no answer to the question "How much of the Web is archived?" We study the question by approximating the Web using sample URIs from DMOZ, Delicious, Bitly, and search engine indexes; and, counting the number of copies of the sample URIs exist in various public web archives. Each sample set provides its own bias. The results from our sample sets indicate that range from 35%-90% of the Web has at least one archived copy, 17%-49% has between 2-5 copies, 1%-8% has 6-10 copies, and 8%-63% has more than 10 copies in public web archives. The number of URI copies varies as a function of time, but no more than 31.3% of URIs are archived more than once per month.
Metabolic networks are complex systems that comprise hundreds of chemical reactions which synthesize biomass molecules from chemicals in an organism's environment. The metabolic network of any one organism is encoded by a metabolic genotype, defined by a set of enzyme-coding genes whose products catalyze the network's reactions. Each metabolic genotype has a metabolic phenotype, such as the ability to synthesize biomass on a spectrum of different sources of chemical elements and energy. We here focus on sulfur metabolism, which is attractive to study the evolution of metabolic networks, because it involves many fewer reactions than carbon metabolism. Specifically, we study properties of the space of all possible metabolic genotypes, and analyze properties of random metabolic genotypes that are viable on different numbers of sulfur sources. We show that metabolic genotypes with the same phenotype form large connected genotype networks that extend far through metabolic genotype space. How far they reach through this space is a linear function of the number of super-essential reactions in such networks, the number of reactions that occur in all networks with the same phenotype. We sho
We extend a previously theory for the interspecific allometric scaling developed in a $d+1$-dimensional space of metabolic states. The time, which is characteristic of all biological processes, is included as an extra dimension to $d$ biological lengths. The different metabolic rates, such as basal (BMR) and maximum (MMR), are described by supposing that the biological lengths and time are related by different transport processes of energy and mass. We consider that the metabolic rates of animals are controlled by three main transport processes: convection, diffusion and anomalous diffusion. Different transport mechanisms are related to different metabolic states, with its own values for allometric exponents. In $d=3$, we obtain that the exponent $b$ of BMR is $b=0.71$, and that the aerobic sustained MMR upper value of the exponent is $b=0.86$ (best empirical values for mammals: $b=0.69(2)$ and $b=0.87(3)$). The 3/4-law appears as an upper limit of BMR. The MMR scaling in different conditions, other exponents related to BMR and MMR, and the metabolism of unicellular organisms are also discussed.