Breast cancer incidence rises with age and peaks across the menopausal transition, yet why some postmenopausal lobules persist, and why that persistence predicts cancer risk, remains unresolved. Incomplete age-related lobular involution is one of the strongest tissue-level predictors of subsequent breast cancer, but it is still commonly viewed as passive failure of hormonally driven regression. This Review proposes a different framework: persistent lobules are maintained by an active reserve niche that outlasts its reproductive function. By integrating breast epidemiology, mammary stromal biology, cellular senescence, immune surveillance, and comparative reserve systems in skeletal muscle, hematopoiesis, and postmenopausal endometrium, we argue that menopause is a biological control point at which tissue fate diverges. Efficient clearance of senescent cells permits lobular regression to complete, whereas impaired immune surveillance may allow inflammatory paracrine signaling, macrophage reprogramming, and immune evasion to create a self-sustaining senescent-immune niche lock. This framework explains why persistent lobules are biologically active, shifts attention from epithelial qu
In this work, the tools of general relativity are used to analytically derive collisional stopping power and a linkage between higher-dimensional field theory and transport phenomena is proposed. We start from a Kaluza-Klein inspired, five-dimensional diffeomorphism-invariant action, and upon compactification, obtain a four-dimensional effective theory in which the matter fields are treated to be brane-localized. The medium response to the projected electron is encoded in symmetric tensor fields coupled covariantly to both electromagnetic and fermionic parts via Lagrangian-derived interactions. When $R_c \sim Λ_{\text{EM}}^{-1}$, $Λ_{\text{EM}} \gg m_e$ and $g_4^2 = \frac{3π^2 m_e v}{4γ^3 R_c^2 e^2 Λ_{\text{EM}}}$ are satisfied, the leading term of Bethe-Møller formula is shown to be recovered in the large $R$ limit. The construction presented here may serve as an alternative approach that uses compactification geometry and medium excitations to determine observable couplings and stopping power. The model intrinsically supports phenomena linked to anisotropy and nonlinear response, as well as gravitational or extra-dimensional effects in laboratory-scale systems via the study of st
This paper introduces the RAG-RLRC-LaySum framework, designed to make complex biomedical research understandable to laymen through advanced Natural Language Processing (NLP) techniques. Our Retrieval Augmented Generation (RAG) solution, enhanced by a reranking method, utilizes multiple knowledge sources to ensure the precision and pertinence of lay summaries. Additionally, our Reinforcement Learning for Readability Control (RLRC) strategy improves readability, making scientific content comprehensible to non-specialists. Evaluations using the publicly accessible PLOS and eLife datasets show that our methods surpass Plain Gemini model, demonstrating a 20% increase in readability scores, a 15% improvement in ROUGE-2 relevance scores, and a 10% enhancement in factual accuracy. The RAG-RLRC-LaySum framework effectively democratizes scientific knowledge, enhancing public engagement with biomedical discoveries.
The Suborbital Imaging Spectrograph for Transition region Irradiance from Nearby Exoplanet host stars (SISTINE) is a rocket-borne ultraviolet (UV) imaging spectrograph designed to probe the radiation environment of nearby stars. SISTINE operates over a bandpass of 98 -- 127 and 130 -- 158 nm, capturing a broad suite of emission lines tracing the full 10$^4$ -- 10$^5$ K formation temperature range critical for reconstructing the full UV radiation field incident on planets orbiting solar-type stars. SISTINE serves as a platform for key technology developments for future ultraviolet observatories. SISTINE operates at moderate resolving power ($R\sim$1500) while providing spectral imaging over an angular extent of $\sim$6', with $\sim$2" resolution at the slit center. The instrument is composed of an f/14 Cassegrain telescope that feeds a 2.1x magnifying spectrograph, utilizing a blazed holographically ruled diffraction grating and a powered fold mirror. Spectra are captured on a large format microchannel plate (MCP) detector consisting of two 113 x 42 mm segments each read out by a cross delay-line anode. Several novel technologies are employed in SISTINE to advance their technical ma
Small molecule drug design hinges on obtaining co-crystallized ligand-protein structures. Despite AlphaFold2's strides in protein native structure prediction, its focus on apo structures overlooks ligands and associated holo structures. Moreover, designing selective drugs often benefits from the targeting of diverse metastable conformations. Therefore, direct application of AlphaFold2 models in virtual screening and drug discovery remains tentative. Here, we demonstrate an AlphaFold2 based framework combined with all-atom enhanced sampling molecular dynamics and induced fit docking, named AF2RAVE-Glide, to conduct computational model based small molecule binding of metastable protein kinase conformations, initiated from protein sequences. We demonstrate the AF2RAVE-Glide workflow on three different protein kinases and their type I and II inhibitors, with special emphasis on binding of known type II kinase inhibitors which target the metastable classical DFG-out state. These states are not easy to sample from AlphaFold2. Here we demonstrate how with AF2RAVE these metastable conformations can be sampled for different kinases with high enough accuracy to enable subsequent docking of k
Liquid-liquid phase separation (LLPS) involving intrinsically disordered protein regions (IDRs) is a major physical mechanism for biological membraneless compartmentalization. The multifaceted electrostatic effects in these biomolecular condensates are exemplified here by experimental and theoretical investigations of the different salt- and ATP-dependent LLPSs of an IDR of messenger RNA-regulating protein Caprin1 and its phosphorylated variant pY-Caprin1, exhibiting, e.g., reentrant behaviors in some instances but not others. Experimental data are rationalized by physical modeling using analytical theory, molecular dynamics, and polymer field-theoretic simulations, indicating that interchain ion bridges enhance LLPS of polyelectrolytes such as Caprin1 and the high valency of ATP-magnesium is a significant factor for its colocalization with the condensed phases, as similar trends are observed for other IDRs. The electrostatic nature of these features complements ATP's involvement in $π$-related interactions and as an amphiphilic hydrotrope, underscoring a general role of biomolecular condensates in modulating ion concentrations and its functional ramifications.
Previous approaches for automatic lay summarisation are exclusively reliant on the source article that, given it is written for a technical audience (e.g., researchers), is unlikely to explicitly define all technical concepts or state all of the background information that is relevant for a lay audience. We address this issue by augmenting eLife, an existing biomedical lay summarisation dataset, with article-specific knowledge graphs, each containing detailed information on relevant biomedical concepts. Using both automatic and human evaluations, we systematically investigate the effectiveness of three different approaches for incorporating knowledge graphs within lay summarisation models, with each method targeting a distinct area of the encoder-decoder model architecture. Our results confirm that integrating graph-based domain knowledge can significantly benefit lay summarisation by substantially increasing the readability of generated text and improving the explanation of technical concepts.
The relation between neural activity and behaviorally relevant variables is at the heart of neuroscience research. When strong, this relation is termed a neural representation. There is increasing evidence, however, for partial dissociations between activity in an area and relevant external variables. While many explanations have been proposed, a theoretical framework for the relationship between external and internal variables is lacking. Here, we utilize recurrent neural networks (RNNs) to explore the question of when and how neural dynamics and the network's output are related from a geometrical point of view. We find that training RNNs can lead to two dynamical regimes: dynamics can either be aligned with the directions that generate output variables, or oblique to them. We show that the choice of readout weight magnitude before training can serve as a control knob between the regimes, similar to recent findings in feedforward networks. These regimes are functionally distinct. Oblique networks are more heterogeneous and suppress noise in their output directions. They are furthermore more robust to perturbations along the output directions. Crucially, the oblique regime is speci
A classic problem in metabolism is that fast-proliferating cells use seemingly wasteful fermentation for energy biogenesis in the presence of sufficient oxygen. This counterintuitive phenomenon, known as overflow metabolism or the Warburg effect, is universal across various organisms. Despite extensive research, its origin and function remain unclear. Here, we show that overflow metabolism can be understood through growth optimization combined with cell heterogeneity. A model of optimal protein allocation, coupled with heterogeneity in enzyme catalytic rates among cells, quantitatively explains why and how cells choose between respiration and fermentation under different nutrient conditions. Our model quantitatively illustrates the growth rate dependence of fermentation flux and enzyme allocation under various perturbations and is fully validated by experimental results in Escherichia coli. Our work provides a quantitative explanation for the Crabtree effect in yeast and the Warburg effect in cancer cells and can be broadly used to address heterogeneity-related challenges in metabolism.
Our goal is to identify brain regions involved in comprehending computer programs. We use functional magnetic resonance imaging (fMRI) to investigate two candidate systems of brain regions which may support this -- the Multiple Demand (MD) system, known to respond to a range of cognitively demanding tasks, and the Language system (LS), known to primarily respond to language stimuli. We devise experiment conditions to isolate the act of code comprehension, and employ a state-of-the-art method to locate brain systems of interest. We administer these experiments in Python (24 participants) and Scratch Jr. (19 participants) - which provides a visual interface to programming, thus eliminating the effect of text in code comprehension. From this robust experiment setup, we find that the Language system is not consistently involved in code comprehension, while the MD is. Further, we find no other brain regions beyond those in the MD to be responsive to code. We also find that variable names, the control flow used in the program, and the types of operations performed do not affect brain responses. We discuss the implications of our findings on the software engineering and CS education commu
Computational models are powerful tools for understanding human cognition and behavior. They let us express our theories clearly and precisely, and offer predictions that can be subtle and often counter-intuitive. However, this same richness and ability to surprise means our scientific intuitions and traditional tools are ill-suited to designing experiments to test and compare these models. To avoid these pitfalls and realize the full potential of computational modeling, we require tools to design experiments that provide clear answers about what models explain human behavior and the auxiliary assumptions those models must make. Bayesian optimal experimental design (BOED) formalizes the search for optimal experimental designs by identifying experiments that are expected to yield informative data. In this work, we provide a tutorial on leveraging recent advances in BOED and machine learning to find optimal experiments for any kind of model that we can simulate data from, and show how by-products of this procedure allow for quick and straightforward evaluation of models and their parameters against real experimental data. As a case study, we consider theories of how people balance ex
To curb the initial spread of SARS-CoV-2, many countries relied on nation-wide implementation of non-pharmaceutical intervention measures, resulting in substantial socio-economic impacts. Potentially, subnational implementations might have had less of a societal impact, but comparable epidemiological impact. Here, using the first COVID-19 wave in the Netherlands as a case in point, we address this issue by developing a high-resolution analysis framework that uses a demographically-stratified population and a spatially-explicit, dynamic, individual contact-pattern based epidemiology, calibrated to hospital admissions data and mobility trends extracted from mobile phone signals and Google. We demonstrate how a subnational approach could achieve similar level of epidemiological control in terms of hospital admissions, while some parts of the country could stay open for a longer period. Our framework is exportable to other countries and settings, and may be used to develop policies on subnational approach as a better strategic choice for controlling future epidemics.
Human society and natural environment form a complex giant ecosystem, where human activities not only lead to the change of environmental states, but also react to them. By using collective-risk social dilemma game, some studies have already revealed that individual contributions and the risk of future losses are inextricably linked. These works, however, often use an idealistic assumption that the risk is constant and not affected by individual behaviors. We here develop a coevolutionary game approach that captures the coupled dynamics of cooperation and risk. In particular, the level of contributions in a population affects the state of risk, while the risk in turn influences individuals' behavioral decision-making. Importantly, we explore two representative feedback forms describing the possible effect of strategy on risk, namely, linear and exponential feedbacks. We find that cooperation can be maintained in the population by keeping at a certain fraction or forming an evolutionary oscillation with risk, independently of the feedback type. However, such evolutionary outcome depends on the initial state. Taken together, a two-way coupling between collective actions and risk is e
Lay summarisation aims to jointly summarise and simplify a given text, thus making its content more comprehensible to non-experts. Automatic approaches for lay summarisation can provide significant value in broadening access to scientific literature, enabling a greater degree of both interdisciplinary knowledge sharing and public understanding when it comes to research findings. However, current corpora for this task are limited in their size and scope, hindering the development of broadly applicable data-driven approaches. Aiming to rectify these issues, we present two novel lay summarisation datasets, PLOS (large-scale) and eLife (medium-scale), each of which contains biomedical journal articles alongside expert-written lay summaries. We provide a thorough characterisation of our lay summaries, highlighting differing levels of readability and abstractiveness between datasets that can be leveraged to support the needs of different applications. Finally, we benchmark our datasets using mainstream summarisation approaches and perform a manual evaluation with domain experts, demonstrating their utility and casting light on the key challenges of this task.
In several large-scale replication projects, statistically non-significant results in both the original and the replication study have been interpreted as a "replication success". Here we discuss the logical problems with this approach: Non-significance in both studies does not ensure that the studies provide evidence for the absence of an effect and "replication success" can virtually always be achieved if the sample sizes are small enough. In addition, the relevant error rates are not controlled. We show how methods, such as equivalence testing and Bayes factors, can be used to adequately quantify the evidence for the absence of an effect and how they can be applied in the replication setting. Using data from the Reproducibility Project: Cancer Biology, the Experimental Philosophy Replicability Project, and the Reproducibility Project: Psychology we illustrate that many original and replication studies with "null results" are in fact inconclusive. We conclude that it is important to also replicate studies with statistically non-significant results, but that they should be designed, analyzed, and interpreted appropriately.
Computational models starting from large ensembles of evolutionarily related protein sequences capture a representation of protein families and learn constraints associated to protein structure and function. They thus open the possibility for generating novel sequences belonging to protein families. Protein language models trained on multiple sequence alignments, such as MSA Transformer, are highly attractive candidates to this end. We propose and test an iterative method that directly employs the masked language modeling objective to generate sequences using MSA Transformer. We demonstrate that the resulting sequences score as well as natural sequences, for homology, coevolution and structure-based measures. For large protein families, our synthetic sequences have similar or better properties compared to sequences generated by Potts models, including experimentally-validated ones. Moreover, for small protein families, our generation method based on MSA Transformer outperforms Potts models. Our method also more accurately reproduces the higher-order statistics and the distribution of sequences in sequence space of natural data than Potts models. MSA Transformer is thus a strong can
It is a well established notion that animals can detect the Earth's magnetic field, while the biophysical origin of such magnetoreception is still elusive. Recently, a magnetic receptor Drosophila CG8198 (MagR) with a rod-like protein complex is reported [Qin \emph{et al}., Nat. Mater. \textbf{15}, 217 (2016)] to act like a compass needle to guide the magnetic orientation of animals. This view, however, is challenged [Meister, Elife \textbf{5}, e17210 (2016)] by arguing that thermal fluctuations beat the Zeeman coupling of the proteins's magnetic moment with the rather weak geomagnetic field ($\sim25-65$ $μ$T). In this work, we show that the spin-mechanical interaction at the atomic scale gives rise to a high blocking temperature which allows a good alignment of protein's magnetic moment with the Earth's magnetic field at room temperature. Our results provide a promising route to resolve the debate on the thermal behaviors of MagR, and may stimulate a broad interest on spin-mechanical couplings down to atomistic levels.
This is a Commentary in $Physics~Today$ on the novel review process developed by the biology journal $eLife$, with the suggestion that it be adopted by physics journals.
Organelle size control is a fundamental question in biology that demonstrates the fascinating ability of cells to maintain homeostasis within their highly variable environments. Theoretical models describing cellular dynamics have the potential to help elucidate the principles underlying size control. Here, we perform a detailed study of the active disassembly model proposed in [Fai et al, Length regulation of multiple flagella that self-assemble from a shared pool of components, eLife, 8, (2019): e42599]. We construct a hybrid system which is shown to be well-behaved throughout the domain. We rule out the possibility of oscillations arising in the model and prove global asymptotic stability in the case of two flagella by the construction of a suitable Lyapunov function. Finally, we generalize the model to the case of arbitrary flagellar number in order to study olfactory sensory neurons, which have up to twenty cilia per cell. We show that our theoretical results may be extended to this case and explore the implications of this universal mechanism of size control.
A recent experiment on zebrafish blastoderm morphogenesis showed that the viscosity (η) of a non-confluent embryonic tissue grows sharply until a critical cell packing fraction (φS). The increase in η up to φS is similar to the behavior observed in several glass-forming materials, which suggests that the cell dynamics is sluggish or glass-like. Surprisingly, η is a constant above φS. To determine the mechanism of this unusual dependence of η on φ, we performed extensive simulations using an agent-based model of a dense non-confluent two-dimensional tissue. We show that polydispersity in the cell size, and the propensity of the cells to deform, results in the saturation of the available free area per cell beyond a critical packing fraction. Saturation in the free space not only explains the viscosity plateau above φS but also provides a relationship between equilibrium geometrical packing to the dramatic increase in the relaxation dynamics.