Understanding the interactions between cells and the extracellular matrix (ECM) during collective cell invasion is crucial for advancements in tissue engineering, cancer therapies, and regenerative medicine. This study focuses on the roles of contact guidance and ECM remodelling in directing cell behaviour, with a particular emphasis on exploring how differences in cell phenotype impact collective cell invasion. We present a computationally tractable two-dimensional hybrid model of collective cell migration within the ECM, where cells are modelled as individual entities and collagen fibres as a continuous tensorial field. Our model incorporates random motility, contact guidance, cell-cell adhesion, volume filling, and the dynamic remodelling of collagen fibres through cellular secretion and degradation. Through a comprehensive parameter sweep, we provide valuable insights into how differences in the cell phenotype, in terms of the ability of the cell to migrate, secrete, degrade, and respond to contact guidance cues from the ECM, impacts the characteristics of collective cell invasion.
Cell type annotation is a key task in analyzing the heterogeneity of single-cell RNA sequencing data. Although recent foundation models automate this process, they typically annotate cells independently, without considering batch-level cellular context or providing explanatory reasoning. In contrast, human experts often annotate distinct cell types for different cell clusters based on their domain knowledge. To mimic this workflow, we introduce the CellPuzzles task, where the objective is to assign unique cell types to a batch of cells. This benchmark spans diverse tissues, diseases, and donor conditions, and requires reasoning across the batch-level cellular context to ensure label uniqueness. We find that off-the-shelf large language models (LLMs) struggle on CellPuzzles, with the best baseline (OpenAI's o1) achieving only 19.0% batch-level accuracy. To fill this gap, we propose Cell-o1, a 7B LLM trained via supervised fine-tuning on distilled reasoning traces, followed by reinforcement learning with batch-level rewards. Cell-o1 achieves state-of-the-art performance, outperforming o1 by over 73% and generalizing well across contexts. Further analysis of training dynamics and reas
Although a good comprehension of how cancer cells collectively migrate by following molecular rules which influence the state of cell-cell adhesion contacts has been generated, the impact of collective migration on cellular rearrangement from subcellular to supracellular level remains less understood. Thus, considering collective cell migration (CCM) of cancer mesenchymal cells on one side and healthy epithelial cells on the other during the fusion of two cell aggregates could result in a powerful tool in order to address the contribution of structural changes at subcellular level which influence the cellular rearrangements and help to understand this important, but still controversial topic. While healthy epithelial cells undergo volumetric cell rearrangement driven by the tissue surface tension, which results in a collision of opposite directed velocity front near the contact point between two cell aggregates, mesenchymal cells follow quite different scenario. These cells are capable of reducing the surface tension and undergo surface cell rearrangement. The main goal of this contribution is to discuss the origin of surface activity of cancer cells by accounting for the crosstalk
The crawling motility of many eukaryotic cells is driven by filamentous actin (F-actin), and regulated by a network of signaling proteins and lipids (including small GTPases). The tangle of positive and negative feedback loops gives rise to various experimentally observed dynamic patterns (``actin waves''). Here we consider a recent prototypical model for actin waves in which F-actin exerts negative feedback onto a GTPase. Guided by recent numerical PDE bifurcation analysis in Hughes (2025) and Hughes et al (2026), we explore cell shapes and motility associated with polar, oscillatory, and traveling waves solutions of a mass-conserved partial differential equation (PDE) model. We use Morpheus (cellular Potts) simulations to investigate the implications of such regimes of behavior on the shapes and motion of cells, and on transitions between modes of behavior. The model demonstrates various cell states, including resting (spatially uniform GTPase), polar cells (static ``zones'' of GTPase), and traveling waves along the cell edge. In some parameter regimes, such states can coexist, so that cells can transition from one behavior to another in response to noisy stimuli.
Cell-cell communication is essential for tissue development, regeneration and function, and its disruption can lead to diseases and developmental abnormalities. The revolution of single-cell genomics technologies offers unprecedented insights into cellular identities, opening new avenues to resolve the intricate cellular interactions present in tissue niches. CellPhoneDB is a bioinformatics toolkit designed to infer cell-cell communication by combining a curated repository of bona fide ligand-receptor interactions with a set of computational and statistical methods to integrate them with single-cell genomics data. Importantly, CellPhoneDB captures the multimeric nature of molecular complexes, thus representing cell-cell communication biology faithfully. Here we present CellPhoneDB v5, an updated version of the tool, which offers several new features. Firstly, the repository has been expanded by one-third with the addition of new interactions. These encompass interactions mediated by non-protein ligands such as endocrine hormones and GPCR ligands. Secondly, it includes a differentially expression-based methodology for more tailored interaction queries. Thirdly, it incorporates novel
Regulation of cell proliferation is a crucial aspect of tissue development and homeostasis and plays a major role in morphogenesis, wound healing, and tumor invasion. A phenomenon of such regulation is contact inhibition, which describes the dramatic slowing of proliferation, cell migration and individual cell growth when multiple cells are in contact with each other. While many physiological, molecular and genetic factors are known, the mechanism of contact inhibition is still not fully understood. In particular, the relevance of cellular signaling due to interfacial contact for contact inhibition is still debated. Cellular automata (CA) have been employed in the past as numerically efficient mathematical models to study the dynamics of cell ensembles, but they are not suitable to explore the origins of contact inhibition as such agent-based models assume fixed cell sizes. We develop a minimal, data-driven model to simulate the dynamics of planar cell cultures by extending a probabilistic CA to incorporate size changes of individual cells during growth and cell division. We successfully apply this model to previous in-vitro experiments on contact inhibition in epithelial tissue: A
Cancer stem cells are controlled by developmental networks that are often topologically indistinguishable from normal, healthy stem cells. The question is why cancer stem cells can be both phenotypically distinct and have morphological effects so different from normal stem cells. The difference between cancer stem cells and normal stem cells lies not in differences their network architecture, but rather in the spatial-temporal locality of their activation in the genome and the resulting expression in the body. The metastatic potential cancer stem cells is not based primarily on their network divergence from normal stem cells, but on non-network based genetic changes that enable the evolution of gene-based phenotypic properties of the cell that permit its escape and travel to other parts of the body. Stem cell network theory allows the precise prediction of stem cell behavioral dynamics and a mathematical description of stem cell proliferation for both normal and cancer stem cells. It indicates that the best therapeutic approach is to tackle the highest order stem cells first, otherwise spontaneous remission of so called cured cancers will always be a danger. Stem cell networks poin
Cell-cell adhesion is one the most fundamental mechanisms regulating collective cell migration during tissue development, homeostasis and repair, allowing cell populations to self-organize and eventually form and maintain complex tissue shapes. Cells interact with each other via the formation of protrusions or filopodia and they adhere to other cells through binding of cell surface proteins. The resulting adhesive forces are then related to cell size and shape and, often, continuum models represent them by nonlocal attractive interactions. In this paper, we present a new continuum model of cell-cell adhesion which can be derived from a general nonlocal model in the limit of short-range interactions. This new model is local, resembling a system of thin-film type equations, with the various model parameters playing the role of surface tensions between different cell populations. Numerical simulations in one and two dimensions reveal that the local model maintains the diversity of cell sorting patterns observed both in experiments and in previously used nonlocal models. In addition, it also has the advantage of having explicit stationary solutions, which provides a direct link between
Myxococcus xanthus cells self-organize into aligned groups, clusters, at various stages of their lifecycle. Formation of these clusters is crucial for the complex dynamic multi-cellular behavior of these bacteria. However, the mechanism underlying the cell alignment and clustering is not fully understood. Motivated by studies of clustering in self-propelled rods, we hypothesized that M. xanthus cells can align and form clusters through pure mechanical interactions among cells and between cells and substrate. We test this hypothesis using an agent-based simulation framework in which each agent is based on the biophysical model of an individual M. xanthus cell. We show that model agents, under realistic cell flexibility values, can align and form cell clusters but only when periodic reversals of cell directions are suppressed. However, by extending our model to introduce the observed ability of cells to deposit and follow slime trails, we show that effective trail-following leads to clusters in reversing cells. Furthermore, we conclude that mechanical cell alignment combined with slime-trail-following is sufficient to explain the distinct clustering behaviors observed for wild-type a
The transition from single-cell to multicellular behavior is important in early development but rarely studied. The starvation-induced aggregation of the social amoeba Dictyostelium discoideum into a multicellular slug is known to result from single-cell chemotaxis towards emitted pulses of cyclic adenosine monophosphate (cAMP). However, how exactly do transient short-range chemical gradients lead to coherent collective movement at a macroscopic scale? Here, we use a multiscale model verified by quantitative microscopy to describe wide-ranging behaviors from chemotaxis and excitability of individual cells to aggregation of thousands of cells. To better understand the mechanism of long-range cell-cell communication and hence aggregation, we analyze cell-cell correlations, showing evidence for self-organization at the onset of aggregation (as opposed to following a leader cell). Surprisingly, cell collectives, despite their finite size, show features of criticality known from phase transitions in physical systems. Application of external cAMP perturbations in our simulations near the sensitive critical point allows steering cells into early aggregation and towards certain locations b
We define a model for random (abstract) cell complexes (CCs), similiar to the well-known Erdős-Rényi model for graphs and its extensions for simplicial complexes. To build a random cell complex, we first draw from an Erdős-Rényi graph, and consecutively augment the graph with cells for each dimension with a specified probability. As the number of possible cells increases combinatorially -- e.g., 2-cells can be represented as cycles, or permutations -- we derive an approximate sampling algorithm for this model limited to two-dimensional abstract cell complexes. As a basis for this algorithm, we first introduce a spanning-tree-based method that samples simple cycles and allows the efficient approximation of various properties, most notably the probability of occurence of a given cycle. This approximation is of independent interest as it enables the approximation of a wide variety of cycle-related graph statistics using importance sampling. We use this to approximate the number of cycles of a given length on a graph, allowing us to calculate the sampling probability to arrive at a desired expected number of sampled 2-cells. The probability approximation also trivially leads to a sampl
We construct a model of cell reprogramming (the conversion of fully differentiated cells to a state of pluripotency, known as induced pluripotent stem cells, or iPSCs) which builds on key elements of cell biology viz. cell cycles and cell lineages. Although reprogramming has been demonstrated experimentally, much of the underlying processes governing cell fate decisions remain unknown. This work aims to bridge this gap by modelling cell types as a set of hierarchically related dynamical attractors representing cell cycles. Stages of the cell cycle are characterised by the configuration of gene expression levels, and reprogramming corresponds to triggering transitions between such configurations. Two mechanisms were found for reprogramming in a two level hierarchy: cycle specific perturbations and a noise-induced switching. The former corresponds to a \emph{directed} perturbation that induces a transition into a cycle state of a different cell type in the potency hierarchy (mainly a stem cell) whilst the latter is a priori undirected and could be induced, e.g., by a (stochastic) change in the cellular environment. These reprogramming protocols were found to be effective in large reg
Rapid Single Flux Quantum (RSFQ) circuits are the most evolved superconductor logic family. However, the need to clock each cell and the deep pipeline causes a complex clock network with a large skew. This results in lower throughput and high latency in RSFQ. This work introduces an asynchronous RSFQ cell library that incorporates the α-cell, enabling bidirectional signal paths in RSFQ circuits. The α-cell mitigates the need for a large clock network by allowing reverse signal flow, minimizing routing, and enabling compact circuit designs. We demonstrate the library's reliability and efficiency by analog simulations and using in-house optimization tools. The asynchronous reset RSFQ (AR-SFQ) will enable efficient implementation of scalable, high-performance computing frameworks, such as state machines, neuromorphic computing, and higher fan-in circuits.
We generated a computational approach to analyze the biomechanics of epithelial cell aggregates, either island or stripes or entire monolayers, that combines both vertex and contact-inhibition-of-locomotion models to include both cell-cell and cell-substrate adhesion. Examination of the distribution of cell protrusions (adhesion to the substrate) in the model predicted high order profiles of cell organization that agree with those previously seen experimentally. Cells acquired an asymmetric distribution of basal protrusions, traction forces and apical aspect ratios that decreased when moving from the edge to the island center. Our in silico analysis also showed that tension on cell-cell junctions and apical stress is not homogeneous across the island. Instead, these parameters are higher at the island center and scales up with island size, which we confirmed experimentally using laser ablation assays and immunofluorescence. Without formally being a 3-dimensional model, our approach has the minimal elements necessary to reproduce the distribution of cellular forces and mechanical crosstalk as well as distribution of principal stress in cells within epithelial cell aggregates. By mak
Quian Quiroga et al. [Nature 435, 1102 (2005)] have recently discovered neurons that appear to have the characteristics of grandmother (GM) cells. Here we quantitatively assess the compatibility of their data with the GM-cell hypothesis. We show that, contrary to the general impression, a GM-cell representation can be information-theoretically efficient, but that it must be accompanied by cells giving a distributed coding of the input. We present a general method to deduce the sparsity distribution of the whole neuronal population from a sample, and use it to show there are two populations of cells: a distributed-code population of less than about 5% of the cells, and a much more sparsely responding population of putative GM cells. With an allowance for the number of undetected silent cells, we find that the putative GM cells can code for 10^5 or more categories, sufficient for them to be classic GM cells, or to be GM-like cells coding for memories. We quantify the strong biases against detection of GM cells, and show consistency of our results with previous measurements that find only distributed coding. We discuss the consequences for the architecture of neural systems and synapt
Statistical and mathematical modeling are crucial to describe, interpret, compare and predict the behavior of complex biological systems including the organization of hematopoietic stem and progenitor cells in the bone marrow environment. The current prominence of high-resolution and live-cell imaging data provides an unprecedented opportunity to study the spatiotemporal dynamics of these cells within their stem cell niche and learn more about aberrant, but also unperturbed, normal hematopoiesis. However, this requires careful quantitative statistical analysis of the spatial and temporal behavior of cells and the interaction with their microenvironment. Moreover, such quantification is a prerequisite for the construction of hypothesis-driven mathematical models that can provide mechanistic explanations by generating spatiotemporal dynamics that can be directly compared to experimental observations. Here, we provide a brief overview of statistical methods in analyzing spatial distribution of cells, cell motility, cell shapes and cellular genealogies. We also describe cell- based modeling formalisms that allow researchers to simulate emergent behavior in a multicellular system based
Organisms across all domains of life regulate the size of their cells. However, the means by which this is done is poorly understood. We study two abstracted "molecular" models for size regulation: inhibitor dilution and initiator accumulation. We apply the models to two settings: bacteria like Escherichia coli, that grow fully before they set a division plane and divide into two equally sized cells, and cells that form a bud early in the cell division cycle, confine new growth to that bud, and divide at the connection between that bud and the mother cell, like the budding yeast Saccharomyces cerevisiae. In budding cells, delaying cell division until buds reach the same size as their mother leads to very weak size control, with average cell size and standard deviation of cell size increasing over time and saturating up to 100-fold higher than those values for cells that divide when the bud is still substantially smaller than its mother. In budding yeast, both inhibitor dilution or initiator accumulation models are consistent with the observation that the daughters of diploid cells add a constant volume before they divide. This adder behavior has also been observed in bacteria. We f
Multiplexed immuno-fluorescence tissue imaging, allowing simultaneous detection of molecular properties of cells, is an essential tool for characterizing the complex cellular mechanisms in translational research and clinical practice. New image analysis approaches are needed because tissue section stained with a mixture of protein, DNA and RNA biomarkers are introducing various complexities, including spurious edges due to fluorescent staining artifacts between touching or overlapping cells. We have developed the RRScell method harnessing the stochastic random-reaction-seed (RRS) algorithm and deep neural learning U-net to extract single-cell resolution profiling-map of gene expression over a million cells tissue section accurately and automatically. Furthermore, with the use of manifold learning technique UMAP for cell phenotype cluster analysis, the AI-driven RRScell has equipped with a marker-based image cytometry analysis tool (markerUMAP) in quantifying spatial distribution of cell phenotypes from tissue images with a mixture of biomarkers. The results achieved in this study suggest that RRScell provides a robust enough way for extracting cytometric single cell morphology as w
Single-cell technologies are revolutionizing the entire field of biology. The large volumes of data generated by single-cell technologies are high-dimensional, sparse, heterogeneous, and have complicated dependency structures, making analyses using conventional machine learning approaches challenging and impractical. In tackling these challenges, deep learning often demonstrates superior performance compared to traditional machine learning methods. In this work, we give a comprehensive survey on deep learning in single-cell analysis. We first introduce background on single-cell technologies and their development, as well as fundamental concepts of deep learning including the most popular deep architectures. We present an overview of the single-cell analytic pipeline pursued in research applications while noting divergences due to data sources or specific applications. We then review seven popular tasks spanning through different stages of the single-cell analysis pipeline, including multimodal integration, imputation, clustering, spatial domain identification, cell-type deconvolution, cell segmentation, and cell-type annotation. Under each task, we describe the most recent developm
Bacteria tightly regulate and coordinate the various events in their cell cycles to duplicate themselves accurately and to control their cell sizes. Growth of Escherichia coli, in particular, follows a relation known as Schaechter 's growth law. This law says that the average cell volume scales exponentially with growth rate, with a scaling exponent equal to the time from initiation of a round of DNA replication to the cell division at which the corresponding sister chromosomes segregate. Here, we sought to test the robustness of the growth law to systematic perturbations in cell dimensions achieved by varying the expression levels of mreB and ftsZ. We found that decreasing the mreB level resulted in increased cell width, with little change in cell length, whereas decreasing the ftsZ level resulted in increased cell length. Furthermore, the time from replication termination to cell division increased with the perturbed dimension in both cases. Moreover, the growth law remained valid over a range of growth conditions and dimension perturbations. The growth law can be quantitatively interpreted as a consequence of a tight coupling of cell division to replication initiation. Thus, its