In cancer research, the term epigenetics was used in the 1970s in its modern sense encompassing non-genetic events modifying the chromatin state, mainly to oppose the emerging oncogene paradigm. However, starting from the establishment of this prominent concept, the importance of these epigenetic phenomena in cancer rarely led to questioning the causal role of genetic alterations. Only in the last 10 years, the accumulation of problematic data, better experimental technologies, and some ambitious models pushed the idea that epigenetics could be at least as important as genetics in early oncogenesis. Until this year, a direct demonstration of epigenetic oncogenesis was still lacking. Now Parreno, Cavalli and colleagues, using a refined experimental model in the fruit fly Drosophila melanogaster, enforced the initiation of tumours solely by imposing a transient loss of Polycomb repression, leading to a purely epigenetic oncogenesis phenomenon. Despite a few caveats that we discuss, this pioneering work represents a major breakpoint in cancer research that leads us to consider the theoretical and conceptual implications on oncogenesis and to search for links between this artificial ex
Artificially intelligent (AI) co-scientists must be able to sift through research literature cost-efficiently while applying nuanced scientific reasoning. We evaluate Small Language Models (SLMs, <= 8B parameters) for classifying medical research papers. Using literature on the oncogenic potential of HMTV/MMTV-like viruses in breast cancer as a case study, we assess model performance with both zero-shot and in-context learning (ICL; few-shot prompting) strategies against frontier proprietary Large Language Models (LLMs). Llama 3 and Qwen2.5 outperform GPT-5 (API, low/high effort), Gemini 3 Pro Preview, and Meerkat in zero-shot settings, though trailing Gemini 2.5 Pro. ICL leads to improved performance on a case-by-case basis, allowing Llama 3 and Qwen2.5 to match Gemini 2.5 Pro in binary classification. Systematic lexical-ablation experiments show that SLM decisions are often grounded in valid scientific cues but can be influenced by spurious textual artifacts, underscoring need for interpretability in high-stakes pipelines. Our results reveal both promise and limitations of modern SLMs for scientific triage; pairing SLMs with simple but principled prompting strategies can appro
More than 30 years ago, we published a paper entitled as abnormal chromatin configuration and oncogenesis, which proposed the first hypothesis that links oncogenesis to abnormal three-dimensional (3D) genome structure. Recently, many studies have demonstrated that the 3D genome structure plays a major role in oncogenesis, which strongly supports our hypothesis. In this paper, further thoughts about our hypothesis is presented.
Diseased conditions are a consequence of some abnormality that are associated with clinical conditions in numerous cells and tissues affecting various organs. The common role of EBV (Epstein-Barr virus) in causing infectious mononucleosis (IM) affecting B-cells and epithelial cells and the development of EBV-associated cancers has been an area of active research. Investigating such significant interactions may help discover new therapeutic targets for certain EBV-associated lymphoproliferative (Burkitt's Lymphoma and Hodgkin's Lymphoma) and non-lymphoproliferative diseases (Gastric cancer and Nasopharyngeal cancer). Based on the DisGeNET (v7.0) data set, we constructed a disease-gene network bipartite graph to identify genes that are involved in various carcinomas namely, gastric cancer (GC), nasopharyngeal cancer (NPC), Hodgkin's lymphoma (HL) and Burkitt's lymphoma (BL). Using the community detection algorithm (Louvain method), we identified communities followed by functional enrichment using over-representation analysis methodology. In this study, we identified the modular communities to explore the relation of this common causative pathogen (EBV) with different carcinomas such
Understanding how glioblastoma (GBM) emerges from initially healthy glial tissue requires models that integrate bioelectrical, metabolic, and multicellular dynamics. This work introduces an ASAL-inspired agent-based framework that simulates bioelectric state transitions in glial cells as a function of mitochondrial efficiency (Meff), ion-channel conductances, gap-junction coupling, and ROS dynamics. Using a 64x64 multicellular grid over 60,000 simulation steps, we show that reducing Meff below a critical threshold (~0.6) drives sustained depolarization, ATP collapse, and elevated ROS, reproducing key electrophysiological signatures associated with GBM. We further apply evolutionary optimization (genetic algorithms and MAP-Elites) to explore resilience, parameter sensitivity, and the emergence of tumor-like attractors. Early evolutionary runs converge toward depolarized, ROS-dominated regimes characterized by weakened electrical coupling and altered ionic transport. These results highlight mitochondrial dysfunction and disrupted bioelectric signaling as sufficient drivers of malignant-like transitions and provide a computational basis for probing the bioelectrical origins of oncogen
Identifying driver genes is crucial for understanding oncogenesis and developing targeted cancer therapies. Driver discovery methods using protein or pathway networks rely on traditional network science measures, focusing on nodes, edges, or community metrics. These methods can overlook the high-dimensional interactions that cancer genes have within cancer networks. This study presents a novel method using Persistent Homology to analyze the role of driver genes in higher-order structures within Cancer Consensus Networks derived from main cellular pathways. We integrate mutation data from six cancer types and three biological functions: DNA Repair, Chromatin Organization, and Programmed Cell Death. We systematically evaluated the impact of gene removal on topological voids ($β_2$ structures) within the Cancer Consensus Networks. Our results reveal that only known driver genes and cancer-associated genes influence these structures, while passenger genes do not. Although centrality measures alone proved insufficient to fully characterize impact genes, combining higher-order topological analysis with traditional network metrics can improve the precision of distinguishing between driver
Tumour heterogeneity is increasingly recognized as a major obstacle to therapeutic success across neuro-oncology. Gliomas are characterised by distinct combinations of genetic and epigenetic alterations, resulting in complex interactions across multiple molecular pathways. Predicting disease evolution and prescribing individually optimal treatment requires statistical models complex enough to capture the intricate (epi)genetic structure underpinning oncogenesis. Here, we formalize this task as the inference of distinct patterns of connectivity within hierarchical latent representations of genetic networks. Evaluating multi-institutional clinical, genetic, and outcome data from 4023 glioma patients over 14 years, across 12 countries, we employ Bayesian generative stochastic block modelling to reveal a hierarchical network structure of tumour genetics spanning molecularly confirmed glioblastoma, IDH- wildtype; oligodendroglioma, IDH-mutant and 1p/19q codeleted; and astrocytoma, IDH- mutant. Our findings illuminate the complex dependence between features across the genetic landscape of brain tumours, and show that generative network models reveal distinct signatures of survival with b
Genomic alterations lead to cancer complexity and form a major hurdle for a comprehensive understanding of the molecular mechanisms underlying oncogenesis. In this review, we describe the recent advances in studying cancer-associated genes from a systems biological point of view. The integration of known cancer genes onto protein and signaling networks reveals the characteristics of cancer genes within networks. This approach shows that cancer genes often function as network hub proteins which are involved in many cellular processes and form focal nodes in the information exchange between many signaling pathways. Literature mining allows constructing gene-gene networks, in which new cancer genes can be identified. The gene expression profiles of cancer cells are used for reconstructing gene regulatory networks. By doing so, the genes, which are involved in the regulation of cancer progression, can be picked up from these networks after which their functions can be further confirmed in the laboratory.
Tumor cells with two nuclei (binucleated cells, BiNC) or more nuclei (multinucleated cells, MuNC) indicate an increased amount of cellular genetic material which is thought to facilitate oncogenesis, tumor progression and treatment resistance. In canine cutaneous mast cell tumors (ccMCT), binucleation and multinucleation are parameters used in cytologic and histologic grading schemes (respectively) which correlate with poor patient outcome. For this study, we created the first open source data-set with 19,983 annotations of BiNC and 1,416 annotations of MuNC in 32 histological whole slide images of ccMCT. Labels were created by a pathologist and an algorithmic-aided labeling approach with expert review of each generated candidate. A state-of-the-art deep learning-based model yielded an $F_1$ score of 0.675 for BiNC and 0.623 for MuNC on 11 test whole slide images. In regions of interest ($2.37 mm^2$) extracted from these test images, 6 pathologists had an object detection performance between 0.270 - 0.526 for BiNC and 0.316 - 0.622 for MuNC, while our model archived an $F_1$ score of 0.667 for BiNC and 0.685 for MuNC. This open dataset can facilitate development of automated image
Within systems biology there is an increasing interest in the stochastic behavior of genetic and biochemical reaction networks. An appropriate stochastic description is provided by the chemical master equation, which represents a continuous time Markov chain (CTMC). In this paper we consider the stochastic properties of a biochemical circuit, known to control eukaryotic cell cycle and possibly involved in oncogenesis, recently proposed in the literature within a deterministic framework. Due to the inherent stochasticity of biochemical processes and the small number of molecules involved, the stochastic approach should be more correct in describing the real system: we study the agreement between the two approaches by exploring the system parameter space. We address the problem by proposing a simplified version of the model that allows analytical treatment, and by performing numerical simulations for the full model. We observed optimal agreement between the stochastic and the deterministic description of the circuit in a large range of parameters, but some substantial differences arise in at least two cases: 1) when the deterministic system is in the proximity of a transition from a
Background: Radiotherapy outcomes are usually predicted using the Linear Quadratic model. However, this model does not integrate complex features of tumor growth, in particular cell cycle regulation. Methods: In this paper, we propose a multiscale model of cancer growth based on the genetic and molecular features of the evolution of colorectal cancer. The model includes key genes, cellular kinetics, tissue dynamics, macroscopic tumor evolution and radiosensitivity dependence on the cell cycle phase. We investigate the role of gene-dependent cell cycle regulation in the response of tumors to therapeutic irradiation protocols. Results: Simulation results emphasize the importance of tumor tissue features and the need to consider regulating factors such as hypoxia, as well as tumor geometry and tissue dynamics, in predicting and improving radiotherapeutic efficacy. Conclusion: This model provides insight into the coupling of complex biological processes, which leads to a better understanding of oncogenesis. This will hopefully lead to improved irradiation therapy.
The glutamate metabotropic receptor 1 (GRM1) drives oncogenesis when aberrantly activated in melanoma and several other cancers. Metabolomics reveals that patient-derived xenografts with GRM1-positive melanoma tumors exhibit elevated plasma glutamate levels associated with metastatic melanoma in vivo. Stable isotope tracing and GCMS analysis determined that cells expressing GRM1 fuel a substantial fraction of glutamate from glycolytic carbon. Stimulation of GRM1 by glutamate leads to activation of mitogenic signaling pathways, which in turn increases the production of glutamate, fueling autocrine feedback. Implementing a rational drug-targeting strategy, we critically evaluate metabolic bottlenecks in vitro and in vivo. Combined inhibition of glutamate secretion and biosynthesis is an effective rational drug targeting strategy suppressing tumor growth and restricting tumor bioavailability of glutamate.
Massive stars in their final stages of collapse radiate most of their binding energy in the form of MeV neutrinos. The recoil atoms that they produce in elastic scattering off nuclei in organic tissue create radiation damage which is highly effective in the production of irreparable DNA harm, leading to cellular mutation, neoplasia and oncogenesis. Using a conventional model of the galaxy and of the collapse mechanism, the periodicity of nearby stellar collapses and the radiation dose are calculated. The possible contribution of this process to the paleontological record of mass extinctions is examined.
The cell cycle is tightly regulated by cyclins and their catalytic moieties, the cyclin-dependent kinases (CDKs). Cyclin D1, in association with CDK4/6, acts as a mitogenic sensor and integrates extracellular mitogenic signals and cell cycle progression. When deregulated (overexpressed, accumulated, inappropriately located), cyclin D1 becomes an oncogene and is recognized as a driver of solid tumors and hemopathies. Recent studies on the oncogenic roles of cyclin D1 reported non-canonical functions dependent on the partners of cyclin D1 and its location within tumor cells or tissues. Support for these new functions was provided by various mouse models of oncogenesis. Finally, proteomic and transcriptomic data identified complex cyclin D1 networks. This review focuses on these aspects of cyclin D1 pathophysiology, which may be crucial for targeted therapy.
The competition between cancer cells and immune system cells in inhomogeneous conditions is described at cell scale within the framework of the thermostatted kinetic theory. Cell learning is reproduced by increased cell activity during favorable interactions. The cell activity fluctuations are controlled by a thermostat. The direction of cell velocity is changed according to stochastic rules mimicking a dense fluid. We develop a kinetic Monte Carlo algorithm inspired from the direct simulation Monte Carlo (DSMC) method initially used for dilute gases. The evolution of an initially localized tumor is analyzed. Qualitatively different behaviors are observed as the field regulating activity fluctuations decreases. For high field values, i.e. efficient thermalization, cancer is controlled. For small field values, cancer rapidly and monotonously escapes from immunosurveillance. For the critical field value separating these two domains, the 3E's of immunotherapy are reproduced, with an apparent initial elimination of cancer, a long quasi-equilibrium period followed by large fluctuations, and the final escape of cancer, even for a favored production of immune system cells. For field value
In 1994, I conjectured that insulin-degrading enzyme (IDE) acts as an inhibitor of malignant transformation by degrading insulin and thus preventing this major growth-stimulatory hormone from binding and thereby inactivating the retinoblastoma tumor suppressor protein (RB). Ten years later, I discovered that a carboxyterminal RB amino acid sequence resembles the catalytic center of IDE. This structural homology raised the possibility that insulin degradation is a basic mechanism for tumor suppression shared by RB and IDE. Subsequently, a first immunohistochemical study on the differential expression of human IDE in normal tissues, primary tumors and their corresponding lymph node metastases further corroborated the initial conjecture on IDE being an antineoplastic molecule. In this report, it is shown that IDE harbors ankyrin repeat-like amino acid sequences through which it might bind and, as a result, antagonize the pro-inflammatory factor NF-kappaB as well as cyclin-dependent kinases (CDKs). As equally revealed here, IDE also contains 2 RXL cyclin-binding motifs which could contribute to its presumed inhibition of CDKs. These new findings suggest that IDE is potentially able to
The proton-magnetic reaction is commonly used in MRI machines with a strong magnetic field of over 1 T, while this study hypothesized that the electron magnetic reaction of hydrogen could affect the hydrogen bonds of double-stranded DNA (dsDNA) at a low magnetic field below 0.01 T. The goal is to develop a hydrogen bonding magnetic reaction-based gene regulation (HBMR-GR) system. The polarities of DNA base pairs are derived from the relative electrostatic charge between purines and pyrimidines, which become positively and negatively charged, respectively. The Pyu dsDNAs with pyrimidine(s)-purine(s) sequences, ds3T3A, ds3C3G, and ds3C3A, showed stronger DNA hybridization potential, increased infrared absorption at 3400-3200 cm-1, and a unique DNA conformation in HPLC analysis compared to the corresponding Puy dsDNAs. To target the three-dimensional structure of dsDNA based on the DNA base pair polarities, one can use cyclic electromagnetic DNA simulation (CEDS) with approximately 25% efficiency for randomly oriented dsDNAs. CEDS was found to induce sequence-specific hybridization of target oligo-dsDNAs in 0.005M NaCl solution and sequence-specific conformation of oligo-dsDNAs in 0.1
We propose a statistical framework to integrate radiological magnetic resonance imaging (MRI) and genomic data to identify the underlying radiogenomic associations in lower grade gliomas (LGG). We devise a novel imaging phenotype by dividing the tumor region into concentric spherical layers that mimics the tumor evolution process. MRI data within each layer is represented by voxel--intensity-based probability density functions which capture the complete information about tumor heterogeneity. Under a Riemannian-geometric framework these densities are mapped to a vector of principal component scores which act as imaging phenotypes. Subsequently, we build Bayesian variable selection models for each layer with the imaging phenotypes as the response and the genomic markers as predictors. Our novel hierarchical prior formulation incorporates the interior-to-exterior structure of the layers, and the correlation between the genomic markers. We employ a computationally-efficient Expectation--Maximization-based strategy for estimation. Simulation studies demonstrate the superior performance of our approach compared to other approaches. With a focus on the cancer driver genes in LGG, we discu