搜索 — ResearchTracker

Therapeutic mRNA design requires coordinating multiple interacting sequence features across the full transcript, where codon usage, untranslated regions (UTRs), and their coupling jointly determine stability, translation efficiency, and protein expression. Here, we present mRNA generation via unrolled trajectories and informed latent updates (mRNAutilus), a framework for simultaneous codon optimization and de novo UTR design directly from sequence. mRNAutilus combines a masked discrete diffusion model trained on millions of full-length mRNAs with Monte Carlo Tree Guidance to generate Pareto-efficient sequences under multiple functional objectives, using lightweight regressors over model embeddings to predict half-life, translation efficiency, and protein abundance. Unlike recent methods that design coding sequences and UTRs separately or rely on post hoc assembly and screening, mRNAutilus generates complete transcripts in a single process optimized across properties. Across diverse targets, zero-shot mRNAs encoding P. pyralis luciferase achieve over 400-fold higher expression than wild-type and outperform commercial and machine learning-designed baselines, including zero-shot gener

Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics

arXiv2025-02-19作者：Matthew Wood, Mathieu Klop, Maxime Allard

mRNA-based vaccines have become a major focus in the pharmaceutical industry. The coding sequence as well as the Untranslated Regions (UTRs) of an mRNA can strongly influence translation efficiency, stability, degradation, and other factors that collectively determine a vaccine's effectiveness. However, optimizing mRNA sequences for those properties remains a complex challenge. Existing deep learning models often focus solely on coding region optimization, overlooking the UTRs. We present Helix-mRNA, a structured state-space-based and attention hybrid model to address these challenges. In addition to a first pre-training, a second pre-training stage allows us to specialise the model with high-quality data. We employ single nucleotide tokenization of mRNA sequences with codon separation, ensuring prior biological and structural information from the original mRNA sequence is not lost. Our model, Helix-mRNA, outperforms existing methods in analysing both UTRs and coding region properties. It can process sequences 6x longer than current approaches while using only 10% of the parameters of existing foundation models. Its predictive capabilities extend to all mRNA regions. We open-source

搜索结果：mRNA

mRNAutilus: Multi-Objective-Guided Discrete Generation of mRNA with Optimized Therapeutic Properties

Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics

mRNA2vec: mRNA Embedding with Language Model in the 5'UTR-CDS for mRNA Design

mRNA-protein assembly reduces fluctuations in a system with bursty transcription

Distilling Genomic Models for Efficient mRNA Representation Learning via Embedding Matching

Bacterial stress granule protects mRNA through ribonucleases exclusion

Co-Translational mRNA Decay in Plants: Recent advances and future directions

Beware of so-called 'good' correlations: a statistical reality check on individual mRNA-protein predictions

A New Deep-learning-Based Approach For mRNA Optimization: High Fidelity, Computation Efficiency, and Multiple Optimization Factors

HyperHELM: Hyperbolic Hierarchy Encoding for mRNA Language Modeling

Curriculum-Augmented GFlowNets For mRNA Sequence Generation

Equi-mRNA: Protein Translation Equivariant Encoding for mRNA Language Models

An Evolutionary Approach for Designing Stable and Highly Expressible Low-Immunogenicity Therapeutic mRNA Sequences

Language-Inspired Modeling Reveals Redundant Encoding of N4-acetylcytidine(ac4C) Modifications in mRNA

Towards secondary structure prediction of longer mRNA sequences using a quantum-centric optimization scheme

Leveraging Knowledge Networks: Rethinking Technological Value Distribution in mRNA Vaccine Innovations

The Race of mRNA therapy: Evidence from Patent Landscape

HELM: Hierarchical Encoding for mRNA Language Modeling

mRNA Folding Algorithms for Structure and Codon Optimization

Protein-Conditioned Multi-Objective Reinforcement Learning for Full-Length mRNA Design