搜索 — ResearchTracker

Peptide-based vaccines, enabled by bioinformatics and machine learning (ML), have emerged as one of the most promising approaches for rapid, safe, and cost-effective vaccine design against infectious diseases. Unlike conventional approaches that depend heavily on whole-pathogen cultures or recombinant protein expression, peptide vaccines can be designed in silico and synthesized quickly. Rational and targeted in silico approaches for the discovery of peptide-based vaccine candidates include B-cell and T-cell epitope prediction, immunogenicity, antigenicity, allergenicity, autoimmunity, population coverage, sequence conservation, molecular docking, molecular dynamics simulation, in silico cloning, and immunological simulation analyses. The combination of these comprehensive computational methods can effectively generate high-quality vaccine candidates for subsequent validation via in vitro and in vivo experiments. This review contextualizes the historical trajectory of peptide-based vaccinology, from early linear epitope discoveries in the 1960s to multi-epitope constructs and clinically tested candidates such as UB-612 and PepGNP-Covid19. It examines critical challenges in immunoinformatics, including performance gaps in epitope prediction tools, complexities in human leucocyte antigen (HLA) mapping, and the need for extensive manual intervention in pipelines. Artificial intelligence-driven approaches, spanning deep learning, and interpretable ML, are positioned to transform epitope prediction, reduce human error, and standardize reproducibility. These advances have the potential to support global outbreak response targets such as the Coalition for Epidemic Preparedness Innovations (CEPI) 100 Days Mission and the World Health Organization (WHO) Research and Development (R&D) Blueprint. However, their performance remains constrained by data quality, dataset imbalance, limited benchmark standardization, and persistent underrepresentation of many HLA alleles and population groups. Key Points Peptide-based vaccines, accelerated by bioinformatics and machine learning, offer a potentially rapid, relatively safe, and cost-effective alternative to traditional vaccine design, enabling in silico development and swift synthetic manufacturing. Computational methods such as B-cell and T-cell epitope prediction, immunogenicity analysis, and molecular simulations allow for rational and targeted vaccine candidate discovery, enhancing quality and efficiency. The field has evolved from early linear epitope discoveries in the 1960s to sophisticated multi-epitope constructs and clinically tested candidates like UB-612 and PepGNP-Covid19. Major challenges in immunoinformatics include performance limitations in epitope prediction tools, complexities in HLA mapping, and the necessity for manual intervention in data pipelines. Artificial intelligence-driven models, including deep learning and interpretable machine learning, promise to overcome these challenges by improving prediction accuracy, reducing errors, and supporting global epidemic response efforts such as CEPI's 100 Days Mission and the WHO R&D Blueprint.

RiSpy: a feature selection-based fingerprinting framework for accurate identification of genome-edited rice lines.

PubMed2026-05-04作者：Zolfaghari A, Fraiture MA, Vanneste K

The European Union (EU) enforces strict regulations on the traceability and labeling of genetically modified organisms (GMOs), including genome-edited (GE) lines produced through new genomic techniques (NGTs). Identifying GE organisms created by single nucleotide variations (SNVs) is however challenging, as a single SNV alone cannot unambiguously define a GE line. Recently, we introduced the concept of generating a genetic fingerprint to distinguish a specific GE rice line. This proof-of-concept approach integrated whole-genome sequencing (WGS)-based characterization with the Illumina technology, the public 3 K Rice Genomes (3KRG) database, and statistical feature-selection tools, to select and combine key genetic elements, including GE on-target site(s) and cultivar-specific 2-SNV barcodes, into a unique genetic fingerprint. In the present study, we expand this concept into a generalized data-driven framework allowing identification of multiple rice lines. Supported by newly developed bioinformatics and statistical feature-selection-based pipelines, this optimized strategy enables the generation of genetic fingerprints irrespective of a rice cultivar's inclusion in publicly available databases like 3KRG. In addition, this refined strategy can leverage WGS data generated from both Illumina and Oxford Nanopore Technologies (ONT) platforms for fingerprint generation and GE line identification. Using two distinct in-house GE rice lines from different cultivars, along with various publicly available WGS datasets, we demonstrated the robustness, scalability, and specificity of this approach for reliable GE rice line identification. Our findings provide a methodological foundation for data-driven traceability of GE rice lines, reinforcing regulatory compliance, supporting intellectual property (IP) protection, and contributing to the responsible implementation of EU GMO/NGT legislation.

搜索结果：Briefings in bioinformatics

The critical role of artificial intelligence and bioinformatics in accelerating peptide-based vaccine discovery for tackling global infectious diseases.

RiSpy: a feature selection-based fingerprinting framework for accurate identification of genome-edited rice lines.

Cell line-specific gene network enrichment analysis for interpreting continuous phenotypes.

SOPA and SIMPA: normalized single-sample integrated multiomics pathway analysis of tumor heterogeneity in solid cancers.

AbTune: layer-wise selective fine-tuning of protein language models for antibodies.

A chromatin-structure-guided framework for predictive and interpretable regulatory genomics.

T-cell receptor alpha to beta chains binding prediction.

MRDsteer: quality-aware AI-driven closed-loop optimization enhances ctDNA-based minimal residual disease detection.

A scoping review of approaches to evaluating workflow management systems for bioinformatics users.

Quantum bioinformatics: a systematic review of methods, trends, and challenges.

Annotation-free phenotype prediction using knowledge-augmented clustering from single-cell RNA sequencing data.

Ligand-agnostic off-target site prediction for early toxicity screening by leveraging point cloud-based protein cavity analysis.

An overview of self-supervised deep learning applications to molecular data.

scHashFormer: a hash-driven graph transformer for scalable scRNA-seq clustering.

Investigating the anticancer activity of eravacycline in pancreatic cancer via target-based deep learning and experimental validation.

Graph-based drug-target interaction modeling: from representation learning to output-driven drug discovery.

AGCLD: an adaptive graph contrastive learning method with denoising for spatial domain identification.

Hi-C informed kernel association test for integrating 3D genome structure into variant-set analysis.

A graph retrieval-augmented generation pipeline for systematic drug target discovery: validation and application to ocular neovascularization.

Advancing bioinformatics with language models: components, applications, and perspectives.