搜索 — ResearchTracker

Single-molecule assays like NOMe-seq, dSMF, and Nanopore are superior to DNase-seq and ATAC-seq as they do not destroy DNA. Thus, they enable quantification of all three, that is, protein-free, Transcription Factor-bound, and histone-complex-bound states. But a user-friendly tool to visualize and quantify such states is lacking. Here, we present SMTrackR, an R/Bioconductor package to visualize protein-DNA binding states on individual sequenced DNA molecules. SMTrackR queries the single-molecule footprint database we built and hosted at Galaxy Server. It comprises BigBed files generated from NOMe-seq, dSMF, and Nanopore (SMAC-seq) datasets. SMTrackR exploits UCSC REST API to query a BigBed file and plot footprint heatmap categorized in different binding states, as well as report their occupancies. Additionally, this package generates a Gviz-enabled script to visualize these single molecules on gene tracks. The SMTrackR tool is implemented in the statistical programming language R and is available as a Bioconductor package, SMTrackR (https://bioconductor.org/packages/3.23/bioc/html/SMTrackR.html). The GitHub repository at https://github.com/satyanarayan-rao/SMTrackR has latest updates. The installation time is less than five minutes given the dependent packages are installed. The tool is also available as a web version https://smtrackrest.iitr.ac.in/. A function is provided to use local BigBed file for users who wish to use unpublished data. A fully automated pipeline to generate such BigBed files is available at https://github.com/satyanarayan-rao/SMF_for_SMThub, and https://github.com/satyanarayan-rao/dSMF_for_SMThub.

Ancient hybridization and phylogenetic discordance: Exploring evolutionary complexity in Asteraceae.

PubMed2026-06-01作者：Ellestad PA, Moore-Pollard ER, Siniscalchi CM

Conflicting phylogenetic signals are common in plant phylogenomics and often reflect evolutionary histories shaped by processes like hybridization, incomplete lineage sorting, and whole-genome duplication (WGD). We aimed to identify and assess these complex processes in the hyper-diverse family Asteraceae to offer insight into the underlying causes of phylogenetic discordance. We used new and existing Hyb-Seq and transcriptome data to explore phylogenetic discordance by testing for nuclear/plastid incongruences, WGD, and reticulation. We present a tutorial detailing the execution of complex bioinformatic analyses to increase transparency, facilitate reproducibility, and support advancements in the field of plant evolution (https://github.com/erika-r-moore/Ellestad_etal_2025_APPS_Hybridizations). We uncovered extensive discordance among nuclear gene trees and deep reticulation events, particularly among South American lineages. Signals of WGD were found across the family but were often difficult to interpret, likely due to variation in data completeness, the complexity of the events, and their ancient origins. Our study and tutorial, along with a growing body of phylogenomic research, emphasize the role of reticulation and WGD in the evolution of large, diverse clades, while also underscoring the challenges. We anticipate continued advancements in theoretical approaches that will further enhance empirical studies in reticulate evolution. Las señales filogenéticas conflictivas son comunes en la filogenómica de plantas, y a menudo, reflejan historias evolutivas moldeadas por procesos como la hibridación, la clasificación incompleta de linajes y la duplicación del genoma completo. Nuestro objetivo fue identificar y evaluar estos procesos complejos en la diversa familia Asteraceae, para ofrecer una perspectiva sobre las causas subyacentes de la discordancia filogenética. Utilizamos datos nuevos y existentes de Hyb‐Seq y transcriptomas para explorar la discordancia filogenética mediante pruebas de incongruencia entre los genomas nucleares y plastidiales, duplicación completa del genoma (WGD) y reticulación. Presentamos un tutorial que detalla la ejecución de análisis bioinformáticos complejos para aumentar la transparencia, facilitar la reproducibilidad y apoyar los avances en el campo de la evolución de las plantas (https://github.com/erika-r-moore/Ellestad_etal_2025_APPS_Hybridizations). Descubrimos una discordancia extensa entre los árboles genéticos nucleares y eventos profundos de reticulación, particularmente entre linajes sudamericanos. Se detectaron señales de WGD en toda la familia, aunque a menudo resultaron difíciles de interpretar, probablemente debido a la variación en la integridad de los datos, la complejidad de los eventos y su origen antiguo. Nuestro estudio y tutorial, junto con un cuerpo creciente de investigaciones filogenómicas, destacan el papel de la reticulación y de los WGD en la evolución de clados grandes y diversos, al mismo tiempo que subrayan los desafíos asociados. Anticipamos avances continuos en enfoques teóricos que potenciarán aún más los estudios empíricos sobre la evolución reticulada.

Applications in plant sciences

Inferring Dynamic Information from Protein Structures by Gaussian Integrals and Deep Learning.

PubMed2026-06-24作者：Vilicich F, Bottino N, Su Z

Protein dynamics are central to function, but experiments and molecular dynamics (MD) simulations remain costly, low-throughput, and difficult to compare across protocols. Scalable structure-based methods are needed to infer dynamics from static protein structures. We present a deep learning framework that predicts protein dynamics from 30-dimensional Gaussian integral (GI) descriptors of Cα backbone topology. Using 1,374 ATLAS protein chains with MD-derived RMSF, GI stratified proteins into fold-relevant clusters enriched for secondary structure, sequence homology, and ECOD families. An attention-based 1D-CNN classified flexible versus non-flexible proteins with test AUC = 0.772 and separated slow-mode- from fast-mode-dominated dynamics with AUC = 0.91. Regression models recovered mean RMSF (Pearson r = 0.72; R² = 0.46) and slow-mode RMSF more accurately (Pearson r = 0.83; R² = 0.62), supporting rapid inference of flexibility and collective-motion bias. Code and data are available on GitHub at: https://github.com/fvilicich/gaussian_integral/blob/main/gaussian_integral_classification.ipynb. Supplementary data are available at Bioinformatics online.

搜索结果：GitHub

SMTrackR: an R/Bioconductor package for mapping protein binding at individual DNA molecules.

Ancient hybridization and phylogenetic discordance: Exploring evolutionary complexity in Asteraceae.

Inferring Dynamic Information from Protein Structures by Gaussian Integrals and Deep Learning.

Cell type-specific contextualisation of the human phenome: towards the systematic treatment of all rare diseases.

Accurate full segmentation of organs-at-risk in head and neck cancer based on multimodal point cloud fusion.

Wqsreg: a Stata command for weighted quantile sum regression.

Informed-Exploration Reinforcement Learning for Automated Virtual Coronary Intervention Planning.

RoCA: Robust Contrastive One-Class Time Series Anomaly Detection With Contaminated Data.

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

SegJointGene: joint cell segmentation and spatial gene prioritization by information entropy guided convolutional neural networks.

Toward an Open Analysis Ecosystem for Plasmodium Genomic Epidemiology.

MSTune: A Data-Driven Approach to Parameter Tuning Using Grid Search and Differential Evolution for Gas Chromatography-Mass Spectrometry-Based Compound Identification.

A fully inductive inference protocol for population GNNs in single-subject brain disorder diagnosis.

Raising the Bar in Graph OOD Generalization: Invariant Learning beyond Explicit Environment Modeling.

A Novel Framework for Gene Regulatory Network Inference Integrating Bidirectional Mamba and Dual Contrastive Learning.

Hotgenes: an R package for reducing bottlenecks in bulk omics data exploration and collaboration.

Rethinking the detail-preserved completion of complex tubular structures based on point cloud: A dataset and a benchmark.

HybSuite: An integrated pipeline for hybrid capture phylogenomics from reads to trees.

Expanding the OMOP common data model to support extracorporeal life support research.

ES-DETR: Real-time detection transformer with encover and soft-dropout.