SoyMD: a platform combining multi-omics data with various tools for soybean research and breeding.

Nucleic Acids Res

Guangdong Key Laboratory of Plant Adaptation and Molecular Design, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou 510405, China.

Published: January 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Advanced multi-omics technologies offer much information that can uncover the regulatory mechanisms from genotype to phenotype. In soybean, numerous multi-omics databases have been published. Although they cover multiple omics, there are still limitations when it comes to the types and scales of omics datasets and analysis methods utilized. This study aims to address these limitations by collecting and integrating a comprehensive set of multi-omics datasets. This includes 38 genomes, transcriptomes from 435 tissue samples, 125 phenotypes from 6686 accessions, epigenome data involving histone modification, transcription factor binding, chromosomal accessibility and chromosomal interaction, as well as genetic variation data from 24 501 soybean accessions. Then, common analysis pipelines and statistical methods were applied to mine information from these multi-omics datasets, resulting in the successful establishment of a user-friendly multi-omics database called SoyMD (https://yanglab.hzau.edu.cn/SoyMD/#/). SoyMD provides researchers with efficient query options and analysis tools, allowing them to swiftly access relevant omics information and conduct comprehensive multi-omics data analyses. Another notable feature of SoyMD is its capability to facilitate the analysis of candidate genes, as demonstrated in the case study on seed oil content. This highlights the immense potential of SoyMD in soybean genetic breeding and functional genomics research.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10767819PMC
http://dx.doi.org/10.1093/nar/gkad786DOI Listing

Publication Analysis

Top Keywords

multi-omics data
8
multi-omics datasets
8
multi-omics
7
soymd
5
soymd platform
4
platform combining
4
combining multi-omics
4
data
4
data tools
4
soybean
4

Similar Publications

Background: Soil salinization represents a critical global challenge to agricultural productivity, profoundly impacting crop yields and threatening food security. Plant salt-responsive is complex and dynamic, making it challenging to fully elucidate salt tolerance mechanism and leading to gaps in our understanding of how plants adapt to and mitigate salt stress.

Results: Here, we conduct high-resolution time-series transcriptomic and metabolomic profiling of the extremely salt-tolerant maize inbred line, HLZY, and the salt-sensitive elite line, JI853.

View Article and Find Full Text PDF

Low-cost and high-throughput RNA sequencing data for barley RILs achieved GP performance comparable to or better than traditional SNP array datasets when combined with parental whole-genome sequencing SNP data. The field of genomic selection (GS) is advancing rapidly on many fronts including the utilization of multi-omics datasets with the goal of increasing prediction ability and becoming an integral part of an increasing number of breeding programs ensuring future food security. In this study, we used RNA sequencing (RNA-Seq) data to perform genomic prediction (GP) on three related barley RIL populations.

View Article and Find Full Text PDF

Hybrid AI in synthetic biology: next era in agriculture.

Trends Plant Sci

September 2025

Crop and Soils Sciences, University of Georgia, Athens, GA 30602, USA; Institute of Plant Breeding and Genetics and Genomics, University of Georgia, Athens, GA 30602, USA.

Synthetic biology holds great potential to transform agriculture, yet its progress is constrained by the complexity of multigenomic, multitrait, and multi-environment data. Desirable traits often arise from complex gene networks acting across diverse conditions, making them difficult to predict and optimize manually. In the past decade, artificial intelligence (AI) has supported this process, but its large data needs and poor integration limit its role to pattern recognition rather than explanatory trait design.

View Article and Find Full Text PDF

Advances in nanopore direct RNA sequencing and its impact on biological research.

Biotechnol Adv

September 2025

Key Laboratory of Microbiological Metrology, Measurement & Bio-product Quality Security, State Administration for Market Regulation, China Jiliang University, Hangzhou 310018, China. Electronic address:

Nanopore direct RNA sequencing (DRS) is a transformative technology that enables full-length, single-molecule sequencing of native RNA, capturing transcript isoforms and preserving epitranscriptomic modifications without cDNA conversion. This review outlines key advances in DRS, including optimized protocols for mRNA, rRNA, tRNA, circRNA, and viral RNA, as well as analytical tools for isoform quantification, poly(A) tail measurement, fusion transcript identification, and base modification profiling. We highlight how DRS has redefined transcriptomic studies across diverse systems-from uncovering novel transcripts and alternative splicing events in cancer, plants, and parasites to enabling the direct detection of m6A, m5C, pseudouridine, and RNA editing events.

View Article and Find Full Text PDF

InterVelo: A Mutually Enhancing Model for Estimating Pseudotime and RNA Velocity in Multi-Omic Single-Cell Data.

Bioinformatics

September 2025

Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.

Motivation: RNA velocity has become a powerful tool for uncovering transcriptional dynamics in snapshot single-cell data. However, current RNA velocity approaches often assume constant transcriptional rates and treat genes independently with gene-specific times, which may introduce biases and deviate from biological realities. Here, we present InterVelo, a novel deep learning framework that simultaneously learns cellular pseudotime and RNA velocity.

View Article and Find Full Text PDF