98%
921
2 minutes
20
Accurate identification of genetic variants from family child-mother-father trio sequencing data is important in genomics. However, state-of-the-art approaches treat variant calling from trios as three independent tasks, which limits their calling accuracy for Nanopore long-read sequencing data. For better trio variant calling, we introduce Clair3-Trio, the first variant caller tailored for family trio data from Nanopore long-reads. Clair3-Trio employs a Trio-to-Trio deep neural network model, which allows it to input the trio sequencing information and output all of the trio's predicted variants within a single model to improve variant calling. We also present MCVLoss, a novel loss function tailor-made for variant calling in trios, leveraging the explicit encoding of the Mendelian inheritance. Clair3-Trio showed comprehensive improvement in experiments. It predicted far fewer Mendelian inheritance violation variations than current state-of-the-art methods. We also demonstrated that our Trio-to-Trio model is more accurate than competing architectures. Clair3-Trio is accessible as a free, open-source project at https://github.com/HKU-BAL/Clair3-Trio.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487642 | PMC |
http://dx.doi.org/10.1093/bib/bbac301 | DOI Listing |
Mol Genet Genomics
September 2025
Human Phenome Institute, MOE Key Laboratory of Contemporary Anthropology, Zhangjiang Fudan International Innovation Center, Fudan University, 825 Zhangheng Road, Shanghai, 201203, China.
Accurate variant calling is essential for next-generation sequencing (NGS)-based diagnosis of rare diseases, yet most benchmarking studies have focused on standard cell lines or trio-based samples, with limited relevance to sporadic cases. Here, we systematically compared the performance of DeepVariant and GATK HaplotypeCaller in two Chinese cohorts of patients with sporadic epilepsy (EP) and autism spectrum disorder (ASD). DeepVariant exhibited higher precision and sensitivity in detecting single nucleotide variants (SNVs), while GATK showed a distinct advantage in identifying rare variants, which are often key to understanding the genetic basis of rare diseases.
View Article and Find Full Text PDFOral pre-exposure prophylaxis (PrEP) denotes an effective strategy to reduce the risk of HIV infection. However, many individuals encounter difficulties adhering to the once-daily regimen, which highlights the need for a broader portfolio of PrEP options. The novel HIV capsid inhibitor lenacapavir (LEN), when injected every six month, has shown potential in the recently completed clinical trials.
View Article and Find Full Text PDFClin Pharmacol Ther
September 2025
Molecular Brain Science Department, Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Tanenbaum Centre for Pharmacogenetics, Toronto, Ontario, Canada.
Pharmacogenomics enables the personalization of drug therapy by linking genetic variations to differences in drug metabolism, efficacy, and risk of adverse reactions. Genetic polymorphisms within cytochrome P450 (CYP) genes significantly affect enzyme activity, influencing drug plasma levels, responses, and safety. Central to this process is accurate genotype-to-phenotype translation, especially for the CYP enzyme family, which metabolizes 70-80% of clinically used drugs.
View Article and Find Full Text PDFSci Total Environ
September 2025
Graduate School of Agriculture, Meijo University, 1-501 Shiogamaguchi, Nagoya 468-8502, Japan. Electronic address:
Halogenated polycyclic aromatic hydrocarbons (HPAHs) including chlorinated (ClPAHs) and brominated (BrPAHs) variants, are emerging contaminants that are considered the next-generation candidates of persistent organic pollutants. Since there was a significant gap exists in understanding of partitioning dynamics of HPAHs between the particulate phase (PP) and dissolved phase (DP) considering many congeners, this study analyzed 75 congeners of parent PAHs and HPAHs (p/HPAHs) in the samples collected from 27 sites from 20 water bodies in Sri Lanka. The results revealed that the mean of the total concentrations of PAHs, ClPAHs, and BrPAHs in the aqueous phase (PP + DP) were 55.
View Article and Find Full Text PDFGigascience
January 2025
Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02215, USA.
Background: While benchmarks on short-read variant calling suggest a low error rate below 0.5%, they are only applicable to predefined confident regions. For a human sample without such regions, the error rate could be 10 times higher.
View Article and Find Full Text PDF