98%
921
2 minutes
20
Accurate indel calling plays an important role in precision medicine. A benchmarking indel set is essential for thoroughly evaluating the indel calling performance of bioinformatics pipelines. A reference sample with a set of known-positive variants was developed in the FDA-led Sequencing Quality Control Phase 2 (SEQC2) project, but the known indels in the known-positive set were limited. This project sought to provide an enriched set of known indels that would be more translationally relevant by focusing on additional cancer related regions. A thorough manual review process completed by 42 reviewers, two advisors, and a judging panel of three researchers significantly enriched the known indel set by an additional 516 indels. The extended benchmarking indel set has a large range of variant allele frequencies (VAFs), with 87% of them having a VAF below 20% in reference Sample A. The reference Sample A and the indel set can be used for comprehensive benchmarking of indel calling across a wider range of VAF values in the lower range. Indel length was also variable, but the majority were under 10 base pairs (bps). Most of the indels were within coding regions, with the remainder in the gene regulatory regions. Although high confidence can be derived from the robust study design and meticulous human review, this extensive indel set has not undergone orthogonal validation. The extended benchmarking indel set, along with the indels in the previously published known-positive set, was the truth set used to benchmark indel calling pipelines in a community challenge hosted on the precisionFDA platform. This benchmarking indel set and reference samples can be utilized for a comprehensive evaluation of indel calling pipelines. Additionally, the insights and solutions obtained during the manual review process can aid in improving the performance of these pipelines.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10963753 | PMC |
http://dx.doi.org/10.1038/s41598-024-57439-7 | DOI Listing |
J Therm Biol
September 2025
Hainan Key Laboratory of Tropical Animal Reproduction & Breeding and Epidemic Disease Research, School of Tropical Agriculture and Forestry, Hainan University, Haikou, 570228, China. Electronic address:
In light of the challenges posed by global climate change, the environmental adaptability of organisms is becoming increasingly important. The Wuzhishan (WZS) pig, tolerant to high heat and humidity, is an ideal model for genomic study. By characterizing its genome and assessing its genetic diversity and runs of homozygosity (ROH), we can gain insights into its current conservation status and genomic architecture.
View Article and Find Full Text PDFNat Methods
August 2025
PacBio, Menlo Park, CA, USA.
Recent advances in genome sequencing have improved variant calling in complex regions of the human genome. However, it is difficult to quantify variant calling performance because existing standards often focus on specificity, neglecting completeness in difficult-to-analyze regions. To create a more comprehensive truth set, we used Mendelian inheritance in a large pedigree (CEPH-1463) to filter variants across PacBio high-fidelity (HiFi), Illumina and Oxford Nanopore Technologies platforms.
View Article and Find Full Text PDFFood Chem (Oxf)
December 2025
IGA Technology Services S.R.l., via Jacopo Linussio 51, I-33100 Udine, Italy.
In this paper, we have evaluated a targeted high-throughput massive parallel sequencing approach for detecting single nucleotide mutations or small genomic changes generated by new genomic techniques (NGT). We used unique molecular identifiers (UMIs) for the quantification of the mutant alleles and duplex sequencing to confirm a mutation on both strands to avoid polymerase chain reaction (PCR) artefacts or sequencing miss-calls. We tested the approach in blinded analyses on a set of mixed NGT-modified tomato lines and identified each single nucleotide mutation or small insert/deletion (InDel) down to a 0.
View Article and Find Full Text PDFAnn Hum Genet
July 2025
Institute of Molecular Genetics and Genetic Engineering, University of Belgrade, Belgrade, Serbia.
Background: Neoadjuvant chemoradiotherapy (nCRT) is essential for treating locally advanced rectal cancer (LARC), however response to nCRT varies, and reliable predictors are lacking.
Methods: This study used whole exome sequencing analysis to investigate genetic differences between tumors highly responsive and non-responsive to nCRT. Five patients with good response and two patients without response to nCRT were used as a discovery set.
Int J Gynecol Cancer
September 2025
Sun Yat-sen University Cancer Center, Guangdong Provincial Clinical Research Center for Cancer, State Key Laboratory of Oncology in South China, Department of Gynecological Oncology, Guangzhou, China. Electronic address:
Objective: This study aimed to develop a high-performance prognostic model to predict poly(ADP-ribose) polymerase inhibitor (PARPi) treatment outcomes in patients with ovarian cancer.
Methods: This was a retrospective cohort study. Inclusion criteria were high-grade serous or endometroid carcinoma, clear cell carcinoma with platinum-sensitive disease (>6 months without progression from the end of platinum) or platinum-responsive disease eligible for front-line PARPi therapy.