98%
921
2 minutes
20
Background: Breed identification plays an important role in conserving indigenous breeds, managing genetic resources, and developing effective breeding strategies. However, researches on breed identification in livestock mainly focused on purebreds, and they yielded lower predict accuracy in hybrid. In this study, we presented a Multi-Layer Perceptron (MLP) model with multi-output regression framework specifically designed for genomic breed composition prediction of purebred and hybrid in pigs.
Results: We utilized a total of 8,199 pigs from breeding farms in eight provinces in China, comprising Yorkshire, Landrace, Duroc and hybrids of Yorkshire × Landrace. All the animals were genotyped with 1K, 50K and 100K SNP chips. Comparing with random forest (RF), support vector regression (SVR) and Admixture, our results from five replicates of fivefold cross validation demonstrated that MLP achieved a breed identification accuracy of 100% for both hybrid and purebreds in 50K and 100K SNP chips, SVR performed comparable with MLP, they both outperformed RF and Admixture. In the independent testing, MLP yielded accuracy of 100% for all three pure breeds and hybrid across all SNP chips and panel, while SVR yielded 0.026%-0.121% lower accuracy than MLP. Compared with classification-based framework, the new strategy of multi-output regression framework in this study was helpful to improve the predict accuracy. MLP, RF and SVR, achieved consistent improvements across all six SNP chips/panel, especially in hybrid identification. Our results showed the determination threshold for purebred had different effects, SVR, RF and Admixture were very sensitive to threshold values, their optimal threshold fluctuated in different scenarios, while MLP kept optimal threshold 0.75 in all cases. The threshold of 0.65-0.75 is ideal for accurate breed identification. Among different density of SNP chips, the 1K SNP chip was most cost-effective as yielding 100% accuracy with enlarging training set. Hybrid individuals in the training set were useful for both purebred and hybrid identification.
Conclusions: Our new MLP strategy demonstrated its high accuracy and robust applicability across low-, medium-, and high-density SNP chips. Multi-output regression framework could universally enhance prediction accuracy for ML methods. Our new strategy is also helpful for breed identification in other livestock.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12351870 | PMC |
http://dx.doi.org/10.1186/s40104-025-01249-y | DOI Listing |
J Anim Sci
August 2025
Department of Animal Science, Michigan State University, 474 S Shaw Ln, East Lansing, MI 48824, USA.
Genomic selection has been used in animal breeding for c. 15 years and continues to be an important tool in predicting genetic merit in livestock populations. The dairy cattle industry was the first to adopt genomic selection, initially based on some 50K SNP arrays for thousands of animals.
View Article and Find Full Text PDFJ Anim Sci Biotechnol
August 2025
State Key Laboratory of Animal Biotech Breeding, Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China. xding@cau
Background: Breed identification plays an important role in conserving indigenous breeds, managing genetic resources, and developing effective breeding strategies. However, researches on breed identification in livestock mainly focused on purebreds, and they yielded lower predict accuracy in hybrid. In this study, we presented a Multi-Layer Perceptron (MLP) model with multi-output regression framework specifically designed for genomic breed composition prediction of purebred and hybrid in pigs.
View Article and Find Full Text PDFPlants (Basel)
July 2025
Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec City, QC G1V 0A6, Canada.
Efficient and consistent DNA extraction is crucial for genotyping but often hindered by the limitations of traditional manual processes, which are labour-intensive, error-prone, and costly. We introduce a semi-automated, robotic-assisted DNA extraction (RoboCTAB) tailored for large-scale plant genotyping, leveraging advanced yet affordable liquid-handling robotic systems. The protocol/workflow integrates a CTAB extraction protocol specifically adapted for a robotic liquid-handling system, making it compatible with high-throughput genotyping techniques such as SNP genotyping and sequencing.
View Article and Find Full Text PDFClin Chim Acta
August 2025
The Precision Medical Center, The Second Hospital of Nanjing, Nanjing University of Chinese Medicine, Nanjing 210003, China. Electronic address:
Viral pandemics pose severe threats to human health and societal stability, exemplified by the COVID-19 outbreak in 2019. Conventional viral detection methods such as Polymerase chain reaction (PCR) typically require trained personnel, expensive equipment, and 2-4 h for processing. Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated protein (Cas) and Argonaute (Ago) system-based detection methods achieve attomolar sensitivity or single-copy detection limits with single-base specificity within 1 h, without requiring complex or costly instruments.
View Article and Find Full Text PDFTheor Appl Genet
July 2025
Kansas State University, Manhattan, KS, USA.
Adult-plant resistance to yellow rust, caused by Puccinia striiformis f. sp. tritici, is a durable type of resistance in wheat (Triticum aestivum L.
View Article and Find Full Text PDF