98%
921
2 minutes
20
Integrative taxonomy is central to modern taxonomy and systematic biology, including behavior, niche preference, distribution, morphological analysis, and DNA barcoding. However, decades of use demonstrate that these methods can face challenges when used in isolation, for instance, potential misidentifications due to phenotypic plasticity for morphological methods, and incorrect identifications because of introgression, incomplete lineage sorting, and horizontal gene transfer for DNA barcoding. Although researchers have advocated the use of integrative taxonomy, few detailed algorithms have been proposed. Here, we develop a convolutional neural network method (morphology-molecule network [MMNet]) that integrates morphological and molecular data for species identification. The newly proposed method (MMNet) worked better than four currently available alternative methods when tested with 10 independent data sets representing varying genetic diversity from different taxa. High accuracies were achieved for all groups, including beetles (98.1% of 123 species), butterflies (98.8% of 24 species), fishes (96.3% of 214 species), and moths (96.4% of 150 total species). Further, MMNet demonstrated a high degree of accuracy ($>$98%) in four data sets including closely related species from the same genus. The average accuracy of two modest subgenomic (single nucleotide polymorphism) data sets, comprising eight putative subspecies respectively, is 90%. Additional tests show that the success rate of species identification under this method most strongly depends on the amount of training data, and is robust to sequence length and image size. Analyses on the contribution of different data types (image vs. gene) indicate that both morphological and genetic data are important to the model, and that genetic data contribute slightly more. The approaches developed here serve as a foundation for the future integration of multimodal information for integrative taxonomy, such as image, audio, video, 3D scanning, and biosensor data, to characterize organisms more comprehensively as a basis for improved investigation, monitoring, and conservation of biodiversity. [Convolutional neural network; deep learning; integrative taxonomy; single nucleotide polymorphism; species identification.].
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/sysbio/syab076 | DOI Listing |
Environ Monit Assess
September 2025
Institute of Earth Sciences, Southern Federal University, Rostov-On-Don, Russia.
Sustainable urban development requires actionable insights into the thermal consequences of land transformation. This study examines the impact of land use and land cover (LULC) changes on land surface temperature (LST) in Ho Chi Minh city, Vietnam, between 1998 and 2024. Using Google Earth Engine (GEE), three machine learning algorithms-random forest (RF), support vector machine (SVM), and classification and regression tree (CART)-were applied for LULC classification.
View Article and Find Full Text PDFEye (Lond)
September 2025
Genetics Laboratory, Metropolitan South Clinical Laboratory, Bellvitge University Hospital, Institut d'Investigació Biomèdica de Bellvitge (IDIBELL), L'Hospitalet de Llobregat, Barcelona, Spain.
Background: Inherited retinal dystrophies (IRDs) are a genetically heterogeneous group of conditions, with approximately 40% of cases remaining unresolved after initial genetic testing. This study aimed to assess the impact of a personalised genomic approach integrating whole-exome sequencing (WES) reanalysis, whole-genome sequencing (WGS), customised gene panels and functional assays to improve diagnostic yield in unresolved cases.
Subjects/methods: We retrospectively reviewed a cohort of 597 individuals with IRDs, including 525 probands and 72 affected relatives.
Med Eng Phys
October 2025
College of Basic Medical Science, Shanxi University of Chinese Medicine, Jinzhong, 030619, Shanxi, China.
Pulse diagnosis holds a pivotal role in traditional Chinese medicine (TCM) diagnostics, with pulse characteristics serving as one of the critical bases for its assessment. Accurate classification of these pulse pattern is paramount for the objectification of TCM. This study proposes an enhanced SMOTE approach to achieve data augmentation, followed by multi-domain feature extraction.
View Article and Find Full Text PDFProc Biol Sci
September 2025
Division of Integrative Anatomical Sciences, University of Southern California Keck School of Medicine, Los Angeles, CA, USA.
Red blood cell (RBC) size constrains the rate of diffusion of gases between (i) the environment and the capillary beds of the gas exchanger and (ii) the blood and organs. In birds, small RBCs with a high surface area to volume ratio permit a high O diffusion capacity and facilitate sustained, vigorous exercise. Unfortunately, our knowledge of archosaur cardiovascular evolution is incomplete without fossilized RBCs and blood vessels.
View Article and Find Full Text PDFEur J Pharm Biopharm
September 2025
Center of Drug Metabolism and Pharmacokinetics, China Pharmaceutical University, Nanjing 210009, China. Electronic address:
Prodrugs with enzymatic activation requirements, such as the weakly basic biopharmaceutical classification system (BCS) class IV compound abiraterone acetate (ABA), face considerable bioequivalence (BE) risks owing to their pH-dependent solubility, food effects, and variable intestinal hydrolysis. This study established clinically relevant dissolution specifications for ABA using biorelevant dissolution and physiologically based biopharmaceutics modelling (PBBM). Two dissolution methods, two-stage (gastrointestinal transfer simulation) and single-phase (biorelevant media), were evaluated under fasted and fed conditions.
View Article and Find Full Text PDF