98%
921
2 minutes
20
Background: Discovery of microRNAs (miRNAs) relies on predictive models for characteristic features from miRNA precursors (pre-miRNAs). The short length of miRNA genes and the lack of pronounced sequence features complicate this task. To accommodate the peculiarities of plant and animal miRNAs systems, tools for both systems have evolved differently. However, these tools are biased towards the species for which they were primarily developed and, consequently, their predictive performance on data sets from other species of the same kingdom might be lower. While these biases are intrinsic to the species, their characterization can lead to computational approaches capable of diminishing their negative effect on the accuracy of pre-miRNAs predictive models. We investigate in this study how 45 predictive models induced for data sets from 45 species, distributed in eight subphyla/classes, perform when applied to a species different from the species used in its induction.
Results: Our computational experiments show that the separability of pre-miRNAs and pseudo pre-miRNAs instances is species-dependent and no feature set performs well for all species, even within the same subphylum/class. Mitigating this species dependency, we show that an ensemble of classifiers reduced the classification errors for all 45 species. As the ensemble members were obtained using meaningful, and yet computationally viable feature sets, the ensembles also have a lower computational cost than individual classifiers that rely on energy stability parameters, which are of prohibitive computational cost in large scale applications.
Conclusion: In this study, the combination of multiple pre-miRNAs feature sets and multiple learning biases enhanced the predictive accuracy of pre-miRNAs classifiers of 45 species. This is certainly a promising approach to be incorporated in miRNA discovery tools towards more accurate and less species-dependent tools. The material to reproduce the results from this paper can be downloaded from http://dx.doi.org/10.5281/zenodo.49754 .
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4884428 | PMC |
http://dx.doi.org/10.1186/s12859-016-1036-3 | DOI Listing |
Eur J Radiol
September 2025
Department of Radiology, Affiliated Hospital of Hebei University, Baoding 071000, China. Electronic address:
Purpose: The present study aimed to develop a noninvasive predictive framework that integrates clinical data, conventional radiomics, habitat imaging, and deep learning for the preoperative stratification of MGMT gene promoter methylation in glioma.
Materials And Methods: This retrospective study included 410 patients from the University of California, San Francisco, USA, and 102 patients from our hospital. Seven models were constructed using preoperative contrast-enhanced T1-weighted MRI with gadobenate dimeglumine as the contrast agent.
Biomol Biomed
September 2025
Department of Cardiology, Renmin Hospital of Wuhan University, Wuhan, China.
Coronary heart disease (CHD) is a leading cause of morbidity and mortality; patients with type 2 diabetes mellitus (T2DM) are at particularly high risk, highlighting the need for reliable biomarkers for early detection and risk stratification. We investigated whether combining the stress hyperglycemia ratio (SHR) and systemic inflammation response index (SIRI) improves CHD detection in T2DM. In this retrospective cohort of 943 T2DM patients undergoing coronary angiography, associations of SHR and SIRI with CHD were evaluated using multivariable logistic regression and restricted cubic splines; robustness was examined with subgroup and sensitivity analyses.
View Article and Find Full Text PDFJ Org Chem
September 2025
State Key Laboratory of Fine Chemicals, School of Chemical Engineering, Ocean and Life Sciences, Dalian University of Technology, Panjin 124221, P. R. China.
The Buchwald-Hartwig (B-H) reaction graph, a novel graph for deep learning models, is designed to simulate the interactions among multiple chemical components in the B-H reaction by representing each reactant as an individual node within a custom-designed reaction graph, thereby capturing both single-molecule and intermolecular relationship features. Trained on a high-throughput B-H reaction data set, B-H Reaction Graph Neural Network (BH-RGNN) achieves near-state-of-the-art performance with an score of 0.971 while maintaining low computational costs.
View Article and Find Full Text PDFJ Med Internet Res
September 2025
School of Advertising, Marketing and Public Relations, Faculty of Business and Law, Queensland University of Technology, Brisbane, Australia.
Background: Labor shortages in health care pose significant challenges to sustaining high-quality care for people with intellectual disabilities. Social robots show promise in supporting both people with intellectual disabilities and their health care professionals; yet, few are fully developed and embedded in productive care environments. Implementation of such technologies is inherently complex, requiring careful examination of facilitators and barriers influencing sustained use.
View Article and Find Full Text PDFJ Chem Inf Model
September 2025
Department of Chemistry, Delaware State University, Dover, Delaware 19901, United States.
The calculation of the highest occupied molecular orbital-lowest unoccupied molecular orbital (HOMO-LUMO) gap for chemical molecules is computationally intensive using quantum mechanics (QM) methods, while experimental determination is often costly and time-consuming. Machine Learning (ML) offers a cost-effective and rapid alternative, enabling efficient predictions of HOMO-LUMO gap values across large data sets without the need for extensive QM computations or experiments. ML models facilitate the screening of diverse molecules, providing valuable insights into complex chemical spaces and integrating seamlessly into high-throughput workflows to prioritize candidates for experimental validation.
View Article and Find Full Text PDF