Improving Enhancer Identification with a Multi-Classifier Stacked Ensemble Model.

J Mol Biol

Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, South Korea; Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, South Korea. Electronic address:

Published: December 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Enhancers are DNA regions that are responsible for controlling the expression of genes. Enhancers are usually found upstream or downstream of a gene, or even inside a gene's intron region, but are normally located at a distant location from the genes they control. By integrating experimental and computational approaches, it is possible to uncover enhancers within DNA sequences, which possess regulatory properties. Experimental techniques such as ChIP-seq and ATAC-seq can identify genomic regions that are associated with transcription factors or accessible to regulatory proteins. On the other hand, computational techniques can predict enhancers based on sequence features and epigenetic modifications. In our study, we have developed a multi-classifier stacked ensemble (MCSE-enhancer) model that can accurately identify enhancers. We utilized feature descriptors from various physiochemical properties as input for our six baseline classifiers and built a stacked classifier, which outperformed previous enhancer classification techniques in terms of accuracy, specificity, sensitivity, and Mathew's correlation coefficient. Our model achieved an accuracy of 81.5%, representing a 2-3% improvement over existing models.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jmb.2023.168314DOI Listing

Publication Analysis

Top Keywords

multi-classifier stacked
8
stacked ensemble
8
enhancers dna
8
enhancers
5
improving enhancer
4
enhancer identification
4
identification multi-classifier
4
ensemble model
4
model enhancers
4
dna regions
4

Similar Publications

Steady-state visually evoked potential is one of the active explorations in the brain-computer interface research. Electroencephalogram based brain computer interface studies have been widely applied to perceive solutions for real-world problems in the healthcare domain. The classification of externally bestowed visual stimuli of different frequencies on a human was experimented to identify the need of paralytic people.

View Article and Find Full Text PDF
Article Synopsis
  • Major depressive disorder (MDD) is complex and requires precise diagnosis, and resting-state functional MRI (rsfMRI) is increasingly used for this purpose.
  • A new machine learning method called MFMC has been developed that combines multiple features and classifiers to improve MDD discrimination accuracy to 96.9%, relying on data from a large and diverse group of subjects.
  • The method identified 13 key feature values linked to specific brain regions that significantly aid in classification, potentially serving as useful biomarkers for diagnosing and predicting MDD in clinical settings.
View Article and Find Full Text PDF

Improving Enhancer Identification with a Multi-Classifier Stacked Ensemble Model.

J Mol Biol

December 2023

Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, South Korea; Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, South Korea. Electronic address:

Enhancers are DNA regions that are responsible for controlling the expression of genes. Enhancers are usually found upstream or downstream of a gene, or even inside a gene's intron region, but are normally located at a distant location from the genes they control. By integrating experimental and computational approaches, it is possible to uncover enhancers within DNA sequences, which possess regulatory properties.

View Article and Find Full Text PDF

In this paper, a new supervised classification paradigm, called classifier subset selection for stacked generalization (CSS stacking), is presented to deal with speech emotion recognition. The new approach consists of an improvement of a bi-level multi-classifier system known as stacking generalization by means of an integration of an estimation of distribution algorithm (EDA) in the first layer to select the optimal subset from the standard base classifiers. The good performance of the proposed new paradigm was demonstrated over different configurations and datasets.

View Article and Find Full Text PDF