98%
921
2 minutes
20
Many problems of modern genetics and functional genomics require the assessment of functional effects of sequence variants, including gene expression changes. Machine learning is considered to be a promising approach for solving this task, but its practical applications remain a challenge due to the insufficient volume and diversity of training data. A promising source of valuable data is a saturation mutagenesis massively parallel reporter assay, which quantitatively measures changes in transcription activity caused by sequence variants. Here, we explore the computational predictions of the effects of individual single-nucleotide variants on gene transcription measured in the massively parallel reporter assays, based on the data from the recent "Regulation Saturation" Critical Assessment of Genome Interpretation challenge. We show that the estimated prediction quality strongly depends on the structure of the training and validation data. Particularly, training on the sequence segments located next to the validation data results in the "information leakage" caused by the local context. This information leakage allows reproducing the prediction quality of the best CAGI challenge submissions with a fairly simple machine learning approach, and even obtaining notably better-than-random predictions using irrelevant genomic regions. Validation scenarios preventing such information leakage dramatically reduce the measured prediction quality. The performance at independent regulatory regions entirely excluded from the training set appears to be much lower than needed for practical applications, and even the performance estimation will become reliable only in the future with richer data from multiple reporters. The source code and data are available at https://bitbucket.org/autosomeru_cagi2018/cagi2018_regsat and https://genomeinterpretation.org/content/expression-variants.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6834773 | PMC |
http://dx.doi.org/10.3389/fgene.2019.01078 | DOI Listing |
Clin Appl Thromb Hemost
September 2025
Pediatric Hematology Laboratory, Division of Hematology/Oncology, Department of Pediatrics, The Seventh Affiliated Hospital of Sun Yat-Sen University, Shenzhen, Guangdong, China.
Hemophilia, an X-linked monogenic disorder, arises from mutations in the or genes, which encode clotting factor VIII (FVIII) or clotting factor IX (FIX), respectively. As a prominent hereditary coagulation disorder, hemophilia is clinically manifested by spontaneous hemorrhagic episodes. Severe cases may progress to complications such as stroke and arthropathy, significantly compromising patients' quality of life.
View Article and Find Full Text PDFEur Geriatr Med
September 2025
School of Public Health Sciences, University of Waterloo, Waterloo, Canada.
Purpose: Sleep disturbance is prevalent in long-term care facilities (LTCFs), yet there is limited understanding of individual factors predicting changes in sleep within these populations. Our objective was to determine predictors of sleep disturbance in LTCFs and investigate variation in prevalence across facilities in two Canadian provinces-New Brunswick and Saskatchewan.
Method: This retrospective longitudinal cohort study used interRAI comprehensive health assessment data from 2016 to 2021, encompassing 21,394 older adults aged ≥ 65 years across 228 LTCFs.
Acta Neurochir (Wien)
September 2025
Department of Neurosurgery, Medical University of Gdańsk, Gdańsk, Poland.
Purpose: Moyamoya disease (MMD) is a chronic cerebrovascular disorder characterized by progressive arterial stenosis and fragile collateral formation, elevating stroke risk. Revascularization is the standard treatment, yet up to 27% of patients experience ischemic events within a year due to bypass insufficiency. While digital subtraction angiography (DSA) remains the gold standard for assessing bypass function, it is invasive and time-consuming.
View Article and Find Full Text PDFGraefes Arch Clin Exp Ophthalmol
September 2025
Department of Ophthalmology, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College Hospital, No. 1 Shuaifuyuan Wangfujing Dongcheng District, China, 100730, Beijing.
Purpose: To evaluate the predictive value of the preoperative orientation and offset of angle alpha(chord alpha) and angle kappa(chord mu) for visual outcomes in patients who underwent trifocal intraocular lens (IOL) implantation.
Methods: Patient records of eyes that underwent AT LISA tri 839MP implantation were retrospectively collected and grouped according to the preoperative offset and orientations of chord alpha and chord mu. The two-dimensional location of each angle was described by the interaction of the orientation and offset.
J Agric Food Chem
September 2025
Guangdong Provincial Key Laboratory of Food Quality and Safety/Nation-Local Joint Engineering Research Center for Machining and Safety of Livestock and Poultry Products, South China Agricultural University, Guangzhou 510642, China.
Adulterated yohimbine (YHB) in food poses a risk to public health, making it imperative to develop fast and sensitive detection methods. In this study, computational-chemistry-based prediction was employed to design YHB haptens for generating the high-affinity monoclonal antibody Yohi-4A7, which exhibited an optimal half-inhibitory concentration (IC) of 1.69 ng/mL against YHB.
View Article and Find Full Text PDF