Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In addition to their common usages to study gene expression, RNA-seq data accumulated over the last 10 years are a yet-unexploited resource of SNPs in numerous individuals from different populations. SNP detection by RNA-seq is particularly interesting for livestock species since whole genome sequencing is expensive and exome sequencing tools are unavailable. These SNPs detected in expressed regions can be used to characterize variants affecting protein functions, and to study -regulated genes by analyzing allele-specific expression (ASE) in the tissue of interest. However, gene expression can be highly variable, and filters for SNP detection using the popular GATK toolkit are not yet standardized, making SNP detection and genotype calling by RNA-seq a challenging endeavor. We compared SNP calling results using GATK suggested filters, on two chicken populations for which both RNA-seq and DNA-seq data were available for the same samples of the same tissue. We showed, in expressed regions, a RNA-seq precision of 91% (SNPs detected by RNA-seq and shared by DNA-seq) and we characterized the remaining 9% of SNPs. We then studied the genotype (GT) obtained by RNA-seq and the impact of two factors (GT call-rate and read number per GT) on the concordance of GT with DNA-seq; we proposed thresholds for them leading to a 95% concordance. Applying these thresholds to 767 multi-tissue RNA-seq of 382 birds of 11 chicken populations, we found 9.5 M SNPs in total, of which ∼550,000 SNPs per tissue and population with a reliable GT (call rate ≥ 50%) and among them, ∼340,000 with a MAF ≥ 10%. We showed that such RNA-seq data from one tissue can be used to () detect SNPs with a strong predicted impact on proteins, despite their scarcity in each population (16,307 SIFT deleterious missenses and 590 stop-gained), () study, on a large scale, -regulations of gene expression, with ∼81% of protein-coding and 68% of long non-coding genes (TPM ≥ 1) that can be analyzed for ASE, and with ∼29% of them that were -regulated, and () analyze population genetic using such SNPs located in expressed regions. This work shows that RNA-seq data can be used with good confidence to detect SNPs and associated GT within various populations and used them for different analyses as GTEx studies.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273700PMC
http://dx.doi.org/10.3389/fgene.2021.655707DOI Listing

Publication Analysis

Top Keywords

rna-seq data
16
snp detection
16
gene expression
12
expressed regions
12
rna-seq
11
snps
9
detection genotype
8
genotype calling
8
allele-specific expression
8
livestock species
8

Similar Publications

Purpose: Autoimmune thyroiditis (AIT) is the most common organ-specific autoimmune disease, and its pathogenesis is closely related to the inflammatory microenvironment driven by immune cell penetration. The role of the newly proposed concept of PANoptosis in immune-related diseases is gradually being revealed. However, there is currently a lack of reports on PANoptosis in AIT.

View Article and Find Full Text PDF

Background And Aim: Granulosa cells (GCs) are crucial mediators of follicular development and oocyte competence in goats, with their gene expression profiles serving as potential biomarkers of fertility. However, the lack of a standardized, quantifiable method to assess GC quality using transcriptomic data has limited the translation of such findings into reproductive applications. This study aimed to develop a hybrid deep learning model integrating one-dimensional convolutional neural networks (1DCNNs) and gated recurrent units (GRUs) to classify GCs as fertility-supporting (FS) or non-fertility-supporting (NFS) using single-cell RNA sequencing (scRNA-seq) data.

View Article and Find Full Text PDF

Analysis of physiological characteristics and gene co-expression networks in roots under low-temperature stress.

Front Plant Sci

August 2025

Branch of Animal Husbandry and Veterinary of Heilongjiang Academy of Agricultural Sciences, Qiqihar, Heilongjiang, China.

is the most widely cultivated high-protein forage crop globally. However, its cultivation in high-latitude and cold regions of China is significantly hindered by low-temperature stress, particularly impacting the root system, the primary functional tissue crucial for winter survival. The physiological and molecular mechanisms underlying the root system's adaptation and tolerance to low temperatures remain poorly understood.

View Article and Find Full Text PDF

Background: Most RNA-seq datasets harbor genes with extreme expression levels in some samples. Such extreme outliers are usually treated as technical errors and are removed from the data before further statistical analysis. Here we focus on the patterns of such outlier gene expression to investigate whether they provide insights into the underlying biology.

View Article and Find Full Text PDF

Neurotoxic Effects of 4-Hydroxy-4'-Isopropoxydiphenylsulfone Exposure on Zebrafish Embryos.

Environ Pollut

September 2025

Zhejiang Collaborative Innovation Center for Full-Process Monitoring and Green Governance of Emerging Contaminants, Key Laboratory of Pollution Exposure and Health Intervention of Zhejiang Province, Interdisciplinary Research Academy, Zhejiang Shuren University, Hangzhou, 310015, China.

The central nervous system (CNS) is particularly vulnerable to endocrine-disrupting chemicals, especially bisphenol analogues. Bisphenol A (BPA), a widely studied compound, has been associated with various neurological disorders, leading to restrictions on its use and the subsequent adoption of alternative chemicals such as 4-hydroxy-4'-isopropoxydiphenylsulfone (BPSIP). However, concerns regarding the potential neurotoxicity of BPSIP have emerged.

View Article and Find Full Text PDF