AP-SKAT: highly-efficient genome-wide rare variant association test.

BMC Genomics

Department of Integrative Genomics, Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi, Japan.

Published: September 2016


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Genome-wide association studies have revealed associations between single-nucleotide polymorphisms (SNPs) and phenotypes such as disease symptoms and drug tolerance. To address the small sample size for rare variants, association studies tend to group gene or pathway level variants and evaluate the effect on the set of variants. One of such strategies, known as the sequential kernel association test (SKAT), is a widely used collapsing method. However, the reported p-values from SKAT tend to be biased because the asymptotic property of the statistic is used to calculate the p-value. Although this bias can be corrected by applying permutation procedures for the test statistics, the computational cost of obtaining p-values with high resolution is prohibitive.

Results: To address this problem, we devise an adaptive SKAT procedure termed AP-SKAT that efficiently classifies significant SNP sets and ranks them according to the permuted p-values. Our procedure adaptively stops the permutation test when the significance level is outside some confidence interval of the estimated p-value for a binomial distribution. To evaluate the performance, we first compare the power and sample size calculation and the type I error rates estimate of SKAT, SKAT-O, and the proposed procedure using genotype data in the SKAT R package and from 1000 Genome Project. Through computational experiments using whole genome sequencing and SNP array data, we show that our proposed procedure is highly efficient and has comparable accuracy to the standard procedure.

Conclusions: For several types of genetic data, the developed procedure could achieve competitive power and sample size under small and large sample size conditions with controlling considerable type I error rates, and estimate p-values of significant SNP sets that are consistent with those estimated by the standard permutation test within a realistic time. This demonstrates that the procedure is sufficiently powerful for recent whole genome sequencing and SNP array data with increasing numbers of phenotypes. Additionally, this procedure can be used in other association tests by employing alternative methods to calculate the statistics.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5031335PMC
http://dx.doi.org/10.1186/s12864-016-3094-3DOI Listing

Publication Analysis

Top Keywords

sample size
16
association test
8
association studies
8
snp sets
8
permutation test
8
power sample
8
type error
8
error rates
8
rates estimate
8
proposed procedure
8

Similar Publications

Background: Authentic leadership in nursing is associated with positive nurse outcomes globally. However, the last published systematic review, in 2018, showed no evidence from the United States and little evidence of effect on patient or health system outcomes.

Objectives: To systematically review, appraise, and synthesize evidence focused on the effect of authentic leadership on nurse, patient, and system outcomes in acute care hospitals in the U.

View Article and Find Full Text PDF

Background: Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder lacking objective biomarkers for early diagnosis. DNA methylation is a promising epigenetic marker, and machine learning offers a data-driven classification approach. However, few studies have examined whole-blood, genome-wide DNA methylation profiles for ASD diagnosis in school-aged children.

View Article and Find Full Text PDF

Background: Anal squamous cell cancer incidence has risen 2.2% each year over the past decade. Current screening includes anal cytology and high-resolution anoscopy but is burdened with sampling error and patient discomfort.

View Article and Find Full Text PDF