Effective discovery of rare variants by pooled target capture sequencing: A comparative analysis with individually indexed target capture sequencing

Seungjin Ryu, Jeehae Han, Trina M. Norden-Krichmar, Nicholas J. Schork, Yousin Suh

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Identification of all genetic variants associated with complex traits is one of the most important goals in modern human genetics. Genome-wide association studies (GWAS) have been successfully applied to identify common variants, which thus far explain only small portion of heritability. Interests in rare variants have been increasingly growing as an answer for this missing heritability. While next-generation sequencing allows detection of rare variants, its cost is still prohibitively high to sequence a large number of human DNA samples required for rare variant association studies. In this study, we evaluated the sensitivity and specificity of sequencing for pooled DNA samples of multiple individuals (Pool-seq) as a cost-effective and robust approach for rare variant discovery. We comparatively analyzed Pool-seq vs. individual-seq of indexed target capture of up to 960 genes in ∼1000 individuals, followed by independent genotyping validation studies. We found that Pool-seq was as effective and accurate as individual-seq in detecting rare variants and accurately estimating their minor allele frequencies (MAFs). Our results suggest that Pool-seq can be used as an efficient and cost-effective method for discovery of rare variants for population-based sequencing studies in individual laboratories.

Original languageEnglish (US)
Pages (from-to)24-31
Number of pages8
JournalMutation Research - Fundamental and Molecular Mechanisms of Mutagenesis
Volume809
DOIs
StatePublished - May 1 2018

Fingerprint

Costs and Cost Analysis
Validation Studies
Genome-Wide Association Study
Medical Genetics
DNA Sequence Analysis
Gene Frequency
Sensitivity and Specificity
DNA
Population
Genes

Keywords

  • Genetic variant
  • Individually indexed target capture sequencing
  • Pooled target capture sequencing
  • Rare variant

ASJC Scopus subject areas

  • Molecular Biology
  • Genetics
  • Health, Toxicology and Mutagenesis

Cite this

Effective discovery of rare variants by pooled target capture sequencing : A comparative analysis with individually indexed target capture sequencing. / Ryu, Seungjin; Han, Jeehae; Norden-Krichmar, Trina M.; Schork, Nicholas J.; Suh, Yousin.

In: Mutation Research - Fundamental and Molecular Mechanisms of Mutagenesis, Vol. 809, 01.05.2018, p. 24-31.

Research output: Contribution to journalArticle

@article{02986dd02f1b49f38f31ae69caddd958,
title = "Effective discovery of rare variants by pooled target capture sequencing: A comparative analysis with individually indexed target capture sequencing",
abstract = "Identification of all genetic variants associated with complex traits is one of the most important goals in modern human genetics. Genome-wide association studies (GWAS) have been successfully applied to identify common variants, which thus far explain only small portion of heritability. Interests in rare variants have been increasingly growing as an answer for this missing heritability. While next-generation sequencing allows detection of rare variants, its cost is still prohibitively high to sequence a large number of human DNA samples required for rare variant association studies. In this study, we evaluated the sensitivity and specificity of sequencing for pooled DNA samples of multiple individuals (Pool-seq) as a cost-effective and robust approach for rare variant discovery. We comparatively analyzed Pool-seq vs. individual-seq of indexed target capture of up to 960 genes in ∼1000 individuals, followed by independent genotyping validation studies. We found that Pool-seq was as effective and accurate as individual-seq in detecting rare variants and accurately estimating their minor allele frequencies (MAFs). Our results suggest that Pool-seq can be used as an efficient and cost-effective method for discovery of rare variants for population-based sequencing studies in individual laboratories.",
keywords = "Genetic variant, Individually indexed target capture sequencing, Pooled target capture sequencing, Rare variant",
author = "Seungjin Ryu and Jeehae Han and Norden-Krichmar, {Trina M.} and Schork, {Nicholas J.} and Yousin Suh",
year = "2018",
month = "5",
day = "1",
doi = "10.1016/j.mrfmmm.2018.03.007",
language = "English (US)",
volume = "809",
pages = "24--31",
journal = "Mutation Research",
issn = "0027-5107",
publisher = "Elsevier",

}

TY - JOUR

T1 - Effective discovery of rare variants by pooled target capture sequencing

T2 - A comparative analysis with individually indexed target capture sequencing

AU - Ryu, Seungjin

AU - Han, Jeehae

AU - Norden-Krichmar, Trina M.

AU - Schork, Nicholas J.

AU - Suh, Yousin

PY - 2018/5/1

Y1 - 2018/5/1

N2 - Identification of all genetic variants associated with complex traits is one of the most important goals in modern human genetics. Genome-wide association studies (GWAS) have been successfully applied to identify common variants, which thus far explain only small portion of heritability. Interests in rare variants have been increasingly growing as an answer for this missing heritability. While next-generation sequencing allows detection of rare variants, its cost is still prohibitively high to sequence a large number of human DNA samples required for rare variant association studies. In this study, we evaluated the sensitivity and specificity of sequencing for pooled DNA samples of multiple individuals (Pool-seq) as a cost-effective and robust approach for rare variant discovery. We comparatively analyzed Pool-seq vs. individual-seq of indexed target capture of up to 960 genes in ∼1000 individuals, followed by independent genotyping validation studies. We found that Pool-seq was as effective and accurate as individual-seq in detecting rare variants and accurately estimating their minor allele frequencies (MAFs). Our results suggest that Pool-seq can be used as an efficient and cost-effective method for discovery of rare variants for population-based sequencing studies in individual laboratories.

AB - Identification of all genetic variants associated with complex traits is one of the most important goals in modern human genetics. Genome-wide association studies (GWAS) have been successfully applied to identify common variants, which thus far explain only small portion of heritability. Interests in rare variants have been increasingly growing as an answer for this missing heritability. While next-generation sequencing allows detection of rare variants, its cost is still prohibitively high to sequence a large number of human DNA samples required for rare variant association studies. In this study, we evaluated the sensitivity and specificity of sequencing for pooled DNA samples of multiple individuals (Pool-seq) as a cost-effective and robust approach for rare variant discovery. We comparatively analyzed Pool-seq vs. individual-seq of indexed target capture of up to 960 genes in ∼1000 individuals, followed by independent genotyping validation studies. We found that Pool-seq was as effective and accurate as individual-seq in detecting rare variants and accurately estimating their minor allele frequencies (MAFs). Our results suggest that Pool-seq can be used as an efficient and cost-effective method for discovery of rare variants for population-based sequencing studies in individual laboratories.

KW - Genetic variant

KW - Individually indexed target capture sequencing

KW - Pooled target capture sequencing

KW - Rare variant

UR - http://www.scopus.com/inward/record.url?scp=85045459677&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85045459677&partnerID=8YFLogxK

U2 - 10.1016/j.mrfmmm.2018.03.007

DO - 10.1016/j.mrfmmm.2018.03.007

M3 - Article

C2 - 29677560

AN - SCOPUS:85045459677

VL - 809

SP - 24

EP - 31

JO - Mutation Research

JF - Mutation Research

SN - 0027-5107

ER -