Meta-Analysis of Genome-Wide Association Studies with Correlated Individuals: Application to the Hispanic Community Health Study/Study of Latinos (HCHS/SOL)

Tamar Sofer, John R. Shaffer, Mariaelisa Graff, Qibin Qi, Adrienne M. Stilp, Stephanie M. Gogarten, Kari E. North, Carmen R. Isasi, Cathy C. Laurie, Adam A. Szpiro

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

Investigators often meta-analyze multiple genome-wide association studies (GWASs) to increase the power to detect associations of single nucleotide polymorphisms (SNPs) with a trait. Meta-analysis is also performed within a single cohort that is stratified by, e.g., sex or ancestry group. Having correlated individuals among the strata may complicate meta-analyses, limit power, and inflate Type 1 error. For example, in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), sources of correlation include genetic relatedness, shared household, and shared community. We propose a novel mixed-effect model for meta-analysis, “MetaCor,” which accounts for correlation between stratum-specific effect estimates. Simulations show that MetaCor controls inflation better than alternatives such as ignoring the correlation between the strata or analyzing all strata together in a “pooled” GWAS, especially with different minor allele frequencies (MAFs) between strata. We illustrate the benefits of MetaCor on two GWASs in the HCHS/SOL. Analysis of dental caries (tooth decay) stratified by ancestry group detected a genome-wide significant SNP (rs7791001, P-value = 3.66 × 10−8, compared to 4.67 × 10−7 in pooled), with different MAFs between strata. Stratified analysis of body mass index (BMI) by ancestry group and sex reduced overall inflation from λGC = 1.050 (pooled) to λGC = 1.028 (MetaCor). Furthermore, even after removing close relatives to obtain nearly uncorrelated strata, a naïve stratified analysis resulted in λGC = 1.058 compared to λGC = 1.027 for MetaCor.

Original languageEnglish (US)
Pages (from-to)492-501
Number of pages10
JournalGenetic Epidemiology
Volume40
Issue number6
DOIs
StatePublished - Sep 1 2016

Fingerprint

Genome-Wide Association Study
Hispanic Americans
Meta-Analysis
Economic Inflation
Health
Gene Frequency
Single Nucleotide Polymorphism
Dental Caries
Tooth
Body Mass Index
Research Personnel
Genome
Power (Psychology)

Keywords

  • effect heterogeneity
  • inflation
  • mixed models
  • stratified analysis

ASJC Scopus subject areas

  • Epidemiology
  • Medicine(all)
  • Genetics(clinical)

Cite this

Meta-Analysis of Genome-Wide Association Studies with Correlated Individuals : Application to the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). / Sofer, Tamar; Shaffer, John R.; Graff, Mariaelisa; Qi, Qibin; Stilp, Adrienne M.; Gogarten, Stephanie M.; North, Kari E.; Isasi, Carmen R.; Laurie, Cathy C.; Szpiro, Adam A.

In: Genetic Epidemiology, Vol. 40, No. 6, 01.09.2016, p. 492-501.

Research output: Contribution to journalArticle

Sofer, Tamar ; Shaffer, John R. ; Graff, Mariaelisa ; Qi, Qibin ; Stilp, Adrienne M. ; Gogarten, Stephanie M. ; North, Kari E. ; Isasi, Carmen R. ; Laurie, Cathy C. ; Szpiro, Adam A. / Meta-Analysis of Genome-Wide Association Studies with Correlated Individuals : Application to the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). In: Genetic Epidemiology. 2016 ; Vol. 40, No. 6. pp. 492-501.
@article{0e552e0d206c4f919aac05557039ad2f,
title = "Meta-Analysis of Genome-Wide Association Studies with Correlated Individuals: Application to the Hispanic Community Health Study/Study of Latinos (HCHS/SOL)",
abstract = "Investigators often meta-analyze multiple genome-wide association studies (GWASs) to increase the power to detect associations of single nucleotide polymorphisms (SNPs) with a trait. Meta-analysis is also performed within a single cohort that is stratified by, e.g., sex or ancestry group. Having correlated individuals among the strata may complicate meta-analyses, limit power, and inflate Type 1 error. For example, in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), sources of correlation include genetic relatedness, shared household, and shared community. We propose a novel mixed-effect model for meta-analysis, “MetaCor,” which accounts for correlation between stratum-specific effect estimates. Simulations show that MetaCor controls inflation better than alternatives such as ignoring the correlation between the strata or analyzing all strata together in a “pooled” GWAS, especially with different minor allele frequencies (MAFs) between strata. We illustrate the benefits of MetaCor on two GWASs in the HCHS/SOL. Analysis of dental caries (tooth decay) stratified by ancestry group detected a genome-wide significant SNP (rs7791001, P-value = 3.66 × 10−8, compared to 4.67 × 10−7 in pooled), with different MAFs between strata. Stratified analysis of body mass index (BMI) by ancestry group and sex reduced overall inflation from λGC = 1.050 (pooled) to λGC = 1.028 (MetaCor). Furthermore, even after removing close relatives to obtain nearly uncorrelated strata, a na{\"i}ve stratified analysis resulted in λGC = 1.058 compared to λGC = 1.027 for MetaCor.",
keywords = "effect heterogeneity, inflation, mixed models, stratified analysis",
author = "Tamar Sofer and Shaffer, {John R.} and Mariaelisa Graff and Qibin Qi and Stilp, {Adrienne M.} and Gogarten, {Stephanie M.} and North, {Kari E.} and Isasi, {Carmen R.} and Laurie, {Cathy C.} and Szpiro, {Adam A.}",
year = "2016",
month = "9",
day = "1",
doi = "10.1002/gepi.21981",
language = "English (US)",
volume = "40",
pages = "492--501",
journal = "Genetic Epidemiology",
issn = "0741-0395",
publisher = "Wiley-Liss Inc.",
number = "6",

}

TY - JOUR

T1 - Meta-Analysis of Genome-Wide Association Studies with Correlated Individuals

T2 - Application to the Hispanic Community Health Study/Study of Latinos (HCHS/SOL)

AU - Sofer, Tamar

AU - Shaffer, John R.

AU - Graff, Mariaelisa

AU - Qi, Qibin

AU - Stilp, Adrienne M.

AU - Gogarten, Stephanie M.

AU - North, Kari E.

AU - Isasi, Carmen R.

AU - Laurie, Cathy C.

AU - Szpiro, Adam A.

PY - 2016/9/1

Y1 - 2016/9/1

N2 - Investigators often meta-analyze multiple genome-wide association studies (GWASs) to increase the power to detect associations of single nucleotide polymorphisms (SNPs) with a trait. Meta-analysis is also performed within a single cohort that is stratified by, e.g., sex or ancestry group. Having correlated individuals among the strata may complicate meta-analyses, limit power, and inflate Type 1 error. For example, in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), sources of correlation include genetic relatedness, shared household, and shared community. We propose a novel mixed-effect model for meta-analysis, “MetaCor,” which accounts for correlation between stratum-specific effect estimates. Simulations show that MetaCor controls inflation better than alternatives such as ignoring the correlation between the strata or analyzing all strata together in a “pooled” GWAS, especially with different minor allele frequencies (MAFs) between strata. We illustrate the benefits of MetaCor on two GWASs in the HCHS/SOL. Analysis of dental caries (tooth decay) stratified by ancestry group detected a genome-wide significant SNP (rs7791001, P-value = 3.66 × 10−8, compared to 4.67 × 10−7 in pooled), with different MAFs between strata. Stratified analysis of body mass index (BMI) by ancestry group and sex reduced overall inflation from λGC = 1.050 (pooled) to λGC = 1.028 (MetaCor). Furthermore, even after removing close relatives to obtain nearly uncorrelated strata, a naïve stratified analysis resulted in λGC = 1.058 compared to λGC = 1.027 for MetaCor.

AB - Investigators often meta-analyze multiple genome-wide association studies (GWASs) to increase the power to detect associations of single nucleotide polymorphisms (SNPs) with a trait. Meta-analysis is also performed within a single cohort that is stratified by, e.g., sex or ancestry group. Having correlated individuals among the strata may complicate meta-analyses, limit power, and inflate Type 1 error. For example, in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), sources of correlation include genetic relatedness, shared household, and shared community. We propose a novel mixed-effect model for meta-analysis, “MetaCor,” which accounts for correlation between stratum-specific effect estimates. Simulations show that MetaCor controls inflation better than alternatives such as ignoring the correlation between the strata or analyzing all strata together in a “pooled” GWAS, especially with different minor allele frequencies (MAFs) between strata. We illustrate the benefits of MetaCor on two GWASs in the HCHS/SOL. Analysis of dental caries (tooth decay) stratified by ancestry group detected a genome-wide significant SNP (rs7791001, P-value = 3.66 × 10−8, compared to 4.67 × 10−7 in pooled), with different MAFs between strata. Stratified analysis of body mass index (BMI) by ancestry group and sex reduced overall inflation from λGC = 1.050 (pooled) to λGC = 1.028 (MetaCor). Furthermore, even after removing close relatives to obtain nearly uncorrelated strata, a naïve stratified analysis resulted in λGC = 1.058 compared to λGC = 1.027 for MetaCor.

KW - effect heterogeneity

KW - inflation

KW - mixed models

KW - stratified analysis

UR - http://www.scopus.com/inward/record.url?scp=84981165197&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84981165197&partnerID=8YFLogxK

U2 - 10.1002/gepi.21981

DO - 10.1002/gepi.21981

M3 - Article

C2 - 27256683

AN - SCOPUS:84981165197

VL - 40

SP - 492

EP - 501

JO - Genetic Epidemiology

JF - Genetic Epidemiology

SN - 0741-0395

IS - 6

ER -