TY - JOUR
T1 - Social and scientific motivations to move beyond groups in allele frequencies
T2 - The TOPMed experience
AU - Nelson, Sarah C.
AU - Gogarten, Stephanie M.
AU - Fullerton, Stephanie M.
AU - Isasi, Carmen R.
AU - Mitchell, Braxton D.
AU - North, Kari E.
AU - Rich, Stephen S.
AU - Taylor, Matthew R.G.
AU - Zöllner, Sebastian
AU - Sofer, Tamar
N1 - Funding Information:
Molecular data for the Trans-Omics for Precision Medicine (TOPMed) program was supported by the National Heart, Lung, and Blood Institute (NHLBI). Core support including centralized genomic read mapping and genotype calling, along with variant quality metrics and filtering, were provided by the TOPMed Informatics Research Center (3R01HL-117626-02S1; contract HHSN268201800002I). Until October 2021, core support including phenotype harmonization, data management, sample-identity QC, and general program coordination were provided by the TOPMed Data Coordinating Center (R01HL-120393; U01HL-120393; contract HHSN268201800001I). We gratefully acknowledge the studies and participants who provided biological samples and data for TOPMed. K.E.N. is supported by R01HD057194, R01DK122503, R01HG010297, R01HL142302, R01HL143885, R01HG009974, and R01DK101855. T.S. is supported by R21AG070644, R21HL145425, and R03OD030598. The authors declare no competing interests.
Funding Information:
Molecular data for the Trans-Omics for Precision Medicine (TOPMed) program was supported by the National Heart, Lung, and Blood Institute (NHLBI). Core support including centralized genomic read mapping and genotype calling, along with variant quality metrics and filtering, were provided by the TOPMed Informatics Research Center ( 3R01HL-117626-02S1 ; contract HHSN268201800002I ). Until October 2021, core support including phenotype harmonization, data management, sample-identity QC, and general program coordination were provided by the TOPMed Data Coordinating Center ( R01HL-120393 ; U01HL-120393 ; contract HHSN268201800001I ). We gratefully acknowledge the studies and participants who provided biological samples and data for TOPMed . K.E.N. is supported by R01HD057194 , R01DK122503 , R01HG010297 , R01HL142302 , R01HL143885 , R01HG009974 , and R01DK101855 . T.S. is supported by R21AG070644 , R21HL145425 , and R03OD030598 .
Publisher Copyright:
© 2022 American Society of Human Genetics
PY - 2022/9/1
Y1 - 2022/9/1
N2 - For the genomics community, allele frequencies within defined groups (or “strata”) are useful across multiple research and clinical contexts. Benefits include allowing researchers to identify populations for replication or “look up” studies, enabling researchers to compare population-specific frequencies to validate findings, and facilitating assessment of variant pathogenicity in clinical contexts. However, there are potential concerns with stratified allele frequencies. These include potential re-identification (determining whether or not an individual participated in a given research study based on allele frequencies and individual-level genetic data), harm from associating stigmatizing variants with specific groups, potential reification of race as a biological rather than a socio-political category, and whether presenting stratified frequencies—and the downstream applications that this presentation enables—is consistent with participants’ informed consents. The NHLBI Trans-Omics for Precision Medicine (TOPMed) program considered the scientific and social implications of different approaches for adding stratified frequencies to the TOPMed BRAVO (Browse All Variants Online) variant server. We recommend a novel approach of presenting ancestry-specific allele frequencies using a statistical method based upon local genetic ancestry inference. Notably, this approach does not require grouping individuals by either predominant global ancestry or race/ethnicity and, therefore, mitigates re-identification and other concerns as the mixture distribution of ancestral allele frequencies varies across the genome. Here we describe our considerations and approach, which can assist other genomics research programs facing similar issues of how to define and present stratified frequencies in publicly available variant databases.
AB - For the genomics community, allele frequencies within defined groups (or “strata”) are useful across multiple research and clinical contexts. Benefits include allowing researchers to identify populations for replication or “look up” studies, enabling researchers to compare population-specific frequencies to validate findings, and facilitating assessment of variant pathogenicity in clinical contexts. However, there are potential concerns with stratified allele frequencies. These include potential re-identification (determining whether or not an individual participated in a given research study based on allele frequencies and individual-level genetic data), harm from associating stigmatizing variants with specific groups, potential reification of race as a biological rather than a socio-political category, and whether presenting stratified frequencies—and the downstream applications that this presentation enables—is consistent with participants’ informed consents. The NHLBI Trans-Omics for Precision Medicine (TOPMed) program considered the scientific and social implications of different approaches for adding stratified frequencies to the TOPMed BRAVO (Browse All Variants Online) variant server. We recommend a novel approach of presenting ancestry-specific allele frequencies using a statistical method based upon local genetic ancestry inference. Notably, this approach does not require grouping individuals by either predominant global ancestry or race/ethnicity and, therefore, mitigates re-identification and other concerns as the mixture distribution of ancestral allele frequencies varies across the genome. Here we describe our considerations and approach, which can assist other genomics research programs facing similar issues of how to define and present stratified frequencies in publicly available variant databases.
KW - allele frequencies
KW - anti-racism
KW - genetic ancestry
KW - stratification
UR - http://www.scopus.com/inward/record.url?scp=85137169573&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85137169573&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2022.07.008
DO - 10.1016/j.ajhg.2022.07.008
M3 - Review article
C2 - 36055210
AN - SCOPUS:85137169573
SN - 0002-9297
VL - 109
SP - 1582
EP - 1590
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 9
ER -