These data include a total of 18,916 subjects from the Health and Retirement Study genotyped on Illumina HumanOmni2.5-arrays.  Data files also include imputed data using the 1000 Genomes and the Haplotype Reference Consortium (HRC) reference panels.

Respondents who consented to the saliva collection in 2006 (Phase 1), 2008 (Phase 2), 2010 (Phase 3), or 2012 (Phase 4) have been genotyped using Illumina Omni genotyping platforms. The Phase 1 and 2 participants were genotyped together, and were imputed together previously (see dbGaP accession number phs000428.v1.p1). The Phase 3 participants were subsequently genotyped, and were imputed together with Phases 1-2 (dbGaP accession number phs000428.v2.p2). An additional 3,303 Phase 4 participants were genotyped in 2015, and were imputed together with Phases 1-3, yielding a total of 18,923 unique HRS participants: 15,620 from Phases 1-3, and 3,303 from Phase 4.  After QC, there were a total of 18,916 unique HRS participants included in this dataset.

Additional information can be found on the HRS website:

Sample Summary per Data Type

Sample SetAccessionData TypeNumber of Samples
HRS-All Phasessnd10027GWAS, 1000G Imputation, HRC Imputation19,004
HRS-Phase 4snd10028GWAS3,475

Available Filesets

FilesetAccessionLatest ReleaseDescription
HRS GWASfsa000020NG00119.v1GWAS Illumina HumanOmni2.5
HRS Imputationfsa000021NG00119.v11000G Imputation data, HRC Imputation data

View the File Manifest for a full list of files released in this dataset.

The HRS is a nationally representative sample with oversamples of African-American and Hispanic populations. The target population for the original HRS cohort includes all adults in the contiguous United States born during the years 1931–1941 who reside in households. HRS was subsequently augmented with additional cohorts in 1993 and 1998 to represent the entire population 51 and older in 1998 (b. 1947 and earlier). Since then, the steady-state design calls for refreshment every six years with a new six-year birth cohort of 51–56 year olds. This was done in 2004 with the Early Baby Boomers (EBB) (b. 1948-53) and in 2010 with the Mid Boomers (MBB) (b. 1954–59).

Sample SetAccessionNumber of Subjects
HRS-All Phasessnd1002718,916
HRS-Phase 4snd100283,303
Consent LevelNumber of Subjects

Visit the Data Use Limitations page for definitions of the consent levels above.

Acknowledgment statement for any data distributed by NIAGADS:

Data for this study were prepared, archived, and distributed by the National Institute on Aging Alzheimer’s Disease Data Storage Site (NIAGADS) at the University of Pennsylvania (U24-AG041689), funded by the National Institute on Aging.

Use the study-specific acknowledgement statements below (as applicable):

For investigators using any data from this dataset:

Please cite/reference the use of NIAGADS data by including the accession NG00119.

For investigators using LASI-DAD (sa000019) data:

In text: "The Longitudinal Aging Study in India, Diagnostic Assessment of Dementia data is sponsored by the National Institute on Aging (grant number R01AG051125, RF1AG055273, U01AG065958) and is conducted by the University of Southern California."

In references: "The Longitudinal Aging Study in India, Diagnostic Assessment of Dementia Study. Produced and distributed by the University of Southern California with funding from the National Institute on Aging (grant number R01AG051125, RF1AG055273, U01AG065958), Los Angles, CA."

For investigators using HRS (sa000021) data:

HRS is supported by the National Institute on Aging (NIA U01AG009740). The genotyping was partially funded by separate awards from NIA (RC2 AG036495 and RC4 AG039029). Our genotyping was conducted by the NIH Center for Inherited Disease Research (CIDR) at Johns Hopkins University. Genotyping quality control and final preparation were performed by the Genetics Coordinating Center at University of Washington (Phases 1-3) and the University of Michigan (Phase 4).