Data Access Requests - DSS NIAGADS

Investigator:
Adanve, Bertrand
Institution:
Genetic Intelligence, Inc
Project Title:
AI-based platform to identify causal, genetically-defined therapeutic targets for Alzheimer's disease
Date of Approval:
April 7, 2021
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Genetic Intelligence (GI) will analyze de-identified whole genome sequences (WGS) from Alzheimer’s disease (AD) and healthy patients using it’s AI-based platform to discover novel causal genes and variants for AD. GI has previously obtained a NSF Phase I SBIR grant (#1819331) to develop and validate it’s computational platform using Amyotrophic Lateral Sclerosis (ALS) WGS (obtained from our partner the New York Genome Center) as a proof-of-principle. GI successfully validated the platform by rediscovering known ALS genes SOD1 and C9orf72, as well as discovering new ALS genes including STMN2, which was independently discovered by two labs this year using experimental approaches. We are currently validating two additional ALS targets experimentally. We plan to focus on applying our platform to uncover novel genes and variants in AD using a similar study design as ALS. Objective 1: Obtain de-identified WGS data (e.g., ADSP and ADNI) from NIAGADS and preprocess them for input into GI’s computational pipeline. Objective 2: Run the AD WGS through our in-house genetic background dissector tool, Cato, that uses several machine learning systems to stratify genomes before analysis to avoid spurious results arising from differences in case and control backgrounds. Objective 3: Rediscover known AD genes. Chromosomes containing known AD genes (e.g., APOE, PSEN1 and PSEN2) will be input into our causal gene discovery platform, Bergspitze, to confirm if it can rediscover the known AD genes. The output from Bergspitze will be input into Franklin, GI’s interpretation module that provides a coherent etiology model for the disease with awareness of alternative etiologies advanced in the literature. Objective 4: Discovery of new AD genes. Run the full genomes from all cluster groups and ancestry cohorts in Bergspitze and Franklin to identify and prioritize new AD genes and variants. There are no plans to collaborate with other institutions.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is the leading cause of dementia affecting over 44 million people worldwide, and for which currently there are no effective prevention or cures. Enormous efforts and funding have been put into the discovery of the root cause of AD so as to find an effective treatment, but even with advancing genetic sequencing and analysis technology, no smoking gun has been found yet. Part of the issue has been scientific focus on analyzing the human exome, the ~1% of the human genome that codes for proteins, due to ease of analysis and lower noise in the data. However, this methodology precludes ~99% of the rest of the human genome, which harbors critical regulatory features that affect many of the processes in the body. Genetic Intelligence (GI) aims to solve this problem using its novel whole genome analysis platform that blends advanced genetic principles with state-of-the-art machine learning to identify causal disease targets. This information can then be used to create new drug candidates that are not only effective and precise, but also affordable.
Investigator:
Beam, Christopher
Institution:
University of Southern California
Project Title:
Genetic and epigenetic correlates of behavioral risk and protective factors in aging
Date of Approval:
October 31, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Epigenetic changes play a role in the biological pathways associated with psychosocial and cognitive outcomes. Little research, however, has been devoted to understanding whether differences in methylation for one psychosocial phenotype also accounts for differential methylation of other phenotypes. The objective of the proposed research is to estimate the associations between epigenetic modifications (i.e., DNA methylation) and behavioral and psychosocial risk factors (e.g., depressive symptomatology, loneliness, low physical activity) previously shown to predict cognitive decline in older adulthood. Our principal analyses include epigenome-wide association studies of the principal phenotypes of interest to quantify differential methylation at specific CpG sites for various phenotypes of Health and Retirement Study participants who were enrolled in the 2016 Venous Blood Study. We subsequently will estimate correlations between the beta weights generated from these EWASs to quantify whether differences in methylation underlying psychosocial and cognitive phenotypes covary. Follow-up analyses will include gene-enrichment analyses to identify pathways that contain genes that are overexpressed for two phenotypes (e.g., loneliness and cognition). In addition, we will construct DNA methylation age acceleration measures using principal components methodologies that improve the reliability of DNAm age variables. The phenotypic characteristics proposed for the present study are broad and include self-report measures of current and previous smoking use, frequency of social interactions, loneliness, depressive symptomatology, self-reported physical activity, subjective memory rating, and cognitive ability measures (e.g., immediate and delayed word recall, backward counting, serial 7s, verbal fluency, vocabulary, quantitative reasoning, and verbal reasoning).
Non-Technical Research Use Statement:
The general purpose of the study is to identify epigenetic correlates of behavioral risk factors and cognitive decline in older adults. For example, our planned analyses will investigate epigenome-wide associations with loneliness, depressive symptomatology, self-reported physical activity, and cognition. Further, we will identify overlapping CpG sites between behavioral risk factors and cognition in older adulthood, and use gene ontology to identify the differentially expressed genes that are enriched in biological processes relevant to disease risk and aging.
Investigator:
Beecham, Gary
Institution:
Wake Forest School of Medicine
Project Title:
U01AG079850_Genetics of neuropsychiatric symptoms in AD
Date of Approval:
January 7, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Neuropsychiatric Symptoms (NPS) (e.g. aggression, psychosis, anxiety, apathy, depression, agitation, sleep disturbances, repetitive behaviors) occur in 85% of AD patients, and are associated with greatly increased suffering of patients and families. Despite this, our understanding of the etiology of NPS in AD is inadequate, with treatments for NPS often being ineffective and associated with serious adverse effects. This knowledge gap is particularly egregious in underserved racial and ethnic groups. The aim of the current project (U01AG079850) is to collate, harmonize, and analyze the AD-associated NPS data collected on ADSP/ADSP-FUS samples. We plan to (1) expand the racially and ethnically diverse datasets of the ADSP-FUS and related efforts to include harmonized NPS data, creating the largest and most diverse genomic resource on NPS in AD to date allowing researchers to assess a wide range of additional critical hypotheses through these resources; (2) utilize these harmonized data to identify and describe genetic determinants, pathways, and polygenic effects underlying specific NPS in AD; (3) explore the shared genetic architecture across AD-associated NPS and with primary psychiatric disorders; and (4) disentangle the role of ancestry in NPS genetic risk. We anticipate that this work will lead to a better understanding of the genetic basis of NPS in AD which is vital to infer the mechanistic pathways underlying these highly disabling symptoms and develop more effective pharmacological targets. To collate NPS data on all ADSP-FUS cohorts we closely collaborate with the ADSP-Phenotype Harmonization Consortium. Creation of refined harmonized NPS phenotypes will be conducted by Dr. Ted Huey’s group at Brown University. Genomic data analyses will be conducted by Dr. Reitz’ group at Columbia University and Dr. Beecham’s group at the Wake Forest School of Medicine.
Non-Technical Research Use Statement:
Although neuropsychiatric symptoms (e.g. aggression, psychosis, anxiety, apathy, depression, and sleep disturbances) occur in ~85% of Alzheimer disease patients and are associated with accelerated decline, increased cost, out-of-home placement, and greatly increased suffering of patients and families, our understanding of their etiology is still inadequate, with treatments often being ineffective and even associated with serious adverse effects (including increased mortality). This knowledge gap is particularly egregious in underserved racial and ethnic groups such as Hispanics and African-Americans. We propose to expand the racially and ethnically diverse ADSP-FUS and related resources to include harmonized neuropsychiatric symptom data allowing researchers to assess a variety of additional critical hypotheses, and to utilize these harmonized data to identify ancestry-specific genetic determinants, molecular pathways, and polygenic effects underlying neuropsychiatric symptoms in Alzheimer disease.
Investigator:
Beer, Simone
Institution:
Forschungszentrum Juelich
Project Title:
Using explainable AI to formulate candidate gene risk scores for AD
Date of Approval:
March 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
In complex diseases like Alzheimer’s disease (AD), the disease risk is conveyed by genetic variants each one with a small effect on disease susceptibility, which may add up to a relevant impact in combination. Such complex synergistic effects can be detected by means of machine learning (ML) models such as decision tree ensembles, which are excellent candidates for detecting genetic variant interactions even beyond only pairwise interactions in candidate gene approaches. Explainable artificial intelligence can then explain the models and deliver information about effects and importance of the single variants. In a first application of this technique, we have formulated an aquaporin-4 (AQP4) single nucleotide polymorphisms (SNPs) based risk score for brain amyloid burden using data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). We plan now to replicate and extend our method with a larger dataset and include further candidate genes, e.g. circadian, sleep related or blood-brain-barrier related genes. The data from the NIAGADS data repository will expand the previously used database significantly. Of special interest are the data from the Alzheimer’s Disease Sequencing Project (ADSP). Besides a general increase in data, the racial/ethnic diversity of ADSP is also of high interest. Genetic variants might occur at different frequencies in different populations, so that rare combinations of variants in one population might be more frequent in another population. Access to these data will not only allow to replicate our previous findings, but also to extend our methodology to other candidate genes. Information on SNP status, together with further information like age, gender, APOE4 status, diagnosis and endophenotypes of AD, as available, will be used to train and explain a ML model. Information gathered from the explanations will be used to assign importance as well as protecting or deleterious effects to the SNPs, eventually allowing to formulate a risk score for (endo)phenotypes of AD or diagnosis.
Non-Technical Research Use Statement:
In diseases like Alzheimer’s, genetic variations each with a small effect may together affect the disease risk. These variations can add up to have a significant impact. We can use machine learning, like decision tree ensembles, and explainable AI to find these combined effects. We've already used this method to create a risk score for brain amyloid burden in Alzheimer's. Now, we want to expand this work using a larger dataset and looking at more genes, e.g. genes related to sleep. We want to use a new dataset that includes more data and diverse genetic information. Different populations may have different genetic variations, so having more diverse data will help us to understand these differences. We'll use this data, along with other information like age and gender, to train a machine learning model. By understanding how the model makes its decisions, we can figure out which variants of a gene are most important and how their combination affect Alzheimer's risk. This will help us to create gene-specific risk scores for Alzheimer's and related conditions.
Investigator:
Belloy, Michael
Institution:
Washington University in St Louis
Project Title:
Elucidating sex-specific risk for Alzheimer's disease through state-of-the-art genetics and multi-omics
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
• Objectives: In this project, we seek to holistically investigate the genetic and molecular drivers of sex dimorphism in Alzheimer’s disease across ancestries. • Study design: This study integrates large-scale population genetics with multi-omics and endophenotype analyses. We are integrating all data available from ADGC and ADSP, together with other data from AMP-AD and biobanks such as UKB, FinnGen, and MVP to conduct large-scale multi-ancestry GWAS, rare-variant gene aggregation analyses, QTL studies, PWAS, TWAS, etc. We also particularly focus on X chromosome association studies. The study design also interrogates interactions with ancestry, hormone exposures, and with APOE*4, as well as comparisons to non-stratified GWAS/XWAS of Alzheimer’s disease. Further, we will also employ genetic correlation analyses, mendelian randomization, colocalization, and pleiotropy analyses, to interrogate overlap with other complex traits to better understand the mechanisms underlying sex dimorphism in Alzheimer’s disease. • Analysis plan, including the phenotypic characteristics that will be evaluated in association with genetic variants: Our phenotypes will include Alzheimer’s disease risk, conversion risk, various endophenotypes (including amyloid/tau biomarkers, brain imaging metrics, etc.) as well as molecular traits. As noted above, we will conduct large-scale multi-ancestry GWAS, XWAS, rare-variant gene aggregation analyses, QTL studies, PWAS, TWAS, etc. Specific aims include interrogating these question and analyses on (1) the autosomes, (2) the X chromosome, and (3) leveraging sex stratified QTL studies to drive discovery of risk genes.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) manifests itself differently across men and women, but the genetic and molecular factors that drive this remain elusive. AD is the most common cause of dementia and till today remains largely untreatable. It is thus crucial to study the genetics of AD in a sex-specific manner, as this will help the field gain important insights into disease pathophysiology, identify novel sex-specific risk factors relevant to personalized genetic medicine, and uncover potential new AD drug targets that may benefit both sexes. This project uses large-scale genomics and multi-omics to elucidate novel sex agnostic and sex-specific AD risk genes. We will interrogate sex dimorphism for AD risk on the autosomes and the sex chromosomes. We similarly interrogate sex dimorphism in the genetic regulation of gene expression and protein levels, which we will integrate with genetic risk for Alzheimer’s disease to further discovery risk genes. Throughout, we will also interrogate how sex-specific risk for AD interactions with hormone exposures, ancestry, and the APOE*4 risk allele.
Investigator:
Belsky, Daniel
Institution:
Columbia University
Project Title:
Life-course sociogenomic analysis of social inequalities in aging
Date of Approval:
June 3, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Overview. In this NIAGADS application, we seek access to DNA methylation (DNAm) data for the US Health and Retirement Study (HRS) to (1) investigate how environmental and behavioral factors in middle and later life, including physical and social environmental conditions, work and retirement, family structure, and life events affect the pace of biological aging ; and (2) to develop, validate, and refine DNAm-based indices of the pace of biological aging. These activities will establish the sensitivity of metrics of biological aging to social determinants of health and furnish validated metrics of biological aging that can be deployed to evaluate the impacts of interventions targeting the biology of aging (R01AG061378), social determinants of health that we hypothesize indirectly impact the biology of aging (R01AG087158, R01AG073402) and early-life exposures that we hypothesize hasten pace of aging across the lifespan (R01AG066887). Analysis Plan. The program of analysis for this project will include a range of activities. We will process and normalize DNAm data following methods established in our prior work. The first step will be to merge DNAm data obtained from NIAGADS with phenotypic data obtained from the HRS. We will then (1) compute indices of the pace and progress of biological aging defined in other studies, many of which are not yet available as data products from the HRS; (2) test how these indices of biological aging are related to social and physical environmental factors (e.g. socioeconomic status and social mobility, family structure, air pollution, neighborhood social environment) and investigate the extent to which they mediate environmental impacts on incidence of chronic disease, disability, and mortality as outlined in in our previous work; (3) test how these indices relate to phenotypes of aging-related decline in functional capacities and organ system integrity defined in Balachandran et al. 2024; (4) conduct discovery analysis to determine DNAm correlates and signatures of pace of aging phenotypes defined in Balachandran et al. 2024 along with related phenotypes we are currently in the process of developing.
Non-Technical Research Use Statement:
Life-course social science links early-life social disadvantage is how social disadvantage is biologically embedded, leading to social inequalities. A hypothesis is that social disadvantage actually hastens aging. While everyone ages chronologically at the same rate, biological changes with aging may proceed faster for some than others. These changes are thought to be a root cause of disease/disability and an intervention target to extend healthy lifespan. A knowledge gap is whether social disadvantage hastens aging-related biological changes. If so, it would open opportunity to join forces between biomedical research developing interventions to slow aging and social science research to address social inequalities. Our work and that of others indicates the pace and progress of biological aging can be measured from DNA methylation. In this project, we will investigate how social determinants of health affect trajectories of biological aging with the goal of informing programs and policies to dismantle health disparities and build healthy longevity for all.
Investigator:
Benjamin, Daniel
Institution:
NBER and UCLA
Project Title:
Multi-Ancestry Meta-Analysis of Alzheimer’s Disease
Date of Approval:
June 11, 2021
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Genome-Wide Association Studies (GWAS) are powerful techniques for linking genetic variation to complex phenotypes but have limited utility in small samples. Because the overwhelming majority of genotyped cohorts consist of individuals of European ancestry, conducting well-powered GWAS in diverse populations will be a challenge for years to come. This project overcomes this barrier by developing a meta-analysis framework for GWAS conducted in populations of different ancestries. In particular, multi-ancestry meta-analysis (MAMA) implements a generalized method of moments estimator based on differences in local linkage disequilibrium (LD) structure across the relevant populations. In doing so, MAMA allows for genetic signal in one population to be shared across other populations, substantially boosting statistical power in small samples and allowing for novel genetic associations to be detected. The primary goal of our proposed project is to use MAMA to jointly analyze GWAS summary statistics for Alzheimer’s disease corresponding to several different ancestries. Preliminary applications of MAMA to other phenotypes has yielded many additional genomewide significant loci for each ancestry. MAMA summary statistics can be interpreted similar to the original GWAS summary statistics, but with greater statistical power. We hope to incorporate NIAGADS data from the study Kunkle et al. (2020) into our analysis pipeline. We will jointly analyze these data with summary statistics from Kunkle et al. (2018) and from Zhou et al. (2018), studies based on European-ancestry and East-Asianancestry samples, respectively. We anticipate that doing so will yield many novel genetic discoveries about the genetics of Alzheimer’s disease in each of these populations.
Non-Technical Research Use Statement:
Understanding the link between genetic variation and the incidence of diseases like Alzheimer’s remains an urgent task in the genetics community. One outstanding challenge involves incorporating people with non-European ancestries in such studies. This is an important effort for two reasons. First, generalizing findings from homogenous populations is difficult and hinders a broader understanding of the relevant disease. Second, such an inability to generalize findings will likely perpetuate existing health inequalities. It will likely take many years before enough people with non-European ancestries have been genotyped to match the precision currently available in European-ancestry samples. Rather than waiting for such resources to materialize, our project develops a statistical framework that allows existing cross-ancestry data to be meta-analyzed to improve our ability to detect genetic associations in understudied populations. The NIAGADS dataset, combined with our methodology, will allow for a broader understanding of Alzheimer’s to be developed.
Investigator:
Benjamin, Daniel
Institution:
NBER and UCLA
Project Title:
How health-relevant outcomes are influenced by genetics.
Date of Approval:
May 16, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
We will use the HRS data to pursue two complementary strategies. One is the discovery of particular genetic polymorphisms associated with social-science outcomes. Because the effect of an individual genetic polymorphism on the outcome is likely to be very small, the HRS sample is too small, taken alone, to be used to discover new associations. Hence, we will pursue this strategy with HRS data in conjunction with other datasets that we have organized in the Social Science Genetic Association Consortium (SSGAC; www.thessgac.org). Our second strategy focuses on exploiting the uniquely rich social-science data in the HRS. We will conduct analyses that will shed light on the genetic architecture of a range of social-science outcomes. We will apply statistical methods that use the information contained in the dense SNP data taken as a whole and are thus well-powered in a sample size such as that of the HRS. Our specific aims are: 1. To incorporate data from the HRS into ongoing meta-GWAS efforts from the SSGAC for a range of social-science outcomes, such as educational attainment, and personality. 2. To continue to include HRS in the future releases of the Polygenic Index (PGI) Repository. PGIs (aka polygenic scores) are summaries of a person's genetic predisposition to a particular trait. HRS was included in the first release of the Repository, for which we created PGIs for 47 phenotypes in 11 datasets, which were returned to the datasets to be shared with users according to the datasets’ own data sharing procedures. We will regularly update the existing PGIs and add new phenotypes as larger GWAS and better methodologies become available. Details on the Repository can be found in Becker et al. (2021, Resource profile and user guide of the Polygenic Index Repository. Nat. Hum. Behav.). 3. To use the HRS genotype data to conduct polygenic prediction analyses for a range of social-science traits. Besides the direct interest in assessing the degree of predictive power in PGIs, we will examine how these PGIs interact with environmental factors to influence life outcomes. 4. To estimate heritability and genetic correlations for social science traits in an older population.
Non-Technical Research Use Statement:
We will use HRS data to explore the genetic architecture of social-science outcomes. To do so, we will either use HRS data together with other datasets to identify specific genetic variants associated with these outcomes, or analyze the aggregate effect of all genetic variants in HRS alone using heritability analyses and polygenic indexes (PGIs). PGIs are summaries of a person's known genetic predisposition to a particular trait. We will use PGIs to examine the pathways underlying the relationship between genetic variants and outcomes of interest, including analyses of how genes and environment interact. We will also include HRS in future releases of the PGI Repository, an initiative that makes PGIs for a wide range of traits available in a number of datasets that may be useful to social scientists (https://www.thessgac.org/pgi-repository ). HRS was included in the first release of the Repository, and we wish to continue to update the HRS PGIs and add PGIs for new phenotypes as more data or better methodologies become available.
Investigator:
Black, Mary Helen
Institution:
JOHNSON/JOHNSON/PHARM/RES/ DEVELOPMENT
Project Title:
Target identification and validation in Alzheimer’s Disease with Whole-Genome and Whole-Exome Sequence Data
Date of Approval:
April 18, 2022
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) is a common, progressive, neurodegenerative disorder with a strong genetic component with heritability estimates ranging from 58–79% for late-onset AD and over 90% for early onset AD. Genetic association studies are important to highlight key biological mechanisms contributing to the etiology of AD and provide key insights into potential pathways that can ultimately be targeted for future therapeutic development. The objective of this study is to perform a retrospective analysis of genetic data collected from large-scale population-based and case-control cohorts including the UK Biobank, the Alzheimer’s Disease Sequencing Project (ADSP), and FinnGen and integrate them with publicly available multi-omics datasets including, but not limited to, Genotype-Tissue Expression (GTEx), Microglia Genomic Atlas (MiGA), and neuroimaging data to identify novel and existing evidence for genetic determinants of AD. No attempt will be made to try and identify subjects. Aim 1: Identify novel and replicate existing gene associations for AD. We will perform case-control and family-based genetic analyses with AD diagnosis as the outcome of interest. Covariates include age, sex, and principal components. ADSP, UKB, and FinnGen will be analyzed separately and combined with meta-analysis. Biobank cases will be defined using ICD-9/ICD-10 codes, and proxy cases and controls will be carefully defined using questionnaire data on parental history of AD. Both true and proxy cases will be considered to maximize the number of AD cases. Aim 2: Prioritize novel gene associations identified in Aim 1. We will perform genetic fine-mapping and leverage tissue and cell-type specific datasets (e.g. GTEx and MiGA) to prioritize targets for further functional and analytical interrogation. Statistical methods used for target prioritization include colocalization, statistical fine-mapping, and Mendelian randomization. Furthermore, multi-omics-based network approaches will be used to identify disease-related molecular modules and tissue-specific regulatory circuits.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a common, progressive, neurodegenerative disorder with a strong genetic component with heritability estimates ranging from 58–79% for late-onset AD and over 90% for early onset AD. To date, there is only one treatment option intended to mediate the disease progression of AD, while all others treat symptoms associated with AD. Genetic association studies are important to highlight key biological mechanisms contributing to the etiology of AD and provide key insights into potential pathways that can ultimately be targeted for future therapeutic development. The objective of this study is to perform a retrospective analysis of genetic data collected from large-scale population-based and case-control cohorts including the UK Biobank, the Alzheimer’s Disease Sequencing Project (ADSP), and FinnGen and integrate them with publicly available multi-omics datasets including, but not limited to, Genotype-Tissue Expression (GTEx), Microglia Genomic Atlas (MiGA), and neuroimaging data to identify novel and existing evidence for genetic determinants of AD.
Investigator:
Blanck, George
Institution:
UNIVERSITY OF SOUTH FLORIDA
Project Title:
Alzheimer's disease (AD) and immune receptor recombinations
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We would like to access the exome files for Alzheimer’s Disease (AD) studies to better ascertain the impact of certain immune receptor (IR) recombinations on the progression of Alzheimer’s patients. First, we will mine the exomes for all IR recombinations present in the AD exome files using our previously published algorithms and software [4-6, 9]. These studies have been successful in identifying IR recombination reads that were strongly associated with distinct survival rates in various cancers. In particular, we have been able to utilize a similar methodology to identify specific T-Cell recombinations associated with cancer features by using the recombinations that are obtained from blood sample exomes. For example, recently published papers from our group have revealed consistent features of T-cell receptor recombinations, obtained from cancer patient blood samples, which were associated with features of cancer progression [4, 8]. Therefore, we hypothesize that we can detect specific IR recombination features from blood-resident T- or B-cells that are relevant to AD clinical features. Thus, we will first identify IR recombinations found within the AD, blood exomes; and then match features of these IRs, such as the chemical aspects of the IR antigen binding sites [1, 4, 5] with clinical characteristics, e.g., age of onset, diagnostic status, and cognitive measures. We thus expect to identify certain IR recombinations associated with distinct prognoses. It should also be noted that there have been previous studies linking HLA-DR alleles to late-onset AD [10]. This is of particular interest, because of our previous work linking T-cell receptor, V or J usage, HLA allele combinations to distinct cancer survival rates [2, 3, 7, 8]. Thus, we will also programmatically obtain the HLA alleles from the exome files and determine whether any T-cell receptor, V or J usage, HLA allele combinations are associated with particular features of AD development. References available by email: gblanck@usf.edu
Non-Technical Research Use Statement:
The purpose of this project is to learn whether there are any features of the immune system that are unique to Alzheimer's patients. If so, such unique features might help understand disease progression better and might provide targets for therapies.
Investigator:
Blue, Elizabeth
Institution:
University of Washington
Project Title:
Genetic modifiers of Alzheimer's disease
Date of Approval:
July 15, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The objective of the proposed research is to identify new genes involved in Alzheimer's disease (AD) by identifying alleles contributing to increased risk for or protection against the disease, providing insight as to why individuals with risk factors develop or escape from AD, and ultimately identifying potential avenues for therapeutic approaches and prevention of the disease. Our study design will use phenotypic (ex., AD diagnosis, age-at-onset, APOE genotype) land genomic data (ex., WGS, array, imputed genotypes) from NIAGADS studies to investigate genotype-phenotype associations. Strategies include association testing and haplotype- and family-based approaches, including estimates of relatedness and population genetics analyses as needed to perform the association testing (ex. control for population structure). NIAGADS data will not be used to investigate individual identity. Consent type and other Data Use Limitations (DUL) for each study will be respected in all analyses. Data from an individual with disease-specific consent will not be used in analyses outside of that restriction, including indirect uses such as imputation reference panels or variant summary statistics. When an individual’s DUL prohibits investigation of population genetics, population history or related issues, their data will be excluded from studies that address those issues. We intend to publish or otherwise broadly share any findings from this study with the scientific community. As such, genomic summary results from datasets with a “sensitive” designation will only be shared through publications to support study’s conclusions and through NIH-funded data repositories which maintain restricted access (ex. NIAGADS). Data from NIAGADS may be combined with non-NIAGADS data from the same or other studies (obtained from dbGaP or other sources), to improve the power for novel genetic discoveries, while respecting the consent of all participants. We expect that this activity creates no additional risks to participants. Data will be shared only among Internal Collaborators at the University of Washington. We do not plan to collaborate with External Collaborators at other institutions.
Non-Technical Research Use Statement:
We propose to identify new genes involved in Alzheimer's disease (AD) by identifying alleles contributing to increased risk for or protection against the disease, providing insight as to why individuals with risk factors develop or escape from AD, and ultimately identifying potential avenues for therapeutic approaches and prevention of the disease. We will combine phenotype and genotype data using association testing and haplotype- and family-based approaches to identify associations and refine those signals with fine-mapping tools and external data.
Investigator:
Boerwinkle, Eric
Institution:
University of Texas Health Science Center at Houston
Project Title:
Therapeutic target discovery in ADSP data via comprehensive whole-genome analysis incorporating ethnic diversity and systems approaches
Date of Approval:
December 18, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Dr. Boerwinkle’s group is involved in all aspects of the ADSP including study design, data gathering and data analysis. Permissions are necessary for both the array genotype, DNA sequence and all available phenotype data. This availability and ensuing data access will be used for data processing and data analysis to identify novel AD risk raising and protective loci.
Non-Technical Research Use Statement:
Dr. Boerwinkle’s group is involved in all aspects of the ADSP including study design, data gathering and data analysis. Permissions are necessary for both the array genotype, DNA sequence and all available phenotype data. This availability and ensuing data access will be used for data processing and data analysis to identify novel AD risk raising and protective loci.
Investigator:
Bozdag, Serdar
Institution:
University of North Texas
Project Title:
Utilizing Machine learning and AI for early detection, and identification of Alzheimer's Disease and Related Dementias
Date of Approval:
September 4, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives of the proposed research: Alzheimer’s disease and its related dementias (ADRD) are a growing public health crisis with no known cure. ADRD diagnosis remains challenging due to its inherent heterogeneity, variability of early symptoms, overlap and possibility of combined etiologies. The overarching objective of this project to develop novel interpretable deep learning methods to integrate multimodal neuroimaging, genomic and clinical datasets collected from demographically diverse cohorts for early detection of ADRD. Study design: Our central hypothesis is that an integrative deep learning model that operate on both MRI data, and NGS data will be an ideal solution for isolating ADRD biomarkers that can lead to early diagnosis. We aim to integrate multi-modal datasets including genomic, clinical, demographic, neuroimaging from diverse populations in studies such as ADNI, ADSP, and HEBLA and develop a multimodal fusion-based approach for optimized and accurate classification of various presentations of ADRD. Further, we will design and develop an interpretability-framework that would help explain the decisions made by deep-learning models leading to knowledge discovery for neuroscientists, and transparent analysis for clinicians. Analysis plan: We plan to utilize genetic variants as features in our deep learning model. To this end, we will perform GWAS to find SNPs that associate with disease diagnosis and endophenotypes such as neuroimaging, biospecimen measures, and cognitive performance. We will utilize genetic data to perform a network propagation study to discovery AD-associated genes.
Non-Technical Research Use Statement:
Alzheimer’s disease and related dementias (ADRD) are a growing health crisis, with current global management costs over $350 billion and cases expected to grow to 14 million globally by 2050. Emerging evidence of advantages of early detection of the disorder, and the current limitation of the clinical practices suggest that more quantitative methods are needed to identify, and early diagnose ADRD. In this project, we aim to develop interpretable deep learning models to integrate genetic, imaging, and clinical datasets for diagnosis and early prediction of ADRD.
Investigator:
Bras, Jose
Institution:
Regeneron Pharmaceuticals
Project Title:
Rare Variants in Alzheimer’s Disease and Other Dementias
Date of Approval:
August 2, 2022
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objectives To identify genetic variants that are overrepresented in sporadic dementias when compared with controls. To identify genetic variants that are found uniquely in apparently sporadic dementia cases. To determine if rare variants identified in our cohorts of neurodegenerative disease are present in the ADSP controls. Study Design We have performed exome-sequencing in over 3,000 samples from a variety of neurodegenerative dementias. These data were generated using Illumina sequencing and called using GATK’s Best Practices v3. In these samples, we have identified genetic variants that have either much lower frequency in controls and in publicly available databases of genetic data (gnomAD), or that are absent from these cohorts. The proposed study design is largely a case-control study in ADSP data to replicate our findings as well as a simple lookup for rarer variants in cases and controls, where sample size isn’t enough to perform meaningful associations. We will perform single variant and gene-based associations using standard methods (fisher test/logistic regression and SKAT-O) using gender, age and principal components as covariates. These tests are either implemented in PLINK or can be performed in R. To allow us to dissect the associations between genetic variants and phenotype we will require access to gender, age at onset, age at death (where available), Braak staging and CERAD scores for all cases in ADSP. Funding Funding for the study is currently from Van Andel Research Institute’s internal funds.
Non-Technical Research Use Statement:
The main objective of the study is to identify genetic variants that cause or predispose to neurodegeneration. To accomplish this, we will analyze data previously generated for a variety of these conditions and use data from ADSP to replicate findings and improve our statistical power to detect these associations with disease. The identification of genetic variants, even if rare, that have a strong impact on dementia phenotypes will be of significant importance in advancing our understanding of disease biology. These variants will also be candidate targets for future diagnostic or therapeutic approaches for these diseases.
Investigator:
Brickman, Adam
Institution:
Columbia University
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
August 16, 2023
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Broce, Iris
Institution:
UCSD
Project Title:
Multi-modal risk scores in cognitive aging and dementia
Date of Approval:
November 22, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Non-invasive and low-cost tools for early detection of individuals at high risk for developing Alzheimer’s Disease (AD) are of crucial importance. While genetic studies (GWAS) exist, most focus on European populations, limiting their application to diverse groups. Given the Eurocentric biases in GWASs, PRSs are better at predicting AD risk for European ancestry as opposed to others. This research addresses this gap by developing a multimodal hazard score (MHS) that incorporates ethnically and genetically diverse populations. Our team established a successful polygenic hazard score (PHS) that predicts AD onset in European cohorts. Higher PHS predicted greater cognitive decline in CN, entorhinal cortex volume loss and predicted conversion from cognitively normal (CN), to mild cognitive impairment (MCI) to AD. The PHS was also associated with neuropathological AD phenotypes and systematically varied AD biomarkers such as amyloid and total tau PET. This suggests its potential as a robust genetic risk indicator to be included in the MHS. Leveraging data from the largest AD cohorts to date, including ADSP, we will conduct Cox proportional models to develop the MHS combining age, PHS, brain atrophy, biofluid-based data, and clinical outcomes to predict neurocognitive decline trajectory. This score will be able to predict AD progression and identify individuals at high risk of transitioning from cognitive normalcy to mild cognitive impairment (MCI) and eventually AD in a generalizable population. Additionally, we will also compute power calculations to estimate required clinical trial sample sizes after hypothetical enrichment using the MHS. Finally, we aim to replicate these findings in other diverse cohorts to ensure broader applicability. We believe this MHS surpasses single-modal and European data-driven models in predicting AD for diverse populations. This could pave the way for earlier intervention, more efficient research, and ultimately, improved outcomes for individuals at risk of AD.
Non-Technical Research Use Statement:
Alzheimer’s Disease (AD) disproportionately affects diverse populations, highlighting the need for non-invasive and low-cost tools for early detection. This research aims to develop a multimodal hazard score (MHS) in ethnically and genetically diverse populations. The MHS combines various risk factors, including an individual's genetic profile, neuroimaging, and specific blood markers (biomarkers). This comprehensive approach offers the potential to identify individuals at high risk of transitioning from cognitively normal (CN) to mild cognitive impairment (MCI), ultimately leading to AD. To develop the MHS, we will leverage data from the largest AD cohorts to date, including ADSP. We will then evaluate MHS on its ability to identify individuals with greater risk for converting from CN to MCI and AD on the generalizable population and compute power calculations to estimate required clinical trial sample sizes. This technique could pave the way for earlier intervention, more cost-efficient research, participant burden and ultimately, improved outcomes for individuals at risk of AD.
Investigator:
Brown, Rebecca
Institution:
University of Pennsylvania
Project Title:
Trajectories of Cognition in Middle Age: Implications for Alzheimer's Disease and Related Dementias in the U.S.
Date of Approval:
March 16, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Polygenic risk scores (PRS) for dementia and aging-related conditions are known to be associated with cognitive outcomes in older age, but little is known about their relationship to mid-life cognitive decline. We plan to use raw genetic data to derive novel PRS from GWAS sources (including Lambert Alzheimer’s disease PRS, with and without APOE; aPRS for coronary artery disease; a longevity PRS) and evaluate their predictive accuracy for cognitive outcomes in middle age relative to existing PRS. Specifically, we want to create a measure of genetic risk associated with three outcomes: age-related cognition; telomere shortening; and methylation/epigenetic clocks. To achieve this, we will combine the HRS Genotype data with other HRS datasets (Harmonized Cognitive Assessment Protocol (HCAP) (2016 Early V1.0); 2008 Telomere Data; Epigenetic Clocks; 2016 Venous Blood Study (VBS)) to which we already have access. Once we have approved NIAGADS genomics data access, we will additionally request access to the HRS-NIAGADS Cross-Reference File (Genotype Data v3,2006-2012) to link the genomics and HRS datasets. In our ongoing analyses, we would like to update our PRS models by incorporating the most recent GWAS summary statistics. For Alzheimer's disease, this requires access to the full summary statistics from the Kunkle et al., 2019 GWAS. We also would like access to the full summary statistics from the Farrell et al., 2024 GWAS and the Rajabli et al., 2025 GWAS to identify genetic modifiers of tauopathy by comparing progressive supranuclear palsy GWAS results with cross-ancestry Alzheimer’s disease GWAS results.
Non-Technical Research Use Statement:
There is evidence to suggest that differences in people’s genetic code might contribute to differences in age-associated cognitive changes. For example, some people develop memory problems in middle age, and other people experience no changes in memory. Researchers think this may be partially explained by differences in people’s genetic code. We might be able to predict who could experience age-related cognitive changes based on their DNA sequence. If we know which people have experienced memory problems, we can see what their DNA has in common compared to the DNA of people who don’t have any memory problems. Then, we can test this by looking at the DNA of a different group of people; evaluating if their DNA has the same things in common as the group of people with memory problems (vs. no memory problems); and predicting whether they will develop memory problems. The long-term goal of this work is to help identify people who might be at risk for developing memory problems and help them access preventative care or interventions to minimize future cognitive impairment.
Investigator:
Carpanini, Sarah
Institution:
Cardiff University
Project Title:
HLA typing of WES data from Alzheimer's disease cases and healthy controls
Date of Approval:
January 27, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We have recently undertaken association testing between HLA haplotype and AD risk using higher resolution whole-exome sequencing (WES) data, a warranted approach given how similar HLA alleles are to one another. Previous studies have fine-mapped the HLA region in AD but this is incredibly difficult in regions of high linkage disequilibrium and requires large sample sets. We have utilized a dataset of >2200 AD cases and controls with available WES data and employed an algorithm to bioinformatically HLA type our dataset validated against PCR based HLA typing service for 11 main HLA genes. We now wish to validate our findings in a replication cohort. We will HLA type the ADSP discovery cohort of 5,096 AD cases and 4,965 controls to further explore the HLA association with AD. We will use the algorithm HLA-HD (PMID: 28419628) to HLA type all individuals. Using sex as a confounding factor we will test for an association between HLA type and AD using logistic regression modelling, testing for statistical significance using permutation testing.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is the most common form of dementia, and it is known that the immune system plays a major role in the development of disease. An individual's risk for developing AD is partially determined by changes in their genes. Recent studies comparing the genetic information from healthy individuals and those with AD have identified that small changes in genes alter risk of developing disease. The genes encoding the Human Leukocyte Antigens (HLA) are some of these disease influencing genes. The HLA genes encode for proteins that tell the difference between your own cells and foreign cells. HLA genes are very difficult to study as they come in many different forms. In the brain HLA genes are mainly found on microglia (the immune cells of the brain). We have already used a dataset of >2000 participants living with AD and elderly well controls and identified novel changes in HLA genes that alter risk of developing AD. We would now like to replicate our findings in another dataset and request access to the ADSP discovery cohort.
Investigator:
Carrasquillo, Minerva
Institution:
Mayo Clinic Florida
Project Title:
Targeted plasma proteome, transcriptome, and e/pQTL analyses identify potential novel biomarkers for Alzheimer's disease in African Americans
Date of Approval:
February 3, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of this study is to identify genetic variants that influence plasma levels of transcripts and proteins that could serve as diagnostic blood-based biomarkers of Alzheimer’s disease in African Americans. Plasma samples were collected from the Florida Consortium of African Americans Alzheimer’s disease studies (FCA3DS). Targeted DNA sequencing was performed across ten Alzheimer’s disease-associated loci to assess genetic variation. Plasma transcript levels and total tau protein concentrations were previously measured in this cohort using a custom nanoString panel and Simoa assays. Plasma proteome data were generated using the SomaScan 7k platform. Phased haplotypes generated with SHAPEIT4 were used to infer local ancestry via RFMix v2, referencing five superpopulations from the 1000 Genomes Project. Using ancestry-specific allelic dosages estimated with Tractor, association testing was conducted between genetic variants and molecular endophenotypes. Receiver operating characteristic analysis was performed to evaluate the additive classification accuracy of age, sex, and the most significant expression/protein QTLs in distinguishing Alzheimer’s disease cases from cognitively unimpaired controls. We plan to utilize the requested dataset to validate our findings as these datasets include plasma proteome measurements, pQTL summary statistics, and plasma cell free RNA transcriptomics from healthy control, presymptomatic AD, early symptomatic AD, and symptomatic AD participants.
Non-Technical Research Use Statement:
African Americans are historically underrepresented in Alzheimer’s disease research and may have disproportionately limited access to neurology clinics where Alzheimer’s disease is often diagnosed and treated. Blood-based biomarkers could offer a means for patients to obtain accessible and accurate diagnostic information related to Alzheimer’s disease. This study seeks to discover novel blood-based biomarkers to aid in diagnosis of Alzheimer’s disease specifically for African Americans.
Investigator:
Carter, Gregory
Institution:
The Jackson Laboratory
Project Title:
Prioritization of Genetic Variants Contributing to Late-Onset Alzheimer’s Disease
Date of Approval:
January 6, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The goal of this collaborative project is to identify and prioritize genetic variations that contribute to late-onset Alzheimer’s disease (AD), with a particular focus on developing experimental models to understand the biology and of and develop new treatments for AD. We use a variety of approaches to analyze high-throughput genomic and genetic data, including statistical methods for identifying causal variants and genetic interaction networks, as well as epigenetic analyses of cellular regulation. By investigating and quantifying the effects of these mutations individually and interactively, we hope to begin to understand the biology of AD and unravel the complexity of AD genetic risk. Our analysis will use the human genetic data in the Alzheimer’s disease sequencing project (ADSP), from which we will identify high-priority candidate genes and variants for further study in experimental models. Thus the goals of this work are twofold: to understand how brain irregularities progress into full AD, and provide the research community with a valuable mouse model for further studies and therapeutic testing. The described research will pilot novel analytical tools for the study of the biology of AD and provide insights into the genetics of neurodegeneration. We will use independent evidence to identify candidates with putative functional roles. Transcriptome data from Alzheimer’s studies will be used to identify candidate genes and ENCODE and other regulatory data will be used to identify putative regulatory regions. Variants will be ranked based on computationally predicted mutation severity and differential expression in AD. Candidates in regulatory regions will be prioritized based on expression differences in nearby genes. Finally, functional data sources such as Gene Ontology, Allen Brain Atlas, and mouse phenotypes will be used to determine the potential role in neurodegeneration. The result will be a list of candidate variants prioritized for study in models. Since these data will be integrated with a focus on the role of individual genes generated from a population-wide analysis, we do not foresee creating any additional risk to individual participant
Non-Technical Research Use Statement:
While many genetic loci have been identified as contributing to the risk of late-onset Alzheimer’s disease, the biological underpinnings and interdependence of these mutations are generally poorly understood. By investigating and quantifying the effects of these mutations individually and interactively, we hope to begin to unravel the complexity of genetic risk. Data from the Alzheimer’s Disease Sequencing Project will be essential in this approach, allowing us to identify the best candidate genes and perform computational analysis to design advanced experimental models of AD. These models will be an experimental basis for understanding the biology of Alzheimer’s and performing early-stage testing of candidate therapeutics.
Investigator:
Castel, Stephane
Institution:
Variant Bio, Inc.
Project Title:
Global Characterization of Germline Immune Variation
Date of Approval:
October 1, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Genetic variation in immune-related genes, including HLA, Fc receptors, and T/B cell receptors, significantly influences vaccine response and susceptibility to infectious diseases. For example, HIV vaccine efficacy has been linked to immunoglobulin genotypes and B cell profiles. Similarly, T cell vaccine design depends on accurate knowledge of HLA allele distribution, yet global data remain incomplete, particularly for non-European populations. To address this, Variant Bio, with Gates Foundation support, is conducting a study to map global immune gene diversity. These genes are underrepresented in current datasets due to both limited sequencing in diverse populations and the technical complexity of these highly variable, repetitive regions. We are benchmarking bioinformatic methods for detecting germline variation in immune genes using WGS data, comparing short-read results to targeted and long-read datasets, including fully phased genomes. We aim to analyze WGS data from diverse global cohorts, including the LASI-DAD study and other public and internal Variant Bio datasets. Population-level allele frequencies will be shared via peer-reviewed publication and databases such as The Allele Frequency Net Database, supporting broader immunogenomics research and vaccine development. Specific Aims 1) Benchmark methods for detecting variation in HLA, Fc receptor, and T/B cell receptor genes. 2) Develop a global map of germline immune variation by geography and ancestry. 3) Publicly share allele frequencies and methods to enable downstream application. Study Design This retrospective analysis will use existing WGS datasets. No new sample collection is planned and no phenotypic data is being requested. Bioinformatic Plan WGS data will be processed using the GATK Best Practices pipeline and imputed/phased using Beagle. Targeted immune loci will be analyzed with specialized software. Methods will be validated against long-read and haplotype-resolved genome references. Emphasis will be placed on characterizing accuracy across ancestries, particularly underrepresented groups.
Non-Technical Research Use Statement:
Variation in immune genes, including HLA, T cell receptors, B cell receptors, and Fc receptors plays a key role in how people respond to infections and vaccines. Yet most existing genetic data comes from individuals of European ancestry, limiting our understanding of global immune diversity. This Gates Foundation–funded project aims to characterize immune gene variation in underrepresented populations. Using whole genome sequencing data – including from the LASI-DAD study we will analyze variation in key immune genes to build a broader map of global diversity. Advanced computational methods will identify variants in these complex regions, benchmarked against newer sequencing technologies and complete genomes. Findings, including population-level allele frequencies, will be shared via publications and public databases. This will support more equitable vaccine development, especially for T cell–based vaccines that rely on accurate HLA allele data.
Investigator:
Chan, Kwun Chuen Gary
Institution:
University of Washington
Project Title:
Genetic Architecture of Alzheimer’s Disease Neuropathology and Clinical Severity Using NACC Phenotypes and Admixture-Informed Genome-wide association studies
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Genome-wide association studies (GWAS) have identified over 80 loci influencing Alzheimer’s disease (AD) risk, yet their impact on quantitative neuropathological traits—such as neuritic and diffuse plaque density, Braak staging of neurofibrillary degeneration, cerebrovascular pathology, and synaptic integrity—remains poorly defined. Similarly, the extent to which these variants contribute to clinical heterogeneity, including dementia severity, cognitive trajectory, and resilience, is not fully understood. Genetic ancestry and population structure further complicate interpretation, as most GWAS have been conducted in European-ancestry cohorts, potentially masking ancestry-specific effects or inflating signals due to unmodeled admixture. We hypothesize that: 1) Established AD risk loci exert pleiotropic effects on specific neuropathological features, independent of overall disease risk; 2) Genetic ancestry and local admixture modulate the effect size and penetrance of these variants on both pathology and clinical outcomes; and 3) Integrating fine-scale genetic structure with quantitative trait GWAS will enhance power, reduce bias, and reveal population-specific mechanisms. Aim 1: Conduct ancestry-aware GWAS of neuropathological traits using NACC postmortem data harmonized with ADSP genetic data Aim 2: Investigate local genetic admixture and its interaction with AD risk loci Aim 3: Integrate genetic, pathological, and clinical data to model disease heterogeneity We anticipate the following: 1) Novel locus-trait associations linking AD risk variants to specific neuropathological outcomes; 2) Evidence of ancestry-dependent genetic regulation of AD pathology, informing inclusive trial design; and 3) A framework for dissecting clinical heterogeneity through genetically informed pathological subtypes. These findings will refine mechanistic models of AD progression, prioritize therapeutic targets with pathology-specific efficacy, and improve risk prediction across diverse populations.
Non-Technical Research Use Statement:
This project aims to better understand how genes linked to Alzheimer’s disease relate to the specific brain changes seen in the condition and why people with the same genetic risk can experience different symptoms or rates of decline. Using clinical and neuropathological data from the National Alzheimer’s Coordinating Center and genetic information from the Alzheimer’s Disease Sequencing Project, we will examine whether known Alzheimer’s‑related genetic variants are associated with measurable neuropathological features such as plaque and tangle burden, blood vessel changes, and markers of synaptic health. We will also study whether these genetic effects differ across individuals with different ancestries or mixed genetic backgrounds. By combining genetic, clinical, and brain‑based information, our goal is to clarify how genetic factors contribute to disease heterogeneity and to improve understanding across diverse populations.
Investigator:
Chang, Timothy
Institution:
University of California, Los Angeles
Project Title:
Rare Genetic Risk and Gene Networks in Tauopathy
Date of Approval:
December 23, 2021
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
The objectives of this proposal are to identify rare genetic risk factors that are unique to or shared by Alzheimer’s disease (AD) and Progressive Supranuclear Palsy (PSP). We hypothesize we will identify a set of rare genetic risk factors associated with AD and another set of rare genetic risk factors associated with PSP, some of which may be shared between AD and PSP. To determine shared or unique genetic risk, we will compare AD to controls and PSP to controls. Both analyses will use the same controls and have similar number of cases. AD subjects will be included from Alzheimer’s Disease Sequencing Project, Alzheimer’s Disease Neuroimaging Initiative, and Accelerating Medicines Partnership – Alzheimer’s disease. From these studies, we will include adult controls. We will use roughly 1900 whole genomes from PSP subjects. Given the availability of AD sequencing, we will replicate the association of rare genetic risk with an independent holdout AD dataset, which will include AD and controls from the ADSP Follow Up Phase. We will also validate our finding in multi-ethnic cohorts from the ADSP. Traditional rare variant analyses have limited power due to the large number of variants and small variant effect size. Although one solution is to group variants into genes, genes do not act in isolation, but rather interact with one another in networks. Grouping variants in a network can improve power. Additionally, since most genetic risk lies in large noncoding regions of the genome, focusing analyses on noncoding regulatory regions should further increase power. We will incorporate network connectivity in rare variant statistical tests and prioritize functional noncoding variants will identify rare genetic risk factors in AD and PSP by overcoming deficiencies in traditional methods. Analyzing rare variants in protein coding, promoter and distal noncoding regions, we will compare the proposed network and non-coding prioritization methods to traditional gene-based, unprioritized non-coding methods.
Non-Technical Research Use Statement:
Neurodegenerative diseases including Alzheimer’s disease and Progressive Supranuclear Palsy are characterized by abnormal tau protein accumulation and do not currently have disease modifying treatments. Analyzing whole genome sequencing with novel genomic and genome informatic methods may identify rare genetic risk factors that lead to these diseases. The shared or unique rare genetic risk factors of Alzheimer’s disease and Progressive Supranuclear Palsy may become future therapeutic targets.
Investigator:
Chang, Timothy
Institution:
University of California, Los Angeles
Project Title:
PSP and CBD Genetics
Date of Approval:
September 3, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We plan to analyze whole exome and whole genome sequence data generated from subjects with progressive supranuclear palsy (PSP), corticobasal degeneration (CBD), Alzheimer's disease (AD) and elderly normal controls. The goal is to detect mutations and variants that cause, contribute to risk, or protect against PSP and/or CBD. We want to compare PSP and CBD genotypes to those from AD and normal controls sequenced by the Alzheimer's Disease Sequence Project. We would like both whole genome and whole exome data from the Alzheimer's Disease Sequence Project for AD and normal controls. We would also like whole genome and whole exome data for PSP and CBD generated by the PSP and Tau consortiums. We will use these data to determine which mutations and variants are associated with PSP or CBD versus benign variants. All PSP and CBD subjects being sequenced are deceased. The requested data sets will have variants recalled as a batch and combined to evaluate allele frequencies of called variants. The AD and control variant frequencies will then be compared to allele frequencies from PSP and CBD subjects as described above. We will also compare structural variants (insertion-deletions, copy number variants, and chromosomal rearrangements) identified in PSP and CBD subjects to those found in AD and in cognitively normal controls in order to determine structural variants involved in PSP and CBD pathogenesis. All of the investigators that are listed will be using a joint called VCF generated from the requested data sets. PSP is a neurodegenerative disease closely related to Alzheimer's disease (AD). PSP, CBD and AD have neurofibrillary tangles as part of the signature neuropathology defining these disorders. PSP and CBD are considered Alzheimer’s Disease Related Disorders (ADRD).
Non-Technical Research Use Statement:
We are attempting to identify all the inherited elements that contribute to progressive supranuclear palsy (PSP) and corticobasal degeneration (CBD) risk. To do this we will analyze DNA sequence data from subjects with AD, PSP, CBD, and subjects who are cognitively normal. The sequence data from these groups will be compared to identify differences that contribute to the risk of developing PSP and CBD, or that protect against these diseases. These DNA differences can be at a single site in the genetic code, or can span multiple sites, changing the copy number of DNA sequences. Both types of genetic variants will be examined.
Investigator:
Chang, Timothy
Institution:
University of California, Los Angeles
Project Title:
Genetic Risk in LASI-DAD
Date of Approval:
September 2, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We intend to utilize genetic available in the LASI-DAD cohort, including whole-genome sequencing and TopMed imputed data, to conduct genetic association studies. In our previous work as detailed in the paper "Improving Genetic Risk Modeling of Dementia from Real-World Data in Underrepresented Populations", we developed polygenic risk scores using electronic health records from UCLA Health across diverse ancestry groups (Hispanic Latino American, African American, and East Asian American). Our objective using the LASI-DAD genetic data is to examine the correlation between dementia related genetic risk models with dementia diagnoses and cognitive function. We plan to investigate the association between these genetic risk score, and the five broad cognitive domains (orientation, memory, executive functioning, language/fluency, and visuospatial function). These analytical results can be compared to our and others’ previous work.
Non-Technical Research Use Statement:
The research will examine how genetic factors affect memory and cognitive abilities as individuals grow older. Using the data from the LASI-DAD study, which includes comprehensive genetic information, we aim to explore whether genetic risks for dementia are linked to actual dementia diagnoses and specific cognitive functions. Utilizing the LASI-DAD data would help us compare the results with our previous work, where we developed genetic risk models for dementia in diverse groups like Hispanic Latino, African American, and East Asian American populations. Gaining insight into these differences may enhance our understanding of genetic influences on dementia across diverse populations, contributing to improved diagnosis.
Investigator:
Chen, Han
Institution:
New York University
Project Title:
Efficient IBD mapping for Alzheimer's Disease and related brain imaging phenotypes
Date of Approval:
June 15, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s Disease (AD) is a highly heritable complex human disease, but only a limited number of its genetic risk factors have been identified. In this project, we propose to leverage identical-by-descent (IBD) segments, which are expected to better tag haplotype effects and provide complementary genetic information to additively coded genotypes that are commonly used in genome-wide association studies, in the genetic association mapping for AD and related brain imaging endophenotypes. First, we will develop the statistical and computational framework for efficient genetic association mapping using IBD segments in large samples. We will then apply IBD mapping to identify novel genomic regions associated with AD and related brain imaging endophenotypes from the Alzheimer’s Disease Sequencing Project (ADSP), and combine association evidence from the UK Biobank and ADSP. We expect our novel tools and findings from this project to be made publicly available to facilitate novel discoveries that improve our understanding on the genetic architecture of AD and related brain imaging data.
Non-Technical Research Use Statement:
Identical-by-descent segments inferred from genome-wide data provide information on population history and genetic relatedness that are not sufficiently used in current genetic association approaches. We propose to leverage identical-by-descent segments in the genetic association mapping for Alzheimer’s Disease and related brain imaging endophenotypes, develop efficient statistical methods and implement novel computational algorithms that can be applied to large samples. We expect our tools and findings to facilitate genetic research and provide novel insights into the genetic architecture of Alzheimer’s Disease.
Investigator:
Chen, Jingchun
Institution:
University of Nevada, Las Vegas
Project Title:
Classification of Alzheimer’s disease with Genetic Data and Artificial Intelligence
Date of Approval:
November 14, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer's disease(AD) is the most common cause of dementia, accounting for 60% to 80% of cases that affect over six million people in the United States. The disease gradually progresses from mild cognitive impairment(MCI) to dementia, which takes more than a decade. Identifying individuals who have a high risk of AD earlier is essential for AD prevention and intervention. As the heritability of AD is high(up to 79%), genetic data should be powerful to identify individuals at high risk. Indeed, polygenic risk score (PRS), designed to estimate individual genetic liability by integrating large GWAS summary statistics and individual genotype data, has been shown to be promising for AD risk prediction(AUCs up to 84%). However, the prediction accuracy using a single PRS is still not sufficient for MCI and AD classification in clinical practice. We hypothesize that convolution neural network(CNN) models can improve the classification of AD and MCI by multiple integrating PRSs from multiple traits, multi-omics data (genotyping data, scRNA-seq), clinical data, and imaging data. The objective is to develop advanced AI algorithms and build data-driven models for disease risk assessment, earlier identifying individuals with high risk for MCI and AD. Our long-term goal is to develop and validate a prediction model that can be translated into clinical practice. Our CNN model has recently shown an improved performance for AD with PRSs from multiple traits(AUC 92.4%). We want to extend our approach to predicting AD and MCI in different ethnic groups and validate the results with independent datasets. To this end, we would like to apply for multi-omics data in NG00067.v9 from https://dss.niagads.org/datasets/ng00067/. With an extensive experience in genetic studies on complex disorders and disease modeling, we are confident that we will achieve the specified goals and promote the integration of genetic data with AI algorithms, facilitating data-driven, personalized care of AD. We expect to finish this study within 2 years with publication and grant application. We have IRB approval and will follow the rules for data sharing and acknowledgment.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD), the most common form of dementia, that usually develops from mild cognitive impairment to dementia. There is currently no treatment to slow the progression of this disorder. But earlier identification of the individuals with higher risk maybe critical to prevent the disease. We propose a new approach to create models for classification of AD and MCI with artificial intelligence and genetic data. This study will have a significant value in personalized medicine for AD risk assessment, classification, and earlier intervention. We don’t have the planned collaboration with researchers outside Cleveland Clinic in the current analytic plans.
Investigator:
Cheng, Feixiong
Institution:
Cleveland Clinic
Project Title:
A Multimodal Infrastructure for Alzheimer’s MultiOme Data Repurposing: Artificial Intelligence, Network Medicine, and Therapeutics Discovery
Date of Approval:
May 22, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We propose to develop capable and intelligent computer-based toolboxes that enable searching, sharing, visualizing, querying, and analyzing genetics, genomics, multi-omics, and clinical data for AD. The central unifying hypothesis of this project (1U01AG073323-01 [pending for Council meeting at May/2021) is that a genome-wide, multimodal artificial intelligence (AI) framework to identify novel risk genes and networks from human WGS/WES and multi-omics findings will offer drug targets for targeted therapeutic development in AD. Aim 1 will identify rare coding variant-based risk genes using a sequence and structure-based deep learning model. Aim 2 will identify rare non-coding variant-based risk genes using a multiple kernel learning approach. Aim 3 will test whether GWAS common variants linked to AD pathobiology and endophenotypes are enriched in gene regulatory networks in a cell-type specific manner using a Bayesian framework. These analyses will leverage variants from ethnically diverse WGS/WES and clinical data (i.e., imaging, biomarkers, and cognitive measures) from Alzheimer's Disease Sequencing Project (ADSP), and publicly available chromatin interactomic data from NIH RoadMap, FANTOM5, and NIH 4D Nucleome. We will validate our findings using WGS/WES data and protein expression data from our existing cohorts: The Cleveland Clinic Lou Ruvo Center for Brain Health Aging and Neurodegenerative Disease Biobank (CBH-Biobank) and the Cleveland Alzheimer's Disease Research Center (CADRC). We will compile information for clinical data harmonization, including functional imaging, AD biomarkers, and cognitive measures for all integrative analyses. There are no any PHI information will collected or used in the data analysis. We don’t have the planned collaboration with researchers outside Cleveland Clinic in the current analytic plans.
Non-Technical Research Use Statement:
It is estimated that more than 16 million people with AD live in the United States by 2050 and the predisposition to AD involves a complex, polygenic, and pleiotropic genetic architecture. This project will develop intelligent computer-based network medicine and systems biology tools, capable of identifying and validating human genome sequencing findings for novel risk gene discoveries and targeted therapeutic development in AD. The innovative network-based, artificial intelligence toolboxes and novel risk genes and biologically relevant targeted therapeutic approaches developed in this proposal will prove to be novel and effective ways to improve outcomes in long-term brain care for the rapidly growing AD population, an essential goal of AD precision medicine.
Investigator:
Cochran, Nick
Institution:
HudsonAlpha Institute for Biotechnology
Project Title:
Effects of genetic variation in regulatory regions near ADRD-associated genes and replication of genetic findings in early-onset dementias
Date of Approval:
March 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We seek access to ADSP data for two purposes. First, my lab is focused on understanding regulatory mechanisms of rare non-coding variation in neurodegeneration-associated loci. As a part of these efforts, we have generated data internally including 10X multiomics (matched single nucleus RNA-seq and ATAC-seq) and genome-wide restriction fragment resolution HiC data, which, along with publicly available data, we have used to nominate regulatory elements for neurodegeneration-associated genes. One question is if genetic variation in cases is enriched in these regulatory regions compared to controls, which we will use ADSP data to assess. As a second independent effort, we are a part of collaborations with the Yokoyama lab at UCSF and the Kosik lab at UCSB as well as The Multi-Partner Consortium to Expand Dementia Research in Latin America (ReDLat) to analyze genomes for early onset Alzheimer’s and frontotemporal dementia cohorts compared to unaffected controls. A critical part of these efforts is replication of any findings in independent cohorts. Access to Alzheimer's Disease Sequencing Project (ADSP) data is ideal for this purpose. We will analyze ADSP data for association signals identified in our independent cohorts using either single variant or burden analysis approaches. Phenotypic characteristics that will be evaluated in association with genetic variants will be either case/control status or age of symptom onset as available. Although we conduct these projects as collaborations, this application is for analysis of ADSP data at HudsonAlpha. We seek access to all consent groups. Our research does not include the study of population origins or ancestry, and thus qualifies for HMB-designated studies. Our research is also applicable to each of the disease-specific (DS) categories: for example, we are interested in effects in Alzheimer's and related dementias as well as other phenotypes related to aging, brain and memory. We also note that this analysis involves methods development research (MDS) (new approaches to understand effects of non-coding variation). Finally, this is a genetics study only (GSO).
Non-Technical Research Use Statement:
Our lab aims to understand function of the genome to gain confidence in the precise way in which genetic changes lead to risk for disease. We are working to identify stretches of DNA near genes associated with Alzheimer’s and related dementias and/or aging, brain and memory that may be involved in turning these genes on and off. A key test is if genetic changes in these regions are enriched in people with disease, which we will use these ADSP data to assess. In a second project, we work with members of The Multi-Partner Consortium to Expand Dementia Research in Latin America (ReDLat) to to analyze the DNA from patients with early onset Alzheimer’s and frontotemporal dementia. A critical part of this type of work is checking to see if findings from one set of patients are reproducible in different sets of patients. Access to ADSP data would allow for us to answer this question. The types of data that will be evaluated in association with genetics will be either if the individuals assessed have disease or not, or if their genetics affects when they develop disease.
Investigator:
Conley, Dalton
Institution:
Princeton University
Project Title:
The sociogenomics of human phenotypes: How social and biological factors jointly shape individual behaviors and outcomes related to socioeconomic attainment and demographic outcomes.
Date of Approval:
October 24, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Genetics has been increasingly integrated into research on sociological topics such as fertility, well-being, risk-taking, and longevity (Mills and Tropf, 2020). While the existing literature has established heritability of a set of sociopsychological, behavioral, and health outcomes, estimates from different cohorts or study designs —e.g., twin and family studies, GWAS, and SNP heritability — differ substantially. Missing/hidden heritability prompts discussion on the optimal methodological approach. We seek to understand how the use of family designs impacts the validity of genetic effect estimates. Specifically, we will compare the performance of classic and family-based GWAS and their downstream polygenic scores (PGSs) in predicting a rich set of sociopsychological, behavioral, and health outcomes. Additionally, we will explore to what extent family-based GWAS results yield increased portability to diverse and admixed ancestries. Another important area is gene-environment interactions (G×E). G×E research has employed diverse approaches (Miao et al., 2022), such as differential heritability/variance, genetic correlation, and mean/variance PGS (Johnson et al., 2022) analyses. Despite this, limitations remain; many (early) G×E studies fail to properly control for potential confounders (Keller, 2014). Moreover, which G×E mechanisms — e.g., outcome moderation (i.e., Domingue et al. 2020) vs. variability moderation —underlie the effects is poorly understood. In addition, little is known about the extent which to social changes serve to modify associations between genetic ancestry and self-identified/reviewer classified race. We aim to employ recent methodological advances to the multiple research gaps described above. We will examine a rich set of variables, including SES, early-life experiences, physical development, mental health, medical conditions, and mortality. This work will be collaborative with Professor Sam Trejo, also of Princeton University (Sociology).
Non-Technical Research Use Statement:
This project aims to increase understanding of how genetic and socioenvironmental factors interactively affect social, behavioral, and health outcomes, with an eye towards gaps in the research literature. For one thing, existing efforts at quantifying the genetic effects on individual behaviors/outcomes have come to sometimes substantially different estimates. For another, many existing G×E studies have been improperly designed to answer their intended research question, and few of them have specifically examined which GxE mechanisms explain the observed patterns. This project can help us better understand the biosocial underpinnings of a rich set of individual outcomes and inform policies aimed at reducing social/health disparities. Our research improves the development of tools that identify individuals for early intervention, suggests how the DNA characteristics of a population may influence the effectiveness of health policies, and facilitate evidence-based policymaking that considers not only socioenvironmental factors but also their interactions with one’s gene’s.
Investigator:
Coppola, Gianfilippo
Institution:
Yale University
Project Title:
AD subtypes
Date of Approval:
March 23, 2023
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) is a highly heterogeneous disease with diverse clinical manifestations and rate of progression. AD subtypes identification will improve our understanding of the underlying disease mechanisms, enable us to predict disease trajectory and develop new disease-modifying treatments. Objective. To identify distinct AD subtypes from WGS data of AD individuals Analysis plan. We will use 3000 WGS data derived from the ADSP Discovery Case-Control Based Extension Study. We will use the available SNVs and INDELS and infer structural variants (SVs) with our in-house multi-caller pipelines. Rare variants will be retained for further analysis. We will then split the dataset in training and tests set, and use the identified set of genetic variants (i.e. SNVs, INDELS and SVs) as input to a deep neural network (an autoencoder architecture) to learn an unsupervised latent representation of the data. AD subtypes will be identified within this reduced space and characterized using, demographics and clinical data. We will then contrast each subtype with the control groups to identify subtype relevant variants (i.e. putative subtype biomarkers), which will be used as input features to a gradient boosted tree model, to generate a subtype predictive model and subtype specific features. Planned collaboration. Each member of the team will devote effort in specific areas of investigation, nevertheless, all the team members will discuss, through regular meeting, individual progress and potential challenges. In particular, Dr Coppola (Research Scientist, Department of Pathology, Yale University, USA), together with Dr Dean Palejev (Associate Professor, GATE Institute, Sofia University, Bulgaria) will be involved in the deep learning model generation and validation, and subtype identification; Dr Fredrik Johansson (Assistant Professor, Department of Computer Science & Engineering, Chalmers University of Technology. Sweden), will work on the supervised machine learning model; Dr Alexander Schliep, Associate Professor, Department of Computer Science & Engineering, University of Gothenburg, Sweden), will work on the SVs inference.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a highly heterogeneous disease with diverse clinical manifestations and rate of progression. The heterogeneity of AD has complicated both clinical trial design and outcomes, and thus the need for better models of AD, and/or better strategies for selection of participants into specific clinical trials is evident. The identification of more homogeneous disease subgroups (i.e. AD subtypes) will improve our understanding of the underlying disease mechanisms, enable us to predict disease trajectory and develop new disease-modifying treatments. We will use a comprehensive set of genetic variants in combination with deep learning algorithms to identify AD subtypes. Subtypes will be characterized using clinical and demographic data. Finally, variants specific to each cluster will be identified and used to train a predictive machine-learning model to classify new individuals.
Investigator:
Coppola, Giovanni
Institution:
Regeneron Pharmaceuticals
Project Title:
Genetic investigation in neurodegenerative conditions
Date of Approval:
February 3, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective: To leverage large-scale electronic health care record data and biomarkers of neurodegeneration to improve gene discovery in Alzheimer’s disease. Study design: The goal of our work is to integrate common and rare genetic variation across multiple cohorts with the genetic data collected in the ADSP to improve gene discovery in Alzheimer’s disease (AD) and related neurodegenerative conditions. We plan to perform both genome-wide and exome-wide association analyses (GWAS/ExWAS) with phenotypes including any dementia, AD, mild cognitive impairment (MCI), age of onset for cognitive decline, and quantitative readouts of neurodegeneration including cognitive tests and brain MRI measures where available. We will integrate our findings with expression quantitative trait loci and single cell gene expression to understand pathways and mechanisms modulating risk for these phenotypes. Analysis plan: First, we will integrate data from ADSP with electronic health and genetic data at the Regeneron Genetics Center (RGC) to harmonize both genetic variation and phenotype data. Next, we will use standard approaches to perform GWAS/ExWAS, perform meta-analysis across cohorts, and post-GWAS analyses to annotate our findings across traits. All data will remain anonymized and securely stored, we will not share any of the individual level data outside of Regeneron or beyond the researchers on our application. We have a secure computational environment to store these data and IT staff dedicated to ensuring we comply with the necessary requirements delineated by the NIAGADS.
Non-Technical Research Use Statement:
Therapeutic development in Alzheimer’s disease has greatly benefitted from understanding the genetics of this disorder. However, the role of rare genetic variation and the impact of genetics on biomarkers in Alzheimer’s remain mostly unknown. Our goal is to develop new therapies, and the data from the ADSP will help us prioritize potential molecules and pathways to pursue in Alzheimer’s disease and related conditions.
Investigator:
Corces, Michael Ryan
Institution:
Gladstone Institutes and UCSF
Project Title:
WGS Rare Variants
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective: To nominate putatively functional rare noncoding variants in AD Study Design: We have developed a novel pipeline for noncoding variant prioritization that combines principles from statistical genetics, gene regulation, and machine learning. We have previously used this type of pipeline to prioritize noncoding variants within known genetic risk loci for AD (PMID: 33106633). In brief, this pipeline: 1. Identifies all known LD-expanded variants that have been significantly associated with AD 2. Filters these variants for those overlapping gene regulatory elements in specific cell types of the brain 3. Uses machine learning to predict which variants will have strong effects on transcription factor binding 4. Uses functional genomics technologies including massively parallel reporter assays and CRISPR-based genome editing to pinpoint which of the nominated variants have validated functional effects So far, this pipeline has only been applied to common variants identified by GWAS but we aim to apply this same methodology to nominate functional noncoding rare variants. With this in mind, we will: 1. Download all ADSP WGS datasets to identify all variants discovered in AD cases and controls 2. Annotate each variant with its frequency both within the ADSP cohorts and within the general population using resources such as gnomAD and TOPMED. 3. Input rare variants implicated in AD (i.e. either only observed in AD or observed more frequently in AD than in the general population) in the above described pipeline to functionally validate a subset of rare variants as putative noncoding drivers of disease. 4. Link any putative functional variants to their cell type-specific target genes The result of this work would be a list of variants with putative functional roles in AD and their putative target genes. In this study, we do not plan to associate any phenotypic characteristics other than AD vs Non-AD. No collaboration is anticipated.
Non-Technical Research Use Statement:
Alzheimer's disease (AD) is driven by both genetic and non-genetic factors. Previous studies have estimated that ~60% of the susceptibility to AD can be attributed to genetic factors. This heritability can be roughly evenly divided between (i) common genetic variation and (ii) rare or structural variation. Of the rare genetic variation driving AD, we understand vanishingly little. This is because >90% of rare genetic variants lie within the noncoding regions of the genome. These noncoding regions harbor gene regulatory elements but do not code for proteins. As we lack a fundamental understanding of how genetic variation impacts the noncoding genome, it has remained challenging to predict which of these noncoding rare variants might have functional effects in AD. We have developed a pipeline to prioritize these rare noncoding variants using a combination of epigenetics and machine learning approaches. We will use this pipeline to nominate putatively functional rare noncoding variants and then use functional genomics assays to validate these predicted effects.
Investigator:
Crane, Paul
Institution:
University of Washington
Project Title:
AI4AD (Artificial Intelligence for Alzheimer’s Disease): Ultrascale Machine Learning to Empower Discovery in Alzheimer’s Disease Biobanks
Date of Approval:
October 8, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to genomic, imaging and cognitive data, in order to 1) identify AD genotypes and endophenotypes that dissect AD’s heterogeneity; 2) relate said genotypes and endophenotypes with clinical progression in pre-dementia patients; 3) identify novel treatment targets for AD by analyzing whole genome and associated phenotypic data. The goals of this multisite initiative (Paul Thompson, USC; Christos Davatzikos, Li Shen, Penn; Andy Saykin, IU; Heng Huang, Pitt, Paul Crane, UW; Adam Brickman, Columbia; Tim Hohman, Vanderbilt; Guyngah Jun, BU; Duygu Tosun, UCSF; Alexander Zaranek, Curii) leverage the promise of machine learning (ML) to contribute to precision diagnostics, prognostication, and targeted and novel treatments. We will develop ML and deep learning methods to apply to large scale biobanks of whole genome sequences (WGS), neuroimaging, cognitive, and clinical data, aiming to discover new genomic features that influence biological processes of AD. We will apply methods of genome representation and tiling to WGS repositories to create inputs for AI methods. We will develop novel, interpretable, biological knowledge guided deep learning methods to discover genomic motifs associated with AD, AD risk, and biological processes of AD as defined by NIA-AA criteria. To quantify subtypes and disentangle biological processes of AD, we will apply computational methods to multimodal MRI and amyloid- and tau-sensitive PET to stratify and subtype patient groups; novel imaging genomics methods will detect genomic markers and pathways that modulate the developing pathology as detected in the images, and that predict future clinical decline or resilience. We hypothesize that advanced deep learning methods combined with whole genome data will outperform traditional methods and GWAS for predicting AD onset and progression, and will assist with disease subtyping and discovering treatable targets in the genome. A team will rank and repurpose existing and identify novel drugs and targets in the genome based on the discovered genetic motifs affecting AD.
Non-Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to extensive and rich genomic, imaging and cognitive data, in order to 1) identify genotypes and endophenotypes of AD that dissect the heterogeneity of the disease; 2) relate these genotypes and endophenotypes with clinical progression, in pre-dementia patients; 3) identify novel treatment targets for AD, by analyzing whole genome and associated phenotypic data at a previously impossible scale. Collectively, the goals of this highly collaborative multi-site initiative leverage the promise of machine learning to contribute to precision diagnostics, prognostication, and targeted and novel treatments.
Investigator:
Crane, Paul
Institution:
University of Washington
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
July 23, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Crimmins, Eileen
Institution:
University of Southern California
Project Title:
GWAS and Systems Biology Analyses for Aging-Related Conditions: Longevity and Disease
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Research Use Statement: Our project will rely on phenotype and genotype data from the Health and Retirement Study (HRS), a nationally representative longitudinal study of the older adult population in the U.S. This is an on-going study. Data we have been using beginning in 2016 are from an approved application through dbGaP, from 15,507 HRS participants and include single nucleotide polymorphism (SNP) data on just under 2.5 million markers, imputed data on approximately 21 million DNA variants, and phenotype data on disease incidence and prevalence, functioning, biomarkers, mortality, and environmental and behavioral covariates. Our request from NIAGADS would provide us with an additional genetic sample to what we have been using, for the additional data on 3,409 participations (yielding N=18,916 total with harmonized genetic data through NIAGADS). Data usage will not create additional risk to participants. Aims of the project are to (1) Identify genetic networks and pathways that influence human aging, disease, functioning, and longevity; (2) Develop predictive models of aging-related health outcomes using information from gene networks; and (3) Examine how social and environmental conditions interact with genes within these aging-related gene networks. We will implement statistical models to test for associations between genetic variants and the same phenotype data. In moving forward with the additional samples, we will use the HRS genome-wide data to examine genetic signatures of healthspan, lifespan, and cognitive aging. Using these genetic signatures, we plan to (i) run pathway enrichment analysis to identify influential biological pathways, (ii) use them for predictive modeling of morbidity/mortality risk and cognitive aging, and (iii) incorporate information from social and behavioral data to examine GxE interactions. The overall goal of the project is to identify mechanistic gene and environment networks that contribute to aging acceleration or deceleration.
Non-Technical Research Use Statement:
Non-Technical Summary: Aging is the largest risk factor for morbidity and mortality. Previous research using animal models or case-control studies of centenarians have suggested that variations in the pace of aging may be partially explained by genetic and genomic differences. However, few genetic regulators of human lifespan and healthspan have been identified. Furthermore, there is reason to suggest that the pace of aging may be a polygenic trait, for which multiple genes form complex networks that collectively influence aging and longevity phenotypes. These complex genetic networks may further interact with exogenous factors causing variation to arise in health outcomes under diverse environments. The goal of this project is to use advantaged statistical modeling techniques to understand how gene-gene and gene-environment interactions influence longevity and aging-related conditions.
Investigator:
Cruchaga, Carlos
Institution:
Washington University School of Medicine
Project Title:
The Familial Alzheimer Sequencing (FASe) Project
Date of Approval:
January 21, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The goal of this study is to identify new genes and mutations that cause or increase risk for Alzheimer disease (AD), as well as protective factors. Individuals and families were selected from the Knight-ADRC (Washington University) and the NIA-LOAD study. Only families with at least three first-degree affected individuals were included. Families with pathogenic variants in the known AD or FTD genes, or in which APOE4 segregated with disease were excluded. At least two cases and one control were selected per family. Cases had an age at onset (AAO) after 65 yo and controls had a larger age at last assessment than the latest AAO within the family. Whole exome (WES) and whole genome sequencing (WGS) was generated for 1,235 individuals (285 families) that together with data from our collaborators and the ADSP family-based cohort (3,449 individuals and 757 families) will provide enough statistical power to identify new genes for AD. Dr. Tanzi (Harvard Medical School) will provide WGS from 400 families from the NIMH Alzheimer disease genetics initiative study. We will perform single variant and gene-based analyses to identify genes and variants that increase risk for disease in AD families. Single variant analysis will consist of a combination of association and segregation analyses. We will run family-based gene-based methods to identify genes that show and overall enrichment of variants in AD cases. We will also look for protective and modifier variants. To do this we will identify families loaded with AD cases, that also include individuals with a high burden of known risk variants but that do not develop the disease (escapees). We will use the sequence data and the family structure to identify variants that segregate with the escapee phenotype. The most promising variants and genes will be replicated in independent datasets (ADSP case-control, ADNI, Knight-ADRC, NIA-LOAD ). We will perform single variant and gene-based analyses to replicate the initial findings, and survival analysis to replicate the protective variants. We will select the most promising variants/genes for functional studies
Non-Technical Research Use Statement:
Family-based approaches led to the identification of disease-causing Alzheimer’s Disease (AD) variants in the genes encoding APP, PSEN1 and PSEN2. The identification of these genes led to the A?-cascade hypothesis and to the development of drugs that target this pathway. Recently, we have identified rare coding variants in TREM2, ABCA7, PLD3 and SORL1 with large effect sizes for risk for AD, confirming that rare coding variants play a role in the etiology of AD. In this proposal, we will identify rare risk and protective alleles using sequence data from families densely affected by AD. We hypothesize that these families are enriched for genetic risk factors. We already have sequence data from 695 families (2,462 individuals), that combined with the ADSP and the NIMH dataset will lead to a dataset of more than 1,042 families (4,684 individuals). Our preliminary results support the flexibility of this approach and strongly suggest that protective and risk variants with large effect size will be found, which will lead to a better understanding of the biology of the disease.
Investigator:
Cruchaga, Carlos
Institution:
Washington University School of Medicine
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
October 24, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Cuccaro, Michael
Institution:
University of Miami Miller School of Med
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
May 30, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Curtis, David
Institution:
University College London
Project Title:
Developing improved methods to analyse next generation sequence data
Date of Approval:
September 2, 2021
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
The objectives are to identify and characterise genes and genetic variants which increase or decrease the risk of developing Alzheimer's disease. Exome sequenced and whole genome sequenced cases and controls will be analysed. The predicted function of DNA variants will be obtained using software such as VEP, PolyPhen, SIFT. Weighted burden analysis will be performed wherein variants are given higher weights if they are predicted to have a major effect on protein function and/or if they are rare. For each gene, in each subject the weights for the DNA variants possessed by that subject will be summed to produce a score. The scores between cases and controls will be compared using logistic regression and incorporating relevant covariates such as sex, age, principal components. If scores are on average higher in cases this indicates that damage to the gene increases risk of Alzheimer's disease. If scores are higher in controls this indicates that damage to the gene reduces risk. Sets of genes will also be analysed in a similar way. The method has been applied to a smaller ADSP dataset: https://www.biorxiv.org/content/10.1101/596007v1
Non-Technical Research Use Statement:
We will analyse whether variants in DNA can interfere with the functioning of particular genes and either increase or decrease the risk of developing Alzheimer's disease. We will examine all the variants in a gene observed in large samples of people with and without Alzheimer's disease to see if variants are more commonly seen in one or other group. We will weight the variants to that more attention is paid to those which are rare and those which are predicted to have a major effect on the functioning of the gene. If we see more variants in the people with Alzheimer's disease then this suggests that damaging that gene could increase risk of illness. If the people without disease have more variants in a gene then that could suggest that damaging that gene would actually protect against Alzheimer's disease. Understanding these effects will ultimately assist in the development of methods to treat or prevent the disease.
Investigator:
Curtis, David
Institution:
University College London
Project Title:
Developing improved methods to analyse next generation sequence data
Date of Approval:
April 11, 2024
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
The objectives are to identify and characterise genes and genetic variants which increase or decrease the risk of developing Alzheimer's disease. Exome sequenced and whole genome sequenced cases and controls will be analysed. The predicted function of DNA variants will be obtained using software such as VEP, PolyPhen, SIFT. Weighted burden analysis will be performed wherein variants are given higher weights if they are predicted to have a major effect on protein function and/or if they are rare. For each gene, in each subject the weights for the DNA variants possessed by that subject will be summed to produce a score. The scores between cases and controls will be compared using logistic regression and incorporating relevant covariates such as sex, age, principal components. If scores are on average higher in cases this indicates that damage to the gene increases risk of Alzheimer's disease. If scores are higher in controls this indicates that damage to the gene reduces risk. Sets of genes will also be analysed in a similar way. The method has been applied to a smaller ADSP dataset: Curtis D, Bakaya K, Sharma L, Bandyopadhyay S. Weighted burden analysis of exome-sequenced late onset Alzheimer's cases and controls provides further evidence for a role for PSEN1 and suggests involvement of the PI3K/Akt/GSK-3β and WNT signalling pathways. Ann Hum Genet 2020 https://doi.org/10.1111/ahg.12375
Non-Technical Research Use Statement:
We will analyse whether variants in DNA can interfere with the functioning of particular genes and either increase or decrease the risk of developing Alzheimer's disease. We will examine all the variants in a gene observed in large samples of people with and without Alzheimer's disease to see if variants are more commonly seen in one or other group. We will weight the variants to that more attention is paid to those which are rare and those which are predicted to have a major effect on the functioning of the gene. If we see more variants in the people with Alzheimer's disease then this suggests that damaging that gene could increase risk of illness. If the people without disease have more variants in a gene then that could suggest that damaging that gene would actually protect against Alzheimer's disease. Understanding these effects will ultimately assist in the development of methods to treat or prevent the disease.
Investigator:
Dai, Yulin
Institution:
University of Texas health science center at Houston
Project Title:
Assessing polygenic risks for Alzheimer's diseases in National Institute on Aging Genetics of Alzheimer’s Disease Data Storage Site (NIAGADS) cohort.
Date of Approval:
March 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives of the proposed research: Alzheimer's disease (AD) is a progressive neurological disorder that affects millions globally. Despite significant strides in developing treatments aimed at specific neuropathological markers, a substantial subset of individuals retains normal cognitive function even in the presence of AD-related neuropathology. This phenomenon suggests the existence of "cognitive resilience" factors that help preserve cognitive function independently of the disease's typical pathological processes. The primary objective of our proposed research is to conduct a comprehensive secondary analysis of the extensive genotype and phenotype data curated within the National Institute on Aging Genetics of NIAGADS. We aim to identify genetic and molecular signatures of individuals with high resilience to AD. Study design: To achieve our objectives, we will develop a novel computational framework that integrates polygenic risk scores (PRS), incorporating additional clinical variables into clinical risk scores. This will allow for a more precise measurement of AD risk at the individual level within the ADSP and provide a comprehensive assessment of AD risk and resilience factors. Analysis plan: Following our previous GRPa-PRS framework [1], we will identify participants within the top quantile of AD risk who, despite their elevated risk, do not develop AD. These individuals will be classified as highly resilient. Given that the effects of resilience are generally subtle, we will aggregate genetically regulated effects at the pathway level, enabling us to characterize resilience at the individual level. Following this, we will conduct a comparative analysis of the identified resilient vs AD individuals to further understand the underlying genetic and molecular basis of their resilience. To further validate our findings, we will link these results with additional molecular evidence from the ROS/MAP cohort. 1. GRPa-PRS: A risk straticifcation method to identify genetically regulated pathways in polygenic diseases. Alzheimers Dement. 2. Polygenic resilience scores capture protective genetic effects for Alzheimer’s disease. Translational psychiatry, 12(1), 296
Non-Technical Research Use Statement:
Alzheimer's disease (AD) is a progressive neurological disorder that poses substantial societal and economic challenges, affecting over 55 million people worldwide. Significant research efforts and therapeutic developments have focused on addressing AD's neuropathological features, such as beta-amyloid plaques and neurofibrillary tangles. Despite these efforts, clinical-pathological studies have revealed that a notable proportion of individuals maintain normal cognitive function even in the presence of these neuropathological markers. Through our GRPa-PRS framework, we aim to stratify individuals with high resilience and identify factors associated with cognitive resilience—the ability to preserve cognitive function despite a high genetic risk for developing AD. The success of this project will help to uncover these resilience factors could pave the way for innovative therapeutic strategies that enhance resistance to AD.
Investigator:
Dallett, Carolina
Institution:
Roche
Project Title:
RDS003-Diversity
Date of Approval:
April 5, 2023
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The goal of this project is to identify and prioritize genetic variations that contribute to late-onset Alzheimer’s disease (AD). The planned multi-modal meta-analysis study will integrate genetic information from multiple AD datasets (ADSP, EADB GWAS, ADGC, FinGen, UK Biobank, CHARGE consortia) with other publicly available multi-omics data including gene expression (GTEx), epigenetics (GEO) and neuro-imaging. The objective, here, is to extend and refine polygenic risk scores (PRS) to work across ethnicities and prioritize genes and pathways that are causal in nature to development of late-onset AD in different ancestry cohorts. Retrospective, case-control analysis setup against the backdrop of other neurodegenerative diseases will also help identify lead variants that are specific to AD enabling the elucidation of mechanistic pathways that lead to the disease etiology. A well-known limitation of PRS scores derived from single ethnicity datasets is that its predictive power declines in cross-ethnicity cohorts. To address this, we want to investigate the possibility of creating orthogonal effect scores (OES) from multimodal analysis for gene prioritization and causality determination. The OES will be used to further refine PRS in different ethnicities and possibly identify common emerging pathways. Linkage disequilibrium (LD), quantitative trait loci (QTL) analysis, tissue colocalization, pathway networks and siRNA screening data along with functional implications of mutations, evolutionarily conservation of genomic regions and overlap with non-coding regulatory sites (eg. DNAse I hypersensitivity sites) will be used in the generation of OES. This orthogonal scoring system will be tested for its ability to refine PRS for achieving higher AD risk prediction in known AD cases. The contrived OES will enable creation of in-silico admixed populations and association of AD risk in these populations. The ability to predict AD risk correctly in admixed populations will be immensely useful in associating PRS signatures to AD risk in real patients.
Non-Technical Research Use Statement:
The genetic landscape of late onset AD is complex and its pathophysiology has been elusive in spite of advances in high-throughput genomic techniques. The main limitations in characterizing the disease are, presence of multiple pathways resulting in the disease pathophysiology, lack of large data cohorts that encompass multiple ethnicities leading to low predictive power for AD risk across studies, and difficulty in identifying the causal genomic changes from the many AD-associated loci. This study aims at addressing the latter two limitations by integrating AD data containing mutations, expression, modifications and changes in AD-associated gene loci to guide their prioritization in prediction of AD risk. This prioritization method has the potential to lead us to causative genomic changes that can be traced to key AD related biological pathways and provide the elusive pathophysiological signature for AD across ethnicities.
Investigator:
Davatzikos, Christos
Institution:
University of Pennsylvania
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
May 8, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
DeStefano, Anita
Institution:
Boston University
Project Title:
Therapeutic target discovery in ADSP data via comprehensive whole-genome analysis incorporating ethnic diversity and systems approaches
Date of Approval:
February 3, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective: Utilize ADSP data sets to identify genes & specific genetic variants that confer risk for or protection from Alzheimer disease. Aim 1: Using combined WGS/WES across the ADSP Discovery, Disc-Ext, and FUS Phases, including single nucleotide variants, small insertion/deletions, and structural variants. We will: Aim 1a. Perform whole genome single variant and rare variant case/control association analyses of AD using ADSP and other available data; Aim 1b. Target protective variant identification via association analysis using selected controls within the ADSP data and performing meta analysis across association results based on selected controls from non-ADSP data sets. Aim 1c. Perform endophenotype analyses including cognitive function measures, hippocampal volume and circulation beta-amyloid ADSP data in subjects for which these measures are available. Meta analysis will be conducted across ADSP and non-ADSP analysis results. Aim 2: To leverage ethnically-diverse and admixed populations to identify AD variants we will: Aim 2a. Estimate and account for global and local ancestry in all analyses; Aim 2b. Perform admixture mapping in samples of admixed ancestry; and Aim 2c. Perform ethnicity-specific and trans-ethnic meta-analyses. Aim 3: To identify putative therapeutic targets through functional characterization of genes and networks via bioinformatics, integrative ‘omics analyses. We will: Aim 3a. Annotate variants with their functional consequences using bioinformatic tools and publicly available “omics” data. Aim 3b. Prioritize results, group variants with shared function, and identify key genes functionally related to AD via weighted association analyses and network approaches. Analyses will be performed in coordination with the following PIs. Coordination will involve sharing expertise, analysis plans or analysis results. No individual level data will be shared across institutions. Philip De Jager, Columbia University; Eric Boerwinkle & Myriam Fornage, U of Texas Health Science Center, Houston; Sudha Seshadri, U of Texas, San Antonio; Ellen Wijsman, U of Washington. William Salerno, Baylor College of Medicine.
Non-Technical Research Use Statement:
This proposal seeks to analyze existing genetic sequencing data generated as part of the Alzheimer’s Disease Sequencing Project (ADSP) including the ADSP Follow-up Study (FUS) with the goal of identifying genes and specific changes within those genes that either confer risk for Alzheimer’s Disease or provide protection from Alzheimer’s Disease. Analytic challenges include analysis of whole genome sequencing data, appropriately accounting for population structure across European ancestry, Hispanic, and African American participants, and interpreting results in the context of other genomic data available.
Investigator:
DeStefano, Anita
Institution:
Boston University
Project Title:
Assessing Alzheimer’s disease risk and heterogeneity using multimodal machine learning approaches
Date of Approval:
August 29, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of this study is to develop machine learning models using genetic and phenotype data from the NIAGADS database https://dss.niagads.org/. We will develop both unsupervised and supervised learning models to characterize the heterogeneity and risk of Alzheimer’s disease (AD). This is an MPI study in collaboration with Dr. Honghuang Lin at University of Massachusetts Chan Medical School. For the first aim, we will build an expandable multimodal unsupervised machine learning framework to investigate AD heterogeneity. We will perform AD subtyping by harnessing the rich multimodality information across a wide spectrum of data (e.g., genetics, images and blood biomarkers). A Bayesian kernel network will be built to estimate the relative weight of each individual data modality, which would also allow the addition of new data modalities as they become available. The analyses will be performed both within and between ethnic populations. For the second aim, we will build an expandable multimodal supervised machine learning framework to quantify AD risk from longitudinal follow-up of clinically normal elders. We will build a separate deep learning network for each data modality in consideration of its unique feature sets. A multiplicative strategy will then be taken to aggregate information from different modalities with weighted contributions. Feature selection will also be performed to identify the most informative features predictive of AD risk. For the third aim, we will build AD-related gene regulatory networks in post-mortem human brain samples. We will examine the association of multi-omics data with AD, which will be used to assign gene priority based on the combinatorial evidence from each type of omics data. A gene ontology-guided greedy search strategy will then be implemented to build gene regulatory networks, and identify key drivers that might be potential therapeutic targets for AD. The analyses will be stratified by ethnic populations and AD phenotypic clusters.
Non-Technical Research Use Statement:
Alzheimer's disease (AD) is the most common form of dementia characterized by progressive loss of cognitive function. There are very limited treatment options for AD. For the current application, we seek to develop multimodal machine learning models by leveraging the rich collection of AD-related omics data and phenotypical data recently generated from the Alzheimer's Disease Sequencing Project (ADSP). Three aims will be pursued in the current application. For Aim 1, we will build an expandable multimodal unsupervised machine learning framework to investigate AD heterogeneity. For Aim 2, we will build an expandable multimodal supervised machine learning framework to quantify AD risk from longitudinal follow up of cognitively normal elders. For Aim 3, we will build AD-related gene interaction networks in post-mortem human brain samples. The present application represents an innovative approach to identify individuals at high risk of AD. The outlined strategy will provide new insights into the risk stratification and prevention strategies for AD.
Investigator:
Dugue, Pierre-antoine
Institution:
MONASH UNIVERSITY
Project Title:
Development and validation of improved epigenetic markers of biological ageing
Date of Approval:
July 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives In this project, we aim to develop DNA methylation (DNAm)-based markers that capture additional aspects of biological ageing and ultimately improve disease risk prediction. We will use genome-wide blood DNA methylation data and phenotypic / clinical data from the Health Retirement Study (HRS), including objective and subjective health indicators, to: i) Carry out epigenome-wide association studies (EWAS) of subjective health and ageing-related biomarkers; ii) Use machine learning methods to develop and validate DNA methylation-based markers of biological ageing and assess their contribution to disease risk prediction. Study design The project will use multiple prospective cohorts to implement a discovery-validation analytical strategy, and assess the associations of novel methylation-based markers with ageing-related phenotypes including self-reported health variables and disease / mortality risk. Analysis plan 1) Carry out EWAS of self-reported health variables (e.g. physical and psychological functioning) and ageing-related biomarkers to uncover the molecular mechanisms underlying subjective and objective health. 2) Apply machine learning methods to multiple large-scale datasets using a discovery and validation strategy to generate novel, improved epigenetic markers of ageing. 3) Asses the association of these novel epigenetic markers of ageing with major health-related variables, including lifestyle-related variables (e.g. smoking, alcohol consumption, diet), clinical markers (e.g. CRP, blood pressure, gait speed, grip strength), and risk of disease (e.g. cancers, cardiovascular, Alzheimer’s Disease).
Non-Technical Research Use Statement:
Age and ageing are fundamental risk factors of many chronic diseases. Biomarkers of ageing using large-scale molecular data are increasingly being developed and used for disease risk prediction and health assessment. The aim of this project is to develop and validate novel ageing biomarkers using genome-wide blood DNA methylation data. We will further the capacity of these markers to predict ageing-related health traits such as functional decline, cancers, cardiovascular diseases, dementia, and mortality. Our ultimate goal is to improve biological age estimation and disease risk prediction by using molecular-level data that can be obtained from one blood draw.
Investigator:
Ebbert, Mark
Institution:
University of Kentucky
Project Title:
Resolving mutations in challenging genomic regions to test association with disease phenotypes
Date of Approval:
January 22, 2020
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
A majority of the human genome has been well characterized through the initial Human Genome Project and numerous large-scale sequencing studies such as the 1000 Genomes Project, Alzheimer's Disease Neuroimaging Initiative (ADNI), Alzheimer’s Disease Sequencing Project, and others. There are, however, many genome regions that are challenging to characterize using standard approaches that are important to human health and disease. We intend to (1) develop and test new methods to characterize mutations in these regions, and (2) test associations between these mutations and disease phenotypes. Data from the ADSP may be combined with other datasets, such as the Alzheimer's Disease Neuroimaging Initiative. All appropriate precautions will be taken to verify proper population stratification and eliminate any sample redundancy. Combining these data will not increase risk to participants, as all individual-level data will remain confidential. We may also use portions of the ADSP data as controls for other diseases such as amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD), though only in situations that do not violate genetic or data-use principles. Specifically, data that where participants consented for use only within Alzheimer's disease studies will not be used for any purpose outside Alzheimer's disease research.
Non-Technical Research Use Statement:
Many regions of the human genome present challenges that prohibit scientists from discovering potential disease-causing mutations. We are developing methods to characterize mutations in these regions to identify new genes involved in disease.
Investigator:
Ebbert, Mark
Institution:
University of Kentucky
Project Title:
Resolving mutations in challenging genomic regions to test association with Alzheimer's disease phenotypes
Date of Approval:
January 21, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
A majority of the human genome has been well characterized through the initial Human Genome Project and numerous large-scale sequencing studies such as the 1000 Genomes Project, Alzheimer's Disease Neuroimaging Initiative (ADNI), Alzheimer’s Disease Sequencing Project, and others. There are, however, many genome regions that are challenging to characterize using standard approaches that are important to human health and disease. We intend to (1) develop and test new methods to characterize mutations in these regions, and (2) test associations between these mutations and disease phenotypes. Data from the ADSP may be combined with other datasets, such as the Alzheimer's Disease Neuroimaging Initiative. All appropriate precautions will be taken to verify proper population stratification and eliminate any sample redundancy. Combining these data will not increase risk to participants, as all individual-level data will remain confidential. We may also use portions of the ADSP data as controls for other diseases such as amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD), though only in situations that do not violate genetic or data-use principles. Specifically, data that where participants consented for use only within Alzheimer's disease studies will not be used for any purpose outside Alzheimer's disease research.
Non-Technical Research Use Statement:
Many regions of the human genome present challenges that prohibit scientists from discovering potential disease-causing mutations. We are developing methods to characterize mutations in these regions to identify new genes involved in disease.
Investigator:
Engelman, Corinne
Institution:
University of Wisconsin - Madison
Project Title:
AD Risk Prediction
Date of Approval:
March 10, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Currently, no combined measure of rare (minor allele frequency [MAF] less than 0.5%), low frequency (MAF 0.5% to 5%), and common variant (MAF 5% or higher) risk of Alzheimer’s disease (AD) exists. Common variant polygenic risk scores (PRSs) have been a central approach to predicting genetic risk of AD, however, we know that rare and low frequency variants can account for the missing heritability in AD. One objective of this project is to determine the risk for AD based on variants across the full allele frequency spectrum. An additional objective of this project is to utilize GWAS summary statistics from multiple ancestries to calculate the PRS and AD risk. To accomplish these objectives, we will leverage sequencing data from the ADSP (our study is contributing 1,531 samples to the Follow Up phase) and summary statistics from published GWAS from different ancestries. To generate the common variant PRS, we will use these summary statistics and methods such as PRS-CSx. We will determine the carrier status for rare and low frequency AD risk alleles based on the literature and bioinformatics tools. Prediction of AD case-control status and age-at-onset for quantiles of the common variant PRS and carrier status will be characterized with an empirical receiver operating characteristic (ROC) curve. The statistical software R will be used to perform regression analyses and to evaluate the AUC.
Non-Technical Research Use Statement:
Currently, risk prediction for the later-onset form of Alzheimer’s disease (AD) focuses on genetic variants that are more common in the population, but ignores less common variants. Prediction is also largely based on data from populations of European ancestry. The goals of this project are to incorporate genetic variants across the full allele frequency spectrum (more and less common genetic variants) and to include more ancestrally diverse populations. To accomplish these goals, we will leverage genetic data from the ADSP and summary statistics (results) from published studies in diverse populations. We will determine the prediction of AD case-control status and age-at-onset.
Investigator:
Ertekin-Taner, Nilufer
Institution:
Mayo Clinic
Project Title:
CLEAR-AD
Date of Approval:
November 19, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
This U19 aims to bridge these knowledge gaps for discovery and validation of Centrally-linked Longitudinal pEripheral biomARkers of AD (CLEAR-AD) in multi-ethnic populations. CLEAR-AD U19 is based on the premise that AD is a complex disorder in which many biological pathways are disrupted due to multi-omic perturbations, which can be detected in brain and reflected in blood. The specific aims of CLEAR-AD are: 1) To discover CLPMS of the complex and heterogeneous AD pathophysiology and its co-pathologies. 2) To identify longitudinal CLPMS that detect and predict dynamic neuroimaging, fluid biomarker, and clinical changes across AD spectrum. 3) To characterize differences and similarities in CLPMS profiles across NHW, African American (AA) and Latino American (LA) participants to uncover biomarker patterns in multi-ethnic groups. 4) To make these vast resources available to the scientific community to amplify and accelerate its impact. In this U19, we will leverage NIH-funded ADNI, MCSA and ADRC cohorts of >3,700 multi-ethnic participants to generate >20,000 multi-omics measures (Omics Core) that will be processed and integrated with >48,000 harmonized AD cognitive, neuroimaging and fluid endophenotypes (Analytic Core). Using these data, we will identify brain region and cell-type specific CLPMS, which reflect biological subtypes of AD and disease stage (Project 1). We will discover longitudinal changes in CLPMS that predict cognitive and A/T/N/V progression (Project 2). We will define longitudinal cognitive and A/T/N/V changes and CLPMS in URP that are either conserved with NHW or population-specific (Project 3). This U19 will a) Identify the next generation of AD biomarkers with mechanistic insights; b) Establish a precision medicine approach for rigorous multi-omics biomarker discovery and validation in AD; c) Discover molecules that can serve as biomarkers and therapeutic targets; d) Enhance biomarker research in trial-ready multi-ethnic populations; and e) Generate and share a vast and harmonized resource of endophenotype and multi-omics data in NIH-funded cohorts.
Non-Technical Research Use Statement:
There is a clear and immediate need for the discovery of peripheral molecular signatures linked to central disease processes, core and co-pathologies in Alzheimer’s Disease (AD), that will serve as precision medicine blood-based biomarkers for diagnostic, prognostic, theragnostic and therapeutic purposes. AD is a complex disorder in which many biological pathways are disrupted due to multi-omic perturbations, which can be detected in brain and reflected in blood, i.e. centrally-linked peripheral molecular signatures (CLPMS). This U19 will leverage deeply phenotyped, longitudinal NIH-funded multi-ethnic cohorts and cross-disciplinary expertise for multi-omics data generation and its integration with harmonized AD endophenotypes, will share these data and utilize them in integrated U19 projects to discover CLPMS that will serve as the next generation of AD biomarkers.
Investigator:
Ezzati, Ali
Institution:
University of California, Irvine
Project Title:
Advanced Modeling in Alzheimer's Cohorts and Trials
Date of Approval:
June 11, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We aim to develop robust, calibrated, and reliable predictive models by utilizing machine learning techniques on a rich dataset of imaging, omics, and phenotypic data. Our approach is twofold: 1. We will employ supervised machine learning methods to construct predictive models for defined outcomes such as clinical disease stages, meaningful cognitive decline, and imaging and neuropathologic markers (e.g., amyloid or tau positivity). We acknowledge that a major challenge in implementing predictive models is their integration into real-world settings. Therefore, once validated, we will convert these models into web applications or simplify them into risk scores to improve their usability and practical application. 2. Given the significant heterogeneity in Alzheimer's disease and existing gaps in our mechanistic understanding, we will utilize semi-supervised machine learning to subtype the disease. This method will incorporate genetics, biofluid biomarkers, imaging, and neuropsychological tests. Our methodology offers a significant advantage over the unsupervised machine and deep learning techniques commonly used in the field. By incorporating "partial labeling" and incorporating knowledge of disease states, our semi-supervised approach enables more clinically relevant disease subtyping and results. For this project, we are collaborating with experts from other institutions, including Dr. Christos Davatzikos from the University of Pennsylvania, who is an expert in neuroimaging and machine learning, and Dr. Richard Lipton from Albert Einstein College of Medicine, known for his expertise in epidemiology and the design of trials and cohorts.
Non-Technical Research Use Statement:
Despite the extensive research on Alzheimer's and other dementias, the success rate of randomized clinical trials has been disappointingly low. This is primarily due to the significant clinical variations among patients, which limits our ability to predict their outcomes accurately. To address this challenge, we aim to develop a predictive framework specifically focusing on Alzheimer's and related dementias. We will use several cohorts and will employ machine learning and advanced statistical modeling on various types of data, including demographics, neuropsychological measures, biofluid and imaging biomarkers from existing international cohorts and concluded clinical trials. The framework will enable us to predict disease progression which will facilitate early intervention and personalized care for individuals at risk of developing dementia. Our project seeks to harness the power of machine learning and advanced analytics to improve our understanding of Alzheimer's Disease, enhance prediction accuracy, and ultimately contribute to more effective treatments and interventions.
Investigator:
Falcone, Guido
Institution:
Yale School of Medicine
Project Title:
Genomic analyses to evaluate the contribution of hypertension and hypercholesterolemia to risk of Alzheimer's Disease and cognitive decline in non-demented persons.
Date of Approval:
October 15, 2020
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Novel treatments for Alzheimer’s Disease (AD) are urgently needed. Observational data indicate that hypertension and hypercholesterolemia are associated with increased risk of both AD and cognitive status (CS) in non-demented persons. Because hypertension and hypercholesterolemia can be treated effectively, confirmation of causal links between them and AD/CS would provide an appealing therapeutic opportunity. Because mutations are randomly distributed during meiosis, mutation-disease associations are immune to confounding by postnatal exposures. In this setting, mutations strongly associated with an exposure of interest constitute ideal instrumental variables to evaluate the causal effect of that exposure on an outcome of interest. This is an appealing strategy for hypertension/hypercholesterolemia (exposures of interest) and AD/CS (outcomes of interest) because genetic variation explains a substantial proportion of the variance of these two vascular risk factors. We will combine novel methods in statistical genetics and well-established instrumental variable techniques to test the overarching hypothesis that genetically-determined hypertension and hypercholesterolemia influence risk of both late-onset AD and CS in non-demented persons. Our proposal leverages our team’s expertise and successful track record of impactful contributions in the fields of Aging; the robust research infrastructure available through Yale’s OAIC; and access, through the NIAGAD Data Storage Site and UK Biobank, to clinical and genomic data from 550,990 persons to pursue the following aims: determine whether genetically-determined hypertension and hypercholesterolemia are associated, individually or jointly, with increased risk of late-onset AD; and determine whether genetically-determined hypertension and hypercholesterolemia are associated with CS in community-dwelling individuals not yet diagnosed with dementia. This administrative supplement to Yale’s OAIC will deploy an innovative strategy for causal inference based on genetic information to clarify whether observed associations between hypertension/ hypercholesterolemia and AD/CS reflect true causal relationships.
Non-Technical Research Use Statement:
Novel treatments for Alzheimer’s Disease are urgently needed. Observational data indicate that hypertension and hypercholesterolemia are associated with increased risk of both late-onset Alzheimer’s Disease and cognitive decline in non-demented persons; however, it is not clear whether these relationships are causative or associative. We will combine novel methods in statistical genetics and well-established instrumental variable techniques to test the overarching hypothesis that genetically-determined hypertension and hypercholesterolemia influence risk of both late-onset Alzheimer’s Disease and cognitive decline in nondemented persons.
Investigator:
Fan, Maoyong
Institution:
Ball State University
Project Title:
How does stock market fluctuations affect senior citizens' portfolio choices?
Date of Approval:
December 18, 2024
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: exploring the causal effect of stock market fluctuations on senior citizen's portfolio choices using the Health and Retirement Survey (HRS) Study design: We investigate how does stock market ups and downs affect people's investment decisions. Then, we examine how the relationship between individual portfolio choices and stock market returns is affected by social economic determinants and genetic markers associated with risk-taking behavior. Our goal is to analyze genetic information, stock market fluctuations, and portfolio choices and determine how genetic markers impact seniors' financial decision-making under different market conditions. Analysis plan: We use HRS to construct variables that reflect individual's financial assets, including stocks, bonds, and other investment. We collect data on stock market returns from CRSP and COMPUSTAT, and create national-level and state-level market returns. By comparing individual's portfolio choices at different year (corresponding to different market returns) or comparing people's portfolio choices across states (corresponding to different state-level returns), we examine how each individual's portfolio choices change as the stock market fluctuates. we then examine how education or cognition (represented by education- and cognition-related genetic variants) and risk preferences (risky behavior-related genetic variants) affect impact seniors' financial decision-making under different market conditions. For example, we use an instrument variable (IV) approach to isolate random variation in financial literacy and education and estimate causal effects of education on portfolio choices among older adults. The IVs are constructed from individual’s genetic variants, either key single nucleotide polymorphisms (SNPs) or the polygenic score (PGS).
Non-Technical Research Use Statement:
The objective of this study is to link genetic information, stock market shifts, and portfolio choices to understand how genetic markers affect senior citizens' financial decisions under varying market circumstances using the Health and Retirement Surveys (HRS). The study is designed to scrutinize the effect of market volatility on investment choices, and how this connection is further impacted by socioeconomic factors and genetic markers linked to financial literacy and risk-taking behavior. The findings can be used to inform policy and financial education initiatives that target senior citizens and promote healthy financial decision-making. Additionally, the study can highlight the importance of considering genetics in financial decision-making and its potential implications for financial advisors and investment managers. Additionally, the study can highlight the importance of considering genetics in financial decision-making and its potential implications for financial advisors and investment managers.
Investigator:
Fardo, David
Institution:
University of Kentucky
Project Title:
Localizing risk variants and estimating effects in the Alzheimer's Disease Sequencing Project (ADSP) Data (Update to GRCh38)
Date of Approval:
March 27, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
This is an update to a currently-active project through dbGaP in order to garner mapped reads for downstream analysis of regions identified through the scan statistic, endophenotype and causal approaches. The original edited RUS is copied below: We aim to better isolate causal variants within putative ADRD disease genes via two primary approaches. First, we will use an empirical Bayes scan statistic to detect regions of disease variant enrichment. In addition, we will employ novel causal inference methodology to estimate variant-specific causal risk for ADRD. These complementary approaches will allow for discovery of novel ADRD genes as well as enumeration/localization of important variants within putative AD risk genes. We will also employ more conventional approaches (e.g., SKAT, endophenotype development) as appropriate. We have read and approved the Data Use Agreement as signed and submitted on dbGaP and plan to upload results of our findings in a timely manner.
Non-Technical Research Use Statement:
The main goals of the Alzheimer’s Disease Sequence Project (ADSP) include the identification of novel genomic variants contributing to risk of Late-Onset Alzheimer’s Disease or to protection against Alzheimer’s Disease (AD), as well as providing information as to why at-risk individuals may not develop AD or related dementias, especially in multi-ethnic populations. The aim of our data analysis aligns with these goals to identify novel genomic variants associated with AD. We will aim to do so via a scan-based statistic at each variant, where the statistic is specially designed for the analysis of genomic data. We will also explore alternative methodologies for these discoveries including the calculation of a causal estimate of variants within putative AD genes.
Investigator:
Fardo, David
Institution:
University of Kentucky
Project Title:
Genetic Architecture of Pure Alzheimer's Disease and Mixed Pathology
Date of Approval:
September 17, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives Alzheimer's disease and related dementias (ADRD) present challenges to healthcare systems worldwide, affecting millions of individuals. We aim to unravel mechanisms driving ADRD. Integrating data from diverse omics offers a comprehensive view of the molecular landscape associated with ADRD. This holistic understanding could revolutionize diagnosis, treatment, and prevention. In particular, longitudinal omics analysis has unique characteristics that merit evaluating current and novel methods. Study design & analysis plan We will perform genome-wide association studies on ADRD and also examine previously associated loci associated with clinical phenotypes and/or neuropathologies. We will examine how associations are driven by particular neuropathological features, furthering our understanding of underlying mechanisms. The corresponding transcriptome-wide data from Seattle Alzheimer’s Disease Brain Cell Atlas (SEA-AD) will be used to infer possible mechanisms by integrating biological knowledge. We will explore models to harmonize extensive multi-omics data, including transcriptome, epigenome, genome, and clinical data, to capture informative factors and associations for ADRD. We will harness data generated by SEA-AD to characterize the molecular changes linked to phenotypes at cell-type resolution. We will perform abundance analyses, differential gene expression, differential chromatin accessibility, eQTL in each cell type across multiple brain regions to help identify biological drivers of brain pathology.
Non-Technical Research Use Statement:
Alzheimer’s disease and related dementias affect millions of people worldwide. Our research aims to better understand the underlying causes of these diseases by looking closely at how genes and other biological factors contribute to their development. We will analyze a wide range of biological information—including data from DNA and RNA—collected from multiple brain regions of research participants / donors. We hope to discover new ways to identify the disease earlier, understand how it progresses, and find more effective treatments. This research is a collaborative effort with scientists from the Allen Institute for Brain Science and the University of Washington. Together, we aim to bring new insights that could lead to improved care and hope for those affected by Alzheimer’s and related dementias.
Investigator:
Farrer, Lindsay
Institution:
Boston University
Project Title:
ADSP Data Analysis
Date of Approval:
June 25, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
As part of the Consortium for Alzheimers Sequence Analysis (CASA: NIA grant UF1-AG047133), we plan to analyze whole exome and whole genome sequence data generated from subjects with Alzheimer's disease (AD) and elderly normal controls. These data will be generated by the National Human Genome Institute Large-Scale Sequence Program. The goal of the planned analyses is to identify genes that have alleles that protect against or increase susceptibility to AD. We will evaluate variants detected in the sequence data for association with AD to identify protective and susceptibility alleles using the whole exome case-control data. We will also evaluate sequence data from multiplex AD families to identify variants associated with AD risk and protection, and evaluate variant co-segregation with AD. The family data will be whole genome data. The family-based data will be used to inform the cases control analysis and visa versa. We also will focus on structural variants (insertion-deletions, copy number variants, and chromosomal rearrangements). Evaluation of structural variants will involve both whole genome and whole exome data. Structural variants will be analyzed with single nucelotide variants detected and analyzed in the case-control and family-based data.
Non-Technical Research Use Statement:
We are attempting to identify all the inherited elements that contribute to Alzheimer's disease risk. To do this we will analyze DNA sequence data from subjects with Alzheimer's disease and elderly subjects who are cognitively normal. The sequence data from these 2 groups will be compared to identify differences that contribute to the risk of developing Alzheimer's disease of that protect against Alzheimer's disease. These DNA differences can be at a single site in the genetic code, or can span multiple sites, changing the copy number of DNA sequences. Both types of genetic variants will be examined.
Investigator:
Fernandez, Victoria
Institution:
ACE Alzheimer Center
Project Title:
GADIR
Date of Approval:
February 10, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of this study is to contribute to our understanding of neurodegenerative diseases by examining the genetic contributors of major dementia neuropathological hallmarks (amyloid-β deposition, tau pathology, TDP-43, hippocampal sclerosis, Lewy body pathology, and cerebrovascular disease, among others. We will generate the largest Iberian database(N=3500) of neuropathologically curated brains (Aim 1) with a subset of those (N≈350) undergoing deep digital phenotyping (Aim 3). We will generate an associated genetic map (Aim 2) order to elucidate how common and rare genetic variants contribute to specific pathologies. We additionally aim to determine how polygenic risk scores (PRS) and pathway-specific PRS correspond to single and mixed neuropathological profiles, and to clarify the genetic architecture driving co-pathologies that frequently complicate clinical diagnosis. Eventually, we will replicate and fine-map our findings (Aim 4) leveraging available datasets at NIAGADS and other public repositories. Our analysis plan includes genome-wide association testing of ordinal, binary, and quantitative neuropathological traits; rare-variant burden analyses for coding and non-coding regions; PRS and pathway-PRS modeling across multiple dementia-related diseases; unsupervised clustering to identify variant sets defining specific endophenotypes; and pathway and network analyses to interpret significant signals. Colocalization and functional annotation approaches will integrate genomic findings with transcriptomic and proteomic resources. Data obtained from NIAGADS will be used to strengthen replication, broaden meta-analytic power, validate associations across independent neuropathology cohorts, and support functional interpretation using available genetic, expression, and multi-omic datasets. All analyses will use de-identified data in compliance with ethical and data-sharing standards.
Non-Technical Research Use Statement:
Dementia is an immensely challenging and prevalent condition, deeply impacting the lives of over 55 million individuals worldwide. While Alzheimer's disease stands as the most commonly recognized form of dementia, there exist other conditions that present comparable symptoms but distinct underlying pathological characteristics. To provide more effective support to patients and their families, we need to better understand the genetic causes associated to each of these brain pathologies, and to develop advanced tools for early classification and diagnosis. This grant proposal aims to tackle these challenges by establishing the largest Iberian (Spanish and Portuguese) database of dementia neuropathological cases, marked by a modernized and standardized neuropathological classification alongside comprehensive genomic data. Our goal is to delve further into the genetic architecture underpinning these pathological features and to refine existing risk assessment tools for more accurate diagnoses.
Investigator:
Frost, Bess
Institution:
UT Health San Antonio Barshop Institute
Project Title:
Investigating retrotransposon activation and retrotransposon-associated genetic variants associated with human tauopathy
Date of Approval:
October 25, 2022
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objective: To gain insights into retrotransposon activation in specific cell types, our first objective is to analyze differential transposable element expression in bulk sequenced microglia from Alzheimer’s disease patient brain tissue versus controls (NG00105). Study design: Reads will be aligned to the GRCh38 human reference genome with STAR using parameters optimized for aligning transposon derived multi-aligning reads. Read counts for transposon and gene loci will be obtained using TEtranscripts. Differential expression of genes and transposons will then be calculated using Deseq2. Analysis plan: Unsupervised machine learning techniques will be applied to cluster transcription counts by variance to make associations between specific retrotransposons and microglial/immune response associated genes. Objective: We have identified multiple candidate non-reference mobile element insertion variants using nanopore long read sequencing of DNA extracted from frontal cortex of patients at Braak 0, III, and V/VI. Our second objective is to utilize the ADSP umbrella whole genome sequencing dataset (NG00067) to determine if our findings are conserved in a larger cohort of patients with Alzheimer’s disease. Study Design: CRAM alignment files aligned to the GRCh38 reference genome from the ADSP discovery (snd10000) and PSP-UCLA (snd10017) WGS data sets will be analyzed with xTea (Chu et al. 2021) to identify the presence of mobile element insertions previously identified via nanopore. Only genomic regions containing insertions of interest will be analyzed. Analysis Plan: Non-reference mobile insertions identified via nanopore will be compared in control, Alzheimer’s disease, and PSP NIAGADS datasets. Insertions meeting the designated criteria will be considered for a replication analysis using cohorts from the ADSP umbrella dataset. We will determine whether these variants can predict the longitudinal clinical rate of disease progression and correlate with other features such as tau PET positivity, CSF tau, and cognitive testing. We will also consider sex, age, and high-risk genotypes.
Non-Technical Research Use Statement:
Objective 1: Almost half of the human genome is composed of transposable elements, or “jumping genes.” Retrotransposons are activated in human Alzheimer’s disease and related “tauopathies,” as well as in Drosophila and mouse models of tauopathy. In the current study, we will analyze retrotransposon activation specifically in microglia, the immune cells of the brain, in the context of tauopathy. In addition, we will determine if retrotransposons activation correlates with expression of neighboring immune response genes. Objective 2: We have previously identified tau-induced retrotranpsoson activation as driver of neurodegeneration. In a preliminary analysis of Alzheimer’s disease patient samples and controls, we have used long-read whole genome DNA sequencing technology to discover non-reference retrotransposon insertions that are unique to Alzheimer’s disease patients. In the current study, we expand these analyses to determine if our findings are conserved in a larger patient cohort, and how these novel insertions relate to disease progression.
Investigator:
Funk, Cory
Institution:
Institute for Systems Biology
Project Title:
Immunity in AD
Date of Approval:
October 2, 2023
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objective: The HLA region was previously identified in an Alzheimer’s GWAS study. The role of the adaptive immune system in Alzheimer’s is not well understood, despite emerging evidence suggesting infectious agents may be contributing to the disease. Study design: Using the data requested from NIAGADS, we will attempt to remap the HLA region to produce a more better defined haplotypes for each sample. We will also look at other variants of interest associated with Alzheimer’s and/or the innate and adaptive immune system, to pursue hypotheses around how these two arms of the immune system may interact. We will also use these data to perform association testing, identifying variants associated with AD risk or infection and evaluating their sensitivity to covariates such as APOE genotype, sex, and ancestry. Analysis plan: Data from NIAGADS with be downloaded to an AWS instance. Regions of interest, such as the HLA region on chromosome 6, will be extracted using samtools. Additional genotype data will be imputed using the Michigan Imputation Server and reference data selected to best match the ancestry(s) represented in the data. Association testing will adjust for population structure and genetic relatedness. Variants of interest will be annotated using resources such as the Variant Effect Predictor and the Genotype-Tissue Expression project to facilitate the interpretation of association results. Pathway analyses may be used to better understand potential relationships between implicated genes and genes previously implicated in AD and related disorders. We will perform genome scans in large data sets representing diverse ancestries. We will use imputed genotype data within association signals to fine-map the location of variants associated with Alzheimer’s disease. Association testing across independent data sets will be used to replicate these signals. We will use variant annotation to describe the potential relationships between implicated variants and gene function, regulation, and pathways. This work will attempt to identify genes involved in the innate and adaptive immune responses in connection with AD.
Non-Technical Research Use Statement:
We will be investigating the possible connections between genes in the immune system and Alzheimer's disease. We will be looking at both the innate and adaptive arms of the immune system. We will also include approaches that consider the potential role of pathogens in contributing to Alzheimer's etiology.
Investigator:
Furman, David
Institution:
Buck Institute
Project Title:
Association of omics markers of tissue aging with health outcomes and mortality
Date of Approval:
January 9, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Using transcriptomic and epigenomics data across multiple tissues and matched blood from the Genotype-Tissue Expression (GTEx) database and the UK Biobank we have constructed a computational framework that calculates the rate of aging of tissues in an individual using blood gene expression, DNA methylation or blood clinical markers. The next step is to validate and optimize the resulting models to predict organ-specific aging in disease states specific to the organ of interest, and mortality using clinical and omics data from the Health and Retirement Study. Objectives Validate and optimize blood-based diagnostic tools using clinical and omics data from the Health and Retirement Study. • Determine if the signatures of tissue aging in the blood are associated with disease incidence. • Estimate the association between the blood-based organ aging and mortality. • Build omics predictors of organ age estimations from blood gene expression, DNA methylation or blood clinical markers (i.e. prediction of the prediction). Methods Using the generated model to predict tissue aging using blood we will assess the association between tissue age acceleration and clinical data in the Health and Retirement Study cohort. We will study the association between age acceleration and two types of clinical information: Chronic diseases: We expect to observe a positive tissue age acceleration in the tissues associated with the diseases diagnosed in the donors. For example, individuals diagnosed with COPD are expected to have higher age acceleration in the lungs compared to healthy individuals. For all these analyses we will perform logistic regression analysis to examine the binary outcome of disease presence or absence in relation to tissue age acceleration. Mortality: We hypothesize that our models will reveal a significant association between accelerated aging in specific tissues and overall mortality risk. Cox proportional hazards models will be utilized to estimate the hazard ratios for mortality, with tissue age acceleration as the primary variable of interest. We will adjust these models for potential confounders such as baseline age and sex.
Non-Technical Research Use Statement:
We've developed a computer program using data from a large database to figure out how quickly different parts of our body age, based on information from our blood. We want to see if this can help us understand how diseases develop and how long someone might live. To do this, we'll use information from the Health and Retirement Study, a study on aging and health, to check if the aging patterns we find in blood are connected to diseases people have and how long they live.
Investigator:
Ghezzi, Daniele
Institution:
Institute of Neurology Besta
Project Title:
MitoAD
Date of Approval:
January 31, 2024
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Mitochondrial dysfunction has been hypothesized to be the primary event in Alzheimer's disease (AD) pathology. Extracellular amyloid-ß (Aß) plaques and intracellular neurofibrillary tangles are histopathological hallmarks of the disease. The so called mitochondrial cascade hypothesis proposes that mitochondrial dysfunction is the primary event in AD pathology. Numerous experiments demonstrated that accumulation of Aß in mitochondria begins before the occurrence of extracellular deposition, and this leads to mitochondrial dysfunction with increased oxidative stress, impaired mitodynamics and decrease in ATP production that leads to synaptic dysfunction, apoptosis, and neurodegeneration. Few mitochondrial enzymes are closely linked to the degradation of accumulated intra-mitochondrial Aß, but also to degradation of the mitochondrial targeting sequences that are cleaved off from the imported precursor proteins by the mitochondrial matrix peptidase. If these peptides fail to be cleared from the mitochondrial matrix, they may act as detergent-like, toxic agents, forming pores in the membranes. This project stems from preliminary, original observations from mouse models, yeast KO models, and a screening of a gene encoding a mitochondrial protease in small cohorts of patients with neurodegenerative disorders. In summary, this MitoAD project will be focused on trying to answer to the following question: Are there variants in mitochondrial protease genes associated with increased risk for AD? We have already selected 7 genes, and we will compare the frequency of: 1)Loss of function variants and 2)Missense variants with frequency <1% in the ADRD database vs. GnomAD database. In case of significant differences in this first analysis, we will then possibly subdivide the ADRD cohort in two groups, i.e. AD and FTD, in order to see if the observed mitochondrial impairment is directly linked to AD/ Aß accumulation or is a more general defect, not specifically associated with neurodegeneration.
Non-Technical Research Use Statement:
Mitochondrial dysfunction has been hypothesized to be the primary event in Alzheimer's disease (AD) pathology. Extracellular amyloid-beta (Aß) deposition is the key histopathological hallmark of AD, but Aß accumulation occurs also in mitochondria causing impairment in different mitochondrial pathways. Diverse mitochondrial enzymes have a role in degradation of unfolded proteins including accumulated intramitochondrial Aß. The scope of this project is to better define the role of these mitochondrial proteases on Aß processing and to evaluate how/if impairment in this pathway is linked to AD development. We want to exploit the ADRD database to evaluate if variants in selected genes encoding mitochondrial proteases are indeed associated with increased risk for AD (and, in case, if this risk is specific for AD or is common to other dementia neurodegenerative conditions).
Investigator:
Gibbs, Richard
Institution:
Baylor College of Medicine
Project Title:
Therapeutic Target Discovery in ADSP data via Comprehensive Whole-Genome Analysis Incorporating Ethnic Diversity and Systems Approaches
Date of Approval:
June 20, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Objective: Utilize ADSP data sets to identify genes & specific genetic variants that confer risk for or protection from Alzheimer disease. Aim 1: Using combined WGS/WES across the ADSP Discovery, Disc-Ext, and FUS Phases, including single nucleotide variants, small insertion/deletions, and structural variants. We will: Aim 1a. Perform whole genome single variant and rare variant case/control association analyses of AD using ADSP and other available data; Aim 1b. Target protective variant identification via association analysis using selected controls within the ADSP data and performing meta analysis across association results based on selected controls from non-ADSP data sets. Aim 1c. Perform endophenotype analyses including cognitive function measures, hippocampal volume and circulation beta-amyloid ADSP data in subjects for which these measures are available. Meta analysis will be conducted across ADSP and non-ADSP analysis results. Aim 2: To leverage ethnically-diverse and admixed populations to identify AD variants we will: Aim 2a. Estimate and account for global and local ancestry in all analyses; Aim 2b. Perform admixture mapping in samples of admixed ancestry; and Aim 2c. Perform ethnicity-specific and trans-ethnic meta-analyses. Aim 3: To identify putative therapeutic targets through functional characterization of genes and networks via bioinformatics, integrative ‘omics analyses. We will: Aim 3a. Annotate variants with their functional consequences using bioinformatic tools and publicly available “omics” data. Aim 3b. Prioritize results, group variants with shared function, and identify key genes functionally related to AD via weighted association analyses and network approaches. Analyses will be performed in coordination with the following PIs. Coordination will involve sharing expertise, analysis plans or analysis results. No individual level data will be shared across institutions. Anita DeStephano, Boston University, Philip De Jager, Columbia University; Eric Boerwinkle & Myriam Fornage, U of Texas Health Science Center, Houston; Sudha Seshadri, U of Texas, San Antonio; Ellen Wijsman, U of Washington.
Non-Technical Research Use Statement:
This proposal seeks to analyze existing genetic sequencing data generated as part of the Alzheimer’s Disease Sequencing Project (ADSP) including the ADSP Follow-up Study (FUS) with the goal of identifying genes and specific changes within those genes that either confer risk for Alzheimer’s Disease or provide protection from Alzheimer’s Disease. Analytic challenges include analysis of whole genome sequencing data, appropriately accounting for population structure across European ancestry, Hispanic, and African American participants, and interpreting results in the context of other genomic data available.
Investigator:
Goate, Alison
Institution:
Icahn School of Medicine at Mount Sinai
Project Title:
Study of Alzheimer's disease and other dementias (e.g. frontotemporal dementia) and related phenotypes
Date of Approval:
July 8, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer's disease (AD) is the most common form of dementia but has no effective prevention or treatment. Developing a comprehensive picture of the genetic architecture of AD including a network level functional assessment of risk/resilience genes is essential to develop novel therapeutic targets. The overarching goals of this study are to use genetic and genomic approaches to: 1) identify genes and variants that are involved in the development of AD and related disorders; 2) identify functional networks enriched for AD or related disorder risk and protective loci; 3) determine how cellular function and physiology is impacted by these genetic factors in disease-relevant cell types and animal models. This study will use publicly available whole genome/exome sequence data generated by the Alzheimer’s Disease Sequencing Project (ADSP) and genome-wide association study (GWAS) data from the International Genomics of Alzheimer’s Project (IGAP) and others. We will apply a suite of case-control and family approaches to investigate genetic association with dichotomous and continuous disease traits. This study will not only further our understanding of the genetic architecture of AD but also provide key information regarding the molecular mechanisms, setting the stage for novel therapeutic development.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is the only disease among the top ten killers in the U.S. without a disease modifying therapy. Genetic studies provide a powerful means to identify genes and pathways that are causally linked to disease etiology. We propose to use genomic and functional approaches to identify genes that alter the risk of AD and investigate how these genes disrupt cellular pathways leading to disease.
Investigator:
Greicius, Michael
Institution:
Stanford University School of Medicine
Project Title:
Examining Genetic Associations in Neurodegenerative Diseases
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We are studying the effects of rare (minor allele frequency < 5%) genetic variants on the risk of developing late-onset Alzheimer’s Disease (AD). We are interested in variants that have a protective effect in subjects who are at an increased genetic risk, or variants that lead to multiple dementias. Our aim is to identify any genetic variants that are present in the “case” group but not the “AD control” groups for both types of variants. The raw data we receive will be annotated to identify SNP locations and frequencies using existing databases such as 1,000 Genomes. We will filter the data based on genetic models such as compounded heterozygosity, recessive and dominant models to identify different types of variants.
Non-Technical Research Use Statement:
Current genetic understanding of Alzheimer’s Disease (AD) does not fully explain its heritability. The APOE4 allele is a well-established risk factor for the development of Alzheimer’s Disease (AD). However, some individuals who carry APOE4 remain cognitively healthy until advanced ages. Additionally, the cause of mixed dementia pathology development in individuals remains largely unexplained. We aim to identify genetic factors associated with these “protected” and mixed pathology phenotypes.
Investigator:
Greytak, Ellen
Institution:
Parabon NanoLabs
Project Title:
Novel whole-genome analysis methods for Alzheimer's risk prediction
Date of Approval:
July 27, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Despite recent advancements, existing genetic risk prediction models (GRPMs) for late-onset Alzheimer's Disease (LOAD) lack sufficient discrimination ability to support clinical applications. Given the lack of treatments that meaningfully affect disease progression once symptoms have manifested and the socioeconomic consequences at stake, there is a serious unmet need for more accurate GRPMs able to assess a patient's LOAD risk in middle age or earlier, before presymptomatic neurodegeneration begins. Additionally, the lack of diversity in traditional GRPMs further exacerbates inequities in health care for non-Europeans. This project builds off of our previous work in which we produced a GRPM for LOAD using single SNPs and epistatic interactions between SNPs in an ensemble model with polygenic risk scores. In this phase we will develop methods for cross-ancestry SNP analysis to select features to be included in a LOAD GRPM that can be generalized to individuals of any ancestry. The ADSP data will be incorporated with data from the Alzheimer's Disease Neuroimaging Initiative, the Alzheimer's Disease Sequencing Project, the UK BioBank and the Framingham Heart Study. We will develop methods for association testing for homogeneous cross-ancestry SNP effects, followed by fine-mapping using diverse subjects to identify genetic features with functional relevance. We will also create and implement methods for association testing for heterogeneous cross-ancestry SNP effects for identification of epistatic interactions across ancestries. Finally we will train a cross-ancestry AD risk model using a cross-validation framework and replicate our findings in an independent cohort. The model will include age, sex, APOE genotype, and ancestry PCs as covariates and we will test gradient boosting, deep learning, super learning, and ensemble modeling methods.
Non-Technical Research Use Statement:
There are no treatments that significantly slow progression of Alzheimer’s Disease (AD) once symptoms manifest, making early intervention crucial to reduce the burden of this disease. A genetic risk prediction model (GRPM) for determining AD risk early in life, would allow early intervention, life planning, and improved patient stratification for clinical trials. Despite advancements, GRPMs for AD lack sufficient discrimination to support such applications. To address this need, Parabon developed a GRPM using machine learning that is able to predict an individual's risk of developing AD at any age that achieves state-of-the-art prediction accuracy. However, like most genetic risk scores, it was built using European subjects and thus has reduced accuracy in non-Europeans. The goal of this project is to enhance our modeling pipeline to identify genetic variants and interactions both across and within ancestral groups, then build a predictive model and test it in an independent replication set.
Investigator:
Gudmundsdottir, Valborg
Institution:
Icelandic Heart Association
Project Title:
Understanding Genetic Risk Factors for Alzheimer’s Disease in the Iceland Population
Date of Approval:
September 5, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
AGES Proposal Objectives: To conduct a genome-wide association study of common and rare variants, including single nucleotide variants (SNVs), short insertion/deletions (indels), and structural variants (SVs), with Alzheimer’s Disease in an integrative analysis of ADSP and the Age/Gene-Environment Susceptibility (AGES) Study data Analysis Plan: 1. QC plan for pooled samples from ADSP and AGES 1.1. Genotype Level QC - Check read depth coverage and genotype quality for each genotype - Change genotypes to missing if low quality (DP < 10 or GQ < 20) - Allele balance heterozygosity (0.25 < ABhet < 0.75) 1.2. Sample Level QC - Samples without AD status will be excluded - Principal component analysis (PCA) with the reference populations in the 1000 Genomes Project to identify most similar population group - Standard QC, remove samples with: - Missingness > 1% - Heterozygosity > 3 standard deviations from the mean het/hom ratio - Relatedness values indicating duplicate samples 1.3. Variant Level QC - Standard QC, remove variants with: - Minor allele frequency threshold (see Association Tests) - Missingness > 1% - Low Hardy-Weinberg p-values (< 1e-10) - Ti/Tv coefficients > 10 SD from the mean - Keep variants with ABhet ratio between 0.3-0.7 2. General genetic quality analyses (non-phenotypic) - Burden analysis using SKAT-O - Identifying positive selection - Identifying population bottlenecks and expansions (effective population size) 3. Primary Analysis 3.1 Available relevant phenotypes: Provided by The Icelandic Heart Association 3.2. Participants - ADSP release 5 (R5) 58k samples with clear Alzheimer’s Disease status - AGES subjects with clear Alzheimer’s Disease status 4. Potential Secondary Analyses: Copy Number Variation (CNV) and Structural Variant (SV) analyses if a novel SNV is found. CNVs/SVs are detected separately by Smoove and Manta and merged by Svimmer for each sample. Further merging for all samples can be done by Svimmer that provides the input of GraphTyper2 for joint genotyping. Collaboration: This project will be a collaboration between University of Pennsylvania and The Icelandic Heart Association
Non-Technical Research Use Statement:
This study aims to explore how genetic differences contribute to Alzheimer’s Disease by analyzing data integration of ADSP and the Age/Gene-Environment Susceptibility (AGES) Study. By studying various types of genetic changes—including small variations in DNA sequences and larger structural differences—this project seeks to identify genetic markers linked to Alzheimer’s. The research team will employ advanced computational methods to ensure the data quality and analyze the relationship between specific genetic factors and Alzheimer’s. This includes examining rare mutations, evaluating their potential biological impact, and exploring pathways that might help explain the development of the disease. This collaboration between the University of Pennsylvania and the Icelandic Heart Association represents an important step toward understanding the role of genetics in Alzheimer’s Disease in Iceland population and could provide insights for future treatments or interventions.
Investigator:
Habes, Mohamad
Institution:
UT Health San Antonio
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Haines, Jonathan
Institution:
Case Western Reserve University
Project Title:
Alzheimer Disease Sequence Analysis Collaborative (a.k.a. Collaborative Alzheimer Disease REsearch; CADRE)
Date of Approval:
January 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We plan to analyze whole exome and whole genome sequence data generated from subjects with Alzheimer disease (AD) and elderly normal controls. The goal of the planned analyses is to identify genes and other functional elements that have variations that protect against or increase susceptibility to AD. We will evaluate variants detected in the sequence data for association with AD to identify protective and susceptibility alleles using the whole exome and whole genome data. We will also evaluate similar sequence data from multiplex AD families to identify variants associated with AD risk and protection, and evaluate variant co-segregation with AD. We also will focus on structural variants (insertion-deletions, copy number variants, and chromosomal rearrangements) detected using both whole genome and whole exome data. All data will be analyzed separately and in an integrated fashion and will incorporate additional genetic and functional data.
Non-Technical Research Use Statement:
We are attempting to identify all the inherited elements that contribute to Alzheimer's disease risk. To do this we will analyze DNA sequence data from subjects with Alzheimer's disease and elderly subjects who are cognitively normal. The sequence data from these two groups will be compared to identify differences that contribute to the risk of developing Alzheimer's disease of that protect against Alzheimer's disease. These DNA differences can be at a single site in the genetic code, or can span multiple sites, changing the copy number of DNA sequences. Both types of genetic variants will be examined.
Investigator:
Hatchwell, Eli
Institution:
Population Bio
Project Title:
Mutational Spectrum of Causal Genes for Neurological/Neurodegenerative Diseases and Endometriosis Identified via High Resolution Genome Wide Copy Number Analysis
Date of Approval:
September 12, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
While single gene rare variants have been shown to play a significant role in Early-Onset Alzheimer’s Disease (EOAD), their role in Late-Onset (LOAD) has not been emphasised. The gene discovery methodology we have developed at Population Bio allows for unbiased exploration of highly informative genomic variants in any cohort of interest. Our approach is based on ultra-high resolution copy number variant (CNV) analysis. We have invested heavily in such analysis on normal populations. These are used as comparators for cohorts of interest, such as LOAD. In our LOAD work, this analysis generated a list of CNVs which were either absent in the normal populations we studied or else present at significantly higher frequency in the LOAD cohort. Such CNVs are routinely annotated to determine if they overlie known genes and/or regulatory regions. As an example, we have discovered a deletion in 3% of our LOAD cases, which is present in <= 1% of normals. This deletion disrupts a transcription factor binding site in the intron of a gene, which, via GeneHancer, is known to control exon 1 of the gene. The gene in question is novel to LOAD, and is an important metabolic gene, with known biology. It is vital that we validate this finding by analysis of independent LOAD datasets. In addition, we wish to validate other genes discovered in the same manner We have very deep experience of analyzing WGS/WES datasets. Our focus will be to pull out of the available WGS/WES datasets all the variants for the candidate genes of interest. Such variants, including SNVs, indels and CNVs (called using a variety of tools we have experience with) will be analyzed by reference to databases of normal individuals: i.CNVs, by reference to our own internal database but also gnomad (https://gnomad.broadinstitute.org) CNV data and DGV (http://dgv.tcag.ca) ii.SNVs/indels, by reference to gnomad These analyses will allow us to determine whether there exists a mutational burden for our candidate genes of interest in independent LOAD cohorts, and will serve as validation/refutation. The main phenotype of interest will be definitive diagnoses of LOAD, based on neuropathological and clinical cognitive analyses
Non-Technical Research Use Statement:
Most of the common conditions that affect large numbers of the general population have a genetic basis. While progress has been rapid in the field of cancer, the same cannot be said for common, non-cancer, conditions, such as Late-Onset Alzheimer's Disease (LOAD). It is pretty clear now that not all cases of LOAD represent the same disease, in terms of what is the cause. Our approach has been to consider common diseases as collections of rare subgroups, each of which has a specific cause and which, in due course, will have a specific treatment. We have pioneered and implemented a method to rapidly uncover potentially causal genes in common disorders and will use the data generated from this study to strengthen our discoveries, by validating a set of novel candidate genes we have identified in LOAD Our project will allow us to: 1.Define subsets of disease 2.Work with pharmaceutical companies to develop drugs that will specifically target each subset of disease. In some cases, disease progression may be halted by the therapies developed. In some cases, reversal and/or cure may be possible
Investigator:
Higgins-Chen, Albert
Institution:
Yale University
Project Title:
TranslAGE: Translating Geroscience to Humans by Validating and Refining Aging Biomarkers in Longitudinal Studies
Date of Approval:
October 29, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The TranslAGE research project aims to evaluate and optimize DNA methylation (DNAm)-based aging biomarkers for geroscience clinical trials. We will benchmark existing biomarkers to identify those that are: (1) responsive to aging interventions, (2) prognostic for age-related diseases and outcomes, and (3) stable over time without intervention. This will guide the development of next-generation biomarkers. Study Design: We will analyze DNAm and phenotypic data from the Health and Retirement Study (HRS), evaluating how various DNAm biomarkers relate to mortality, disease, and physical and cognitive function. All pipelines and variable names will be harmonized with other major studies, including the Baltimore Longitudinal Study of Aging (BLSA) and Framingham Heart Study, enabling cross-study comparison. Analysis Plan: 1. Data Harmonization: Core variables (age, sex, death, follow-up time) will be standardized using a common data dictionary. 2. Biomarker Computation: Multiple DNAm biomarkers will be calculated using https://github.com/HigginsChenLab/methylCIPHER and Z-scored for comparability. 3. Model Fitting: Cox models for mortality and disease onset, Logistic regression for incident disease, Linear models for continuous traits like cognition, BMI, gait speed. 4. Performance Evaluation: Predictive performance will be assessed using hazard ratios, concordance index (C-index), and AUCs. Biomarkers will be ranked across domains, and visualizations will summarize associations. 5. Cross-Study Integration: Summary statistics will be compared with those from external cohorts, intervention studies (e.g., effect sizes), and test-retest reliability datasets (e.g., intraclass correlation) 6. Composite Development: Top biomarkers with strong prognostic power, responsiveness, and stability will be combined into a composite score. Novel mortality, morbidity, and frailty predictors will be trained in external datasets and validated in HRS. Planned collaborations: Sofiya Milman at Albert Einstein College of Medicine (relating DNAm biomarkers to frailty), Jessica Lasky-Su at Brigham and Women’s Hospital (relating DNAm proxies of serum proteins and metabolites to aging outcomes).
Non-Technical Research Use Statement:
The TranslAGE knowledgebase aims to improve how we measure biological aging, helping scientists evaluate new treatments that target the aging process. We will use DNA methylation from the Health and Retirement Study and other studies to identify biomarkers that predict future health, respond to anti-aging therapies, and remain stable over time. We will calculate many different biological age scores from blood samples and test how well each one predicts outcomes like memory, physical ability, disease, and mortality. We will combine this information with results from other studies to identify the best scores that not only predict future age-related disease, but are also responsive to treatments intended to prevent future disease. We will also use insights from this study to develop better biological age scores that may eventually be used to assess whether interventions are successfully modifying the aging process to promote healthier, longer lives.
Investigator:
Hohman, Timothy
Institution:
Vanderbilt University Medical Center
Project Title:
Genetic Drivers of Resilience to Alzheimer's Disease
Date of Approval:
March 16, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
“Asymptomatic” Alzheimer’s disease (AD) is a phenomenon in which 30% of individuals over age 65 meet criteria for autopsy-confirmed pathological AD (beta-amyloid plaques and tau aggregation) but do not clinically manifest cognitive impairment.1-3 The resilience that underlies asymptomatic AD is marked by both protection from neurodegeneration (brain resilience)4 and preserved cognition (cognitive resilience). Our central hypothesis is that genetic effects allow a subset of individuals to endure extensive AD neuropathology without marked brain atrophy or cognitive impairment. We are uniquely positioned to identify resilience genes by leveraging the Resilience from Alzheimer’s Disease (RAD) database, a local resource in which we have harmonized a validated quantitative phenotype of resilience across 8 large AD cohort studies. Our strong interdisciplinary team represents international leaders in genetics, neuroscience, neuropsychology, neuropathology, and psychometrics who will leverage the infrastructure and rich resources of the AD Genetics Consortium, IGAP, ADSP, and our recently established and harmonzed continuous metric of resilience to fulfill the following aims: Aim 1. Identify and replicate common genetic variants that predict cognitive resilience (preserved cognition) and brain resilience (protection from brain atrophy) in the presence of AD pathology. We hypothesize that common genetic variation will explain variance in resilience above and beyond known predictors like education. Replication analyses will leverage age of onset data from IGAP to demonstrate that resilience loci predict a later age of AD onset. Aim 2. Identify and replicate rare and low-frequency genetic variants that predict cognitive and brain resilience. Rare and low-frequency variants with large effects have been identified in AD case/control studies, providing new insight into the genetic architecture of AD. Aim 3: Identify sex-specific genetic drivers of cognitive and brain resilience to AD pathology. Our preliminary results highlight sex differences in the downstream consequences of AD neuropathology, including sex-specific genetic markers of resilience.
Non-Technical Research Use Statement:
As the population ages, late-onset Alzheimer’s disease (AD) is becoming an increasingly important public health issue. Clinical trials targeted a reducing AD progression have demonstrated that patients continue to decline despite therapeutic intervention. Thus, there is a pressing need for new treatments aimed at novel therapeutic targets. A shift in focus from risk to resilience has tremendous potential to have a major public health impact by highlighting mechanisms that naturally counteract the damaging effects of AD neuropathology. The goal of the present project is to characterize genetic factors that protect the brain from the downstream consequences of AD neuropathology. We will identify both rare and common genetic variants using a robust metric of resilience developed and validated by our research team. The identification of such genetic effects will provide novel targets for therapeutic intervention in AD.
Investigator:
Hohman, Timothy
Institution:
Vanderbilt University Medical Center
Project Title:
Alzheimer's Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
July 8, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Hohman, Timothy
Institution:
Vanderbilt University Medical Center
Project Title:
Estimating Heterogeneous Treatment Effects in Asymptomatic Alzheimer’s Disease.
Date of Approval:
July 15, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) is a neurodegenerative disorder that progressively impairs memory and cognitive functions. Detecting and intervening in the asymptomatic stages of Alzheimer’s is crucial for slowing or preventing the onset of clinical symptoms. However, responses to preventive treatments often vary between individuals due to genetic heterogeneity. Understanding this variability can lead to more effective treatment strategies that are tailored to an individual's genetic profile. This project aims to estimate and make inferences about heterogeneous treatment effects (HTEs) in asymptomatic AD based on genetic information. Specifically, we will explore how genetic variations influence an individual's response to early interventions aimed at preventing the progression to symptomatic Alzheimer's. By leveraging machine learning algorithms and comprehensive genomic information in the A4 cohort, this project seeks to develop methods that can identify subgroups of individuals who are more or less likely to benefit from anti-amyloid treatments. The findings will provide critical insights into the genetic basis of treatment response variability in AD, paving the way for more personalized interventions in the early stages of the disease. This work aligns with the growing focus on precision medicine and has the potential to improve outcomes by offering more tailored therapeutic approaches for individuals at risk of developing Alzheimer's.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a neurological disorder that gradually worsens over time, leading to memory loss and cognitive decline. Detecting and treating Alzheimer’s before symptoms appear is crucial to slowing down or preventing its progression. However, not everyone responds to early treatments in the same way, and much of this difference may be linked to genetics. This project is focused on understanding how a person’s genetic makeup can influence their response to treatments aimed at preventing Alzheimer’s. Specifically, we are studying how genetic differences affect the success of preventive treatments. By using advanced data analysis techniques and studying large genetic databases, we aim to identify which groups of people are more likely to benefit from early interventions. The goal of this research is to provide new insights into how genes affect treatment outcomes, allowing doctors to offer more personalized, effective treatments to individuals at risk of developing Alzheimer’s disease.
Investigator:
Holstege, Henne
Institution:
Amsterdam UMC
Project Title:
Searching for Alzheimer-related genetic variants and genes
Date of Approval:
December 8, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The purpose of this study is to find new Alzheimer related variants and genes, by combining exome and whole genome data from healthy controls and Alzheimer patients from different studies. Data will be analyzed using association, burden and variant component statistics.
Non-Technical Research Use Statement:
Some individuals develop dementia, while others do not. A large part is likely determined by ones genes, Alzheimer’s disease has a heritability of up to 80%. What are the key genetic factors that determine if one will get Alzheimer disease? In this study, we will thoroughly explore genomic data of a large group of healthy persons and dementia patients to answer this question.
Investigator:
hsu, stephen
Institution:
michigan state university
Project Title:
Machine Learning Methods for the Genetics of Alzheimer’s Disease
Date of Approval:
January 20, 2022
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Our goal is to use machine learning (ML) to understand and predict genetic cause and risk to Alzheimer's disease (AD). Our study is purely data informatic in nature and follows common ML-practices, including data cleaning and quality control of both samples and genotypes, pre-processing (such as correcting for covariates like age, sex and general population structure), training and parameter optimization, and evaluation using hold-out sets. Our primary goal is to increase the predictive power for disease status from only the genetic information. We use several different ML-algorithms to build predictors, such as compressed sensing (also known as LASSO), neural networks, and horseshoe Bayesian regression. The underlying genetic architectures are studied through dissecting and analyzing the trained predictors — which loci are important? how important are they (in particular the polygenetic importance beyond APOE)? what are the genetic correlations for these loci within and across different ancestries or other population groups? etc. — informing both fundamental disease research and future predictor algorithm designs. A priori, we will at least use the phenotypic characteristics sex, age/age of onset, race, ethnicity, AD status, and family history. We will also investigate whether a more informative case variable can be constructed as a function of the mentioned variables in conjunction with AD status comments. Other phenotypic characteristics may also be used in the continuous improvement of our predictor algorithms. All analysis will be performed on high-performance computing clusters at Michigan State University (MSU), where they will be stored under strict security, accessible only to PI and three other MSU staff (who also sign the Data Transfer Agreement), in accord with regulations. We will publish all scientific results in peer-reviewed journals and make developed general algorithms public. Published predictors will be made available, both to the public and returned to the NIAGADS.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is affecting more than 6 million Americans with an enormous impact on their lives and surrounding families. We aim to improve existing and develop new methods to predict the AD risk of an individual from his/her genetic information. Such a predictor can inform both life and treatment decisions, and its inner workings shed light on the genetic causes of AD. As such, it furthers the basic disease understanding and both the development and employment of preventive medications. Our methods use modern machine learning techniques (e.g. LASSO and neural networks) which are trained on carefully processed trait and genetic data. The amount of data is a crucial factor for success and the more than 30,000 participating individuals in the NIAGADS database constitute a state-of-the-art resource for this type of research. All data is handled under strict security policies while all scientific results will be made publicly available through publications and downloadable files.
Investigator:
Hu, William
Institution:
Rutgers Biomedical and Health Sciences
Project Title:
Genomic and social determinants of cognitive decline and resilience in the Health and Retirement Study
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Both positive (social engagement, leisurely activities) and negative (isolation, widowhood) social determinants have been identified to influence cognitive decline in the Health and Retirement Study, but known genetic risk factors for Alzheimer’s disease, cardiovascular disease, and frailty are often not taken into account. Leveraging the expertise of the Asian Resource Center for Minority Aging Research, we propose to examine impact of introducing genomic markers into our current models linking social determinants and cognitive decline, and identify interactions predictive of vulnerability as well as resilience to cognitive decline
Non-Technical Research Use Statement:
People’s behaviors can contribute to or compensate for genetic risks for age-related conditions such as dementia and frailty, and we will identify positive and negative behaviors associated with genetic risks for Alzheimer’s disease and related conditions.
Investigator:
Ichikawa, Osamu
Institution:
SUMITOMO PHARMA CO., LTD.
Project Title:
Understanding the genetic mechanism of Alzheimer's Disease
Date of Approval:
January 27, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The purpose of our study is to identify effective therapeutic targets for Alzheimer’s disease (AD) through stratifying this heterogeneous disorder into subtypes. We will (i) define specific patient segments based on known genetic risks such as ApoE genotype and AD-related phenotypes, and (ii) identify novel genetic factors and understand the biological and pathological mechanism for the specific segment. The specific patient segments will be defined and characterized by the known genetic risks such as ApoE genotype, and AD-related phenotypes, including symptoms, clinical progression, brain imaging and Braak stages. The whole genome sequencing data and the whole exome sequencing data will be analyzed to identify novel genetic variants or genes associated in case-control cohort for each specific segment. These findings will be confirmed with family-based association analyses. We plan to analyze the whole genome sequencing data and the whole exome sequencing data with several phenotype-variant analysis approaches. First, we will stratify and characterize the patients by the known genetic risks and AD-related phenotypes, including symptoms, clinical progression, brain imaging and Braak stages. Then, we will analyze common variants, rare variants, including loss of function mutation, to identify novel genetic variants associated to each specific segment. These findings will be confirmed with family-based association analyses. System biological approaches will be used to determine perturbation of specific genes or pathways related to the phenotypes and to understand the molecular mechanism in each segment by integrating public data such as gene expression data that could be useful to identify the relevant brain region and cell types. All data will remain anonymized and securely stored, and only those listed on our application and their staff will have access to these data. We will not share any of the individual level data outside of Sumitomo Dainippon Pharma nor beyond the researchers on our application. We will adhere to all agreements through the DSS NIAGADS. We have a secure computational environment where we will use these data.
Non-Technical Research Use Statement:
The purpose of our study is to identify effective therapeutic targets for Alzheimer’s disease. Alzheimer’s disease is heterogeneous disease, which result from different combinations of genetic factors as well as environmental factors. Stratifying this heterogeneous disorder into subtypes based on genetic factors and objective phenotypes is important step to discover effective therapeutic targets. The whole genome sequencing data and the whole exome sequencing data will be analyzed to identify novel genetic variants associated with disease and/or subtypes.
Investigator:
Jacquemont, Sebastien
Institution:
SAINTE-JUSTINE UNIVERSITY HOSPITAL CTR
Project Title:
Understanding Rare Genetic Variation and Disease Risk: A Global Neurogenetics Initiative
Date of Approval:
July 28, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Copy number variants (CNVs) -deletions or duplications of DNA segments- are associated with greatly elevated rates of neurodevelopmental psychiatric disorders (NPDs), including Autism Spectrum Disorders (ASD). Psychosis Spectrum (PS) conditions, and ADHD. ‘Genetics first’ or ‘bottom-up’ studies, in which participants are ascertained based on genetic etiology, can potentially overcome challenges posed by the genetic, mechanistic, and phenotypic heterogeneity of behaviorally defined (idiopathic) NPDs. While efforts to date have been successful in elucidating underlying neural substrates in a handful of such CNVs, critical gaps in our knowledge remain: 1) It is unknown if the growing number of CNVs identified as associated with the same psychiatric disorders (e.g., ASD and PS) show commonalities at the macroscale brain level; 2) the guiding principles underlying relationships between CNV-related variations in brain architecture and the function of genes encompassed in these CNVs are unknown; 3) We propose to assemble the largest neuroimaging genomic dataset to date in order to identify the neural and shared molecular underpinnings of CNVs. Our multidisciplinary team, with expertise in medical and statistical genetics, mathematical modelling, and brain imaging, will address the following aims: Aim 1: Characterize the effect of the most well-established recurrent NPD-associated CNVs on brain structure and function. Aim 2: Beyond recurrent CNVs: Investigate effects on brain structure and function of global CNV burden genome-wide. CNVs will be identified using the same pipeline across all datasets (combining multiple CNV callers). Multimodal neuroimaging data, including T1-weighted structural images, T1w/T2w ratio images, resting-state functional MRI, and diffusion MRI, will be processed using the same harmonized pipelines developed by ENIGMA.
Non-Technical Research Use Statement:
Neurodevelopmental disorders (NDs) such as autism spectrum disorder (ASD) and schizophrenia (SZ) represent a major health burden, each of which occurs in 1% of our societies. The genetic contribution to NDs is as high as 60-80%. With the routine implementation of genetic testing in neurodevelopmental clinics worldwide, rare mutations that contribute significantly to neurodevelopmental symptoms are identified in 10 to 40 % of children with NDs. However, the effects of these mutations on brain structure and function are mostly unknown. We will comprehensively chart and characterize the effects of these mutations on brain structure and function. Such progress will provide insight into the mechanisms by which mutations may lead to neurodevelopmental and psychiatric disorders.
Investigator:
Jaffe, Andrew
Institution:
Neumora Therapeutics
Project Title:
Comparisons of pre- and post-mortem microglial populations
Date of Approval:
July 21, 2022
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
In the study, we propose to directly compare and analyze pre-mortem microglial cells obtained during surgical resection from Young et al [PMID: 34083789] with post-mortem microglia from Lopes et al [PMID: 34992268, Dataset NG00105] to better define the transcriptional landscape of human microglia and the effects of tissue processing. We have previously re-processed and re-analyzed bulk and single cell data from Young et al. to identify expression quantitative trait loci (eQTLs) and develop RNA deconvolution models to partition bulk microglia profiles (like those measured by Dataset NG00105) into cell fractions of 7 important microglial subpopulations/cell states including “homeostatic”, “stress”, and “chemokine/cytokine” using the single cell RNA-seq (scRNA-seq) data from Young et al. We propose to perform this RNA deconvolution in Lopes et al, and test whether any of these cell populations – particularly related to neuroinflammation – are more prevalent in neurodegenerative disorders like Alzheimer’s (AD) or Parkinson’s Diseases (PD). We will also test whether these cell subtype fractions identified in pre-mortem tissue are consistent in postmortem tissue. As validation, we will perform supervised clustering of the NG00108 snRNA-seq data (in mouse) and test whether any AD-associated microglial cell subtypes were enriched in the 5xFAD genotype. Lastly, we propose to combine genotype and RNA data from Lopes et al (NG00105) and Young et al and perform eQTL mega-analysis to double the discovery sample size of microglial eQTLs. We hypothesize that this mega-analysis will produce a much larger number of significant eQTLs, as the GTEx project [PMID: 32913098] found approximately ~3000 eGenes in 100 subject discovery datasets (which was the approximate sample sizes of Young et al and Lopes et al) and ~7000 eGenes in 200 subjects (the combined sample size in this proposal). We will also assess clinical relevance by performing colocalization analysis of this larger eQTL map with genome-wide association studies (GWAS) of neurodegenerative disorders. Overall, this proposal will compare and contrast two recently large-scale genomic efforts profiling human microglia.
Non-Technical Research Use Statement:
Non-technical: This proposal will compare and contrast two recently large-scale genomic efforts profiling human microglia, including from premortem human brain tissue (Young et al, PMID: 34083789) and from postmortem brain tissue (Lopes et al, PMID: 34992268, Dataset: NG00105). We will specifically assess the distribution of various microglial cell states – derived from single cell RNA-seq data – and determine if all of these states are represented in microglia from postmortem tissue. We will perform validation analyses of these cellular states in a mouse model of AD (Dataset: NG00108). Assuming the pre- and post-mortem datasets are comparable, we will combine these datasets and perform joint analysis of genotype and phenotype to better understand variation in microglia gene expression.
Investigator:
Jaiswal, Siddhartha
Institution:
Stanford University
Project Title:
Clonal Hematopoiesis in NIAGADS
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Approximately 15-20% of people age 70 or older carry a cancer-associated mutation in a substantial proportion of their blood cells, even though the vast majority do not have cancer. This condition has been termed Clonal Hematopoiesis of Indeterminate Potential (CHIP). Past research has demonstrated that CHIP is associated with an increased risk of both all-cause mortality and several age-associated diseases, such as atherosclerotic cardiovascular disease. Using data from TOPMed, we found a surprising correlation: after controlling for competing risks, such as death, age, sex, and APOE genotype, CHIP is associated with protection from AD and AD-related pathologies, and that the degree of protection is proportional to the size of the mutant clone. We would like to improve our understanding of this phenomenon by leveraging the richly annotated whole genome (WGS) and whole exome sequencing (WES) datasets provided by the NIAGADs under the Alzheimer’s Disease Sequencing Project (ADSP). Understanding of this phenomenon could lead to possible treatments and greater ability to predict who will suffer from AD and its related pathologies. The NIAGADs has already performed the screening, recruitment, consent, and specimen collection. We are requesting access to datasets that include de-identified phenotypic and genetic information from both healthy controls and patients with AD and AD-related pathologies, specifically Dementia and Frontotemporal Dementia. We plan to control for and examine the following phenotypic characteristics: presence or absence of diagnosed neurological disease, age of disease onset (or age at the first visit for control groups), age at specifimen collection, sex, ethnic origin, and measures of cognitive and clinical decline. These phenotypes will be evaluated against genetic information (WES, WGS, single-nucleotide polymorphisms, and polygenic risk scores). Specifically, we plan to use mutation callers in order to identify somatic mutations, such as mutect or varscan (from the WES data, etc.).
Non-Technical Research Use Statement:
Approximately 15-20% of people age 70 or older carry a cancer-associated mutation in a substantial proportion of their blood cells, even though the vast majority do not have cancer. This condition has been termed Clonal Hematopoiesis of Indeterminate Potential (CHIP). Past research has demonstrated that CHIP is associated with an increased risk of both all-cause mortality and several age-associated diseases, such as atherosclerotic cardiovascular disease. Using data from TOPMED and Alzheimer’s Disease (AD) Sequencing Project, we found a surprising correlation: CHIP is associated with protection from AD and AD-related pathologies, and that the degree of protection is proportional to the size of the mutant clone. We would like to improve our understanding of this phenomenon by leveraging the richly annotated whole genome and whole exome sequencing datasets provided by the NIAGADs to replicate this association. Understanding of this phenomenon could lead to possible treatments and greater understanding of individual AD risk.
Investigator:
JIANG, RONG
Institution:
Duke Health
Project Title:
Gene and Stress on Hearing Loss in Older Adults
Date of Approval:
December 4, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of this research is to investigate the genetic and epigenetic mechanisms underlying hearing loss (HL), with a specific focus on how psychosocial stress and related environmental exposures interact with genetic/epigenetic markers to affect HL risk and progression, especially in older adults. We will conduct a cohort-based study design leveraging HRS dataset, integrating genomic, epigenomic (e.g. DNA methylation), and environmental data to evaluate both main effects and interactions. Our analysis plan will include (1) genome-wide association studies (GWAS) and epigenome-wide association studies (EWAS) of HL; (2) joint tests of genetic, epigenetic, and environmental factors, with a particular focus on psychosocial stress, and (3) integrative approaches to develop a predictive model to improve HL prediction and identify at-risk subgroups. The primary phenotypes of interest include self-rated hearing difficulties, and auditory measures from pure tune audiometry test. Psychosocial stress includes measures from Leave Behind Questionnaires, with sociodemographic, lifestyle and health variables. This project will improve our understanding of how genetic susceptibility and stress exposures jointly impact HL risk, with the long-term goal of identifying biomarkers for early detection and potential intervention targets. At this stage, no external collaborators are planned, though we anticipate future opportunities for collaboration to validate findings across institutions or cohorts.
Non-Technical Research Use Statement:
We plan to study how genes and epigenetic changes (such as DNA methylation) interact with stress to affect hearing loss (HL) in older adults. Using Health and Retirement Study (HRS) data, we will investigate whether stress and genetics together contribute to higher risk of hearing problems. This research could help identify people at greater risk and provide insights into strategies to prevent or reduce HL through both medical and public health approaches.
Investigator:
jin, LEI
Institution:
University of Florida
Project Title:
Assess the impact of common human TMEM173 alleles on Alzheimer's disease
Date of Approval:
March 6, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: Affecting more than 3 million people per year, Alzheimer's Disease is the only leading cause of death without a treatment or cure. Genetics plays a major role in developing Alzheimer's Disease. For example, APOEe4 is the strongest genetic risk factor for sporadic Alzheimer's disease. The APOEe4 allele increases the disease risk by 3 times in heterozygotes and by 15 times in homozygotes. Notably, - 40% of African Americans have at least one APOEe4 allele. Yet, African Americans with APOE e4 do not have an elevated risk of developing Alzheimer's. The underlying protective mechanism in Africans is unknown. The TMEM173 gene encodes a protein called STING that is critical in host defense, anti-tumor immunity, and tissue inflammation. Besides the WT allele, the human TMEM173 gene contains two additional common alleles: R71HG230A-R293Q (HAQ) and G230A-R293Q (AQ). In East Asians, WT/HAQ (34.3%), not WT/WT (22.0%), is the most common TMEM173 genotype. Intriguingly, the AQ allele is exclusively carried by Africans (~40%). The objective is to explore/establish an association between the common African AQ allele and protection against Alzheimer’s Disease. Study design: Our hypothesis is that the common African-specific TMEM173 AQ allele is associated with a decreased risk of Alzheimer’s disease. Using the Data Portal, https://dss.niagads.org/datasets/, we will select the relevant dataset. For example, one of the datasets we are interested in is (https://dss.niagads.org/sample-sets/snd 10003/) ADGC African American samples (Accession Number:snd 10003) that has 1648 controls and 1290 cases. We will download the sequence file, analyze the TMEM173 alleles in the 1648 controls and 1290 cases data, calculate the odds ratio with p values, and determine if the TMEM173 AQ alleles impact AD incidence in Africans. We will also compare ADSP cohorts with Caucasians, Asians, and Hispanics for the common TMEM173 alleles. Analysis plan: The analyzed cohorts consist of cases and controls. We will compare the allele frequency differences in the case and controls. We will match gender, age, and APOE e4 allele but will not conduct detailed phenotypic characterization.
Non-Technical Research Use Statement:
Genetic factors influence people with Alzheimer’s. For example, just one copy of the APOEe4 allele increases the odds of Alzheimer’s disease by ~3 folds. - 40% of Africans have at least one copy of APOEe4 allele. Yet, Africans with APOEe4 do not have an elevated risk of developing Alzheimer's. The underlying protective mechanism is unknown. Accumulating evidence suggests that the TMEM173 gene promotes neuroinflammation and neurodegeneration, such as Parkinson’s and Alzheimer’s diseases. Intriguingly, while the majority of people have the WT allele of the TMEM173 gene, many have the R71H-G230A-R293Q (HAQ), G230A-R293Q (AQ) alleles. More East Asians are WT/HAQ (34.3% of East Asians) than WT/WT (22.0%). Meanwhile, the AQ allele is Africans-specific. Less than 1% of non-Africans have the AQ allele, while ~40% of Africans carry the AQ allele. One copy of the AQ allele can functionally suppress TMEM173-promoted tissue inflammation in mice. Here, we explore the potential protective role of the AQ allele in Alzheimer’s disease.
Investigator:
Jinwal, Umesh
Institution:
University of South Florida, College of Pharmacy
Project Title:
Characterize the Role of Shroom-3 in Alzheimer's Disease
Date of Approval:
February 5, 2020
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Objectives of the proposed research: Alzheimer's disease (AD), the most common type of dementia, is a neurodegenerative disease that generally affects people greater than 45 years old. AD patients show a persistent cognitive decline that leads to total disability at the end stage. Tau protein is one of major proteins linked to AD progression; it accumulates in neurons and forms paired helical filaments. As a result, Tau protein loses its capability to bind with microtubules and leading to neurodegeneration. We have performed Cdc37 chaperone based mass spectrometry to identify novel proteins linked to AD. We found Shroom-3 interaction with Cdc37 completely abolished in AD brain tissues compared to normal human brain tissues. These data provide strong evidence for potential role of Shroom-3 in AD. Currently, there is no genomic data available on Shroom-3 in AD cases. Hence, with this data access we aimed to perform genomic analysis for Shroom-3 and identify any potential mutations (SNPs) in Shroom-3 in AD. After analyzing Shroom-3, we will look at Cdc37 chaperones and other related proteins to fully characterize Shroom-3 and associated proteins. Study design: As a pilot study, we will aim for sample size n=100. Depending on available data and information for analysis, we will group samples as follows: male & female, different ethnicity, and age groups. Depending on data analysis results sample size will be adjusted to higher numbers after completion of pilot study with n=100. We will use bioinformatic software to compare gene sequences from AD patients with normal healthy individual (wild-type gene sequences) to identify any potential mutations/ Single nucleotide polymorphisms (SNPS). Based on results, we will plan cellular and animal model studies for further characterization. Analysis plan, including the phenotypic characteristics that will be evaluated in association with genetic variants: We will carefully look at AD and normal aligned sequences for any changes in a particular nucleotide or set of nucleotides to identify mutations/SNPs in different groups (gender, ethnicity, & age).
Non-Technical Research Use Statement:
Alzheimer's disease (AD), the most common type of dementia, is a neurodegenerative disease that generally affects people greater than 45 years old. AD patients show a persistent cognitive decline that leads to total disability at the end stage. Tau protein is one of major proteins linked to AD progression; it accumulates in neurons and forms paired helical filaments. As a result, Tau protein loses its capability to bind with microtubules and leading to neurodegeneration. We have performed Cdc37 chaperone based mass spectrometry to identify novel proteins linked to AD. We found Shroom-3 interaction with Cdc37 completely abolished in AD brain tissues compared to normal human brain tissues. These data provide strong evidence for potential role of Shroom-3 in AD. Currently, there is no genomic data available on Shroom-3 in AD cases. Hence, with this data access we aimed to perform genomic analysis for Shroom-3 and identify any potential mutations (SNPs) in Shroom-3 in AD. We will also look at Cdc37 chaperones and related proteins to fully characterize Shroom-3 and associated proteins.
Investigator:
Johansson, Fredrik
Institution:
Chalmers University of Technology
Project Title:
AD subtypes
Date of Approval:
August 10, 2021
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) is a highly heterogeneous disease with diverse clinical manifestations and rate of progression. AD subtypes identification will improve our understanding of the underlying disease mechanisms, enable us to predict disease trajectory and develop new disease-modifying treatments. Objective. To identify distinct AD subtypes from WGS data of AD individuals Analysis plan. We will use 3000 WGS data derived from the ADSP Discovery Case-Control Based Extension Study. We will use the available SNVs and INDELS and infer structural variants (SVs) with our in-house multi-caller pipelines. Rare variants will be retained for further analysis. We will then split the dataset in training and tests set, and use the identified set of genetic variants (i.e. SNVs, INDELS and SVs) as input to a deep neural network (an autoencoder architecture) to learn an unsupervised latent representation of the data. AD subtypes will be identified within this reduced space and characterized using, demographics and clinical data. We will then contrast each subtype with the control groups to identify subtype relevant variants (i.e. putative subtype biomarkers), which will be used as input features to a gradient boosted tree model, to generate a subtype predictive model and subtype specific features. Planned collaboration. Each member of the team will devote effort in specific areas of investigation, nevertheless, all the team members will discuss, through regular meeting, individual progress and potential challenges. In particular, Dr Coppola (Research Scientist, Department of Pathology, Yale University, USA), together with Dr Dean Palejev (Associate Professor, GATE Institute, Sofia University, Bulgaria) will be involved in the deep learning model generation and validation, and subtype identification; Dr Fredrik Johansson (Assistant Professor, Department of Computer Science & Engineering, Chalmers University of Technology. Sweden), will work on the supervised machine learning model; Dr Alexander Schliep, Associate Professor, Department of Computer Science & Engineering, University of Gothenburg, Sweden), will work on the SVs inference.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a highly heterogeneous disease with diverse clinical manifestations and rate of progression. The heterogeneity of AD has complicated both clinical trial design and outcomes, and thus the need for better models of AD, and/or better strategies for selection of participants into specic clinical trials is evident. The identication of more homogeneous disease subgroups (i.e. AD subtypes) will improve our understanding of the underlying disease mechanisms, enable us to predict disease trajectory and develop new disease-modifying treatments. We will use a comprehensive set of genetic variants in combination with deep learning algorithms to identify AD subtypes. Subtypes will be characterized using clinical and demographic data. Finally, variants specic to each cluster will be identied and used to train a predictive machine-learning model to classify new individuals.
Investigator:
Johnson, Emma
Institution:
Washington University School of Medicine
Project Title:
Investigating the Multivariate Genetic Architecture of Aging-Related Traits
Date of Approval:
January 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Aging encompasses a range of phenotypes, including lifespan, healthspan, aging-related diseases, and other aging-related outcomes. Aging-related phenotypes are correlated with each other but not perfectly. We will develop a multivariate genomic model of aging-related outcomes and diseases in order to 1) understand whether aging-related phenotypes are best described individually, clustered in small groups, or as a single group and 2) identify genetic variants that underly processes shared across multiple aging-related phenotypes. First, we will examine genetic correlations among aging-related phenotypes such as Alzheimer's disease, longevity, epigenetic aging, telomere length, parental lifespan, healthspan, frailty, and other aging-related diseases (e.g. Parkinson's disease). We will then examine patterns of genomic covariation among aging-related phenotypes using Genomic Structural Equation Modeling. Finally, we will perform multivariate genome-wide association studies and derive polygenic scores from these GWAS.
Non-Technical Research Use Statement:
A wide range of characteristics and behaviors are related to aging, including lifespan, aging-related diseases such as Alzheimer's disease and Parkinson's disease, and frailty. Aging-related characteristics are genetically correlated, meaning that some genetic variants affect more than one aging-related phenotype (e.g. some genetic variants may confer risk for Alzheimer's disease and may be related to shorter lifespan). We aim to identify how different aspects of aging are genetically related to each other, and whether we can use broader groups of correlated phenotypes to identify genetic variants related to certain aspects of aging.
Investigator:
Jun, Gyungah
Institution:
Boston University School of Medicine
Project Title:
AI4AD (Artificial Intelligence for Alzheimer’s Disease): Ultrascale Machine Learning to Empower Discovery in Alzheimer’s Disease Biobanks
Date of Approval:
January 12, 2021
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to genomic, imaging and cognitive data, in order to 1) identify AD genotypes and endophenotypes that dissect AD’s heterogeneity; 2) relate said genotypes and endophenotypes with clinical progression in pre-dementia patients; 3) identify novel treatment targets for AD by analyzing whole genome and associated phenotypic data. The goals of this multisite initiative (Paul Thompson, USC; Christos Davatzikos, Li Shen, Penn; Andy Saykin, IU; Heng Huang, Pitt, Paul Crane, UW; Adam Brickman, Columbia; Tim Hohman, Vanderbilt; Guyngah Jun, BU; Duygu Tosun, UCSF; Alexander Zaranek, Curii) leverage the promise of machine learning (ML) to contribute to precision diagnostics, prognostication, and targeted and novel treatments. We will develop ML and deep learning methods to apply to large scale biobanks of whole genome sequences (WGS), neuroimaging, cognitive, and clinical data, aiming to discover new genomic features that influence biological processes of AD. We will apply methods of genome representation and tiling to WGS repositories to create inputs for AI methods. We will develop novel, interpretable, biological knowledge guided deep learning methods to discover genomic motifs associated with AD, AD risk, and biological processes of AD as defined by NIA-AA criteria. To quantify subtypes and disentangle biological processes of AD, we will apply computational methods to multimodal MRI and amyloid- and tau-sensitive PET to stratify and subtype patient groups; novel imaging genomics methods will detect genomic markers and pathways that modulate the developing pathology as detected in the images, and that predict future clinical decline or resilience. We hypothesize that advanced deep learning methods combined with whole genome data will outperform traditional methods and GWAS for predicting AD onset and progression, and will assist with disease subtyping and discovering treatable targets in the genome. A team will rank and repurpose existing, and identify novel drugs and targets in the genome based on the discovered genetic motifs affecting AD.
Non-Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to extensive and rich genomic, imaging and cognitive data, in order to 1) identify genotypes and endophenotypes of AD that dissect the heterogeneity of the disease; 2) relate these genotypes and endophenotypes with clinical progression, in pre-dementia patients; 3) identify novel treatment targets for AD, by analyzing whole genome and associated phenotypic data at a previously impossible scale. Collectively, the goals of this highly collaborative multi-site initiative leverage the promise of machine learning to contribute to precision diagnostics, prognostication, and targeted and novel treatments.
Investigator:
Kamboh, M. Ilyas
Institution:
University of Pittsburgh
Project Title:
Genetics of Alzheimer's Disease and Endophenotypes
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: We are requesting access to the NIAGADS datasets to augment our ongoing studies on the genetics of Alzheimer’s disease (AD) and AD-related endophenotypes being carried out by Kamboh and his group since 1995. We are doing GWAS using array genotypes, whole-exome sequencing and whole-genome sequencing on datasets derived from University of Pittsburgh ADRC and ancillary population-based longitudinal studies on dementia and biomarkers. Different available phenotypes include AD and non-AD dementia, age-at-set, disease progression and survival, neuroimaging, cognitive decline, plasma biomarkers for the core ATN and non-ATN pathologies. We also plan to expand on gene-gene interaction and sex-stratified analyses which require the actual genotype data. The NIAGADS datasets will be used for replication and meta-analysis, and for gene-gene interaction and sex-stratified analyses. Study Design: A case-control design will incorporate a diverse cohort of individuals with AD and age-matched controls. For quantitative traits (neuroimaging and plasma biomarkers, cognitive performance measures, indicators of disease progression), linear regression analyses will be performed to identify genetic loci. To ensure the findings are robust and inclusive, participants from diverse demographic backgrounds will be included, enabling the exploration of potential genetic variations across populations. Analysis Plan: We will conduct GWAS and targeted analyses on candidate genes on different AD and AD-related phenotypes. Primary phenotypic variables include AD disease status, age-at-onset, last age for controls, APOE genotype, cognitive decline trajectories, sex, and race. Analyses will evaluate the influence of specific genetic variants on disease risk, cognitive performance, and biomarker levels, considering both individual and interactive effects of the APOE genotype. Results will be adjusted for potential confounders, such as demographic factors, to ensure valid associations. Detail analytical methods are described in our published papers for case-control (PMID: 32651314;35694926), quantitative traits (PMID: 30361487;37666928), and cognitive decline (PMID: 37089073; 30954325).
Non-Technical Research Use Statement:
Our research group at the University of Pittsburgh (Pitt), has been working on the genetics of Alzheimer’s disease (AD) and AD-related endophenotypes for almost three decades, on data derived largely from the University of Pittsburgh Alzheimer’s Disease Research Center and ancillary dementia studies. We are requesting access to the NIAGADS genotype and phenotype datasets to augment our sample size to increase power to detect novel genetic associations with AD and related endophenotypes.
Investigator:
Katt, Moriah
Institution:
West Virginia University
Project Title:
Machine Learning-Driven Identification of Alzheimer’s Disease-Specific Targets
Date of Approval:
June 10, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of this study is to prioritize Alzheimer’s disease (AD)-associated genetic targets using computational and machine learning approaches applied to genome-wide association study (GWAS) summary statistics derived from the NIAGADS dataset (NG00075). Specifically, we will analyze variant-level association signals from the International Genomics of Alzheimer’s Project (IGAP) meta-analysis (Kunkle et al., 2019) and integrate these signals into gene-level representations to identify genes that may be associated with AD pathology and potential molecular targets. Using statistical aggregation and machine learning modeling approaches, we will evaluate patterns of genetic association across variants mapped to genes and genomic regions to prioritize candidate genes that may be linked to AD biological mechanisms. These analyses will leverage association statistics and allele frequency information available in the controlled-access IGAP summary statistics. Allele frequency information provides population-level context for variant-disease associations and can support prioritization of candidate genes whose associated variants may reflect biological processes relevant to AD across broader segments of the population represented in the GWAS. This study is a secondary computational analysis of existing, de-identified GWAS summary statistics. The dataset contains variant-level association metrics describing the statistical relationship between genetic variants and AD. No individual-level genotype data, clinical records, or personally identifiable information will be accessed. All analyses will be conducted using secure institutional computing systems in compliance with NIAGADS data use policies. Computational analyses will be performed on a HIPAA-compliant High-Performance Computing cluster operated by Research Computing at West Virginia University that provides a secure institutional environment for data storage and analysis. The phenotypic characteristic evaluated in association with genetic variants is AD case-control status as defined in the original IGAP meta-analysis, and variants will be analyzed in relation to their reported statistical association with AD.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a progressive neurodegenerative disorder that impacts millions of people worldwide. Previous large genetic studies have identified many differences in people’s DNA that are associated with an increased risk of developing AD. By studying these genetic differences across large groups of individuals, researchers can identify genes that may be connected to the biological changes that occur in the brain during the disease. In this project, we will analyze existing genetic research data to identify genes that are strongly associated with AD. The long-term goal of this work is to help identify molecular targets that could inform future therapeutic delivery strategies aimed at improving the ability of treatments to localize to and remain in affected brain tissue.
Investigator:
Kim, Jong Hun
Institution:
KOREA UNIVERSITY RESEARCH AND BUSINESS FOUNDATION
Project Title:
Discovery of APOE-Interacting Genes Through Trans-Ancestry and Sex-Stratified Analysis to Elucidate Alzheimer's Disease Risk Mechanisms and Stratify ARIA Risk Using Proxy Outcomes
Date of Approval:
July 20, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: This project identifies ancestry- and sex-specific APOE ε4 modifier genes—variants that amplify or attenuate APOE ε4’s effect on AD risk and ARIA susceptibility from anti-amyloid immunotherapy. Aim 1: Trans-ancestry sex-stratified GWIS to construct an APOE-Wide Epistasis Map. Aim 2: Mechanistic validation via eQTL/pQTL colocalization and epistasis network. Aim 3: Explainable AI (XAI) integrating modifier SNPs, multi-omics subtypes, and ARIA proxy outcomes to stratify pre-treatment ARIA risk. Study Design: Multi-cohort secondary analysis using NIAGADS-controlled ADSP data exclusively. Individual-level data from all 15 ADC cohorts (NG00022–NG00151) and multi-ancestry ADSP WGS (NG00067, NG00166) span European, African American, Hispanic/Latino, and South/East Asian ancestries. Functional datasets (eQTL/pQTL: NG00102, NG00118, NG00120, NG00130) support Aim 2; imaging and neuropathology datasets (NG00103, NG00147, NG00175) enable Aim 3 ARIA proxy development. No prospective recruitment. Multi-dataset rationale: GWIS requires 4–8× more samples than standard GWAS (Gauderman 2002); no single cohort is independently powered—all 15 ADC cohorts must be pooled. Trans-ancestry GWIS requires ancestry-matched datasets (NG00100/African, NG00106/South Asian, NG00141/Hispanic) because population-specific LD cannot be imputed from summary statistics. Functional datasets (eQTL, pQTL, methylation) are non-redundant—each covers a distinct regulatory layer for Aim 2. All datasets are AD-specific; non-AD neurodegeneration data are excluded. Analysis Plan: Phenotypes: AD case/control (primary); APOE ε4 × SNP interaction; lobar microbleed count (ARIA-H proxy); SVD score (WMH, lacunar infarcts, perivascular spaces); longitudinal cognitive decline. Covariates: age, sex, top 20 ancestry PCs, stratum. Methods: logistic GWIS; trans-ancestry meta-analysis (METAL/MR-MEGA); sex-stratified/X-chromosome analyses; eQTL/pQTL colocalization (COLOC2/SMR); XGBoost XAI with 5-fold CV and SHAP.
Non-Technical Research Use Statement:
Alzheimer’s disease affects tens of millions worldwide. Lecanemab, approved in 2024, slows Alzheimer’s progression by removing amyloid plaques—but causes dangerous brain side effects (ARIA: Amyloid-Related Imaging Abnormalities) especially in APOE ε4 carriers, who also most need treatment. Currently, doctors cannot predict which APOE ε4 carriers will benefit versus be harmed. Our research identifies modifier genes controlling how dangerous APOE ε4 is. We leverage the ADSP’s diverse dataset spanning 15+ cohorts across European, African American, Hispanic/Latino, and Asian ancestries—a scale statistically necessary because detecting gene–gene interactions requires 4–8× more samples than standard genetic studies. Population-specific patterns allow high-confidence modifier identification. MRI-based brain bleeds and vascular markers serve as validated ARIA surrogates available at scale. The result is an explainable AI tool that predicts—before treatment begins—which APOE ε4 patients face high ARIA risk and which will benefit from lecanemab, enabling precision Alzheimer’s therapy.
Investigator:
Klein, Robert
Institution:
Icahn School of Medicine at Mount Sinai
Project Title:
Polygenic risk for dementia with Lewy bodies
Date of Approval:
October 24, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Dementia with Lewy Bodies (DLB) is an understudied Alzheimer’s-related dementia characterized neuropathologically by the accumulation of Lewy bodies in the brain. Like other dementias, definitive diagnosis can only be made upon autopsy after death, though numerous clinical features such as visual hallucinations, Parkinsonism, and REM sleep behavior disorder are associated with the condition. DLB is thought to account for approximately 5% of dementia diagnoses. A major barrier to understanding the natural history and pathology of this condition is the lack of definitive diagnoses that can be made during a patient’s lifetime. Recently, a polygenic risk score (PRS) was developed that can help identify people at higher risk of developing DLB based on their genetic profile (Chia et al, 2021). Intriguingly, the genetic variants associated with DLB identified in this study are also associated with risk of either Alzheimer’s disease or Parkinson’s disease, consistent with other observations that have identified pathological features in DLB similar to those two. This study will examine the association of the DLB polygenic risk score with various phenotypic measures in the ADSP. The goal of this analysis is to determine if specific features are enriched in people with a high propensity for DLB based on genetics. To the extent that these people have another diagnosed dementia, this would suggest the possibility of a misdiagnosis. Specifically, we will examine individuals whose data is shared through the NIA’s NIAGADS Data Sharing Service. For each individual, we will compute the DLB PRS previously described [3]. We will then ask if the score correlates with a diagnosis of Alzheimer’s disease, presence of amyloid, Braak stage, or Parkinson’s Disease Braak stage. We will also look at subsets of the DLB PRS consisting only of SNPs associated with Alzheimer’s disease or Parkinson’s disease.
Non-Technical Research Use Statement:
Dementia with Lewy bodies (DLB) is an understudied dementia with features similar to both Alzheimer’s disease and Parkinson’s disease. Recent studies have developed a polygenic risk score that can predict who is at higher risk of DLB on the basis of their genetics. Genetic factors in this score have also been associated with Alzheimer’s disease and Parkinson’s disease. Here we will ask how the DLB risk score correlates with Alzheimer’s diagnosis and various features of the brains of people with Alzheimer’s to better understand the relationship between DLB, Alzheimer’s, and Parkinson’s.
Investigator:
Klopack, Eric
Institution:
Indiana University
Project Title:
Health and Aging Associations with Omic Data in the Health and Retirement Study
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
This study will investigate the biological pathways linking social and demographic factors to aging and health outcomes, including mortality, disease morbidity, physical functioning, and cognitive functioning. It will link publicly available data from the Health and Retirement Study (HRS) to epigenetics data available from the National Institute on Aging Genetics of Alzheimer's Disease Data Storage Site (NIAGADS). This study will examine links between social factors (e.g., social stress, social position, discrimination, social support and isolation) to health outcomes (e.g., mortality, disease morbidity, physical functioning, and cognitive functioning) via epigenetics, with a particular focus on aging. This study will utilize machine learning and regression techniques to identify epigenetic markers associated with aging and health and/or sociodemographics. This study will also calculate epigenetic clocks and surrogates not available in the public data releases. This project will also investigate menopause timing and characteristics, sex hormones, hysterectomy and oophorectomy, and other sex and aging factors as predictors and outcomes of epigenetic factors. This will include epigenome-wide analyses, machine learning analyses, and clock and surrogate-based analyses. This project will also investigate how DNAm pipeline procedures may affect epigenetic results, including how decisions surrounding normalization procedures affect EWAS and epigenetic clock results.
Non-Technical Research Use Statement:
This project will identify epigenetic factors linking social and demographic factors to health outcomes. It will also examine how sex and aging related factors may affect or be affected by epigenetics. It will also examine bioinformatic and biostatistical decisions that might affect epigenetic results.
Investigator:
Knowles, David
Institution:
New York Genome Center
Project Title:
Learning the Regulatory Code of Alzheimer's Disease Genomes
Date of Approval:
December 20, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Our overarching objective is to apply machine learning techniques to predict and interpret the functional effects of genetic variants including Single Nucleotide Variants (SNVs), indels and Structural Variants (SVs) from AD WGS data at the levels of DNA regulation and RNA processing, and link these effects directly to pathways and network context. We will leverage WGS generated by the ADSP and others together with harmonized endophenotypes and clinical data, multi-omics data from the AMP-AD, functional genomics data from Roadmap Epigenomics, PsychENCODE and GTEx Projects, and microglia and monocytes specific transcriptomic and single-cell RNA-seq data sets. Our central hypothesis is that many AD-associated genetic risk or protective variants influence pre- and post-transcriptional gene regulation, resulting in changes to gene expression and cellular pathways/networks, and ultimately contribute to protein aggregation in AD. The objective of this aim is to leverage deep-learning-based models capable of predicting functional effects of genomic variants on pre- and post-transcriptional gene regulation. We will train existing and novel sequence-based deep learning models of epigenomic state and RNA regulation and processing specific to AD-relevant cell types and states. in silico mutagenesis under these trained models will be used to calculate functional impact “delta scores” for every SNV, indel and structural variants (SV) detected from AD WGS. We will use these delta scores to empower non-coding rare variant tests of association with AD at the regulatory region, gene and pathway levels. We will conduct functional fine-mapping through the integration of (i) the CNN delta scores (ii-iii) expression and splicing quantitative trait loci (eQTL and sQTL), (iv) AD endophenotypes and (v) multi-ethnic AD WGS data. We will use probabilistic ML methods, combined with cell-type-specific and single-cell RNA-seq datasets, to build gene regulatory networks. This NIH funded project is a close collaboration with Dr. Towfique Raj at Mount Sinai Medical School.
Non-Technical Research Use Statement:
Despite decades of research and enormous investment, no disease-modifying treatment is available for Alzheimer’s disease (AD). Combining population-scale data collection, human genetics and machine learning provide a way forward to uncover and characterize new causal cellular processes involved in AD. Effectively integrating diverse genomic data to better understand AD represents a substantial computational challenge, both in terms of data scale and analysis complexity. We will train machine learning models to predict epigenomic signals from the genomic sequences to estimate the functional impact of any genetic variant. These analyses will highlight variants and genes involved in AD. However, genes do not operate in a vacuum so robust machine learning will be used to learn cell-type and disease- specific networks. Such pathways will be prime candidates for future functional and therapeutic studies of AD.
Investigator:
Konermann, Silvana
Institution:
Arc institute
Project Title:
Modeling Alzheimer’s disease risk and associated molecular phenotypes
Date of Approval:
August 8, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of the proposed research is to determine the relationship between Alzheimer’s disease (AD) genetic risk and associated molecular phenotypes. Genotype data will be used to compute a polygenic risk score (PRS) for disease-affected and control (non-disease-affected) participants. Statistical regression and mediation analyses will be used to model variation of molecular phenotypes with respect to PRS and, where available, pathology stage or cognitive impairment. Molecular phenotypes to be analyzed include bulk/single-cell/single-nucleus transcriptome, epigenome, proteome, metabolome, lipidome, amyloid, and tau. Molecular phenotypes of participants, including controls, will be matched with molecular phenotypes of in vitro cellular models, informing the design of in vitro perturbation experiments that recapitulate the genetic drivers of AD risk.
Non-Technical Research Use Statement:
Our goal is to determine the relationship between human genetic profiles associated with Alzheimer’s disease (AD) risk and specific measurable characteristics of human cells. Using multiple statistical analysis methods, we will build quantitative models that describe how those characteristics vary as a function of AD genetic risk. The models we build will help us design in vitro cellular systems that reflect different levels of AD risk, enabling experiments that inform new strategies for treating or preventing AD.
Investigator:
Koran, Mary Ellen
Institution:
Mayo Clinic Arizona
Project Title:
Alzheimer's Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
April 28, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Kulminski, Alexander
Institution:
Duke University
Project Title:
ApoE2 and protective molecular signatures in Alzheimer’s disease and aging
Date of Approval:
November 20, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective: to identify personalized polygenic profiles, comprised of the APOE e2 allele, other SNPs in the APOE region, and SNPs spread through the entire genome, with stronger protection in aging and Alzheimer’s Disease (AD) framework, and identify the role of AD risk factors in these profiles using ADSP and 10 more studies. Study design: Aim 1 will identify molecular signatures of aging-related traits (AD, cardiovascular diseases, longevity, etc.), defined as differences in linkage disequilibrium (LD) patterns between affected and unaffected subjects using methods of LD analysis. In Aim 2 we will dissect heterogeneity in the molecular signatures using methods of stratification analyses. We will examine the impact of age at onset, sex, race/ethnicity, Braak stage, AD risk factors (diabetes, lipids, hypertension, body mass index, education), and other factors. Aim 3 will identify personalized polygenic profiles of aging-related traits using traditional and advanced bio-demographic methods. In Aim 4 we will perform bioinformatics analysis and characterize transcription pathways using summary statistics and individual-level data from the expression quantitative trait loci studies. In some cases, we may need to pool several datasets to increase power of the analyses in a mega sample. This will be done by pooling individuals’ records for genotypes and selected phenotypes described above from different studies. This pooling will not create any additional risks to participants because neither genetic nor phenotypic information for the same individual will increase. This research is consistent with data use restrictions for ADSP. We will not conduct non-genetic research, will not investigate individual pedigree structures, population origins, ancestry, individual participant genotypes, perceptions of racial/ethnic identity, variables that could be considered as stigmatizing an individual or group, or issues such as non-maternity. The research is designed to protect data confidentiality and follow local and institutional policies and procedures for data handling. The results of this research will be broadly shared with the scientific community.
Non-Technical Research Use Statement:
Increasing population of the elderly individuals worldwide raises serious concerns about burden of geriatric conditions in future, especially Alzheimer’s disease, cardiovascular diseases, and other common aging-related diseases. These diseases can cluster in families suggesting that they can have genetic origin. Understanding their genetic origin could lead to breakthrough in preventing or curing such diseases. Despite continuing efforts, understanding their genetic basis remains very limited. Particular problem is to better understand genetic basis of Alzheimer’s disease, its relationship to other aging-related diseases, and identify genetic variants which could help protect against such diseases. This project focuses on identifying personalized polygenic profiles involving the Alzheimer’s disease protective genetic variant, so-called APOE e2 allele, which could strengthen protective effects against Alzheimer’s disease and investigate which factors can improve this protection. This research will facilitate the development of interventional strategies aiming to promote healthy aging.
Investigator:
Kulminski, Alexander
Institution:
Duke University
Project Title:
Personalized genetic profiles of risk and resilience in Alzheimer’s and vascular diseases
Date of Approval:
July 8, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective: to identify personalized genetic profiles of risks and resilience to Alzheimer’s disease (AD) and vascular diseases in the disease-specific and pleiotropic contexts in prioritized loci leveraging information from the AD-centered pleiotropic meta-analysis planned in this project and previous analyses by our and other research groups, and identify the role of AD risk and other factors in these profiles using ADSP and 13 more studies. Study design: Aim 1 will identify specific and pleiotropic loci for AD and vascular traits from new analyses and the existing publications by: (i) performing pleiotropic genome-wide analysis focused on AD, cardiovascular diseases (CVD), and AD risk factors and (ii) identifying promising loci from this analysis and the results of previous analyses by our and other research groups. Aim 2 will dissect heterogeneity leveraging the analysis of molecular signatures defined as differences in linkage disequilibrium patterns in affected and unaffected subjects. Aim 3 will identify personalized genetic profiles of AD-specific and pleiotropic risks and resilience. Aim 4 will leverage biological, bioinformatics, and omics analyses to make sense of statistical inferences. In some cases, we may need to pool several datasets to increase power of the analyses in a mega sample. This will be done by pooling individuals’ records for genotypes and selected phenotypes described above from different studies. This pooling will not create any additional risks to participants because neither genetic nor phenotypic information for the same individual will increase. This research is consistent with data use restrictions for ADSP. We will not conduct non-genetic research, will not investigate individual pedigree structures, population origins, ancestry, individual participant genotypes, perceptions of racial/ethnic identity, variables that could be considered as stigmatizing an individual or group, or issues such as non-maternity. The research is designed to protect data confidentiality and follow local and institutional policies and procedures for data handling. The results of this research will be broadly shared with the scientific community.
Non-Technical Research Use Statement:
Increasing population of the elderly individuals worldwide raises serious concerns about burden of geriatric conditions in future, especially Alzheimer’s disease, cardiovascular diseases, and other common aging-related diseases. These diseases can cluster in families suggesting that they can have genetic origin. Understanding their genetic origin could lead to breakthrough in preventing or curing such diseases. Despite continuing efforts, understanding their genetic basis remains very limited. Particular problem is to better understand genetic basis of Alzheimer’s disease, its relationship to other aging-related diseases, and identify genetic variants which could help protect against such diseases. This project focuses on identifying personalized genetic profiles of risk and resilience to AD and vascular diseases. This research will facilitate the development of interventional strategies aiming to promote healthy aging.
Investigator:
Lai, Dongbing
Institution:
Indiana University School of Medicine
Project Title:
Evaluating the genetic risk of Alzheimer's disease and related dementia
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The estimated heritability of Alzheimer's disease and related dementia (ADRD) is ~60%-80%. Except AOPE, most ADRD genes have small effects but their effects can be collectively measured by using polygenic scores (PGS). For PGS having high predictability, they can be used to identify high-risk individuals before they show any symptoms thereby early intervention programs can be applied to prevent ADRD or delay the onset . In addition, for PGS with high predictability, genetic variants included in calculating them are likely to be ADRD related hence PGS provide another way to identify ADRD associated genes, especially those having small effects and thus are not genome-wide significant. The objectives of this study are deriving PGS for ADRD with high predictability and using PGS to understand the genetic mechanisms of ADRD in diverse populations. We will use publicly available large-scale genome-wide association studies (GWAS) of ADRD as the discovery datasets. The requested datasets will be used as the target datasets. Different PGS programs/pipelines, including those developed by us will be evaluated to identify the one with the best performance. Furthermore, we will use the requested datasets to develop methods to adjust for the genotyping platforms/batches effects in different datasets as well as population stratification, with the goal to harmonize PGS distributions and thus uniform PGS thresholds to define high/low risk in different datasets can be determined. Additionally, we will evaluate the effects of imputation quality on PGS calculation by comparing PGS calculated using imputation dosages and genotypes obtained by whole genome sequence. Cox proportional hazard model or logistic regression will be used with sex as a covariate. We will also perform sex stratified analyses as ADRD having different prevalence in males and females. For the PGS that have high predictability, we will perform fine mapping to identify genes impacted by genetic variants included in calculating PGS. Then network and pathway analyses will be performed to elucidate the genetic mechanism of ADRD.
Non-Technical Research Use Statement:
Genetics factors play significant roles in Alzheimer's disease and related dementia (ADRD). Most ADRD genes have small effects and individually they cannot be used to predict ADRD risk. However, their effects can be collectively measured by using polygenic scores (PGS). Furthermore, if PGS have high predictability, then genes included in calculating PGS are likely to be ADRD related; therefore, PGS can also be used to identify ADRD genes and understand the genetic mechanism of ADRD. In this study, we will systematically evaluate the performance of different PGS programs to identify the one with the best performance. We will also develop methods to adjust for the genotyping platforms/batches effects and genetic ancestries, Additionally, as many genetic variants are imputed, we will evaluate the effects of imputation quality on PGS predictability. Our goals are to derive PGS that can be used to identify high risk individuals for ADRD hence early intervention can be applied, and to understand the genetic mechanisms of ADRD thereby novel therapeutic strategies can be developed.
Investigator:
Lambert, Jean-Charles
Institution:
Institut Pasteur de Lille
Project Title:
Searching for Alzheimer-related genetic variants and genes
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The purpose of this study is to find new Alzheimer related variants and genes, by combining exome and genome data from healthy controls and Alzheimer patients from different studies. Data will be analyzed using association, burden and variance component statistics.
Non-Technical Research Use Statement:
Some individuals develop dementia, while others do not. A large part is likely determined by ones genes, Alzheimer’s disease has a heritability of up to 80%. What are the key genetic factors that determine if one will get Alzheimer's disease ? In this study, we will thoroughly explore genomic data of a large group of healthy persons and dementia patients to answer this question.
Investigator:
Landman, Bennett
Institution:
Vanderbilt University
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
June 2, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Lange, Christoph
Institution:
Harvard University
Project Title:
FBAT-approaches for region-based analysis, using haplotype information, meta analysis approaches of Alzheimer's disease studies and developmentof Polygenic risk score models for Alzheimer's disease
Date of Approval:
September 5, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Using the haplotype-algorithm for FBAT, we will develop a general testing framework that will allow for the implementation of region-based association tests, e.g. SKAT, burden, multi-variate and, using a permutation approach, the calculation of exact p-values. All association test statistics will be computed based on the exact genetic variance/ covariance matrix. Permutation/ simulation-based p-values are obtained using our new haplotype-algorithm. We will also develop higher criticism approaches for the region-based association analysis of the AD samples. Recent developments in theoretical statistics have shown that higher criticism approaches are, by far, the most powerful statistical techniques to detect association signals in very spare data, for instance rare variant WGS data. In the framework that we developed for the previous funding cycle, the higher criticism approaches will be implemented based on the exact genetic exact variance/ covariance matrix. Permutation/ simulation-based p-values will be obtained using our haplotype-algorithm. We will also develop meta-analysis approaches to combine locus-specific and region-based association findings for AD across studies. Using reported association findings in the literature, we will develop polygenic risk score models/ integrated risk models that are based on the methodology of marker assisted selection, and evaluate their performance in terms of prediction of AD in simulation studies. We will evaluate all of our approaches by application to the requested data set (NG00067).
Non-Technical Research Use Statement:
We will develop haplotype-based approaches for the region-based analysis of WGS data in family-based designs for Alzheimer's Disease. We will develop meta-analysis approaches to aggregate locus specific and region-based associations with Alzheimer's Disease across studies. Furthermore, we will develop polygenic risk score/ integrated risk model approaches that model age-at-onset and non-affection status as the primary phenotype for AD, thereby achieving better performances than standard approaches.
Investigator:
Leavitt, Blair
Institution:
University of British Columbia
Project Title:
Targeted analysis of non-coding variants in Alzheimer’s Disease
Date of Approval:
October 24, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Background The gene progranulin is critical for maintenance of brain health. Progranulin is thought to regulate components of the autophagic/lysosomal pathway. Normal expression of progranulin is needed to prevent neurodegenerative disease caused by aberration in this pathway. Mutations that disrupt the function of progranulin cause Frontotemporal Dementia, while mutations that reduce progranulin expression are thought to increase risk for the development of Alzheimer’s disease. However, mutations that reduce progranulin expression remain to be fully characterized. Objectives This research project seeks to evaluate the effect of variation in progranulin’s regulatory regions on the risk of developing Alzheimer’s disease. In doing so, we hope to uncover novel Alzheimer’s risk variants and broaden the current understanding of the role of progranulin in Alzheimer’s disease. Study Design The first phase of this study involves the computational prediction of thousands of variants with the potential to alter progranulin expression. Briefly, transcription factor ChIP-seq data and transcription factor binding site sequence information has been utilized to identify regions within the progranulin gene where variation is expected to alter transcription factor binding. Variation in these transcriptionally active ‘regulatory’ sites is expected to modify progranulin expression. We will query these regulatory sites in Alzheimer’s patient and control genomes to identify novel variants that are associated with increased or decreased risk of Alzheimer’s. Variants of interest will be further characterized to determine the effect of the variant on progranulin expression. Analysis Plan We will assess enrichment of the most common variants (MAF > 0.005) by chi-squared test. Since most of the variants are thought to be rare, enrichment for these variants will be assessed using a rare variant association test called the Optimized Sequence Kernel Association Test (SKAT-O). Our primary analysis will assess whether any variants are enriched in case or control populations. Subsequently, we intend to evaluate if any variants are capable of modifying age of onset or symptom severity.
Non-Technical Research Use Statement:
Maintenance of brain health during aging is a complicated process, involving many players. One such player is a protein called progranulin. Progranulin works to regulate components of the waste disposal system inside brain cells. Without enough functional progranulin, the waste builds up in brain cells and causes dementia. In fact, progranulin mutations are a common genetic cause of Frontotemporal Dementia. Furthermore, some progranulin mutations are thought to increase risk for the development of Alzheimer’s disease. Despite progranulin’s important role in maintaining brain health during aging, we still do not know much about the role of progranulin mutations in Alzheimer’s. To address this knowledge gap, we intend to perform the most comprehensive search to date for progranulin mutations that increase risk for Alzheimer’s. Any progranulin mutations that appear to increase risk for Alzheimer’s will be characterized further. Ultimately, this study seeks to expand our current understanding of the role of progranulin in the development of Alzheimer’s disease.
Investigator:
Lee, Brian
Institution:
Drexel University
Project Title:
LEGENNDS
Date of Approval:
August 12, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The primary outcome variable of interest is the presence of dementia as determined from the core HRS interview that includes a Telephone Interview for Cognitive Status (TICS). Secondary outcomes of interest include cognitive scores and change in scores. The primary predictor variables of interest are polygenic risk scores for autism, ADHD, and intelligence. The covariates considered in this study include age at study entry, sex, race/ethnicity, socioeconomic status at baseline, education, and genetic ancestry. Cox proportional hazards models will evaluate time to event, operationalized as the time from study entry to the time of the first instance of dementia status. Those who did not receive a status of dementia by the end of the follow-up period were termed censored. Similarly, individuals who died before receiving a diagnosis of dementia were also censored, as well as those who were lost to follow-up. Each censored individual also had a time to event from the time of study entry to the time of censoring. Statistical models will estimate the association of polygenic risk scores for autism, ADHD, and intelligence and dementia/cognitive scores and change in scores. Our collaboration includes researchers at U Pitt (Andrea Rosso, Yicheng Cai) and the University of Haifa (Stephen Levine).
Non-Technical Research Use Statement:
Emerging evidence suggests that certain neurodevelopmental disorders – autism, attention-deficit hyperactive disorder (ADHD), and intellectual disability – may increase the risk of later life neurodegenerative disorders such as Alzheimer’s disease or related dementia (ADRD). The goal of this proposed study is to elucidate the genetic link between autism, ADHD, and intellectual disability and ADRD. As part of this, we will examine the relationship between genetic susceptibility to neurodevelopmental disorders and future risk of ADRD. This study will make use of genetic data from over 18,000 participants in the Health and Retirement Study.
Investigator:
Lee, James
Institution:
University of Minnesota
Project Title:
Recent Selection for Behavioral Traits
Date of Approval:
June 6, 2024
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objectives of the proposed Research: 1) To test for and measure secular trends for a range of traits in humans, with a focus on behavioral and health traits. 2) To test whether the strength and direction of these trends changes between generations. 3) To test hypothesized mediators, including age at first birth and SES. Study design and Analysis Plan: Using cohorts that have completed their fertility we will run regressions with the fertility rate as the dependent variable and polygenic scores as the independent variable. From this we will calculate the selection differential and the strength of selection, after adjusting for the missing variance of the polygenic scores. We will test the role of different moderators by splitting the sample according to the moderators. Analyses will be done and reported with and without the use of sampling weights. We intend to study selection of a range of behavioral and health related traits including the Big Five personality traits, occupational status, ADHD, BMI, educational attainment, cognitive performance, smoking cessation, smoking initiation, height, schizophrenia, depression and autism. We will derive our own polygenic scores from available summary statistics, not limiting ourselves to what is available in the Polygenic Score Data provided by HRS. Secondary analyses will include: 1) measuring change in the polygenic scores between cohorts, with a focus on the difference between those born before, during and after the Second World War. 2) Estimate genotypic change using phenotypes available in the HRS that are closest to our genotypic traits, which include the Big Five personality traits, occupational status, ADHD, BMI, educational attainment, cognitive performance, smoking cessation, smoking initiation, height, schizophrenia, depression and autism. We will not collaborate with researchers from other institutions.
Non-Technical Research Use Statement:
Many traits affect and are associated with the number of children we have. Illnesses and education can get in the way of reproduction, for example. This results in our culture, society and environment selecting for certain traits in future generations. Although the speed of this process is extremely slow, its direction and exact strength is unclear for many traits. We would like to measure this effect.
Investigator:
Lee, Jonghun
Institution:
TAKEDA PHARMACEUTICAL COMPANY LTD
Project Title:
Identification of genetic risks and potential target for stratified Alzheimer's disease patient groups
Date of Approval:
July 8, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The goal of analyzing ADSP umbrella cohort data is identifying variants, genes and pathways associated to Alzheimer’s disease (AD), and stratifying patients by genetic risks. Following describes procedure. 1) Identification and validation of genetic risks The whole genome and whole exome sequencing data will be analyzed to identify genetic variants or genes associated to phenotypes in case-control cohort, such as AD status and Braak stages. Several methods will be applied, such as VEP [William McLaren et al, 2016], LOFTEE [Karczewski, 2015] and PEXT scoring [Beryl B.C et al., 2020] for variant annotation and SAIGE-GENE [Wei Zhou et al., 2020], and KGWAS [Kexin Huang 2024] for the association test. The association will be tested for other endophenotypes such as cognitive scores and brain volumes that available in subset of the cohort. Replication and meta-analysis will be conducted on UK biobank and Tohoku medical megabank organization (ToMMo) cohort data. 2) Patient stratification in ADSP cohort Leveraging the increased sample size, we will stratify the cohort by genetic risks such as ApoE types, or phenotypes such as Braak stages, and compare the effect size of variants or genes among the patient groups. In addition, the genetic risk score (GRS) will be calculated using LDpred2 [Florian Prive, 2020], RapidoPGS [Guillermo Reales, 2020], and PRSice2 [Choi, S.W., 2020], and validated in independent cohorts and compared to available clinical endophenotypes. Then we will search the effect of the GRS to extensive phenotypes in UK biobank and ToMMo. Last, the NG00130 proteome will be used for unsupervised classification of patients. Overlapping the protein signature-based groups with genetic signals, we’ll find casual pathologic pathways and targets for each subgroups. 3) Identify CSF proteomic biomarker for AD patient stratification The large CSF proteomic data (NG00130) will be used to identify composition of proteins classifying patients into AD subtypes. In addition, downstream pathway engagement of relevant pathologies of the subtypes will be analyzed
Non-Technical Research Use Statement:
The aim of our study is identifying variants or genes potentially causal of the Alzheimer’s disease in whole or subset of patients. To be specific, WES and WGS data will be analyzed to investigate common and rare variants associated with disease status and intermediate phenotypes. In addition, the patients will be stratified and sub-grouped by their genetic and proteomic signatures. Last, we will incorporate other large biobanks such as UK biobank or ToMMo to investigate the genetic effects to extensive phenotypes potentially linked to symptoms appearing in sub patient groups.
Investigator:
Lee, Kun Ho
Institution:
Chosun University
Project Title:
Alzheimer's disease(AD) subtype analysis using genome sequencing data
Date of Approval:
November 26, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives of the Proposed Research Alzheimer’s disease (AD) is a common degenerative disease, causing irreversible dementia. Early diagnosis is difficult due to a long asymptomatic period and requires invasive, expensive procedures. A screening method to classify high-risk groups for early AD diagnosis is needed. Study Design Early AD risk prediction can use genomic variants like the Polygenic Risk Score (PRS), which predicts high-risk groups but shows performance differences due to genetic heterogeneity and ethnic specificity. To address this, ethnicity-specific analysis is considered and validated with different ethnic datasets. This study aims to develop Korea-specific PRS models for early AD risk prediction using genomic data from a Korean cohort and the ADSP. Trans-ethnic genomic data will be created by combining GARD and ADSP data, including African American (AA), non-Hispanic Whites (NHW), and East Asian (EA) data. Cross-validation (CV) analysis will divide data into training and test sets. Genomic variants' importance (e.g., p-values, BLUP) will be calculated, and selected variants applied to PRS. PRS models will be evaluated using CV-divided test data to select the best model. Trans-ethnic and ethnicity-specific PRS models will be validated using reserved validation data. Analysis Plan The proposal aims to identify ethnicity differences in genomic prediction built with Caucasian-centric GWA SNVs and improve the model for trans-ethnic groups, particularly East Asians. A Bayesian machine learning approach transfers genetic risk model knowledge from the NHW dataset to other ethnic groups for better accuracy. Genotype datasets from all ancestry groups are used together. Instead of trans-ethnic meta-analysis, the approach by Gim et al. is adopted. Each ethnic group dataset is divided for cross-validation. Training datasets are analyzed to evaluate p-values and BLUP of SNVs. Summary statistics are used to build the prediction model and apply nested-CV for model selection. The best model for each ethnic group is tested using the test dataset. Data is analyzed similarly by learning from ethnic-specific variants and building a prediction model with the new method.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is the leading cause of dementia and is irreversible once symptoms appear. A long asymptomatic period of AD complicates early diagnosis requiring invasive and costly procedures like CSF extraction or PET scans. Therefore, a screening method to identify high-risk groups for early AD diagnosis is necessary. One approach uses the Polygenic Risk Score (PRS), which calculation is based on multiple genomic variants associated with AD. However, PRS predictions vary significantly (60-80%) due to genetic heterogeneity and ethnic specificity. Thus, data from multiple ethnicities must be analyzed. Although Asia accounts for over 50% of global dementia cases, most large-scale AD cohorts are predominantly White, lacking studies on Asians. This study aims to develop trans-ethnic and ethnicity-specific PRS models for early AD risk prediction using genomic data from the GARD cohort, centered on Koreans, and the ADSP, which includes various European ethnicities. It investigates AD’s genetic heterogeneity due to ethnic differences and proposes methods to adjust for variability.
Investigator:
Lee, Wan-Ping
Institution:
University of Pennsylvanian
Project Title:
Understanding Genetic Risk Factors for Alzheimer’s Disease in the Iceland Population
Date of Approval:
May 12, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objects: To conduct a genome-wide association study of common and rare variants, including single nucleotide variants (SNVs), short insertion/deletions (indels), and structural variants (SVs), with Alzheimer’s Disease in an integrative analysis of ADSP and the Age/Gene-Environment Susceptibility (AGES) Study data Analysis Plan: 1. QC plan for pooled samples from ADSP and AGES 1.1. Genotype Level QC - Check read depth coverage and genotype quality for each genotype - Change genotypes to missing if low quality (DP < 10 or GQ < 20) - Allele balance heterozygosity (0.25 < ABhet < 0.75) 1.2. Sample Level QC - Samples without AD status will be excluded - Principal component analysis (PCA) with the reference populations in the 1000 Genomes Project to identify most similar population group - Standard QC, remove samples with: - Missingness > 1% - Heterozygosity > 3 standard deviations from the mean het/hom ratio - Relatedness values indicating duplicate samples 1.3. Variant Level QC - Standard QC, remove variants with: - Minor allele frequency threshold (see Association Tests) - Missingness > 1% - Low Hardy-Weinberg p-values (< 1e-10) - Ti/Tv coefficients > 10 SD from the mean - Keep variants with ABhet ratio between 0.3-0.7 2. General genetic quality analyses (non-phenotypic) - Burden analysis using SKAT-O - Identifying positive selection - Identifying population bottlenecks and expansions (effective population size) 3. Primary Analysis 3.1 Available relevant phenotypes: Provided by The Icelandic Heart Association 3.2. Participants - ADSP release 5 (R5) 58k (or most recent release) samples with clear Alzheimer’s Disease status - AGES subjects with clear Alzheimer’s Disease status 4. Potential Secondary Analyses: Copy Number Variation (CNV) and Structural Variant (SV) analyses if a novel SNV is found. CNVs/SVs are detected separately by Smoove and Manta and merged by Svimmer for each sample. Further merging for all samples can be done by Svimmer that provides the input of GraphTyper2 for joint genotyping. Collaboration: This project will be a collaboration between University of Pennsylvania and The Icelandic Heart Association
Non-Technical Research Use Statement:
Understanding Genetic Risk Factors for Alzheimer’s Disease in Iceland Population This study aims to explore how genetic differences contribute to Alzheimer’s Disease by analyzing data integration of ADSP and the Age/Gene-Environment Susceptibility (AGES) Study. By studying various types of genetic changes—including small variations in DNA sequences and larger structural differences—this project seeks to identify genetic markers linked to Alzheimer’s. The research team will employ advanced computational methods to ensure the data quality and analyze the relationship between specific genetic factors and Alzheimer’s. This includes examining rare mutations, evaluating their potential biological impact, and exploring pathways that might help explain the development of the disease. This collaboration between the University of Pennsylvania and the Icelandic Heart Association represents an important step toward understanding the role of genetics in Alzheimer’s Disease in Iceland population and could provide insights for future treatments or interventions.
Investigator:
Li, Qingqin
Institution:
Janssen Research & Development, LLC
Project Title:
Target identification and validation in Alzheimer’s Disease with Whole-Genome and Whole-Exome Sequence Data
Date of Approval:
March 31, 2023
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Aim 1: Identify novel genes and replicate existing gene associations for Alzheimer’s disease (AD). Aim 1a: Common variant genome-wide association analysis. With this approach, we will leverage existing consortium GWAS summary statistics where makes sense (or request leave-one/N summary association statistics out if we see a need to use a different version of phenotype definition from the same cohort) and augment them with additional datasets available internally. Aim 1b: Rare variant gene-level genetic burden analysis. Using the ADSP analysis pipeline, we will aim to use the same analysis pipeline (but reserve the option to use an alternative pipeline) to contribute the whole genome sequencing (WGS) data generated from the internal galantamine samples to ADSP-led consortium analysis. We will perform case-control and/or family-based genetic analyses and/or quantitative trait genetic analyses using AD traits such as diagnosis, age of onset, amyloid positivity, tau positivity, CSF biomarker endophenotypes, disease progression, etc. (where the phenotype is available) as the outcome of interest. Covariates include age, sex, and principal components. ADSP, UKB, and FinnGen will be analyzed separately and combined with a meta-analysis. Biobank cases will be defined using ICD-9/ICD-10 codes, and proxy cases and controls will be carefully defined using questionnaire data on the parental history of AD. Both true and proxy cases will be considered to maximize the number of AD cases. Aim 2: Prioritize novel gene associations identified in Aim 1. We will perform genetic fine-mapping and leverage tissue and cell-type specific datasets (e.g. GTEx, AD Knowledge Portal including AMP-AD, internal datasets, MiGA, Harari et al snRNA-Seq) to prioritize targets for further functional and analytical interrogation. Furthermore, multi-omics-based network approaches will be used to identify disease-related molecular modules and tissue-specific regulatory circuits. Aim 3: utilize single-nuclei sequencing data to more fully catalog cell type heterogeneity in the brains of individuals with AD and how this differs from brain from uninjured, cognitively unimpaired individuals.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a common, progressive, neurodegenerative disorder with a strong genetic component with heritability estimates ranging from 58–79% for late-onset AD and over 90% for early-onset AD. To date, there is only one approved treatment option intended to mediate the disease progression of AD, while all others treat symptoms associated with AD. Genetic association studies are important to highlight key biological mechanisms contributing to the etiology of AD and provide insights into potential pathways that can ultimately be targeted for future therapeutic development. The aim of this study is to perform a retrospective analysis of genetic data collected from large-scale population-based and case-control cohorts including the UK Biobank, the Alzheimer’s Disease Sequencing Project (ADSP), FinnGen, and Janssen internal cohorts. We will also integrate them with available multi-modal datasets including but not limited to, Microglia Genomic Atlas, Harari et al snRNA-Seq, and neuroimaging data to identify novel and existing evidence for genetic determinants of AD.
Investigator:
Li, Victor On-Kwok
Institution:
the University of Hong Kong
Project Title:
Identification of early biomarkers for Alzheimer’s Disease (AD)
Date of Approval:
July 25, 2023
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Project Title: Identification of early biomarkers for Alzheimer’s Disease (AD) (1) Objectives of the proposed research This research project aims to develop a data-driven causal graph framework for the identification of biomarkers for the early detection of subjects who may potentially develop Alzheimer’s Disease (AD), while accounting for potential confounders, including genetic, sociodemographic, environmental, clinical, and behavioral factors. (2) Study design We will use machine learning techniques to identify biomarkers, such as mutations in the blood, which are highly correlated with the onset of AD. Then we use a causal AI technique to identify the most causal of such biomarkers. Our hypothesis is that these most causal biomarkers can be used to identify presymptomatic subjects who may potentially develop AD. We track such presymptomatic subjects who eventually develop AD to test our hypothesis. (3) Analysis plan This is an interdisciplinary project, marrying AI and neuroscience. We develop a framework that utilizes AI and big-data to speed up the search for the early biomarkers, by incorporating domain knowledge on the complex causal pathological pathways and co-morbidities. We shall evaluate sociodemographic, environmental, clinical, and behavioral factors in association with genetic variants.
Non-Technical Research Use Statement:
Identification of early biomarkers for Alzheimer’s Disease (AD) This will proceed in two stages. First we will use artificial intelligence techniques to identify biomarkers, such as mutations in the blood, which are highly correlated with the onset of AD. The top such markers will then be tracked. Our hypothesis is that these top markers will be able to detect the early onset of AD in presymptomatic subjects. We track presymptomatic subjects who eventually develop AD to test our hypothesis.
Investigator:
Lichtarge, Olivier
Institution:
Baylor College of Medicine
Project Title:
Integrating the impact of exome variations
Date of Approval:
July 15, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
BACKGROUND, OBJECTIVES: Our group developed a method to estimate the impact of missense mutations, that we call the “Action” of missense mutations. This method is better than current state-of-the-art approaches at matching experimental data on mutational loss of function, not just in our own controls but also in blind competitions assessed objectively by independent judges (CAGI 2011, and 2012-13). When we used Action on head and neck cancer patient data (TCGA) we obtained significant separation of patient survival among those with a high Action and those with a low Action in somatic TP53 mutations. However, mutations in other genes may also correlate with patient outcome, such as the mutations of IDH1 in glioblastomas (Nobusawa et al., Clin Cancer Res, 2009). Therefore, we plan to integrate mutation impact information over the human proteome and identify how severely they affect the pathways associated with each cancer type. In addition, we like to test the same principles in data from complex diseases such as Alzheimer’s Disease. To do so, we developed a network diffusion method that uses current information of protein interactions (in a physical or broader sense) in order to project the dysfunction of a protein to its near neighbors (Lisewski et.al., Physica A, 2010). Putting these together, our hypothesis is that the diffusion of Action to the human protein network can identify novel Alzheimer’s disease-associated genes and provide a better stratification of patient outcome. STUDY DESIGN, ANALYSIS PLAN: To test our hypothesis we need to access “Individual germline variant data” of patients. For each individual, we will score the germline missense mutations by Action and treat it as the potential dysfunction on the protein. Then, we will diffuse this action over the network and measure the effect on each gene and on each pathway. When we compare these data to those from healthy individuals (1000 Genomes Project), i) we can identify genes associated to each disease and ii) the pathways that affect mostly the disease, and iii) measure the severity of the mutational damage to these genes or pathways. USE RESTRICTIONS: We will follow all restrictions described.
Non-Technical Research Use Statement:
My group is interested in developing computational tools that predict i) disease-associated genes, ii) disease-causing mutations, and iii) the impact of an individual’s mutations to the phenotype. We make these predictions by comparing the mutational patterns of the cases with those expected either by random chance or given the purifying section observed in human polymorphisms. Here, we request access to the database of the NIA Genetics of Alzheimer’s Disease Data Storage Site (NIAGADS) Data Sharing Service (DSS), in order to obtain protein mutation data from patients and healthy individuals.
Investigator:
Lin, Honghuang
Institution:
University of Massachusetts Chan Medical School
Project Title:
Assessing Alzheimer’s disease risk and heterogeneity using multimodal machine learning approaches
Date of Approval:
August 12, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of this study is to develop machine learning models using genetic and phenotype data from the NIAGADS database https://dss.niagads.org/. We will develop both unsupervised and supervised learning models to characterize the heterogeneity and risk of Alzheimer’s disease (AD). This is an MPI study in collaboration with Dr. Anita DeStefano at Boston University School of Public Health. For the first aim, we will build an expandable multimodal unsupervised machine learning framework to investigate AD heterogeneity. We will perform AD subtyping by harnessing the rich multimodality information across a wide spectrum of data (e.g., genetics, images and blood biomarkers). A Bayesian kernel network will be built to estimate the relative weight of each individual data modality, which would also allow the addition of new data modalities as they become available. The analyses will be performed both within and between ethnic populations. For the second aim, we will build an expandable multimodal supervised machine learning framework to quantify AD risk from longitudinal follow-up of clinically normal elders. We will build a separate deep learning network for each data modality in consideration of its unique feature sets. A multiplicative strategy will then be taken to aggregate information from different modalities with weighted contributions. Feature selection will also be performed to identify the most informative features predictive of AD risk. For the third aim, we will build AD-related gene regulatory networks in post-mortem human brain samples. We will examine the association of multi-omics data with AD, which will be used to assign gene priority based on the combinatorial evidence from each type of omics data. A gene ontology-guided greedy search strategy will then be implemented to build gene regulatory networks, and identify key drivers that might be potential therapeutic targets for AD. The analyses will be stratified by ethnic populations and AD phenotypic clusters.
Non-Technical Research Use Statement:
Alzheimer's disease (AD) is the most common form of dementia characterized by progressive loss of cognitive function. There are very limited treatment options for AD. For the current application, we seek to develop multimodal machine learning models by leveraging the rich collection of AD-related omics data and phenotypical data recently generated from the Alzheimer's Disease Sequencing Project (ADSP). Three aims will be pursued in the current application. For Aim 1, we will build an expandable multimodal unsupervised machine learning framework to investigate AD heterogeneity. For Aim 2, we will build an expandable multimodal supervised machine learning framework to quantify AD risk from longitudinal follow up of cognitively normal elders. For Aim 3, we will build AD-related gene interaction networks in post-mortem human brain samples. The present application represents an innovative approach to identify individuals at high risk of AD. The outlined strategy will provide new insights into the risk stratification and prevention strategies for AD.
Investigator:
Liu, Qian
Institution:
The University of Nevada, Las Vegas
Project Title:
Multi-faceted genomic and epigenomic profiling in Alzheimer's disease using Oxford Nanopore sequencing
Date of Approval:
December 4, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: This project aims to investigate structural variants (SVs) and DNA methylation patterns that may contribute to the development of Alzheimer’s disease (AD) by leveraging the strengths of both long-read and short-read sequencing technologies. Both genomic variations, such as single-nucleotide variants (SNVs) and SVs, and epigenomic variants, such as 5-methylcytosine (5mC), may spontaneously contribute significantly to AD risk. Although previous studies have identified AD-associated SNVs and differential methylation, their ability to resolve complex genomic regions—especially repetitive sequences, which constitute over 50% of the human genome is understudied due to the limitations of short-read sequencing. Also, the potential synergy between genetic variants and epigenetic modifications in AD is missing. This study will incorporate long-read technologies to address these issues. Study Design: We will analyze whole-genome sequencing datasets and associated phenotypic data from individuals diagnosed with AD and matched unaffected controls. SVs and DNA methylation events will be identified separately using short-read and long-read data. Long-read sequencing will be prioritized for its superior ability to detect large-scale genomic rearrangements and methylation signals in complex and repetitive regions. The integration of both sequencing technologies will enhance sensitivity, resolution, and accuracy. Analysis Plan: The analysis will include genome-wide detection and annotation of SVs and 5mC methylation, followed by comparative analysis between AD cases and controls. We will detect SVs and 5mC, and identify differentially methylated regions (DMRs) and AD-linked SV regions (SVRs) in AD patients after considering age and sex. These AD-associated patterns will be annotated with functional genomic elements and evaluate their potential contribution to AD pathogenesis. Finally, we will investigate the co-occurrence (with 100kb separation, adjustable) of SVs and aberrant methylation patterns, aiming to identify synergistic signatures that may improve our understanding of disease mechanisms and inform the development of diagnostic or therapeutic strategies.
Non-Technical Research Use Statement:
Alzheimer's disease (AD) is an incurable neurodegenerative disorder with a strong genetic contribution (an estimated heritability ranging from 58% to 79%). Existing studies have focused on single-nucleotide variants and structural variants (SVs) and differential DNA methylation using short-read sequencing. However, these approaches are limited in reliably detecting SVs and DNA methylation in repeat regions, which compose >50% of the human genome, due to the limitation of short-read sequencing. Furthermore, the synergy of genomic variants and DNA methylation remains underexplored due to technical challenges. We want to investigate SVs and DNA methylation that is unique in AD patients compared to unaffected controls using long-read sequencing. To enhance the investigation, we will combine the data generated by both long-read sequencing and short-read sequencing. Our research is expected to offer novel insights to advance the understanding of AD development and to improve diagnostic or therapeutic approaches.
Investigator:
Lo, Cecilia
Institution:
University of Pittsburgh
Project Title:
Exploring the shared genetic etiologies of CHD and Alzheimer’s disease
Date of Approval:
October 2, 2023
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Congenital heart disease (CHD) affects approximately 1% of infants born each year. While CHD was previously fatal, surgical palliation now allows most patients to survive into adulthood. With more adults living with CHD, there is increasing appreciation for continuing health problems among CHD patients, such as high risk for dementia and Alzheimer’s disease. Recent studies show that APOE modifies neurodevelopmental outcomes in the CHD population (Gaynor JW, J Thoracic Cardiovascular Surgery, 2014) and that CHD patients have higher risk for Alzheimer’s disease (Bagge CN, Circulation, 2018). We hypothesize that CHD and Alzheimer’s have shared genetic causes and modifiers. Further insights into the genetic causes for CHD and dementia may reveal novel genetic relationships between the two diseases and provide possibilities for improvements in long term neurological outcomes for CHD patients. We have performed whole exome sequencing at 80x coverage on a discovery cohort of over 600 CHD patients recruited at the University of Pittsburgh Children’s Hospital and obtained access to a cohort of ~4000 healthy older individuals sequenced by the Medical Genome Reference Bank (MGRB) for use as population-matched controls. Here we will perform case-control association analysis with human next-generation sequencing data to identify SNVs, indels, and CNVs associated with CHD. We request access to sequencing data from the Alzheimer’s Disease Sequencing Project to perform a separate case-control analysis, comparing the Alzheimer’s cohort to the MGRB controls. We will then compare genes and variants that are significantly associated with each disease to identify shared pathways involved in disease pathogenesis. Processing and statistical analysis will be performed on the Pittsburgh Supercomputing Center using GATK, bcftools, PLINK, SKAT, and MAGMA well as custom shell, Python, and R scripts. These studies should help us elucidate the shared genetic etiology of CHD and Alzheimer’s disease. We intend to publish or share any findings from this study with the scientific community by presenting at national scientific meetings.
Non-Technical Research Use Statement:
Congenital heart disease (CHD) affects approximately 1% of infants born each year. While CHD was previously fatal, surgical palliation now allows most patients to survive into adulthood. With more adults living with CHD, there is increasing appreciation for continuing health problems among CHD patients, such as high risk for dementia and Alzheimer’s disease. We hypothesize that CHD and Alzheimer’s have shared genetic causes and modifiers. Further insights into the genetic causes for CHD and dementia may reveal novel genetic relationships between the two diseases and provide possibilities for improvements in long term neurological outcomes for CHD patients. Here we will compare genes and variants that are significantly associated with each disease based on case-control analysis to identify shared pathways involved in disease pathogenesis. In the future, we will study the functional consequences of such mutations using cells and mouse models. These studies should help us to elucidate the shared genetic etiology of CHD and Alzheimer’s disease.
Investigator:
Lodato, Michael
Institution:
University of Massachusetts Chan Medical School
Project Title:
Analysis of somatic mutations in Alzheimer's disease
Date of Approval:
January 6, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Our objective is to study somatic mutations in the genome of human neurons in Alzheimer's disease (AD). Our study design is as follows. We will identify somatic mutations present in the single-cell whole-genome sequencing datasets in this dataset using established algorithms. We will then characterize those mutations in various ways, including 1) location of mutations (exons, introns, intergenic regions), 2) mutation types (substitutions, indels, aneuploidy), and 3) mutation signatures (exact nature of base changes, for example C>T vs. C>A mutations, small vs. large deletions). Our analysis plan is to compare patterns of mutation in advanced AD neurons to neurotypical controls. AD stage was determined by the authors using the Braak staging system. The Braak staging system uses the intensity and brain region distribution of staining of Tau neurofibrillary tangles in the brain to assign the severity of AD pathology. Donors with significant staining in the prefrontal cortex are considered late-stage. We will not analyze germline genetic markers in this study.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) and other neurodegenerative diseases are characterized by age-related loss of neurons in the brain, and afflict about half of individuals over the age 85. Scientists currently have a poor understanding of the causes of these disorders. This study will examine how DNA damage and somatic mutation changes during healthy aging relates to AD, with the goal of better understanding the causes of these diseases so that treatments and cures can be developed.
Investigator:
Lodato, Michael
Institution:
University of Massachusetts Chan Medical School
Project Title:
Somatic mutation analysis during aging
Date of Approval:
January 7, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Introduction: Our lab studies somatic mutation in the human brain. Somatic mutation is the process by which mutations occur in cells of the developing or postnatal body. Somatic mutations that occur in a proliferative cells are inherited by all cells derived from that mutated founder. We refer to these variants as clonal somatic mutations. Somatic mutations occurring in post-mitotic cells are restricted to the cell in which they occurred. We refer to these variants as non-clonal somatic mutations. In the human brain, most clonal somatic mutations reflect embryonic development, making them useful for lineage tracing studies. Non-clonal somatic mutations in the brain accumulate during life in differentiated cells like neurons and oligodendrocytes. Studying the molecular features of these variants, for example the type of base change comprising a substitution, or the nucleotide context in which a mutation occurs, nominates mechanisms responsible for generating a given mutation. Study Design: We have performed single-cell whole-genome sequencing (scWGS) on several postmortem human donors from across human lifespan to profile changes in the burden, molecular signature, and distribution of somatic mutations during life. Many of the donors we studied in our lab were also part of the cohort of the dataset we currently request from NIAGADS, published in Cell by Ganz et al. We aim to integrate the data from Ganz et al. with our own to increase the statistical power of our study. Analysis Plan: Data will be mapped to the human genome using the BWA algorithm, and mutations will be identified using SCAN2 pipeline and other pipelines as needed to broaden our study. Mutation burden, distribution, and signatures will be compared across cell types, during aging, and across collection sites (Boston Children's Hospital for these data, UMass Chan Medical School for our data). The only phenotypic data we would share would be age, sex, cause of death, and QC data (RIN, etc.) Planned Collaboration: We may share these data with Dr. Zhiping Weng, also at UMass Chan Medical School. We often collaborate with her computational biology group to analyze genomic data.
Non-Technical Research Use Statement:
DNA damage has long been thought to contribute to human aging. Our lab uses cutting-edge techniques to study specific types of DNA damage, called somatic mutations, in the human brain at high resolution. We will add the data in this collection to our own to increase the power of our study to learn new things about human brain aging, possibly leading to new anti-aging interventions.
Investigator:
Lu, Qiongshi
Institution:
University of Wisconsin-Madison
Project Title:
Dissect the genetic architecture for sociological traits through integrative analysis of GWAS and functional annotations
Date of Approval:
February 29, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Genome-wide association studies (GWAS) have identified tens of thousands of associations for numerous complex traits. However, despite the identifications of associated genetic variants, interpretation of GWAS findings remains challenging. The complex structure of linkage disequilibrium in the human genome, coupled with weak effect sizes of common genetic variants, hinder our ability to identify biologically functional genetic variants and understand their functional mechanism. Recent advances in epigenetic and transcriptomic functional annotations have accelerated discoveries in a variety of human genetics applications including GWAS downstream analysis. In this project, we leverage integrative genomic functional annotations in GWAS data to dissect the genetic architecture of complex traits. Specifically, we will integrate the requested GWAS data with epigenetic and transcriptomic annotation data in public repositories (e.g. Epigenomics Roadmap Project, ECNODE, and GTEx) to explore the underlying genetic architecture of various sociogenomics traits available in the HRS, examine shared genetic components among these traits, leverage pleiotropy and functional annotation information to prioritize genetic variants affecting these phenotypes, and robust and interpretable produce genetic prediction models. We think that integrating functional annotation information can effectively reduce noises and spurious associations in the non-functional regions in the human genome. More importantly, the tissue-specific nature of epigenetic and transcriptomic data would provide novel insights into the genetic basis and functional pathways of sociogenomic phenotypes. Finally, using better prioritized variants and annotation-informed effect size estimates can improve the prediction accuracy of polygenic risk score, which enhances the statistical power in studying the genetic relationship among multiple phenotypes.
Non-Technical Research Use Statement:
Overwhelming evidence indicates that common genetic variants account for a substantial proportion of phenotypic variance in many complex behavioral phenotypes. As a systematic and robust approach, GWAS can effectively identify genetic variants associated with human traits. In this study, we employ genetic data from HRS to identify genetic variants associated with a variety of sociological phenotypes. Then, we will apply state-of-the-art statistical and computational methods to help interpret our findings. Specifically, we will integrate external annotation information of the human genome to fine-map causal variants at identified genetic loci, identify related tissue and cell types for sociological traits, and identify candidate risk genes. Further, by jointly modeling multiple traits, we dissect the shared and distinct genetic architecture among related sociological traits.
Investigator:
Luo, Sheng
Institution:
Duke University
Project Title:
Integrative multimodal modeling for personalized prediction of Alzheimer's disease progression
Date of Approval:
January 22, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives of the proposed research In this project, we aim to develop an integrative modeling framework to leverage complex multi-modal data for disease modeling, biomarker discovery, and personalized risk prediction in Alzheimer’s disease and related dementias, as well as in aging- and vascular-related processes that contribute to cognitive decline. Study design We will analyze data from ADSP, which includes whole-genome sequencing and rich phenotypic information from multiple cohorts encompassing diverse ancestries. Participants with available cognitive assessments, fluid biomarkers, neuropathology, cardiovascular measures, and neuroimaging data will be included. We will use harmonized longitudinal and cross-sectional data across cohorts to enable integrative analyses of genetic, clinical, and imaging measures. Analysis plan Genetic variants, fluid biomarkers, neuropathology measures, cardiovascular risk factors, and neuroimaging-derived features will be jointly analyzed to derive latent disease profiles. We will use advanced statistical approaches, for example, multivariate functional mixed models for longitudinal clinical assessments, nonparametric modeling for imaging data, and factor analysis. The resulting profiles will then be evaluated for associations with cognitive decline and symptom onset, informing predictive models for dynamic risk estimation, biomarker validation, and individualized risk stratification across diverse ancestry groups.
Non-Technical Research Use Statement:
This project aims to use large-scale genetic, clinical, and imaging data to better understand Alzheimer’s disease and related dementias. By integrating information on genes, brain imaging, biomarkers, and health measures, we will identify patterns that indicate early disease, discover new determinants of risk, and predict how cognitive health may change over time. Our approach will also help stratify individuals by risk and improve personalized predictions of disease progression, ultimately supporting earlier interventions and more targeted care for diverse populations.
Investigator:
Ma, Da
Institution:
Wake Forest University School of Medicine
Project Title:
Neuroimage Genomic analysis for Alzheimer's Subphenotypes
Date of Approval:
May 8, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Objective The objective of the proposed study is to establish the connection between Alzheimer’s Disease-related genomic markers and neuroimaging phenotypes and their association with the clinical onset of dementia. We hypothesize that a) genomic factors are associated with diverse Alzheimer’s Disease-related neuropathological and clinical progression patterns; and b) the genotype-phenotype interaction is dynamic along the Alzheimer’s Disease progression trajectory, which in turn regulates the clinical progression of dementia. Study design We plan to develop data-driven computational models using multi-modal imaging-genomics information, to test these hypotheses with the following two Specific Aims: (1) construct clinically relevant computational neuroimaging-genomic fingerprints to characterize distinctive subtypes of Alzheimer’s Disease neuropathological patterns, and (2) Construct clinically explainable subtype-aware AI models with effective genomic-neuroimaging information fusion to achieve accurate prediction of disease progression of Alzheimer’s Disease. Analysis plan I will construct and validate harmonized models by utilizing the available data from the Alzheimer's Disease Sequencing Project Phenotype Harmonization Consortium, which is a multi-institutional effort that harmonized phenotypical data of 22k participants collected from 31 AD-related cohorts to produce a large-scale, racially diverse, standardized set of clearly defined data. 1. We will develop semi-supervised machine-learning-based classification frameworks to explore the complex genotype-phenotype associations that determine distinctive neuroimaging-based pathological progression patterns. 2. We will also develop machine-learning model predictions of future AD-specific neuropathological biomarkers. More specifically, we aim to predict the progression of cortical Aβ levels for identifying pre-symptomatic subjects, and progression of tau levels for symptomatic subjects.
Non-Technical Research Use Statement:
Alzheimer’s Disease (AD) is a complex neurodegenerative disease with multiple variations of pathologies that affect the brain function, eventually leading to cognitive decline. Individual variations of our gene might be associated with different subtypes of the disease. Thus, it is important to explore the disease characteristics within the various AD subtypes to achieve personalized diagnosis and precision medicine, and eventually developing effective treatments for AD. The objective of this proposal is to study the connection between AD-related genomic markers and neuroimaging phenotypes and their association with the clinical onset of dementia.
Investigator:
Maillard, Pauline
Institution:
UC Davis
Project Title:
Alzheimer's Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
May 16, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Malkova, Anna
Institution:
University of Iowa
Project Title:
Micro-homology Templated Insertions in Alzheimer's Disease
Date of Approval:
May 8, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The objective of our research is to characterize genomic rearrangements associated with various human disease including Alzheimer’s. The overarching hypothesis guiding our research is that repair of DNA double-strand breaks (DSBs) by using ‘risky’ inaccurate pathways can lead to genomic destabilization. Our focus is on two DSB repair pathways: break-induced replication (BIR) and microhomology-mediated BIR (MMBIR). BIR is initiated by a broken DNA end invading into a homologous template followed by extensive DNA synthesis that is highly mutagenic. Interruptions of BIR leads to initiation of MMBIR, a template-switching event that often leads to complex genomic rearrangements and has been linked to neurological conditions and to cancer. The overall goal of our proposed research is to define the molecular mechanisms of MMBIR, and to identify factors that inhibit or promote cells entering into MMBIR. We aim to achieve this using our MMBSearch tool to detect MMBIR events that are often missed by other methods in human WGS analyses. Using MMBSearch we will analyze data from NIAGADS, specifically data on neurological disease associated whole genome sequencing (WGS) and whole exome sequencing (WES) to detect MMBIR events associated with neurodegenerative disorders. The results of this analyses will be used to determine the frequency of MMBIR in various types of human cells and their association with neurodegenerative disorders. In addition, we will identify chromosomal locations where MMBIR events are especially abundant and specific features in humans that predispose them to MMBIR. We will identify genetic variations predisposing cells to MMBIR, which may uncover that specific SNPs, structural variations, certain gene mutations, etc. are associated with MMBIR events. We specifically hypothesize that mutations in DNA repair, DNA replication, chromatin maintenance, and DNA damage checkpoint genes could promote MMBIR. These studies will shed light on the etiology and mechanism of MMBIR to potentially develop biomarkers for early detection and design targeted therapies to treat human disorders.
Non-Technical Research Use Statement:
The goal of our research is to understand the underlying mechanisms of genomic instability that lead to human disease. In particular, we are interested to investigate the molecular mechanism of an essentially uncharacterized DNA repair pathway, microhomology-mediated break-induced replication (MMBIR) that has been implicated in DNA mutations and found in a variety of human cancers and in association with neurological diseases. We have recently described a diagnostic pattern of mutations associated with MMBIR using a yeast model, which has allowed us to develop a novel algorithm to search for MMBIR events in sequenced human genomes. We are planning to apply this new algorithm to identify MMBIR events in analyzing human genome databases. The proposed research will allow us to further understand mechanisms of leading to various human diseases including cancer and neurological human diseases and to refine our software that is aimed to detect MMBIR in human genomes. The proposed research will be focused on analyzing the data from NIAGADS database.
Investigator:
Masters, Colin
Institution:
The Florey Institute, The University of Melbourne
Project Title:
The Australian Imaging Biomarkers and Lifestyle (AIBL) Flagship Study of Ageing: Detecting and Preventing Alzheimer’s disease: Towards Lifestyle Interventions-Somatic mutation in Alzheimer's Disease
Date of Approval:
May 15, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Project Title: The Australian Imaging Biomarkers and Lifestyle (AIBL) Flagship Study of Ageing: Detecting and Preventing Alzheimer’s disease: Towards Lifestyle Interventions - Somatic mutation in Alzheimer's Disease (sub-project) Objectives -- Somatic Mutation in AD is a project to identify non-congenitally acquired genetic risks associated with disease onset of sporadic Alzheimer’s disease (AD). Somatic mutation can be any form of alteration in DNA that occur after conception. As opposed to congenital, it’s generally not hereditary unless the germ cells are involved. These alterations can (but do not always) cause disease. We aim to identify somatic variants that contribute to sporadic AD. We believe that the detection of somatic mutations can overcome the flaws of the large genome-wide multiple testing and increase the signal-to-noise ratio to pinpoint the rare genetic determinants that were largely neglected by current genetic association studies. Study design -- We have collected 20 paired human brain microglial DNAs (treated as “tumour”) and whole blood DNAs (treated as “normal”) to call somatic mutations by a tumour-normal mode using a software, MuTect2 (Broad Institute). The sequence has been obtained from the whole genome. Hundreds of rare genetic variants have been identified to connect with AD. Analysis plan -- We’d like to validate our results using datasets like NG00067, NG00105 and NG00106. However, it’s ideal if we could access the alignment data (i.e., BAM files) as well. Because technically somatic calling is not simply a difference between normal (germline) and reference; but also calls for tumour against normal (germline) alongside alignment. MuTect2 is developed to identify somatic mutations. It works with or without matching normal. Once we get access to the alignment data, we will reprocess all samples using the MuTect2 without matching the normal pipeline. We'll call somatic mutations using those datasets and validate the rare genetic determinants that contribute to sporadic AD.
Non-Technical Research Use Statement:
Somatic Mutation in Alzheimer's disease is a project to identify non-congenitally acquired genetic risks associated disease onset of a sporadic Alzheimer’s disease (AD). We believe that detection of somatic mutations can pinpoint the rare genetic determinants that were largely neglected by current genetic association studies. In our pilot study, we have identified hundreds of rare genetic mutations that are strongly associated with AD. We'd like to validate our results using an independent cohort. We plan to reprocess NIH datasets using our own pipeline. But we would need to access the raw data rather than the processed data. This research will greatly accelerate the research on the molecular genetics of AD.
Investigator:
Mather, Karen
Institution:
UNSW Sydney
Project Title:
Investigating the relationships between polygenic risk scores and dementia and cognition across and within populations of different ancestry
Date of Approval:
December 19, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Background: The Cohort Studies of Memory in an International Consortium (COSMIC) studies the factors linked to cognitive decline and dementia in a diverse range of populations from around the world. COSMIC is headed by the Centre for Healthy Brain Ageing (CHeBA), University of New South Wales (UNSW), Australia (see https://cheba.unsw.edu.au/consortia/cosmic). At present, there is limited knowledge available regarding the genetic factors associated with ageing-related complex phenotypes and diseases in non-European based cohorts, particularly in low- and middle-income countries and whether specific ancestry-based genetic association results are generalizable to populations of other ancestries. In this study, we aim to study the genetic factors associated with dementia and related phenotypes to appraise if they can be used to predict age-related cognitive performance and decline and dementia in a wide range of diverse cohorts. We will use data collected by COSMIC Consortium studies but also from external studies wherever possible. Hence, the application to assess NIAGADS data to include as many ancestry-diverse studies as we can in this work. The data from these cohorts/studies will be analyzed by meta-analysis. Objectives: To assess if dementia/cognitive and other polygenic risk scores (PRS) generated from different ancestries (European, non-European and trans ethnic) predicts age-related cognitive performance and decline and dementia across populations of different ancestries and including studies from low and middle-income countries. Study Design and Analysis Plan: Participants will be adults aged 45 and above without dementia at baseline. Different PRS (constructed using different methods, eg.SBayesRC and using different GWAS p-value thresholds using PLINK) will be undertaken using available GWAS summary statistics. Cognitive data both cross-sectionally and longitudinally will be used where available, with priority given to tests of memory. PRS-cognitive analyses will be performed using appropriate mixed models. Covariates will include age, sex, years of education and any study-specific covariates (e.g. PCs for population stratification).
Non-Technical Research Use Statement:
Most human genetic association studies have been undertaken in populations of European ancestry, despite >75% of the world’s population being of Asian or African ancestry. To date, most genetic variants for dementia have been identified using populations mainly of European ancestry and from high income countries, despite more than ~60% of dementia cases living in low and middle-income countries. In addition, many of the non-white genetic studies have had small sample sizes and lack replication. We need to increase our understanding of the genetic risk for dementia and its related traits in under-represented populations, such as the multi-ethnic cohorts of the Cohort Studies of Memory in an International Consortium (COSMIC). The current project aims to examine if polygenic risk scores for dementia, cognitive and other related phenotypes generated from populations of different ancestries predict performance and decline on cognitive tests and incident dementia in older adults from multi-ethnic populations using the COSMIC Consortium and external studies.
Investigator:
Mayeux, Richard
Institution:
Columbia University
Project Title:
Alzheimer's Disease Sequencing Project
Date of Approval:
July 29, 2022
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
For this study, we will analyze data from whole genome sequencing (WGS) of from the Alzheimer's Disease Sequencing Project (ADSP) discovery-replication phase families and independent case control data from ADSP extension study. We will also analyze WGS and whole exome sequencing (WES) data from the Alzheimer's Disease Neuroimaging (ADNI) study and the ADSP follow-up study (ADSP-FUS) as they become available. The overall goal of this project is to identify and annotate causal variants related to LOAD using sequencing data generated from families multiply affected by the disease and validate the results in independent case-control datasets. Using families as discovery and unrelated individuals as replication and having the ability to genotype additional family members can provide direct evidence of causality by establishing which variants co-segregate in families and are associated in the general population with disease.
Non-Technical Research Use Statement:
Analyses of whole genome, whole exome and targeted resequencing will continue to provide important new information regarding potential risk conferring genes, biochemical pathways involved in Alzheimer's disease and targets that may be suitable for pharmacological manipulation. While whole exome and targeted sequencing are powerful technologies, analysis of whole genomes will provide more information and allow discovery of rare, high risk variants.
Investigator:
McCauley, Jacob
Institution:
University of Miami
Project Title:
Identification of genetic risk factors for Inflammatory Bowel Disease in a Hispanic cohort
Date of Approval:
August 25, 2022
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
The objective of the proposed research is to elucidate the genetic risk factors for Inflammatory Bowel Disease (IBD) in a Hispanic cohort. In order to fulfill this objective, we are requesting access to whole genome sequence (WGS) data for 500 Hispanic controls which were sequenced by the National Human Genome Research Institute as part of the Alzheimer Disease Sequencing Project Discovery Case-Control Based Extension Study. More specifically, these data will be combined with WGS data from our Hispanic cohort of ~1600 IBD cases and ~900 controls ascertained through the Crohn's and Colitis Center at the University of Miami in Miami, Florida and Cedars Sinai Medical Center in Los Angeles, California. Drs. Jacob McCauley and Maria Abreu will serve as the primary investigators for this application at the University of Miami and Dr. Dermot McGovern will serve as the primary investigator at Cedars Sinai Medical Center. The use of these data by the noted not-for-profit organizations will be limited to biomedical purposes as related to IBD, in accordance with data use limitations. The analyses to be conducted in this combined Hispanic sample of ~1600 IBD cases and ~1400 controls include case-control association for replication of single variant IBD associations previously identified in European populations as well as assessment for homogeneity of effect size across populations. Polygenic risk scores will be utilized to test for cumulative variant associations with IBD. Fine-mapping of single variant associations will be done using several parallel approaches. Firstly, trans-ethnic meta-analysis with summary statistics from published European and African American studies will be conducted. Linkage disequilibrium (LD) structure around associated variants will be assessed using knowledge of replication and information on LD from ancestral source populations. Secondly, the Bayesian FINEMAP algorithm will also be considered. Additional analyses will include admixture mapping for identification of novel signals and gene-based tests for cumulative association of rare variants.
Non-Technical Research Use Statement:
The objective of the proposed research is to elucidate the genetic risk factors for Inflammatory Bowel Disease (IBD) in a Hispanic cohort. This project combines whole genome sequence data from 500 Hispanic controls which were sequenced as part of the Alzheimer Disease Sequencing Project with ~1600 IBD patient and ~900 control samples which were ascertained by the Crohn’s and Colitis Center at the University of Miami and Cedars Sinai Medical Center. To facilitate our objective, several analyses will be conducted in this combined sample of ~1600 IBD patients and ~1400 controls. First, we will identify DNA segments that contain a genetic variant that occurs more often in IBD patients than in healthy controls. These segments may be novel to Hispanics or have been identified previously in populations of European ancestry. These DNA segments can be large and often the exact location of the predisposing variant within each segment is unclear. Therefore, additional analyses will be done to narrow the width of these DNA segments and identify the predisposing variant(s) within each segment.
Investigator:
Mez, Jesse
Institution:
Boston University
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
March 16, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Michaelis, Elias
Institution:
University of Kansas
Project Title:
Analysis of genome-wide sequencing data from NIAGADS: Searching for gene variants related to gender-Alzheimer's disease (AD) association
Date of Approval:
August 25, 2020
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: Analyze the approximately 3,500 DNA sequences from GWAS at NIAGADS for associations between gender, AD, and frequency of SNPs in chromosomal DNA using the P-Link software analytical tool. Rationale: We have performed such analyses of the DNA sequences (VCF files) made available by ADNI and have identified a significantly higher incidence of SNPs (p<10-7) in a few chromosomal genes in males vs. females with the diagnosis of AD. We would like to perform similar analyses to the DNA sequences (in VCF files) of the greater than 3,000 sequenced DNAs at NIAGADS. Plan: The SNP association analyses will be performed on the DNA sequences using information about the gender and diagnosis of each individual whose DNA sequence we would analyze. We will use the P-Link software to generate data tables (Tab delimited P-Link association files) and Manhattan plots of genome-wide associations (gnuplot). There are no multiple research sites participating in the planned analysis of the DNA sequences. All work will be performed at the University of Kansas AD Center.
Non-Technical Research Use Statement:
Variations in the sequences of DNA in our chromosomes and their association with the incidence of Alzheimer's disease (AD) have been identified in the last 10 years and have brought about new thinking regarding possible causes of AD. These variations in DNA do not directly cause the disease but increase the likelihood of the onset of AD in some individuals late in their life. For many years, it has been known that among various populations there is differential incidence of AD between males and females. In our initial study of a relatively small number of individuals with or without AD, we identified that there was a significant association between a few of the variants in DNA sequences and the incidence of AD in males as compared with females. The study planned will use the DNA sequences in the NIAGADS repository to conduct a similar analysis for variants in DNA sequences. The NIAGADS sequences represent a substantially larger population than that which we analyzed previously and should allow us to explore the possible association of gender and AD with variants of DNA.
Investigator:
Monti, Stefano
Institution:
Boston University
Project Title:
Longevity Consortium
Date of Approval:
January 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We study human exceptional longevity and healthy aging and we have generated many results that connect genetic variants to these traits, to intermediate molecular profiles, including gene expression, serum proteomics and metabolomics, and to nutrition. We would like to use the genetic, methylation and phenotypic data generated in the HRS for replication of our results linking genetics, multiomics profiles of cognitive aging, and nutrition. We will use these data to investigate the associations between genetic variants, multi-omics profiles, longevity and healthy cognitive aging, and nutrition using mixed effect models adjusted by genome-wide principal components and including random effects with variance covariance matrix that depends on the genetic relation matrix. We will investigate the relations between genetic variants and multi-omics profile using our pipeline for yQTL that uses mixed effect models as above. We will integrate the various results using mediation analysis. We will ask for access to deidentified data and not biological specimens.
Non-Technical Research Use Statement:
Our research focuses on understanding why some people live longer and age more healthily, especially when it comes to brain function and nutrition. The study has found many links between genes and these healthy aging traits by looking at different biological factors like gene activity and blood proteins. We aim to leverage the HRS data to confirm our findings. We will analyze this information to explore how genes, biological markers, and nutrition work together to influence long life and healthy brain aging. The data used will be anonymous, ensuring privacy, and no physical samples will be needed.
Investigator:
Moore, Jason
Institution:
Cedars-Sinai Medical Center
Project Title:
Artificial Intelligence Strategies for Alzheimer's Disease Research
Date of Approval:
March 13, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The goal of this project is to develop artificial intelligence (AI) approaches for extracting unforeseen patterns from clinical, genetic, genomic, and imaging data that could lead to ideas for new drug development or drug repurposing. Our proposed AI methods and software will be open-source, user-friendly, and freely available for all to use. Specifically, we will analyze ADSP data sets using three novel informatics methods to tailor our automated machine learning (AutoML) tool to the analysis of Alzheimer’s disease (AD) data. First, we will develop a Multi-Modal Interaction (M2I) feature selection algorithm for identifying genetic interactions that are predictive of AD (AIM 1). Second, we will develop a Knowledge-driven Multi-omics Integration (KMI) algorithm for combining omics features for AI analysis of AD (AIM 2). Third, we will develop a Multidimensional Brain Imaging Omics (MBIO) integration framework for the joint analysis of multisource large-scale data for predicting AD. Finally, we will integrate all three biomedical informatics methods into our open-source AutoML software package and apply it to the ADSP data sets. We expect our methods will reveal new biomarkers for AD that will open the door for better treatments and clinical decision support.
Non-Technical Research Use Statement:
The goal of this project is to develop artificial intelligence (AI) approaches for extracting unforeseen patterns from clinical, genetic, genomic, and imaging data that could lead to ideas for new drug development or drug repurposing. We will develop three biomedical informatics methods with focuses on genetics, genomics and imaging respectively. We will integrate these methods into our open-source AutoML software package, and apply it to the ADSP data sets. We expect our methods will reveal new biomarkers for AD that will open the door for better treatments and clinical decision support.
Investigator:
MOORJANI, PRIYA
Institution:
UC Berkeley
Project Title:
Alzheimer's Disease Sequencing Project Umbrella Study
Date of Approval:
October 25, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
A central goal of human genetics is to understand the link between genotypic and phenotypic variation, including disease risk and local adaptations. In order to perform these analyses reliably, we need to characterize population structure reliably. The historical signatures of our past such as population mixtures, contractions, and expansions, as well as human diseases and natural selection, have left traces in our genomes. In this proposal, our objective is to develop and apply computational methods to reliably characterize population history, including admixture (recent and ancient events including Neanderthal and Denisovan gene flow), founder events and natural selection in the diverse populations in the ADSP dataset. We will use these inferences to then reliably characterize disease associations and identify signals of natural selection, leveraging local and global ancestry inferences and estimates of relatedness across samples in a linear mixed model or other frameworks. We will look for associations between Alzheimer's disease and other cognitive traits and identify genes and genomic regions/ pathways associated to these traits. We will also study the distribution of archaic ancestry––from Neanderthals, Denisovans or other unknown archaic hominins––across the genome to find regions of exceptional high (or low) archaic ancestry that might provide hints about the function of these regions. Further, we will characterize the differences in mutation patterns across different human populations and archaic and modern human lineages. Together, this project will provide insights about our evolutionary past, mutation patterns and genes associated to local adaptations and Alzheimer’s disease and cognitive traits in a diverse multi-ethnic cohort. Together, this project will provide insights about our evolutionary past, genes associated to local adaptations and Alzheimer’s disease, and evolution of cognitive traits in a diverse multi-ethnic cohort.
Non-Technical Research Use Statement:
Evolutionary history shapes our genes and traits. The historical signatures of our past such as population mixtures, contractions, and expansions, as well as human diseases and natural selection, have left traces in our genomes. In this proposal, our objective is to develop and apply computational methods to reliably characterize population history, including admixture, founder events and natural selection in multi-ethnic individuals. Additionally, we will use these insights to reliably map genes and pathways associated to Alzheimer's disease and other traits in the diverse individuals in ADSP.
Investigator:
Mormino, Elizabeth
Institution:
Stanford
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
March 3, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Murray, Melissa
Institution:
Mayo Clinic Jacksonville
Project Title:
Genetics of Young Onset Alzheimer’s Disease
Date of Approval:
November 25, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Historically, young onset Alzheimer’s disease (YOAD) is defined as onset of clinical symptoms before the age of 65, and 90% of such patients are not associated with mutations in the main 3 Ab related genes (APP, PSEN1 and PSEN2). The objective of this project is to deeply characterize known and new genetic components of YOAD in the largest pathologically confirmed cohort in the world and to evaluate the impact of SNVs, SVs and repeat expansions. Due to their earlier onset age, and strong heritability, we hypothesize that YOAD patients are enriched in rare pathogenic variants within the Abeta and Tau pathways. In addition, patients with YOAD are often misdiagnosed for frontotemporal dementia due to clinical symptom overlap. In that context, we also hypothesize that there is genetic overlap between both diseases. We generated whole-genome sequencing data from over 900 YOAD patients, including more than 400 autopsy confirmed YOAD cases, over 1000 FTD patients and 800 controls. We are requesting access to the ADSP whole-genome sequencing data (raw and VCF) to i) to perform gene-based, single variant and pathway association analyses in ADSP YOAD and late onset data to replicate our findings, ii) increase our YOAD cohort size for assessing the overlap and differences between FTD and YOAD patients. Single nucleotide variants, as well as structural variants, will be assessed. To do so, we will use already generated SNVs VCF but also generate structural variant calling using our Mayo pipeline. We will utilize several commonly used software programs, such as Plink-seq and SKAT package, to perform our association analyses. All analyses will be done at the single variant, gene, structural variant, and pathway levels. Using these approaches, we hope to identify novel mutations/genes/pathways that are related to both AD and FTD and will benefit the larger scientific community working on neurodegenerative disorders.
Non-Technical Research Use Statement:
We aim at identifying new risk factors for young onset Alzheimer’s disease (age at onset before 65). To do so we utilize deep phenotyping and genetic approaches. Upon completion of our work, we will obtain a comprehensive understanding of young onset Alzheimer’s disease genetics. Altogether our project will identify potential new therapeutic targets for young onset Alzheimer’s disease and will pave the way for individualized therapy development not only for young onset Alzheimer’s disease, but also for the more common late onset AD.
Investigator:
Myers, Richard
Institution:
HudsonAlpha Institute for Biotechnology
Project Title:
Replication of risk factors for early-onset dementias
Date of Approval:
August 18, 2020
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
We are a part of collaborations with the Yokoyama lab at UCSF and the Kosik lab at UCSB to analyze genomes for early onset Alzheimer’s and frontotemporal dementia cohorts compared to unaffected controls. A critical part of these efforts is replication of any findings in independent cohorts. Access to Alzheimer's Disease Sequencing Project (ADSP) data is ideal for this purpose. We will analyze ADSP data for association signals identified in our independent cohorts using either single variant or burden analysis approaches. Phenotypic characteristics that will be evaluated in association with genetic variants will be either case/control status or age of symptom onset as available. Although we conduct these projects as collaborations, this application is for analysis of ADSP data at HudsonAlpha.
Non-Technical Research Use Statement:
We work together with the Yokoyama lab at UCSF and the Kosik lab at UCSB to analyze the DNA from patients with early onset Alzheimer’s and frontotemporal dementia in comparison to people without these diseases. A critical part of this type of work is checking to see if findings from one set of patients are reproducible in different sets of patients. Access to Alzheimer's Disease Sequencing Project (ADSP) data would allow for us to answer this question. We will analyze ADSP data for association signals identified in our independent sample sets. The types of data that will be evaluated in association with genetics will be either if the individuals assessed have disease or not, or if their genetics affects when they develop disease.
Investigator:
Nicolas, Gael
Institution:
University of Rouen
Project Title:
Searching for Alzheimer-related genetic variants and genes
Date of Approval:
October 24, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The purpose of this study is to find new Alzheimer related variants and genes, by combining exome and genome data from healthy controls and Alzheimer patients from different studies. Data will be analyzed using association, burden and variant component statistics.
Non-Technical Research Use Statement:
Some individuals develop dementia, while others do not. A large part is likely determined by gene, Alzheimer’s disease has a heritability of up to 80%. What are the key genetic factors that determine if one will get Alzheimer disease? In this study, we will thoroughly explore genomic data of a large group of healthy persons and dementia patients to answer this question.
Investigator:
Oh, Edwin
Institution:
University of Nevada Las Vegas
Project Title:
Genomic data analyses to understand genetic risks and protective factors of Alzheimer's disease
Date of Approval:
October 2, 2023
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The overall objective of the proposed research is to use dataset NG00067 to conduct genetic research for better understanding of genetic protective and risk factors underlying the Alzheimer’s disease (AD) continuum and use artificial intelligence techniques to improve individual-level AD prediction with genomic features. Our central hypothesis are 1) loss-of-function single nucleotide polymorphisms (SNP) of AD-risk genes, and their interactive genes at the protein level, might confer protective effect against AD; and 2) deep convolutional neural network (CNN) can deal with large and colinear feature space, thus the CNN classifier could potentially improve the performance of individual-level AD continuum prediction with genetic features over non-deep classifiers. To test these hypothesis, we plan to examine single loss-of-function mutation SNPs of AD-risk genes, and their protein level interactors, as potential protective factors for AD (Aim1). More specifically, AD-associated genetic risks/hazards will be obtained from previous literatures. Their protein-level interactive genes will be obtained from the Reactome database. Loss-of-function SNPs associated with these genes will be further obtained from the gnomAD database. Phenotypic comparisons along the AD continuum will be conducted among 0-copy, 1-copy, and 2-copy mutation carriers of these SNPs. In addition, we plan to develop a CNN-based deep learning technique to derive high-order meaningful genetic features from the whole-genome or whole-exome sequencing data that could improve the individual-level disease prediction (Aim2). For subjects with brain imaging or CSF biomarker data available from the NACC database, we plan to further jointly evaluate the associations among genetic risks/protectors, brain imaging changes, and CSF biomarkers along the AD continuum (Aim 3) to better understand the pathophysiology underlying AD onset and progression. A separate data request has been made to the NACC database.
Non-Technical Research Use Statement:
The purpose of this study is 1) to better understand genetic protective and risk factors underlying Alzheimer's disease, and 2) to improve individual level disease predictions with artificial intelligence techniques. We seek to identify AD protective genetic features through screening the disease phenotypes of loss-of-function mutation carriers in subjects along the AD spectrum. We also would like to take advantages of the deep neural network in capturing high-order meaningful features from large, high-dimensional, and co-linear feature spaces. Therefore, we will utilize/adapt the well-established deep convolutional neural network classifier with genetic features to predict individual-level disease status. These results will help us to better identify subjects at risks of AD, and improve the accuracy and efficacy for personalized diagnosis, treatment, and prevention.
Investigator:
OShea, Deirdre
Institution:
University of Miami Jackson Health System
Project Title:
Developing a DNAm Biomarker for Cognitive Aging: Addressing Disparities and Promoting Community Engagement
Date of Approval:
March 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The proposed study aims to investigate cognitive aging and cognitive health differences across racial and ethnic groups, with a focus on the role of DNA methylation (DNAm) patterns associated with cognitive performance in later life. This research addresses the public health challenge posed by higher rates of cognitive impairment and dementia observed in African Americans, Hispanics/Latinos, and Native Americans compared to non-Hispanic Whites. The study hypothesizes that socioeconomic factors (SES), such as education and occupation, are associated with DNAm patterns linked to cognitive health. The study design involves a retrospective analysis of data from the Health and Retirement Study (HRS) with linked DNAm data. The analysis plan prioritizes cross-sectional cognitive phenotypes, with longitudinal data incorporated where available. Using elastic net linear regression, we will develop DNAm-based biomarkers of general cognitive ability and domain-specific cognitive performance. We will examine associations between SES indicators and DNAm cognitive scores and explore whether DNAm patterns statistically account for group differences in cognitive outcomes, with race/ethnicity evaluated as a secondary and exploratory factor. Longitudinal analyses are planned to evaluate whether DNAm cognitive scores are associated with subsequent cognitive change and risk of cognitive impairment, using mixed-effects modeling approaches, subject to data availability and statistical power. In addition to quantitative analyses, the study includes a qualitative questionnaire component developed separately to assess community knowledge, perceived barriers, and attitudes toward epigenetic biomarkers and dementia risk among older adults from diverse backgrounds. Data from this survey are investigator-collected and analyzed independently from NIAGADS DSS data. This study aligns with NIAGADS’s mission by leveraging genomic and epigenomic data to advance understanding of cognitive aging and by generating DNAm-derived cognitive measures that can support future mechanistic and translational research.
Non-Technical Research Use Statement:
Our research seeks to understand how biological changes measured in blood relate to cognitive health in older adults. We are studying whether differences in life experiences such as education and other socioeconomic factors are reflected in patterns of DNA methylation and whether these patterns are associated with cognitive performance later in life. Using data from large, long-running studies of aging, we aim to develop a blood-based measure of cognitive functioning based on DNA methylation. This measure may help identify individuals at higher risk for cognitive impairment and improve understanding of how cognitive health varies across individuals and groups. We are also developing a separate survey to better understand how older adults from diverse backgrounds perceive dementia risk and emerging biological markers. By combining biological research with community perspectives, our goal is to ensure that advances in cognitive aging research are relevant, understandable, and useful for a broad range of communities.
Investigator:
Osmond, Matthew
Institution:
University of Toronto
Project Title:
Disentangling human geographic history in India with genetic genealogies
Date of Approval:
June 5, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives The objective of this research is to disentangle the geographic history of humans in India using genetic genealogies. Study design We will first combine the provided vcf with publicly-available vcfs (of worldwide contemporary and ancient genomes) to infer the genetic genealogies of the samples. We will do this with two different open-source programs, Relate (Speidel et al. 2019) and tsinfer (Kelleher et al. 2019). We will then use the genealogies to infer the spatial locations of the genetic ancestors of each sample back in time, from their sampling time to the time of the most recent common ancestor in each genealogy. We will do this with two different open-source programs developed by our group, spacetrees (Osmond & Coop 2024) and sparg (Deraje, Kitchens, et al. 2025). Analysis plan The inferred locations of the genetic ancestors will allow us to visualize an estimate of the geographic history of humans in and around India. This may help support or refute exisiting hypotheses of migration and admixture, and suggest new ones. The spatial population genetics of India is complex, with several waves of migration both within the country and from outside. For instance, it is well known that North Indians have larger proportion of ancestry from West Eurasians compared to South Indians, while the latter harbour ancestry from ancient indigenous South Indians (which are related to the hunter gatherers from the Andamon Islands). More recent studies have found significant amounts of archaic hominin ancestry (from Neandrathals and Denisovans), larger than observed in most other worldwide populations. Estimating the timing and location of these admixtures (and many more) is still an area of ongoing research. We think our analysis will provide new insight into these questions. Collaboration This project will be led by a PhD student in my group, Puneeth Deraje (University of Toronto). We plan to collaborate with Professor Stephen Wright (University of Toronto), who specializes in empirical population genomics.
Non-Technical Research Use Statement:
The geographic movements of our genetic ancestors, from one generation to the next, leave patterns in our genomes. We can therefore use these patterns to infer something about the spatial history of our genetic ancestors. We have developed two new methods for estimating the location of shared genetic ancestors from a set of sampled genomes. We propose to use these methods on a large sample of human genomes from India to help disentangle the complex geographic history of humans in this area. We think our results will provide new visuals of and insight into the spatial history of modern-day Indians.
Investigator:
Oukraintseva, Svetlana
Institution:
Duke University
Project Title:
Genetics of Aging, Health, and Longevity: Focus on Regulatory Mechanisms and Functional Variants Connecting Aging and Alzheimer's Disease
Date of Approval:
September 11, 2024
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
The objective of this project is to find common regulatory and rare functional genetic variants involved in both aging and Alzheimer’s disease (AD), and suggest new genetic targets for AD prevention. We plan to: (i) evaluate collective effects of genetic interactions using newly developed in our group Interaction Polygenic Risk Score (IPRS), allowing to integrate the additive and interaction effects of genes on AD and aging traits, which presents significant methodological advantage; (ii) leverage the whole exome sequencing data (WXS), to find rare functional variants associated with aging and AD; (iii) focus on genetic regulators of translation that influence levels of proteins and provide connection between genes and phenotypes; and (iv) explore biological pathways involved in aging and AD. For this, we will conduct only secondary analyses of existing genetic and phenotypic data collected in the Alzheimer's Disease Sequencing Project (ADSP), as well as in other studies, including Framingham Cohort (a.k.a., Framingham Heart Study (FHS)), Cardiovascular Health Study (CHS), Alzheimer’s Disease Neuroimaging Initiative (ADNI), and UK Biobank. Current request refers to the ADSP. The analyses will be performed using relevant statistical methods and software. The project does not involve any contact with or participation of the real subjects.
Non-Technical Research Use Statement:
The objective of this project is to significantly improve our understanding of the heterogeneity of Alzheimer’ s disease (AD) and common genetic mechanisms in aging and AD, and find new genetic targets for AD prevention, with emphasis on regulatory and rare functional variants involved in both aging and AD. This objective will be addressed by conducting secondary analyses of existing human data collected in existing human studies, containing genetic and phenotypic information on thousands of individuals.
Investigator:
Palejev, Dean
Institution:
Sofia University
Project Title:
AD subtypes
Date of Approval:
April 7, 2021
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) is a highly heterogeneous disease with diverse clinical manifestations and rate of progression. AD subtypes identification will improve our understanding of the underlying disease mechanisms, enable us to predict disease trajectory and develop new disease-modifying treatments. Objective. To identify distinct AD subtypes from WGS data of AD individuals Analysis plan. We will use 3000 WGS data derived from the ADSP Discovery Case-Control Based Extension Study. We will use the available SNVs and INDELS and infer structural variants (SVs) with our in-house multi-caller pipelines. Rare variants will be retained for further analysis. We will then split the dataset in training and tests set, and use the identified set of genetic variants (i.e. SNVs, INDELS and SVs) as input to a deep neural network (an autoencoder architecture) to learn an unsupervised latent representation of the data. AD subtypes will be identified within this reduced space and characterized using, demographics and clinical data. We will then contrast each subtype with the control groups to identify subtype relevant variants (i.e. putative subtype biomarkers), which will be used as input features to a gradient boosted tree model, to generate a subtype predictive model and subtype specific features. Planned collaboration. Each member of the team will devote effort in specific areas of investigation, nevertheless, all the team members will discuss, through regular meeting, individual progress and potential challenges. In particular, Dr Coppola (Research Scientist, Department of Pathology, Yale University, USA), together with Dr Dean Palejev (Associate Professor, GATE Institute, Sofia University, Bulgaria) will be involved in the deep learning model generation and validation, and subtype identification; Dr Fredrik Johansson (Assistant Professor, Department of Computer Science & Engineering, Chalmers University of Technology. Sweden), will work on the supervised machine learning model; Dr Alexander Schliep, Associate Professor, Department of Computer Science & Engineering, University of Gothenburg, Sweden), will work on the SVs inference.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a highly heterogeneous disease with diverse clinical manifestations and rate of progression. The heterogeneity of AD has complicated both clinical trial design and outcomes, and thus the need for better models of AD, and/or better strategies for selection of participants into specic clinical trials is evident. The identication of more homogeneous disease subgroups (i.e. AD subtypes) will improve our understanding of the underlying disease mechanisms, enable us to predict disease trajectory and develop new disease-modifying treatments. We will use a comprehensive set of genetic variants in combination with deep learning algorithms to identify AD subtypes. Subtypes will be characterized using clinical and demographic data. Finally, variants specic to each cluster will be identied and used to train a predictive machine-learning model to classify new individuals.
Investigator:
Pan, Wei
Institution:
University of Minnesota
Project Title:
Powerful and novel statistical methods to detect genetic variants associated with or putative causal to Alzheimer’s disease
Date of Approval:
March 25, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
We have been developing more powerful statistical methods to detect common variant (CV)- or rare variant (RV)-complex trait associations and/or putative causal relationships for GWAS and DNA sequencing data. Here we propose applying our new methods, along with other suitable existing methods, to the existing ADSP sequencing data and other AD GWAS data provided by NIA, hence requesting approval for accessing the ADSP sequencing and other related GWAS/genetic data. We have the following two specific Aims: Aim1. Association testing under genetic heterogeneity: For complex traits, genetic heterogeneity, especially of RVs, is ubiquitous as well acknowledged in the literature, however there is barely any existing methodology to explicitly account for genetic heterogeneity in association analysis of RVs based on a single sample/cohort. We propose using secondary and other omic data, such as transcriptomic or metabolomic data, to stratify the given sample, then apply a weighted test to the resulting strata, explicitly accounting for genetic heterogeneity that causal RVs may be different (with varying effect sizes) across unknown and hidden subpopulations. Some preliminary analyses have conﬁrmed power gains of the proposed approach over the standard analysis. Aim 2. Meta analysis of RV tests: Although it has been well appreciated that it is necessary to account for varying association effect sizes and directions in meta analysis of RVs for multi-ethnic cohorts, existing tests are not highly adaptive to varying association patterns across the cohorts and across the RVs, leading to power loss. We propose a highly adaptive test based on a family of SPU tests, which cover many existing meta-analysis tests as special cases. Our preliminary results demonstrated possibly substantial power gains.
Non-Technical Research Use Statement:
We propose applying our newly developed statistical analysis methods, along with other suitable existing methods, to the existing ADSP sequencing data and other AD GWAS data to detect common or rare genetic variants associated with Alzheimer’s disease (AD). The novelty and power of our new methods are in two aspects: first, we consider and account for possible genetic heterogeneity with several subcategories of AD; second, we apply powerful meta-analysis methods to combine the association analyses across multiple subcategories of AD. The proposed research is feasible, promising and potentially signiﬁcant to AD research. In addition, our proposed analyses of the existing large amount of ADSP sequencing data and other AD GWAS data with our developed new methods are novel, powerful and cost-effective.
Investigator:
Paré, Guillaume
Institution:
McMaster University
Project Title:
Rare Variant Polygenic Risk Scores for Alzheimer's Disease in Hispanic/Latinx Populations
Date of Approval:
May 2, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Late-onset Alzheimer’s Disease (LOAD) affects over 46 million people worldwide and is expected to double by 2030 and triple by 2050. Accumulating evidence supports a strong genetic component underpinning AD etiology. However, genetic studies of AD have focused primarily on assessing the impact of common variants (either in the APOE epsilon 4 allele or traditional polygenic scores) in European populations. Indeed, there is a sparsity of evidence demarcating the role of rare coding variants on LOAD in European and non-European populations alike, which could provide invaluable insight toward the genetic determinants of LOAD since rare variants are collectively numerous and more deleterious relative to common variants. Using the ethnically diverse ADSP WES data, we aim to systematically demarcate the effect of rare variants on LOAD by constructing a rare variant polygenic risk score (rvPRS), which captures the gene-based burden of rare variants across the genome. The Rare Variant EXome CALIBration using External Repositories (RV-EXCALIBER) method will be used to conduct case-control exome-wide association study using gene burden testing to delineate genes that harbour an enrichment of rare, deleterious variants in LOAD cases relative to controls from the genome aggregation database (gnomAD). Gene-based effect sizes will be used to construct an rvPRS in an independent ADSP validation population, which will be used in multivariable logistic regression models to to predict LOAD status after adjusting for age, sex, APOE epsilon-4-allele, and principal components of ancestry. We also aim to assess the transferability for the predictive power of the rvPRS across the European, African, and Hispanic ancestries present within the ADSP.
Non-Technical Research Use Statement:
Late-onset Alzheimer’s Disease (LOAD) affects over 46 million people worldwide and is expected to double by 2030 and triple by 2050. There is evidence supports that genetic factors underlie AD development. To date, genetic studies of primarily focussed on a small subset of all genetic variants that occur commonly in the population. However, it has been shown that variants that are rare can aid in disease prediction and are better at identifying genes that cause disease. Using the ADSP population, we aim to develop a score based on rare variants that can help identify individuals at high risk for AD from various different ethnic backgrounds.
Investigator:
PARIDA, LAXMI
Institution:
IBM Thomas J Watson Research Center
Project Title:
WAGE ADSP Data Analysis
Date of Approval:
June 17, 2022
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
The “Watson Alzheimer’s Genetics Experiment (WAGE) is a collaboration between IBM TJ Watson Research Center, the Center for Genomics of Alzheimer’s Disease (CGAD U54 AG052427) and AD geneticists at the University of Pennsylvania, Indiana University, Columbia University, and Indiana University (Alzheimer’s Disease Neuroimaging Initiative [ADNI]). We plan to analyze whole genome sequence data generated from subjects with Alzheimer's disease (AD) and elderly normal controls obtained from “R3 17K WGS Project Level VCF” and “phenotypes/pedigree for all subjects”. These data were generated by the National Human Genome Institute Large-Scale Sequence Program, the Alzheimer’s Disease Neuroimaging Initiative, and National Institute on Aging funded investigators. The goal of the planned analyses is to identify genes that have alleles that protect against or increase susceptibility to AD. We will examine both single nucleotide variants and structural variants (indels, deletions, insertions, etc). We will use whole genome sequence data for AD cases from the Alzheimer’s Disease Sequence Project. We will use control data from the Alzheimer’s Disease Sequencing Project (ADSP) and ADNI.
Non-Technical Research Use Statement:
We seek to understand what machine learning algorithms can tell us about Alzheimer’s disease, and are applying machine learning algorithms to all the inherited elements that contribute to Alzheimer's disease risk, and characterizing their statistical power to resolve GW significant alleles. To do this we will analyze DNA sequence data from subjects with Alzheimer's disease and elderly subjects who are cognitively normal. The sequence data from these 2 groups will be compared to identify differences that contribute to the risk of developing Alzheimer's disease of that protect against Alzheimer's disease. These DNA differences can be at a single site in the genetic code, or can span multiple sites, changing the copy number of DNA sequences. Both types of genetic variants will be examined.
Investigator:
Park, Peter
Institution:
Harvard Medical School
Project Title:
Examining the association between clonal hematoposiesis and Alzheimer's Disease
Date of Approval:
December 3, 2019
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Recent projects characterizing genomic variation across large numbers of individuals have revealed that somatic mutations driving clonal expansion in hematopoietic cells occur as part of human aging. This phenomenon, is associated with a number of adverse outcomes, including increased mortality, cardiovascular disease risk, and risk of hematological malignancy. The aim of this proposal is to assess what (if any) association clonal hematopoiesis (CH) has with Alzheimer’s disease (AD) or Dementia. We will use the available exome and whole-genome sequencing to look for somatic mutations associated with CH. In general, distinguishing germline mutations from somatic mutations is non-trivial within a single sample. However, somatic and germline variants are expected to differ in their variant allele fraction distributions. Additionally, many somatic mutations associated with CH are thought to cause severe developmental disease when they occur in the germline (e.g., loss of function in DNMT3A is associated with Tatton-Brown-Rahman syndrome). The poor prognosis of affected patients should make germline mutations in these genes rare. After identifying participants with CH, we will use standard statistical methods (e.g. a fisher test) to determine if CH has any association with AD phenotype. We will also look for sex, race, ethnicity, and APOE specific effects.
Non-Technical Research Use Statement:
Recent projects characterizing genomic variation across large numbers of individuals have revealed that somatic mutations driving clonal expansion in hematopoietic cells occur as part of human aging. This phenomenon, is associated with a number of adverse outcomes, including increased mortality, cardiovascular disease risk, and risk of hematological malignancy. The aim of this proposal is to assess what (if any) association clonal hematopoiesis (CH) has with Alzheimer’s disease (AD) or Dementia.
Investigator:
Park, Young Ho
Institution:
Seoul National University Bundang Hospital
Project Title:
Diagnosis of Alzheimer’s disease using GWAS, blood transcriptome and blood biomarkers
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
- Objectives of the proposed research: The primary objective of this research is to validate the findings from our analysis of whole genome sequencing data in Koreans by replicating them using the ADSP dataset, particularly focusing on individuals from diverse ethnic backgrounds. Our previous analyses aimed to identify associations between genetic variants and Alzheimer's disease (AD), with a specific focus on early-onset Alzheimer's disease (EOAD), by analyzing whole genome sequencing data and performing both variant-level and gene-level statistical analyses. By leveraging the requested ADSP data, we aim to confirm these associations and improve the accuracy of our findings. - Study design: This research follows a cross-sectional design. We will utilize data from EOAD patients and control groups recruited at Seoul National University Bundang Hospital to perform rare-variant association analyses, focusing on their association with AD diagnosis. Based on the results of our analysis, we will conduct replication studies using the ADSP dataset, which includes participants of various ethnicities and comprehensive phenotypes. This study is part of our larger project, "Diagnosis of Alzheimer’s disease using GWAS, blood transcriptome and blood biomarkers", which aims to integrate genetic findings with transcriptome data and other biomarkers to improve the diagnostic accuracy of AD. - Analysis plan, including the phenotypic characteristics that will be evaluated in association with genetic variants: The analysis using the ADSP dataset will focus on replicating previous findings related to EOAD diagnosis using whole genome sequencing data in Koreans. We will evaluate the association between variants identified in our analysis and various phenotypic characteristics in the ADSP dataset, such as cognitive function and biomarkers, including β-amyloid, phosphorylated tau, and other Alzheimer-related markers measured by different tools. - Collaborations: There are no planned collaborations with researchers at other institutions for this study.
Non-Technical Research Use Statement:
This study aims to better understand the genetic factors associated with early-onset Alzheimer's disease (EOAD), particularly among diverse populations. By analyzing whole genome sequencing data from Koreans, we have identified certain genetic variants linked to EOAD. We will now replicate these findings using data from the Alzheimer's Disease Sequencing Project (ADSP), which includes individuals of various ethnicities. Through this research, we hope to improve the accuracy of Alzheimer's disease diagnosis by identifying genetic risk factors that contribute to the development of the disease, ultimately aiding in early detection and treatment.
Investigator:
Park, Young Ho
Institution:
Seoul National University Bundang Hospital
Project Title:
Genome-wide search for pleiotropy in ischemic stroke, normal pressure hydrocephalus, and neuropathology endophenotypes linked to Alzheimer’s disease
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
- Objectives of the proposed research: Accumulating evidence indicates that ischemic stroke (IS) and normal pressure hydrocephalus (NPH) share common risk factors and critical mechanisms with Alzheimer’s disease (AD). This study aims to identify pleiotropy between (1) IS and AD-associated neuropathology endophenotypes (NPEs) and (2) NPH and AD-associated NPEs. Based on the identified pleiotropic variants or loci, we will also investigate shared pathways underlying biological mechanisms of IS, NPH, and AD. - Study design: We will conduct genome-wide pleiotropy analyses using summary statistics obtained from previous genome-wide association studies (GWAS) for IS, NPH, and AD-associated NPEs. Along with GWAS summary statistics for NPEs (Shade et al., 2024, Nat Genet), we will use GWAS summary statistics for IS (Malik et al., 2018, Nat Genet) and NPH (Rasanen et al., 2024, Neurology), generated in individuals of European ancestry. - Analysis plan: Two separate genome-wide pleiotropy analyses will be conducted: (1) pleiotropy GWAS for IS and AD-associated NPEs and (2) pleiotropy GWAS for NPH and AD-associated NPEs. We will evaluate pleiotropy for variants with minor allele frequency (MAF) > 0.01 and Z-scores < 80 using the R package PLACO, which examines the composite null hypothesis that a given variant is associated with a maximum of one trait, such that rejecting this hypothesis implies that the variant influences both traits and is thus pleiotropic. For top-ranked loci containing genome-wide significant pleiotropic variants, we will differentiate horizontal pleiotropy from mediated pleiotropy by conducting Bayesian colocalization analysis using the R package COLOC. We also plan to conduct pathway enrichment analyses for top-ranked pleiotropic variants to identify pathways underlying biological mechanisms shared by IS, NPH, and AD. - Planned collaboration: There are no planned collaborations with researchers at other institutions for this study.
Non-Technical Research Use Statement:
Accumulating evidence indicates that ischemic stroke (IS) and normal pressure hydrocephalus (NPH) are known to share common risk factors and critical mechanisms with Alzheimer’s disease (AD). Although large-scale genome-wide association studies (GWAS) have identified many variants or loci associated solely with IS, NPH, and AD, only a few studies have investigated shared genetic factors or pathways underlying their biological mechanisms. This study aims to identify variants or loci that influence AD-associated neuropathology endophenotypes (NPEs) and IS or AD-associated NPEs and NPH simultaneously. Using GWAS summary statistics for each neurodegenerative disease, we will conduct genome-wide pleiotropy analyses, differentiate horizontal pleiotropy (direct genetic effect on AD or through mechanisms that bypass IS or NPH) from mediated pleiotropy (genetic effect on AD through IS or NPH), and investigate biological pathways implicated in shared mechanisms of IS, NPH, and AD. Our findings may provide insight into the genetic basis and underlying mechanisms shared by IS, NPH, and AD.
Investigator:
Parrado, Antonio
Institution:
Janssen R&D
Project Title:
Extensive search for variants that protect or elevate the risk of Alzheimer's Disease
Date of Approval:
December 18, 2020
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: The aim of our analyses of the Alzheimer’s Disease Sequence Project (ADSP) cohort data is to improve the identification of reliable gene-drug targets to develop effective therapeutic medicines to prevent or slow-down the progression of Alzheimer’s disease in patients. Study design: Our goal is to identify novel and reliable gene-targets to develop effective therapeutic medicines to prevent or slow-down the human burden caused by Alzheimer’s disease. To identify variants that confer risk or provide protection to Alzheimer’s disease it is essential to obtain deep sequencing (whole-genome and whole-exome) in families with high penetrant variants and in case-control populations. We will capitalize on the Discovery Phase and Extension Phase cohorts that includes WGS data in families by performing family-based association analyses. Additionally, we will perform case-control association analyses on the whole-exome sequence data from the Discovery Phase Case/Control, Extension Case/Control, FUS1, and FUS2 cohorts. Analysis Plan: We plan to analyze the WGS and WES data with several phenotype-variant analysis approaches. We will perform common variant, rare variant gene-based (i.e. stop-gain, frameshift, putatively deleterious non-synonymous, and splice-site variants), pathway-based analysis, and sex-stratified analysis. We plan to perform association analysis with a dichotomous outcome (i.e. affected/unaffected) and with neuropathology quantitative measures (where available). We have expertise in several analyses software to perform the proposed analysis; they include PLINK, PLINKSeq, MENDELSCAN, and varianttools. We plan to analyze the ADSP cohort and other Alzheimer's disease cohorts independently (i.e. UKBB) and to combine the summary statistics (Odds ratio and p-values) by meta-analysis. The planned research is consistent with the data use limitations/restrictions for the requested dataset(s), and we promise to follow all regulations within. Our proposed research will support all conditions specified in the Data Use Agreements associated with the study and will not violate relevant privacy or consent policies.
Non-Technical Research Use Statement:
The aim of our analyses of the Alzheimer’s Disease Sequence Project (ADSP) cohort data is to unravel the genetic architecture of AD with an objective to identify reliable gene-drug targets through various family-based and population-based statistical analyses, followed by prioritizing molecular targets and to develop effective therapeutic medicines to prevent or slow-down the progression of Alzheimer’s disease in patients.
Investigator:
Pascoal, Tharick
Institution:
University of Pittsburgh
Project Title:
Exploring the association between attention-deficit/hyperactivity disorder and cognitive decline
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: ADHD persists throughout the lifespan and has been linked with higher risk for MCI and AD dementia based on large epidemiological studies. Recent findings from our group indicated a correlation between higher genetic liability for ADHD and progressive cognitive decline, as well as the development of AD pathophysiology, in cognitively unimpaired older adults with amyloid deposition at baseline. We aim to further explore the association between ADHD and cognitive decline by investigating the role of genetic risk for ADHD in the cognitive profile, as well as amyloid and tau deposition, in individuals diagnosed with AD or MCI. For that, we will calculate the ADHD-PRS, which is a valid biomarker of ADHD pathology, in individuals that are part of the Alzheimer’s Disease Sequencing Project. Study design: This study will analyze cross-sectional and longitudinal data from cohorts included in the Alzheimer’s Disease Sequencing Project. Analysis plan: We will calculate ADHD-PRS based on the latest GWAS. We will use linear and mixed regression models to test the association between ADHD-PRS and cognitive function (executive function, language, memory, and visuospatial) in cognitively unimpaired, MCI, and AD individuals. Analyses will be controlled for age, sex, and ancestry. Linear and mixed effect models will be used to evaluate the association between ADHD-PRS and fluid biomarkers (amyloid and phosphorylated tau), as well as neuropathology markers of amyloid deposition and neurofibrillary degeneration. We will conduct sensitivity analysis to explore the confounding effects of education, vascular risk factors (using clinical data or post-mortem markers of vascular brain injury), and psychiatric comorbidities (by calculating PRS of major depressive disorder, bipolar disorder, autism spectrum disorder, and schizophrenia). We hypothesized that higher ADHD-PRS will be associated with longitudinal decline in memory and executive function, as well as higher markers of tau pathology. Based on prior results from our group, we also hypothesize that these findings will be observed in individuals with amyloid deposition at baseline.
Non-Technical Research Use Statement:
ADHD is a common neurodevelopmental disorder that persists throughout the lifespan. Recent large epidemiological studies have indicated an increased risk for AD and MCI among individuals with ADHD. The underlying mechanisms linking ADHD and cognitive decline remain unclear, but prior data published by our group supports that individuals with ADHD have reduced resilience to amyloid pathology, leading to a decline in cognition at lower pathological levels. The main goal of this project is to further investigate the association between ADHD and cognitive decline, tau, and amyloid deposition in individuals diagnosed with AD or MCI. For that, we plan to utilize GWAS data from the Alzheimer’s Disease Sequencing Project to calculate ADHD polygenic risk scores (ADHD-PRS), which is a validated marker of ADHD pathology. We will evaluate the association between ADHD-PRS and cognitive function, as well as fluid biomarkers of amyloid and tau pathology. As secondary goals, we aim to investigate the confounding role of the genetics risk for other psychiatric disorders in these associations.
Investigator:
Pathak, Gita
Institution:
Institute for Genomic Health, Genetics and Genomic Sciences at Mount Sinai
Project Title:
Multi-modal analysis of psychiatric and dementia outcomes
Date of Approval:
June 15, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
a. Objectives of the Proposed Research This study aims to investigate the relationship between psychiatric traits and age-related cognitive decline, addressing a critical knowledge gap in understanding how mental health influences aging outcomes. b. Study Design The study employs a multi-level investigative approach combining epidemiological, genetic, and molecular methodologies. The design incorporates three complementary components: first, identification of phenotypic associations between psychiatric traits and MCI/AD through comprehensive clinical assessment; second, investigation of genetic architecture through analysis of coding and non-coding variants, genetic correlation assessments, polygenic scoring, and Mendelian randomization for causal inference; and third, examination of molecular mechanisms through genetically regulated epigenetic and proteomic processes. The study design enables stratified analyses by sex and ethnicity while controlling for demographic and lifestyle confounders, providing a comprehensive framework for understanding the psychiatric-cognitive decline relationship across multiple biological levels. c. Analytical Plan The analytical approach will proceed in sequential phases, beginning with statistical modeling to identify psychiatric traits significantly associated with MCI and AD outcomes while adjusting for demographic and lifestyle factors. Genetic analyses will employ polygenic risk scores and Mendelian randomization techniques to establish causal relationships between psychiatric conditions (particularly depression and alcohol use disorder) and cognitive outcomes. Molecular analyses will focus on identifying shared genetic loci between psychiatric and cognitive phenotypes, followed by investigation of genetically regulated methylation and proteomic markers as potential mediators. The analysis plan includes development of molecular weights to aid causal inference analyses and determination of effect directionality, with stratified results reported by sex and ethnicity to identify population-specific risk patterns and potential intervention targets.
Non-Technical Research Use Statement:
This research examines how mental health conditions like depression and anxiety may increase the risk of memory problems and Alzheimer's disease as people age. Using genetic data and biological markers, we'll study whether psychiatric conditions directly cause cognitive decline or if they share common underlying causes. The study will identify which mental health factors pose the greatest risk for dementia, particularly looking at differences between men and women and various ethnic groups. Results could help better predict and prevent cognitive decline by addressing mental health early in life, potentially improving outcomes for millions facing both psychiatric and age-related brain conditions.
Investigator:
Pendergrass, Rion
Institution:
Genentech
Project Title:
Genetic Analyses Using Data from the Alzheimer’s Disease Sequencing Project (ADSP) and related studies
Date of Approval:
June 25, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The purpose of our study is to identify novel genetic factors associated with Alzheimer’s Disease, corticobasal degeneration (CBD) and progressive supranuclear palsy (PSP). This includes identifying genetic factors associated with the risk of these conditions, as well as genetic risk factors associated with age-at-onset (AAO) for these conditions. We will also evaluate genetic associations with sub-phenotypes individuals have within these broad disease categories, such as their Braak staging results which provide insights into the level of severity of Alzheimer’s. Thus we are requesting access to the set of genomic Whole Exome and Whole Genome Sequences (WES and WGS) have just been released through the National Institute on Aging Genetics of Alzheimer’s Disease Data Storage Site (DSS NIAGADS). The findings from our genetic association testing have the potential for identification of new therapeutic targets for Alzheimer's Disease, CBD, and PSP. The findings from our studies also have the potential for identification of genetic and phenotypic biomarkers that will be beneficial for subsetting patients in new ways standard genetic epidemiological methods to handle the WGS and WES data. All data will remain anonymized and securely stored, and only those listed on our application and their staff will have access to these data. We will not share any of the individual level data outside of Genentech nor beyond the researchers on our application. We will adhere to all data use agreement stipulations through the DSS NIAGADS. We have a secure computational environment called Rosalind within Genentech where we will use these data. We have IT security staff that constantly monitor all our research computing, assuring safety and privacy of all of our stored data. We will not collaborate with researchers at other institutions.
Non-Technical Research Use Statement:
Genetic variation allows us to understand more of the genetic contribution to risk and protection from diseases such as Alzheimer’s and dementia. This information also allows us to identify important biological contributors to disease for developing effective treatment strategies, and identifying groups of individuals that would benefit most from new treatments. Our exploration of this relationship between genotype and disease traits and outcomes through these datasets will allow us to pursue important new findings for disease treatment.
Investigator:
Pendergrass, Rion
Institution:
Genentech
Project Title:
Genetic Analyses Using Data from MiGA and related studies
Date of Approval:
April 10, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The purpose of our study is to identify novel genetic factors associated with age related neurodegeneration. This includes identifying genetic factors associated with the risk of these conditions, as well as genetic risk factors associated with age-at-onset (AAO) for these conditions. The findings from our analyses have the potential for identification of new therapeutic targets for Alzheimer's Disease and other age related neurodegenerative disease. The findings from our analyses also have the potential for identification of genetic and phenotypic biomarkers that will be beneficial for subsetting patients in new ways. Using the data we have requested we will be identifying genes driving neurodegenerative diseases by identifying dysregulated genes in cases through using total and allele specific gene expression profiles. Genotypes and RNA-seq reads will be used to generate allele specific expression (ASE). RNA-seq counts and ASE from controls will be used to model the variance of both total and ASE gene expression. Total gene expression vs ASE specifically from cases will be used to identify dysregulated genes in single individuals. These will then be compared to pathway and known disease-associated genes. Case/control status, genotype, and RNA-seq data will be all be evaluated together through quantitative trait loci (QTL) analyses, and additional statistical association analyses. All data will remain anonymized and securely stored, and only those listed on our application and their staff will have access to these data. We will not share any of the individual level data outside of Genentech nor beyond the researchers on our application. We will adhere to all data use agreement stipulations through the NIAGADS. We have a secure computational environment called Rosalind within Genentech where we will use these data. We have IT security staff that constantly monitor all our research computing, assuring safety and privacy of all of our stored data. We will not collaborate with researchers at other institutions.
Non-Technical Research Use Statement:
Genetic variation and gene expression data allows us to understand more of the genetic contribution to risk and protection from diseases such as Alzheimer’s and dementia. This information also allows us to identify important biological contributors to disease for developing effective treatment strategies, and identifying groups of individuals that would benefit most from new treatments. Our exploration of this relationship between genotype, disease traits, gene expression, and outcomes, through these datasets will allow us to pursue important new findings for disease treatment.
Investigator:
Pericak-Vance, Margaret
Institution:
University of Miami
Project Title:
Collaboration on Alzheimer Disease Research
Date of Approval:
June 3, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We plan to analyze GWAS, whole exome and whole genome sequence data generated from subjects with Alzheimer's disease (AD) and elderly normal controls. The goal of the planned analyses is to identify genes that have alleles that protect against or increase susceptibility to AD. We will evaluate variants detected in the sequence data for association with AD to identify protective and susceptibility alleles using the whole exome and whole genome case-control data. We will also evaluate sequence data from multiplex AD families to identify variants associated with AD risk and protection, and evaluate variant co-segregation with AD. The family data will be whole genome data. The family-based data will be used to inform the cases control analysis and visa versa. We also will focus on structural variants (insertion-deletions, copy number variants, and chromosomal rearrangements). Evaluation of structural variants will involve both whole genome and whole exome data. Structural variants will be analyzed with single nucelotide variants detected and analyzed in the case-control and family-based data
Non-Technical Research Use Statement:
We are attempting to identify all the inherited elements that contribute to Alzheimer's disease risk. To do this we will analyze DNA sequence data from subjects with Alzheimer's disease and elderly subjects who are cognitively normal. The sequence data from these 2 groups will be compared to identify differences that contribute to the risk of developing Alzheimer's disease of that protect against Alzheimer's disease. These DNA differences can be at a single site in the genetic code, or can span multiple sites, changing the copy number of DNA sequences. Both types of genetic variants will be examined.
Investigator:
Pettine, Warren
Institution:
University of Utah
Project Title:
Benchmarking and fine-tuning of genomic foundation models.
Date of Approval:
January 30, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: To develop and validate methods for assessing how genomic foundation models generalize to neuropsychiatric conditions, specifically Alzheimer's disease and related dementias (ADRD). We aim to: - Evaluate foundation models' ability to capture disease-relevant genomic patterns - Develop fine-tuning approaches to improve model performance - Create benchmarks for assessing model generalization to psychiatric genetics - Establish reproducible pipelines for model evaluation and optimization Study Design: This secondary data analysis study will utilize NIAGADS datasets to: - Extract embedding representations from pre-trained genomic foundation models - Analyze model performance across different cohorts and phenotypes - Develop and test fine-tuning methods - Create standardized evaluation metrics Analysis Plan: Phase 1: Data Preparation - Harmonize genomic and phenotypic data across cohorts - Format data for compatibility with foundation models - Implement quality control measures - Create validation datasets Phase 2: Model Evaluation Phenotypic characteristics to be evaluated: - Clinical diagnosis status (AD/ADRD vs. control) - Cognitive performance metrics - Fluid biomarker levels - Neuropathological findings - Cardiovascular risk factors - Neuroimaging metrics (DTI, FLAIR, PET, T1) Analysis methods: - Embedding space analysis - Layer-wise representation analysis - Linear separability testing - Correlation analysis - Target decoding performance - Distribution matching assessment Phase 3: Fine-tuning Development - Implement distribution matching techniques - Develop data augmentation strategies - Create iterative optimization procedures - Establish performance benchmarks All data access and analysis will be conducted under approved institutional protocols with appropriate data security measures in place.
Non-Technical Research Use Statement:
Our research aims to improve how artificial intelligence (AI) systems understand genetic information related to Alzheimer's disease and dementia. Using existing, anonymized genetic data from research participants, we will test and enhance AI systems' ability to identify genetic patterns associated with these conditions. Just as AI has been trained to understand language and images, we will develop methods to help AI better process genetic information. The goal is to create tools that help researchers better understand the genetic factors contributing to Alzheimer's disease, potentially supporting future advances in early detection and treatment.
Investigator:
Piras, Ignazio
Institution:
Translational Genomics Research Institute
Project Title:
Generation of an Alzheimer’s Disease Polygenic Risk Score using a neuropathological confirmed cohort
Date of Approval:
March 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective of the proposed research Generate an AD polygenic risk score (PRS) using neuropathologically confirmed AD cases and controls (ND). The PRS will be used to assess the risk of AD in other neurological and psychiatric disease and to identify potential risk factors in large population studies accounting for the genetic risk. Study Design We will select the neuropathologically confirmed AD and ND samples, and genotypes will be used to conduct a genome-wide association study (GWAS), which results will be used to estimate the PRS. Analysis plan VCF files will undergo quality controls with the following inclusion criteria: QUAL >30, DP >10, and GQ > 20. We will exclude variants with allelic balance for heterozygous <0.3 or >0.7. We will include only SNPs with genotype rate >95%, MAF>0.05, Hardy-Weinberg Equilibrium p > 1.0E-06, and sample genotype rate >95%. Sex mismatch will be conducted using the SNPs located in the X chromosome. Duplicated samples will be identified using the pi-hat metrics, removing samples with pi-hat >0.7. Outliers will be identified by the Identity By State statistics. The GWAS will be conducted using BOLT-LMM (PMID:29892013), adjusting for the top 10 PCs and for sex. PRS will be estimated using the methods described by Marees et al. (PMID:29484742) and from the PR-Sice2 software (PMID:31307061). A wide range of neurological and psychiatric disease will be used as target datasets to assess the risk of AD. The PRS will be also used to investigate AD risk factors accounting for the genetic risk. For the method by Marees et al., we will select the significant SNPs (p<5.0E-08), and the number of alleles present in each individual in the target dataset will be counted and then multiplied by their corresponding GWAS p-value and finally summed across all SNPs for each individual's PRS. The method implemented in PR-Sice2 used a wide range of p-values optimizing the cutoff according to the accuracy prediction in the target phenotype. This project will be conducted in collaboration with Drs. Eric Reiman Valentina Ghisays (Banner Alzheimer's Institute, Phoenix, AZ). Only the PI (Dr. Piras) will have access to the data.
Non-Technical Research Use Statement:
The research aims to develop a polygenic risk score (PRS) for Alzheimer's disease (AD) using neuropathologically confirmed AD cases and controls. This PRS will help assess AD risk in other neurological and psychiatric conditions and identify potential risk factors, considering genetic predisposition. The study will use Whole Genome Sequencing (WGS) and microarray data from AD and non-AD (ND) samples. A genome-wide association study (GWAS) will generate the necessary genetic data for PRS calculation. Quality control and filtering will be applied to ensure the accuracy of the genetic data. The PRS will be calculated using genome-wide significant SNPs and analyzed with PR-Sice2 software. The score will be applied to various neurological and psychiatric datasets to evaluate the risk of AD and explore potential risk factors.
Investigator:
Plagnol, Vincent
Institution:
Genomics
Project Title:
Integrative analysis of genetic association data for individual genome interpretation and translational biological research
Date of Approval:
July 15, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Genomics aims to transform the development of new medicines using the power of genomics. Our in-house research engine contains the largest collection of human genotype and phenotype data available, and we are continuously seeking to expand it with the aim to understand the nature and extent of pleiotropy in the human genome and its impact in human health and disease. One area of focus for Genomics is the generation of polygenic risk scores (PRSs) to inform personalised treatment and screening pathways. It is a widely understood issue that PRS performance in individuals with non-European genetic ancestries is lower than that in individuals with European genetic ancestries, which then impacts the utility of Integrated Risk Tools (IRTs) that combine genetic and non-genetic factors PRSs to generate a measure of absolute disease risk. Our previously published work has already investigated this issue (see Weale et al. 2021, https://doi.org/10.1016/j.amjcard.2021.02.032). To address these issues further, we intend to use these GWAS data to improve the performance of our PRS and subsequently our IRTs by combining with data that are already available to us. We also plan to combine these data with other in-house resources, to better detail the mechanistic link between genetic and health outcomes. This research will improve the identification and characterisation of novel drug targets. Bespoke analyses within patient subgroups stratified based on major risk alleles or combinations of different endpoints will enable better definition of the patient populations that would benefit from therapeutic interventions, and more precise evaluation of potential safety concerns. We will also use this individual level data to explore training and testing IRTs (using independent subsets of the data), which combine PRS with the individual level phenotypes available within ADSP including cognition scores, fluid biomarkers and neuroimaging data. Finally, for individuals within the randomised control trial cohorts we will explore whether constructed PRS and IRTs are associated with drug response using the available phenotype measures.
Non-Technical Research Use Statement:
Genetic data has the potential to transform health care, by better predicting diseases, understanding and recommending treatment options. This is particularly critical for Alzheimer’s disease, which greatly benefits from early detection. However, much of the data underpinning this revolution have been generated in individuals of European descent. These data cannot be readily used in non-European populations, which limits broad adoption and widens an already existing gap in access to modern genomic technologies. With this project we are interested in assessing how these genetic predictors can be used in individuals of diverse ancestry. We want to improve the design of these tools so that they become more precise, and usable by a wider range of individuals.
Investigator:
Pottier, Cyril
Institution:
Mayo Clinic
Project Title:
Genetics of Young Onset Alzheimer's Disease
Date of Approval:
March 14, 2023
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Historically, young onset Alzheimer’s disease (YOAD) is defined as onset of clinical symptoms before the age of 65, and 90% of such patients are not associated with mutations in the main 3 Ab related genes (APP, PSEN1 and PSEN2). The objective of this project is to deeply characterize known and new genetic components of YOAD in the largest pathologically confirmed cohort in the world and to evaluate the impact of SNVs, SVs and repeat expansions. Due to their earlier onset age, and strong heritability, we hypothesize that YOAD patients are enriched in rare pathogenic variants within the Ab and Tau pathways. In addition, patients with YOAD are often misdiagnosed for frontotemporal dementia due to clinical symptom overlap. In that context, we also hypothesize that there is genetic overlap between both diseases. We generate whole-genome sequencing data from over 900 YOAD patients, including more than 400 autopsy confirmed YOAD cases, over 1000 FTD patients and 800 controls. We are requesting access to the ADSP whole-genome sequencing data (raw and VCF) to i) to perform gene-based, single variant and pathway association analyses in ADSP YOAD and late onset data to replicate our findings, ii) increase our YOAD cohort size for assessing the overlap and differences between FTD and YOAD patients. Single nucleotide variants, as well as structural variants, will be assessed. To do so, we will use already generated SNVs VCF but also generate structural variant calling using our Mayo pipeline. We will utilize several commonly used software programs, such as Plink-seq and SKAT package, to perform our association analyses. All analyses will be done at the single variant, gene, structural variant, and pathway levels. Using these approaches, we hope to identify novel mutations/genes/pathways that are related to both AD and FTD and will benefit the larger scientific community working on neurodegenerative disorders.
Non-Technical Research Use Statement:
We aim at identifying new risk factors for young onset Alzheimer’s disease (age at onset before 65). To do so we use deep phenotyping and genetic approaches. Upon completion of our work, we will obtain a comprehensive understanding of young onset Alzheimer’s disease genetics. Altogether our project will identify potential new therapeutic targets for young onset Alzheimer’s disease and will pave the way for individualized therapy development not only for young onset Alzheimer’s disease, but also for the more common late onset AD.
Investigator:
Rademakers, Rosa
Institution:
Mayo Clinic
Project Title:
Frontotemporal lobar degeneration association study using sequence data
Date of Approval:
October 8, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
We hypothesize that patients with AD and FTD share common genetic factors. We generated whole-genome sequencing data from over 500 FTD patients and 800 controls and identified new genetic variants associated with FTD. Our findings add to the growing body of genetic factors associated with both diseases (TREM2, MAPT, GRN, and C9ORF72). We have now extended our cohort of whole-genome sequenced FTD and controls. We are requesting access to the ADSP whole-genome sequencing data (raw and VCF) to i) use the ADSP controls in order to increase our statistical power to detect risk variants in FTD by increasing the size of our control set, and ii) to perform gene-based, single variant and pathway association analyses in ADSP data with candidate FTD/ALS genes to determine their impact across dementias. Single nucleotide variants, as well as structural variants, will be assessed. To do so, we will call variants combining our FTD data with ADSP data starting from the raw files and using the same pipeline as the one already used for our FTD genome data. ADSP raw files will be processed through the Mayo Clinic Genome-GPS (GGPS) analytic pipeline and the ANNOVAR variant annotation pipeline. In addition, we will also run the ADSP’s data with specific pipelines more accurate for complex genomic regions such as the HLA risk region, e.g. HISAT2. We will also use new genotyping technologies such as long-read sequencing to impute structural variants in the ADSP dataset and compare them to our findings in our FTD cohort. We will utilize several commonly used software programs, such as Plink-seq and SKAT package, to perform our association analyses. Control data will be used to perform a large association study on FTD patients and controls. Then, association studies within the ADSP dataset will be performed. All analyses will be done at the single variant, gene, structural variant, and pathway levels. Using these approaches, we hope to identify novel mutations/genes/pathways that are related to both AD and FTD and will benefit the larger scientific community working on neurodegenerative disorders.
Non-Technical Research Use Statement:
Frontotemporal dementia (FTD) is the second most common form of early-onset dementia after Alzheimer’s disease (AD). While AD patients initially present with memory problems, FTD patients usually present with changes in personality and behavior and sometimes language problems. However, clinical and genetic overlaps between both diseases have been reported. We propose to perform in-depth genetic studies to identify genetic factors that cause or increase the risk for FTD and comprehensively assess the genetic overlap between AD and FTD. The identification of new disease genes will provide novel insight into the disease biology; may improve genetic counseling in patients, and could provide new targets for future therapeutic interventions.
Investigator:
Raffield, Laura
Institution:
University of North Carolina at Chapel Hill
Project Title:
Genomic and Multi-Omic Analysis of Alzheimer's Disease in Diverse Populations
Date of Approval:
March 27, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We are requesting data from the NIAGDS portal for utilization in our project concerning genomic and multiomic analysis of Alzheimer's disease in diverse populations. This is relevant to funded NIA grant R01AG075884 as well as pending NIA applications. We will use deidentified data from NIAGDS, as well as from other data sources including the TOPMed consortium and newly generated data from cohorts such as Jackson Heart Study, to identify putative risk factors and biological mechanisms for identified genetic loci for Alzheimer's disease risk. We will utilize statistical and genetic analysis methods (including polygenic risk score construction, colocalization using GWAS summary statistics, and association analysis for gene transcripts, proteins, and metabolites) to identify putative risk factors for Alzheimer's disease.
Non-Technical Research Use Statement:
Black Americans face a disproportionately high risk of developing cognitive impairment and Alzheimer’s disease and related dementias (ADRD), compared to non-Hispanic White adults, but the biological mediators underlying this elevated risk are not well understood. Additionally, most efforts to identify risk biomarkers have not included diverse populations, making results less relevant to all Americans. High throughput multi-omics data from blood samples in diverse participants, including through newly funded grants and existing funded data from NIA, may allow us to identify predictors of ADRD and incident cognitive impairment risk across diverse US populations, including in Black adults underrepresented in ADRD research. Integration of genetic data may allow us to further clarify the genes, proteins, and metabolites through which Alzheimer’s disease genetic risk variants function, as well as improve risk prediction in populations with substantial non-European ancestry.
Investigator:
Raffington, Laurel
Institution:
Max Planck Institute for Human Development
Project Title:
DNA-methylation profiles of child development in adult aging
Date of Approval:
June 3, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: Biological aging has been quantified in DNA-methylation samples of older adults and applied as so-called “methylation profile scores” (MPSs) in separate target samples, including children. This research indicates that biological aging is affected by childhood environments and shows overlap with developmental processes. Because the MPSs were computed using algorithms developed in adults, these studies indicate a molecular link between childhood environments, development, and adult biological aging. Yet, if MPSs can be used to connect development and aging, previous research has only traveled one way, deriving MPSs developed in adults and applying them to samples of children. Researchers have not yet probed if MPSs that reflect childhood development are associated with physical and psychological aging later in life. Study design: Here we examine whether MPSs quantifying childhood socioeconomic contexts and developmental processes – computed in adults– are related to their adult health. We will use published algorithms developed in child methylation samples and apply them to methylation data from the HRS. We focus on MPSs of socioeconomic contexts, age, gestational age, birthweight, and pubertal timing measured in children and adolescents. We will test whether these MPSs computed in adults are related to their health, mortality, retrospective reports of childhood poverty, as well as their MPSs of biological aging. Analysis plan: In preregistered multiple regressions analyses, we will test for associations of MPSs of childhood exposures and development with physical health (BMI, menopause, functional limitations, multimorbidity, mortality), mental health (cognitive impairment, depression), childhood poverty index, and biological aging (PhenoAge, GrimAge and DunedinPACE). Sex and cell composition will be included as covariate controls, and we will test whether associations hold after accounting for substance use, racial identity, and adult SES.
Non-Technical Research Use Statement:
Biological aging has been quantified in DNA-methylation samples of older adults and applied as so-called “methylation profile scores” (MPSs) in separate target samples, including samples of children. Research shows that biological aging is affected by childhood environments and shows overlap with “developmental processes” (e.g., puberty, cognition). Because the MPSs were computed using algorithms developed in adults, these studies indicate a molecular link between childhood environments, development, and adult biological aging. Researchers have not yet probed if MPSs that reflect exposures in childhood or child development are associated with physical and psychological aging later in life. Here we examine whether MPSs quantifying childhood socioeconomic contexts and child developmental processes – computed in adults from the Health and Retirement study – are related to their adult physical and psychological health. By applying MPSs developed in child samples to older adults, we can potentially unravel how early-life factors contribute to later-life health and aging.
Investigator:
Raj, Towfique
Institution:
Icahn School of Medicine at Mount Sinai
Project Title:
Learning the Regulatory Code of Alzheimer's Disease Genomes
Date of Approval:
January 7, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Our overarching objective is to apply machine learning techniques to predict and interpret the functional effects of genetic variants including Single Nucleotide Variants (SNVs), indels and Structural Variants (SVs) from AD WGS data at the levels of DNA regulation and RNA processing, and link these effects directly to pathways and network context. We will leverage WGS generated by the ADSP and others together with harmonized endophenotypes and clinical data, multi-omics data from the AMP-AD, functional genomics data from Roadmap Epigenomics, PsychENCODE and GTEx Projects, and microglia and monocytes specific transcriptomic and single-cell RNA-seq data sets. Our central hypothesis is that many AD-associated genetic risk or protective variants influence pre- and post-transcriptional gene regulation, resulting in changes to gene expression and cellular pathways/networks, and ultimately contribute to protein aggregation in AD. The objective of this aim is to leverage deep-learning-based models capable of predicting functional effects of genomic variants on pre- and post-transcriptional gene regulation. We will train existing and novel sequence-based deep learning models of epigenomic state and RNA regulation and processing specific to AD-relevant cell types and states. in silico mutagenesis under these trained models will be used to calculate functional impact “delta scores” for every SNV, indel and structural variants (SV) detected from AD WGS. We will use these delta scores to empower non-coding rare variant tests of association with AD at the regulatory region, gene and pathway levels. We will conduct functional fine-mapping through the integration of (i) the CNN delta scores (ii-iii) expression and splicing quantitative trait loci (eQTL and sQTL), (iv) AD endophenotypes and (v) multi-ethnic AD WGS data. We will use probabilistic ML methods, combined with cell-type-specific and single-cell RNA-seq datasets, to build gene regulatory networks. This NIH funded project is a close collaboration with Dr. David Knowles at the New York Genome Center/ Columbia University.
Non-Technical Research Use Statement:
Despite decades of research and enormous investment, no disease-modifying treatment is available for Alzheimer’s disease (AD). Combining population-scale data collection, human genetics and machine learning provide a way forward to uncover and characterize new causal cellular processes involved in AD. Effectively integrating diverse genomic data to better understand AD represents a substantial computational challenge, both in terms of data scale and analysis complexity. We will train machine learning models to predict epigenomic signals from the genomic sequences to estimate the functional impact of any genetic variant. These analyses will highlight variants and genes involved in AD. However, genes do not operate in a vacuum so robust machine learning will be used to learn cell-type and disease-specific networks. Such pathways will be prime candidates for future functional and therapeutic studies of AD.
Investigator:
Ratnapriya, Rinki
Institution:
Baylor College of Medicine
Project Title:
Microglia-associated expression quantitative trait loci (eQTLs) and causal variants relevant to Age-related Macular Degeneration (AMD)
Date of Approval:
July 15, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The objectives of our genomics study is to analyze the eQTLs from microglia (resident immune cells) and identification of causal variants in neuro-degenerative disease. For our study, we will perform the robust eQTL analysis by integrating all available microglia genotyping and gene expression datasets of human. For this purpose we will include the normal phenotype microglia genotype and gene expression datasets. We will investigate the identified microglia eQTL for their association in neuro-degenerative disease, i.e. age-related macular degeneration (AMD). We will run the eQTL detection software on genotype and gene expression data of normal healthy phenotype. After selecting the conditionally significant association eQTLs, we will test the pleiotropic association between the expression level of a gene and a complex trait of interest using summary-level data from AMD GWAS and analyzed eQTLs. We will also perform the genetic colocalisation analysis of potentially related phenotypes for sharing the common genetic causal variant(s) in a given region.
Non-Technical Research Use Statement:
Dysregulated immune function and neuroinflammation have become recognized as common underlying mechanisms in aging and various neurodegenerative diseases, including AMD. Microglia serve as the resident immune cells in both the brain and retina, akin to macrophages, and have recently been identified as significant contributors to AMD pathogenesis. Our objective is to access brain microglia data to investigate microglia-associated expression quantitative trait loci (eQTLs) and causal variants relevant to AMD. We plan to utilize existing genotype and gene expression data from microglia to perform eQTL analysis.
Investigator:
Ravassard, Philippe
Institution:
ICM Paris Brain Institute
Project Title:
lnc-AD : Organoid-based lncRNA discovery platform for Alzheimer's Disease
Date of Approval:
March 4, 2025
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: Our research aims to discover novel long non-coding RNAs (lncRNAs) involved in familial and sporadic forms of Alzheimer’s Disease (AD). Using a patient-derived organoid platform, we previously identified thousands of lncRNAs associated with AD, the majority of which were unknown. Leveraging sequencing data from the NIAGADS database, along with data obtained from the DIAN database (Dominantly Inherited Alzheimer Network), we aim to validate those novel lncRNA and identify which may be involved in the disease process. Design : Aim 1 : using bulk and single-cell RNAseq datasets, we will assess the expression of the lncRNAs of interest in patient samples, and identify those which are dysregulated in familial and sporadic AD patients. The single cell sequencing datasets will allow us to achieve a cell-type resolution. Aim 2 : using whole-genome sequencing (WGS) from patients and controls, we will determine if some of those lncRNAs are associated with known SNPs or rare variants linked to AD. Altogether, this work will advance our understanding of the genetic bases of AD by shedding light on an underexplored genomic category which may play causative roles in disease development, and provide novel therapeutic targets. Analysis plan : Raw files from the bulk (NG00083, previously obtained) and single-cell (NG00108) RNAseq datasets will be aligned to a custom genome containing the novel lncRNA we identified. To undertake differential analyses, we will need to access phenotypic information regarding disease status, as well as risk-modifying variant status. RIN, age of death, post-mortem interval, and sex may be used for correcting expression data. We have previously obtained approval from DIAN to access their datasets. All these datasets were previously analyzed together in two recent publications (PMIDs : 29880032 and 37085492). To appropriately make use of the WGS data from the AD sequencing project (NG00067), we will need to access information regarding age at disease onset, educational attainment, and available clinical scores. All data will be processed locally, and results from our analyses will be shared in agreement with the policies.
Non-Technical Research Use Statement:
Our research is focused on finding new long non-coding RNAs (lncRNAs) that play a role in Alzheimer’s Disease (AD). lncRNAs are an understudied type of gene, which are however known to be implicated in many biological functions. We previously found many unknown lncRNAs linked to AD using a patient-derived organoid platform. We now wish to validate these findings using molecular data from the NIAGADS and DIAN databases. We have two main goals: 1. We will use RNA sequencing data to study the expression of these lncRNAs in patients and identify which ones are abnormal. Single-cell sequencing will help us see how these lncRNAs behave in different cell types. 2. We will use whole-genome sequencing to check if these lncRNAs are connected to common or rare genetic variants known to be associated with AD. This work will improve our understanding of the genetic causes of AD, of lncRNA biology and may reveal new targets for treatment.
Investigator:
Reitz, Christiane
Institution:
Columbia University
Project Title:
Endolysosomal Pathways in Alzheimer's Disease
Date of Approval:
May 30, 2023
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Endosomal dysfunction is implicated in the pathogenesis of Alzheimer’s disease (AD). Variants in SORL1 encoding a receptor for the retromer complex (the master regulator of endosomal trafficking) has been strongly linked to AD. Disrupting the retromer-SORLA pathway mediated regulation of endosomal trafficking triggers AD’s molecular/cellular pathologies in various ways: i) disrupting the retromer-SORLA complex reduces glutamate receptor recycling to the surface of dendritic spines, triggering the early stages of the neurodegenerative process; ii) with SORLA as the intermediary, retromer traffics APP away from its amyloidogenic cleavage in early endosomes. Retromer core or SORLA depletion causes endosomal traffic jam and thereby enhancement of APP processing to Aß; iii) dysfunction of the retromer trafficking pathway, by depleting either SORLA or VPS26b, accelerates tau secretion; iv) a recent study showed that retromer dysfunction in neurons triggers morphological alterations that phenocopy microglia abnormalities observed in the AD brain, and that the microglial pathology can be partially rescued by neuronal retromer gene therapy. These studies provide a strong rationale to investigate the retromer-SORLA pathway as a functional druggable target to slow the course of AD by restoring endosomal function. A critical step for this is to identify individuals carrying pathogenic variants in SORL1 and other endosomal trafficking genes and characterize their (endo)phenotypic profiles including cognitive clinical course and AD biomarker profiles. The goal of this proposal is to determine which SORL1 variants and variants in other endosomal trafficking genes are truly pathogenic and may be valuable drug development candidates by identifying variant carriers in ADSP WGS data and assessing critical endophenotypes in these individuals including cognitive profiles and biofluidic biomarker profiles.
Non-Technical Research Use Statement:
Intracellular trafficking of critical proteins is an important mechanistic pathway in the pathogenesis of Alzheimer’s disease (AD). A master regulator of intracellular endosomal trafficking is the retromer complex. An increasing number of studies indicate that the retromer may be a valuable functional druggable target to slow the course of AD, and a critical step for this is to identify individuals carrying pathogenic variants in retromer-related genes and characterize their (endo)phenotypic profiles including cognitive clinical course and AD biomarker profiles. The goal of this proposal is to determine which genetic variants in endosomal trafficking genes are truly pathogenic and may be valuable drug development candidates by identifying variant carriers in ADSP WGS data and assessing critical endophenotypes in these individuals including cognitive profiles and biofluidic biomarker profiles.
Investigator:
Reitz, Christiane
Institution:
Columbia University
Project Title:
U01AG079850_Genetics of neuropsychiatric symptoms in AD
Date of Approval:
July 8, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Neuropsychiatric Symptoms (NPS) (e.g. aggression, psychosis, anxiety, apathy, depression, agitation, sleep disturbances, repetitive behaviors) occur in 85% of AD patients, and are associated with greatly increased suffering of patients and families. Despite this, our understanding of the etiology of NPS in AD is inadequate, with treatments for NPS often being ineffective and associated with serious adverse effects. This knowledge gap is particularly egregious in underserved racial and ethnic groups. The aim of the current project (U01AG079850) is to collate, harmonize, and analyze the AD-associated NPS data collected on ADSP/ADSP-FUS samples. We plan to (1) expand the racially and ethnically diverse datasets of the ADSP-FUS and related efforts to include harmonized NPS data, creating the largest and most diverse genomic resource on NPS in AD to date allowing researchers to assess a wide range of additional critical hypotheses through these resources; (2) utilize these harmonized data to identify and describe genetic determinants, pathways, and polygenic effects underlying specific NPS in AD; (3) explore the shared genetic architecture across AD-associated NPS and with primary psychiatric disorders; and (4) disentangle the role of ancestry in NPS genetic risk. Included in these analyses will be in particular early-onset samples already recruited and whole-genome sequenced under the READR , EFIGA and NIA-FBS initiatives which have a particularly high prevalence of NPS. We anticipate that this work will lead to a better understanding of the genetic basis of NPS in AD which is vital to infer the mechanistic pathways underlying these highly disabling symptoms and develop more effective pharmacological targets. To collate NPS data on all ADSPFUS cohorts we closely collaborate with the ADSP-Phenotype Harmonization Consortium. Creation of refined harmonized NPS phenotypes will be conducted by Dr. Ted Huey’s group at Brown University. Genomic data analyses will be conducted by Dr. Reitz’ group at Columbia University and Dr. Beecham’s group at the University of Miami.
Non-Technical Research Use Statement:
Although neuropsychiatric symptoms (e.g. aggression, psychosis, anxiety, apathy, depression, and sleep disturbances) occur in ~85% of Alzheimer disease patients and are associated with accelerated decline, increased cost, out-of-home placement, and greatly increased suffering of patients and families, our understanding of their etiology is still inadequate, with treatments often being ineffective and even associated with serious adverse effects (including increased mortality). This knowledge gap is particularly egregious in underserved racial and ethnic groups such as Hispanics and African-Americans. We propose to expand the racially and ethnically diverse ADSP-FUS and related resources to include harmonized neuropsychiatric symptom data allowing researchers to assess a variety of additional critical hypotheses, and to utilize these harmonized data to identify ancestry-specific genetic determinants, molecular pathways, and polygenic effects underlying neuropsychiatric symptoms in Alzheimer disease.
Investigator:
Rexach, Jessica
Institution:
UCLA
Project Title:
Defining the unique immunogenetic landscape of PSP compared to related dementias
Date of Approval:
November 26, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective: This research centers on the investigation of the genetic aspects of Progressive Supranuclear Palsy (PSP) compared to Alzheimer's Disease (AD) and unaffected controls through the analysis of whole genome sequencing data. The objective of this study is to comprehensively analyze the immunogenetic landscape in PSP and AD individuals compared to a control group in this dataset.
Non-Technical Research Use Statement:
In our study, we aim to decipher the genetic underpinnings of Progressive Supranuclear Palsy (PSP) related to immune function. We will use sequencing data to run genomics analyses that are focused on a specific region of DNA called the HLA locus and additional immune genes, which play crucial roles in our immune system. Our goal is to compare the genetic makeup of the HLA locus in people affected by PSP compared to those with Alzheimer’s disease (AD) and the general population. In simple terms, we're looking for clues in the DNA that could explain how changes in the immune system might influence why some individuals develop these conditions.
Investigator:
Rice, Kenneth
Institution:
University of Washington
Project Title:
Alzheimer’s Disease Sequencing Project Coordinating Center
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The Alzheimer’s Disease Sequencing Project Coordinating Center (ADSP CC) is requesting access to the ADSP umbrella dataset and related data to support the CC’s general aims of scientific and administrative support of the ADSP. We propose to: 1. Analyze the molecular and/or phenotype data to identify potential collaborative research projects and develop appropriately protected data sets for ADSP-related training opportunities or workshops to advance the ADSP’s programmatic goals. 2. Analyze (or provide input on analysis plans of) molecular and/or phenotype data to generate resources such as QC filters/metrics, ancestry PCs, local ancestry inference, relatedness measures, annotations, standardized phenotypes. 3. Explore the creation of an ADSP catalog with summaries of available data and resources shared with the scientific community to facilitate discovery and use of ADSP data. This catalog would be limited to summary-level information that can be shared publicly. 4. Complete a landscape analysis of ADSP and related data, including data formats, elements, and documentation that are behind access control, and identify opportunities to support data interoperability and partnerships with other AD/ADRD research programs. This could aid us to develop and implement a data model that defines standards and structure for data and metadata. 5. Investigate feasibility of performing analyses of ADSP data in secure, NIH-designated cloud-based platforms that meet the NIH Security Best Practices for Users of Controlled-Access Data (see Cloud Use Statement and Cloud Server Provider Information). Accessors will not use the requested data to investigate individual identity in any analyses. Consent and other Data Use Limitations (DUL) for each study will be respected in all analyses. This application will support collaboration between ADSP CC investigators at the University of Washington and MPIs Anita DeStefano and Michael Hawrylycz.
Non-Technical Research Use Statement:
The Alzheimer’s Disease Sequencing Project Coordinating Center (ADSP CC) will characterize and analyze the available ADSP and related data to facilitate broader use of this valuable resource. The CC’s activities will include developing documentation, analysis resources, and training materials.
Investigator:
Richards, Brent
Institution:
The Lady Davis Institute
Project Title:
Comparative Analysis of Plasma Proteomic Profiles in Healthy Adults and Individuals with Neurodegenerative Diseases
Date of Approval:
September 3, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: (1) Identify disease-specific pQTL (2) Identify age-specific pQTL (3) Identify causal disease pQTL through Mendelian randomization & colocalization Study design & analysis plan: • Disease-specific analyses We will use linear or logistic regression to ascertain the role of pQTLs on disease. We will investigate the associations of the 7,028 aptamers in each group separetely (i) 1,076 cognitively normal individuals, (ii) the 1,001 clinically diagnosed Alzheimer’s disease patients, and (iii) 1,030 individuals with other neurodegenerative diseases. Each linear model will be adjusted for age, sex, and disease-relevant confounders. • Age-specific analyses Again using a linear or logistic regression models, we will stratify our analysis by age, setting a threshold to distinguish individuals considered old and young. Thus allowing us to detect if there is an effect of age on specific pQTLs. • Causal pQTL through Mendelian randomization & colocalization A recent study in the UKB claimed a predictive and causal role for GFAP, NEFL, GDF15 and LTBP2 Alzheimer’s disease1. We will the R package TwoSampleMR2 to estimate the causal effect of pQTLs on disease outcomes. We will focus on cis-pQTLs since they are less bias to horizontal pleiotropy. For a single SNP, wald ratio will be calculated, and for multiple SNPs the inverse variance weighted (IVW) approach will be employed. We will select independent SNPs (r2 < 0.001), and perform the necessary harmonization steps to perform the MR analysis. Finally, to account for linkage disequilibrium (LD) bias, we will perform a colocalization analysis using both coloc3 and coloc.susie4.
Non-Technical Research Use Statement:
The study aims to understand how specific genetic variations related to proteins impact neurological diseases, particularly Alzheimer's. Over 7,000 genetic proteins will be analyzed in three groups: healthy individuals, Alzheimer's patients, and those with other neurodegenerative diseases. The effect of age, on these genetic variations will be explored in the study. The study will replicate results from a recent proteomic study in the UKBiobank. Using advanced statistical methods to assess the causal role of these genetic markers in Alzheimer's. The stydu will prioritize certain genetic variations and use different statistical techniques to analyze their effects. To ensure accuracy, they'll also address potential biases in their analysis.
Investigator:
Ridge, Perry
Institution:
Brigham Young University
Project Title:
Alzheimer's Disease Genetics: mitochondrial, haplotype-based analyses, and genetic replication
Date of Approval:
January 14, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives In our lab we are working on a variety of Alzheimer’s disease (AD) genetics research projects: mitochondrial genetics of AD, haplotype-based analyses in the nuclear genome, AD genetics in Pacific Islanders (PI), and replication of identified disease variants identified in our novel dataset (the Cache County Study). Our objectives include: 1) develop a large dataset of AD mitochondrial genomes (~60,000 total samples), 2) identify mitochondrial genetic variants associated with AD, 3) explore the relationship between mitochondrial inheritance of AD and the mitochondrial genome, 4) identify haplotypes that explain observed AD variants, 5) replicate variants we identify in samples from the Cache County Study, and 6) test PI-specific AD genetics variants for association in the races represented in the ADSP. Study Design/Analysis Plan 1. Using our published approach, assemble, annotate, and deposit in NIAGAD whole mitochondrial genome sequences from the ADSP and a variety of sources. 2. Estimate haplotypes in the nuclear genome using ShapeIt and our own novel tool in development. 3. Using an evolutionary based method, TreeScanning, assess the effects of mitochondrial and nuclear haplotypes associated with AD status, including both risk and protective variation. We have published several papers using this approach to study the relationship between the mitochondrial genome and AD. 4. Also using TreeScanning, conduct association studies between mitochondrial haplotypes and maternal family history of AD. 5. Using standard statistical methods to replicate associations from the Cache County Study and PIs. 6. Use SNP data to estimate genetic variances.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is the most common form of dementia, is fatal, and causes a substantial burden to both affected individuals and loved ones. We seek to push the boundaries of our current understanding by identifying novel targets. Where a majority of approaches focus on single variants in the nuclear genome, we seek to identify mitochondrial-based targets and haplotypes responsible for influencing disease risk in the nuclear genome. We are sequencing whole genomes from the Cache County Study in large risk and protective pedigrees and will use these samples to confirm our findings. In addition to the work described above, we are studying the AD genetics of Pacific Islanders (PIs). PIs are unrepresented in the ADSP. We will evaluate the genetics we discover in PIs in the populations represented in the ADSP and augment the ADSP with PI genomes.
Investigator:
Rose, Evan
Institution:
University of Chicago
Project Title:
Colorism
Date of Approval:
March 14, 2023
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Our study will investigate the impact of skin color-based discrimination (colorism) on socioeconomic and health outcomes. We will do so by measuring how genetic variants that increase melanin production are associated with surveyed outcomes in the Health and Retirement Survey (HRS), including employment, income, education, and medical history. The results will add quantitative evidence to the large body of qualitative and critical literature on colorism and enhance our understanding of how colorism contributes to structural inequality. To measure the causal impact of colorism on life outcomes, we will use genetic variant data from the Genotype Data Version 3 (2006-2012 Samples) available via NAGADS and outcomes from HRS survey questions, such as income and years of education. We will assemble a list of SNPs from prior studies that have been shown to cause darker skin, and study the impacts of these SNPs on individuals in the HRS. The specific SNPS of interest include rs16891982, rs1426654, and rs1800404, which according to correspondence with Amanda Kuzma are available in the Genotype Data (at least via HRC imputation). To estimate effects, we will fit regression models that relate life outcomes to the presence of SNPs while controlling for any confounding variables, including genetic principal components. If colorism leads to worse life outcomes, we would expect to see a negative slope between the effect size of the SNP and the predicted outcomes. Our analysis can be considered a version of Mendelian randomization (MR). We will perform the work in Python and R. Our study will bring new statistical evidence to a large body of work demonstrating that colorism is a widespread form of discrimination in America. Using Mendelian randomization with variants that modulate genetically predisposed skin color, we can directly isolate the causal effect of colorism on social inequalities. Our findings have the potential to support the lived experiences of people who experience skin color-based discrimination and improve public knowledge that colorism is an important contributor to inequality in our society, and one that we can better address through policy.
Non-Technical Research Use Statement:
Our study will investigate the impact of skin color-based discrimination (colorism) on socioeconomic and health outcomes. We will do so by measuring how genetic variants that increase melanin production are associated with surveyed outcomes in the Health and Retirement Survey (HRS), including employment, income, education, and medical history. The results will add quantitative evidence to the large body of qualitative and critical literature on colorism and enhance our understanding of how colorism contributes to structural inequality. Our findings have the potential to support the lived experiences of people who experience skin color-based discrimination and improve public knowledge that colorism is an important contributor to inequality in our society, and one that we can better address through policy.
Investigator:
Rosso, Andrea
Institution:
University of Pittsburgh
Project Title:
LEGENNDS
Date of Approval:
November 26, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The primary outcome variable of interest is the presence of dementia as determined from the core HRS interview that includes a Telephone Interview for Cognitive Status (TICS). Secondary outcomes of interest include cognitive scores and change in scores. The primary predictor variables of interest are polygenic risk scores for autism, ADHD, and intelligence. The covariates considered in this study include age at study entry, sex, race/ethnicity, socioeconomic status at baseline, education, and genetic ancestry. Cox proportional hazards models will evaluate time to event, operationalized as the time from study entry to the time of the first instance of dementia status. Those who did not receive a status of dementia by the end of the follow-up period were termed censored. Similarly, individuals who died before receiving a diagnosis of dementia were also censored, as well as those who were lost to follow-up. Each censored individual also had a time to event from the time of study entry to the time of censoring. Statistical models will estimate the association of polygenic risk scores for autism, ADHD, and intelligence and dementia/cognitive scores and change in scores. This is a collaboration with Drexel University (PI: Brian K Lee)
Non-Technical Research Use Statement:
Emerging evidence suggests that certain neurodevelopmental disorders – autism, attention-deficit hyperactive disorder (ADHD), and intellectual disability – may increase the risk of later life neurodegenerative disorders such as Alzheimer’s disease or related dementia (ADRD). The goal of this proposed study is to elucidate the genetic link between autism, ADHD, and intellectual disability and ADRD. As part of this, we will examine the relationship between genetic susceptibility to neurodevelopmental disorders and future risk of ADRD. This study will make use of genetic data from over 18,000 participants in the Health and Retirement Study.
Investigator:
Roussos, Panagiotis
Institution:
Icahn School of Medicine at Mount Sinai
Project Title:
Higher Order Chromatin and Genetic Risk for Alzheimer's Disease
Date of Approval:
November 21, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer's disease (AD) is the most common form of dementia and is characterized by cognitive impairment and progressive neurodegeneration. Genome-wide association studies of AD have identified more than 70 risk loci; however, a major challenge in the field is that the majority of these risk factors are harbored within non-coding regions where their impact on AD pathogenesis has been difficult to establish. Therefore, the molecular basis of AD development and progression remains elusive and, so far, reliable treatments have not been found. The overarching goal of this proposal is to examine and validate AD-related changes on chromatin accessibility and the 3D genome at the single cell level. Based on recent data from our group and others, we hypothesize that genotype-phenotype associations in AD are causally mediated by cell type-specific alterations in the regulatory mechanisms of gene expression. To test our hypothesis, we propose the following Specific Aims: (1) perform multimodal (i.e., within cell) profiling of the chromatin accessibility and transcriptome at the single cell level to identify cell type-specific AD-related changes on the 3D genome; (2) fine-map AD risk loci to identify causal variants, regulatory regions and genes; (3) functionally validate putative causal variants and regulatory sequences using novel approaches that combine massively parallel reporter assays, CRISPR and single cell assays in neurons and microglia derived from induced pluripotent stem cells; and (4) develop and maintain a community workspace that provides for the rapid dissemination and open evaluation of data, analyses, and outcomes. Overall, our multidisciplinary computational and experimental approach will provide a compendium of functionally and causally validated AD risk loci that has the potential to lead to new insights and avenues for therapeutic development.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) affects half the US population over the age of 85 and despite decades of research, reliable treatments for AD have not been found. The overarching goal of our proposal is to generate multiscale genomics (gene expression and epigenome regulation) data at the single cell level and perform fine mapping to detect and validate causal variants, transcripts and regulatory sequences in AD. The proposed work will bridge the gap in understanding the link among the effects of risk variants on enhancer activity and transcript expression, thus illuminating AD molecular mechanisms and providing new targets for future therapeutic development.
Investigator:
Roussos, Panagiotis
Institution:
Icahn School of Medicine at Mount Sinai
Project Title:
GENESIS Consortium
Date of Approval:
May 12, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of the proposed research is to elucidate the cell-type-specific molecular mechanisms through which genetic risk variants contribute to serious mental and neurological disorders (SMND), including Alzheimer's disease (AD), Parkinson’s disease (PD), and schizophrenia (SCZ). While genome-wide association studies (GWAS) have identified thousands of risk loci, the majority are non-coding and lie in regions of high linkage disequilibrium, making it challenging to define causal variants and their biological mechanisms. To overcome these limitations, we propose to establish the BrainCellQTL Consortium—a multi-institutional effort to generate and harmonize brain single-cell expression QTL (eQTL) and other molecular QTL data. This study will aggregate and harmonize over 10,000 single-cell transcriptomic libraries from more than 3,000 unique postmortem human brain donors across multiple cohorts. Our integrative analysis plan involves: (1) applying quality control pipelines to single-cell omics and genotype data, (2) harmonizing datasets and establishing a cross-cohort cell type taxonomy, (3) performing multi-ancestry QTL mapping for gene expression and epigenomic traits, (4) colocalizing molecular QTLs with GWAS loci for SMND, and (5) identifying credible causal variants, genes, and regulatory mechanisms. Phenotypic characteristics of interest include cell-type-specific gene expression and chromatin accessibility patterns across brain regions relevant to SMND, with an emphasis on shared and distinct pathways across disorders. We will also develop a publicly accessible web portal and QTL browser to disseminate results, and provide tools and data to the research community.
Non-Technical Research Use Statement:
The goal of this research is to create a detailed map of how genetic variants influence gene activity in specific brain cell types. This map, called a cell type-specific QTL atlas, will help researchers understand how genetic differences contribute to serious mental and neurological disorders such as Alzheimer’s disease, Parkinson’s disease, and schizophrenia. By combining genetic data with information on gene expression and molecular changes in the brain, we aim to uncover the mechanisms that drive these conditions. This work will also help identify important genes and pathways involved in disease, support future genetic studies, and promote consistent data sharing and collaboration across research groups. The GENESIS Consortium will lead this effort by using cutting-edge single-cell data to overcome past challenges and improve our understanding of brain disorders at the level of individual cell types. The results will be shared with the scientific community through public tools and resources.
Investigator:
Rychkova, Anna
Institution:
Alector
Project Title:
Genetic analysis of Alzheimer’s disease risk factors
Date of Approval:
October 12, 2023
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
At Alector we are focused on developing antibody-based therapies for cancer and neurodegenerative disorders such as Alzheimer's disease. Our main therapeutic hypotheses are: that the immune system plays a critical role in neurodegenerative diseases, and that redirecting aberrant immune cell activity in the brain could improve healthy function. We thus are very interested in untangling the role of microglia in Alzheimer’s disease, and understanding the underlying biological pathways. Large GWAS studies of Alzheimer's disease (AD) uncovered a number of loci that are associated with the disease, however the mechanism of their involvement in the pathology is largely unknown. To better understand the role of various AD associated SNPs, we are looking for large datasets with both genotype and transcriptomics data in various cell types, and the Microglia Genomic Atlas study (MiGA) is an excellent resource of such data for microglia. With this data in hand we plan to perform the following analysis: We will query for a linear relationship between AD risk factors (risk allele loads) and mRNA levels to identify transcriptional signatures associated with each SNP. This analysis will be conducted using plink and R, correcting for covariates, such as gender, age, and population structure. We will follow with functional annotation using gene set enrichment analysis to further characterize impact of risk factors. In addition, we are performing similar analysis of various other cell types (monocytes, macrophages, neurons). By doing comparative analysis we are looking to identify cell type specific mechanisms that might be involved in the disease pathology. Overall, mining data from MiGA and other datasets will help us better understand the mechanism of action of risk factors of AD, and aid Alector with biomarker selection strategy, as well as antibody screening.
Non-Technical Research Use Statement:
Understanding the role of myeloid immune cells in neurodegenerative disorders and cancer is central to Alector. Large datasets of samples from patients with Alzheimer's disease and healthy controls are an invaluable resource for scientists striving to understand the biological mechanisms leading to disease and find ways to cure it. The Microglia Genomic Atlas study is one of the rare resources of a large number of microglia samples with both gene expression and genetic variation data. By performing statistical analysis of this dataset in combination with data from other cell types we will gain better understanding into mechanisms of action of Alzheimer's disease’s risk factors, and help Alector with developing treatment for patients.
Investigator:
Sadowski, Martin
Institution:
New York University School of Medicine
Project Title:
APOE4 and Klotho Interaction in AD
Date of Approval:
January 21, 2022
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objectives of the proposed research The objective of the proposed research is to analyze the effect of Klotho-VS heterozygosity and other genetic covariates identified during the study on the rate of Alzheimer's Disease progression as measured by neuropsychological data and brain MRI volumes in the context of the APOE genotype. Study design The design of this study involves the creation of a repository consisting of clinical and genetic data from the participants of the NACC's Uniform Data Set. Only participants who transitioned from a clinical diagnosis of mild cognitive impairment to Alzheimer's Disease will be considered. Analysis plan, including the phenotypic characteristics that will be evaluated in association with genetic variant This study will conduct a secondary analysis of the neuropsychological data (MMSE, MOCA, CDR-SB, ADAS-11) and brain MRI volumes in association with APOE genotype and other genetic predictors of clinical decline like Klotho-VS heterozygosity. Effect of Klotho-VS heterozygosity in the context of a specific APOE genotype will be determined using a linear mixed model approach comparing disease progression over time between genetic variables, as well as demographic variables such as sex and age. There will be no collaboration with researchers at other institutions.
Non-Technical Research Use Statement:
Gene set which controls the rate of Alzheimer's disease (AD) progression both in terms of accelerating and attenuating its rate is unknown. Our recently published work demonstrated feasibility of statistical modeling of longitudinal clinical data from AD patients to uncover the effect of certain genes on the rate of disease progression. Thus, we showed that AD patients who harbor the e4 allele of apolipoprotein E gene show accelerates clinical course of AD in comparison to e4 non-carriers. In this project we plant to use statistical modeling of longitudinal clinical data to correlate trajectory of disease progression in with their individual genetic makeup. A gene which effect on AD progression were are planning to study next is Klotho-VS heterozygosity.
Investigator:
Safo, Sandra
Institution:
University of Minnesota
Project Title:
Innovative Machine and Deep Learning Analyses of Alzheimer's Disease Omics and Phenotypic Data
Date of Approval:
October 27, 2023
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
AD is the most common cause of dementia and presents a substantial and increasing economic and social burden. Our ability to diagnose and classify AD from cognitive normals (CN), or discriminate among individuals with AD, early mild cognitive impairment [EMCI], or late mild cognitive impairment (LMCI), is essential for the prevention, diagnosis, and treatment of AD. Since individuals with MCI have a high chance of converting to AD, effectively discriminating between those who convert to AD (MCI-C) from those who do not convert (MCINC) is important for early diagnosis of AD. The heterogeneity of AD has motivated attempts to classify distinct subgroups of AD to better inform the underlying physiology. There is evidence to suggest that using data across multiple modalities (e.g. genetics, imaging, metabolomics) has potential to classify AD subgroups better than using single modality. We will apply machine and deep learning methods to gain deeper insight into AD and ADRD pathobiology. We will use datasets that include genomics, genetics, metabolomics, and phenotypic data for this purpose. Data will be divided into discovery and validation sets. On the discovery set, state-of-the-art ML and DL methods for integrative analysis that we and others have developed will be coupled with resampling techniques to determine candidate molecular signatures and pathways discriminating the AD groups considered. Molecular scores will be developed from these candidate biomarkers. The clinical utility of the scores beyond well-known clinical risk factors for AD will be ascertained. We will validate our findings using the validation data. We will visually and quantitatively compare the risk scores across several clinical variables and outcomes. We will use (un)supervised clustering methods to identify molecular clusters, and we will investigate molecular clusters differentiating MCI to AD converters from non-converters. We may explore differences across ethnic subgroups. We will also innovatively apply our multimodal molecular subtyping methods to discover, reproduce, and characterize novel molecular subgroups of AD– this will allow for better risk stratification.
Non-Technical Research Use Statement:
We have been developing novel machine learning (ML) and deep learning (DL) methods that leverage genomics, other omics (including proteomics and metabolomics), clinical and epidemiology data to better understand the pathogenesis of complex diseases. By integrating data from different sources, we have identified molecular signatures contributing to the risk of the development of complex diseases beyond established risk factors. We are proposing to innovatively apply these, and other existing, methods, to data pertaining to Alzheimer’s disease (AD) and Alzheimer’s disease related dementias (ADRD). A deeper understanding of the genes, genetic pathways, and other molecular signatures of AD is essential and could facilitate the identification of potential therapeutic targets for the disease.
Investigator:
Sajjadi, Seyed Ahmad
Institution:
University of California, Irvine
Project Title:
Identifying genetic variants associated with multiple pathologic changes in Alzheimer’s Disease and Related Dementias
Date of Approval:
July 14, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Multiple pathologic changes are common in old age and are associated with dementia. While Alzheimer’s disease neuropathologic changes (ADNC) remain the most common pathology observed in older adults, there is also wide recognition for the role of cerebrovascular neuropathologic changes, Lewy body pathology, TAR DNA-binding protein 43 (TDP-43), hippocampal sclerosis in age-related cognitive decline. These pathologies often do not exist in isolation and individuals harboring multiple types of pathologies are found to have worse clinical outcomes compared to individuals with any one specific pathology. The genetic mechanisms and risk factors of the presence of multiple pathologies however are not well understood. While studies have shown shared risk alleles across various pathologies, it remains unclear how multiple pathologies are linked and why individuals with any one pathology develop comorbid pathologies while others do not. Identification of genetic variants distinct and shared between each pathology will highlight potential mechanistic links and dissect differences across pathologic changes. The goal of the planned analyses is to identify genetic variants associated with comorbid pathologies and to assess the relationships in genomic and RNA-level information with clinical outcomes across multiple pathologies. We will use integrated clinical, neuropathologic, and sequencing data from the ADSP-PHC and the 90+ study to 1) identify new and previously identified risk alleles associated with neuropathologic changes and 2) relate genomic information with transcriptomic data from publicly available RNA sequencing datasets, and 3) determine the associations between the genomic and transcriptomic information to clinical outcomes (cognitive, motor, and neuropsychiatric symptoms) across different pathologic changes. We will perform separate analyses on the ADSP-PHC and 90+ study datasets. All analyses will be adjusted for age at death, sex, and ethnicity.
Non-Technical Research Use Statement:
With advancing age, the presence of multiple brain pathologies is common and associated with dementia. While Alzheimer’s disease is the most common pathology observed in older adults, other changes like cerebrovascular pathology, Lewy body pathology, TDP-43 pathology, and hippocampal sclerosis also play a role in cognitive decline. People with multiple types of brain changes tend to have worse outcomes. Understanding why some individuals develop multiple pathologies while others don't is still not clear, but genetics may provide some insight into mechanisms underlying and linking multiple pathologies. By studying genetic variants associated with these brain changes and their impact on clinical outcomes, we aim to uncover new insights. Our research, using data from the ADSP-PHC and the 90+ study, will explore these connections and help shed light on the complex interplay between genetics, brain changes, and functional decline.
Investigator:
Salas Diaz, Lucas
Institution:
Dartmouth College
Project Title:
Human fetal derived microglia display profound age-related changes in epigenetic and transcriptomic features
Date of Approval:
November 25, 2024
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Microglia (MG) are the principal immune cells of the central nervous system, constituting 10% of all cells in the brain. MG cells perform many critical functions involved in normal CNS homeostasis and in neuroinflammation and degenerative diseases. MG cells are derived from embryonic yolk sac progenitors and persist through self-renewal into adulthood. Their behavior and dynamics evolve significantly with aging. As individuals age, microglia undergo profound phenotypic changes. The lack of markers to differentiate between monocyte-derived and resident microglia makes it difficult to assess age-related ontogenetic shifts. Research shows that microglia can be differentiated by unique DNA methylation patterns, forming a "memory trace" of their fetal state, quantified through a fetal cell origin (FCO) score. The FCO score helps determine the epigenetic age of microglia. While generally high in younger individuals, indicating fetal origin, FCO scores decrease significantly in those over 60, suggesting epigenetic remodeling with aging. We hypothesize that aging leads to a large-scale ontogenetic shift in microglial populations, alongside changes in DNA methylation and transcriptomic features. To investigate, microglia samples have been sequenced using RNA-seq, focusing on: 1. DeWitte microglia show strong FCO variation with age; 2. DeWitte also did RNA-Seq data on N=50 of the same samples; 3. Obtain RNA-Seq information from paired samples, then compare expression in samples stratified by high/low FCO or by age (<60 vs >60); 4. Obtain DEGs for FCO high cells, rank all DEGs by logFC high to low expression; 5. Determine the genes in the FCO that discriminate fetal and adult stem cells; 6. Use the fetal gene list and the ranked DEGs from the high/low FCO analysis to perform a gene enrichment analysis. Ask if fetal genes are enriched in the leading edge of overexpressed genes in the FCO high/low ranked list; 7. Determine if fetal genes are enriched in differentially expressed microglial genes; 8. Explore specific gene transcription methylation correlations.
Non-Technical Research Use Statement:
Microglia (MG) are the principal immune cells of the central nervous system, constituting 10% of all cells in the brain. Aging leads to profound changes in microglial phenotypes and the lack of distinguishing markers for monocyte-derived versus resident MG has made it impossible to discern if normal aging leads to age-related ontogenetic shifts in MG populations. The ontogeny of fetal versus adult stem cell-derived populations can be traced to DNA methylation marks using the fetal cell origin (FCO) score. Using published data, we computed the FCO scores of isolated human MG from different brain regions and subjects of varying ages. The results showed the FCO scores in MGs were highly age-dependent. Microglia from older donors demonstrated significantly lower FCO scores, indicating an epigenetic shift or remodeling in the aging CNS population. Here, we hypothesize that aging is associated with a large-scale ontogenetic shift in the MG populations and that this shift will be accompanied by characteristic changes in DNA methylation signatures and associated transcriptomic features.
Investigator:
Saykin, Andrew
Institution:
Indiana University School of Medicine
Project Title:
Alzheimer's Disease Genomics: Systems Biology and Endophenotypes
Date of Approval:
September 17, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) and related genomic data sets including sequencing, GWAS and phenotypic data will be combined with longitudinal clinical, demographic, cognitive, MRI, PET, CSF and blood endophenotype data, where available, to investigate the genetic architecture of Alzheimer’s disease and related disorders (ADRD) and brain aging. The overall goal to gain a better understanding of fundamental disease mechanisms, genetic susceptibility and protective factors, and the relationship of genetic factors to disease heterogeneity, progression and different trajectories across biomarker profiles. Data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) will be combined with ADSP and other data sets to increase detection power and for replication across samples. Analyses will include conventional statistical association, multivariate profiling of endophenotypes, biological pathway and network approaches, longitudinal models and combinatorial machine learning. Deliverables will include reports of new prioritized lists of candidate genes and variants for further investigation in new samples, functional experiments and in model systems. The ultimate goal is discovery of novel potential diagnostic markers and therapeutic targets that will help provide the foundation for a precision medicine approach to AD/ADRD.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) and related genomic data sets will be combined with longitudinal clinical, demographic, cognitive, MRI, PET, CSF and blood biomarker data to investigate the genetic architecture of Alzheimer’s disease and related disorders (ADRD) and brain aging. The overall goal to gain a better understanding of fundamental disease mechanisms, genetic susceptibility and protective factors, and the relationship of genetic factors to disease heterogeneity, progression and different trajectories across biomarker profiles. Data will be combined across studies to increase detection power and for replication. Analyses will include conventional statistical association and advanced analytic approaches including multivariate profiling, biological pathway and network analysis and machine learning. The ultimate goal is discovery of novel potential diagnostic and therapeutic markers that will help provide the foundation for a precision medicine approach to AD/ADRD.
Investigator:
Saykin, Andrew
Institution:
Indiana University School of Medicine
Project Title:
AI4AD (Artificial Intelligence for Alzheimer’s Disease): Ultrascale Machine Learning to Empower Discovery in Alzheimer’s Disease Biobanks
Date of Approval:
January 16, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to genomic, imaging and cognitive data, in order to 1) identify AD genotypes and endophenotypes that dissect AD’s heterogeneity; 2) relate said genotypes and endophenotypes with clinical progression in pre-dementia patients; 3) identify novel treatment targets for AD by analyzing whole genome and associated phenotypic data. The goals of this multisite initiative (Paul Thompson, USC; Christos Davatzikos, Li Shen, Penn; Andy Saykin, IU; Heng Huang, Pitt, Paul Crane, UW; Adam Brickman, Columbia; Tim Hohman, Vanderbilt; Gyungah Jun, BU; Duygu Tosun, UCSF; Alexander Zaranek, Curii) leverage the promise of machine learning (ML) to contribute to precision diagnostics, prognostication, and targeted and novel treatments. We will develop ML and deep learning methods to apply to large scale biobanks of whole genome sequences (WGS), neuroimaging, cognitive, and clinical data, aiming to discover new genomic features that influence biological processes of AD. We will apply methods of genome representation and tiling to WGS repositories to create inputs for AI methods. We will develop novel, interpretable, biological knowledge guided deep learning methods to discover genomic motifs associated with AD, AD risk, and biological processes of AD as defined by NIA-AA criteria. To quantify subtypes and disentangle biological processes of AD, we will apply computational methods to multimodal MRI and amyloid- and tau-sensitive PET to stratify and subtype patient groups; novel imaging genomics methods will detect genomic markers and pathways that modulate the developing pathology as detected in the images, and that predict future clinical decline or resilience. We hypothesize that advanced deep learning methods combined with whole genome data will outperform traditional methods and GWAS for predicting AD onset and progression, and will assist with disease subtyping and discovering treatable targets in the genome. A team will rank and repurpose existing, and identify novel drugs and targets in the genome based on the discovered genetic motifs affecting AD.
Non-Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to extensive and rich genomic, imaging and cognitive data, in order to 1) identify genotypes and endophenotypes of AD that dissect the heterogeneity of the disease; 2) relate these genotypes and endophenotypes with clinical progression, in pre-dementia patients; 3) identify novel treatment targets for AD, by analyzing whole genome and associated phenotypic data at a previously impossible scale. Collectively, the goals of this highly collaborative multi-site initiative leverage the promise of machine learning to contribute to precision diagnostics, prognostication, and targeted and novel treatments.
Investigator:
Saykin, Andrew
Institution:
Indiana University School of Medicine
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
July 21, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology.
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Saykin, Andrew
Institution:
Indiana University School of Medicine
Project Title:
Centrally-linked Longitudinal pEripheral biomARkers of AD (CLEAR-AD)
Date of Approval:
January 20, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
This U19 aims to bridge these knowledge gaps for discovery and validation of Centrally-linked Longitudinal pEripheral biomARkers of AD (CLEAR-AD) in multi-ethnic populations. CLEAR-AD U19 is based on the premise that AD is a complex disorder in which many biological pathways are disrupted due to multi-omic perturbations, which can be detected in brain and reflected in blood. The specific aims of CLEAR-AD are: 1) To discover CLPMS of the complex and heterogeneous AD pathophysiology and its co-pathologies. 2) To identify longitudinal CLPMS that detect and predict dynamic neuroimaging, fluid biomarker, and clinical changes across AD spectrum. 3) To characterize differences and similarities in CLPMS profiles across NHW, African American (AA) and Latino American (LA) participants to uncover biomarker patterns in multi-ethnic groups. 4) To make these vast resources available to the scientific community to amplify and accelerate its impact. In this U19, we will leverage NIH-funded ADNI, MCSA and ADRC cohorts of >3,700 multi-ethnic participants to generate >20,000 multi-omics measures (Omics Core) that will be processed and integrated with >48,000 harmonized AD cognitive, neuroimaging and fluid endophenotypes (Analytic Core). Using these data, we will identify brain region and cell-type specific CLPMS, which reflect biological subtypes of AD and disease stage (Project 1). We will discover longitudinal changes in CLPMS that predict cognitive and A/T/N/V progression (Project 2). We will define longitudinal cognitive and A/T/N/V changes and CLPMS in URP that are either conserved with NHW or population-specific (Project 3). This U19 will a) Identify the next generation of AD biomarkers with mechanistic insights; b) Establish a precision medicine approach for rigorous multi-omics biomarker discovery and validation in AD; c) Discover molecules that can serve as biomarkers and therapeutic targets; d) Enhance biomarker research in trial-ready multi-ethnic populations; and e) Generate and share a vast and harmonized resource of endophenotype and multi-omics data in NIH-funded cohorts.
Non-Technical Research Use Statement:
There is a clear and immediate need for the discovery of peripheral molecular signatures linked to central disease processes, core and co-pathologies in Alzheimer’s Disease (AD), that will serve as precision medicine blood-based biomarkers for diagnostic, prognostic, theragnostic and therapeutic purposes. AD is a complex disorder in which many biological pathways are disrupted due to multi-omic perturbations, which can be detected in brain and reflected in blood, i.e. centrally-linked peripheral molecular signatures (CLPMS). This U19 will leverage deeply phenotyped, longitudinal NIH-funded multi-ethnic cohorts and cross-disciplinary expertise for multi-omics data generation and its integration with harmonized AD endophenotypes, will share these data and utilize them in integrated U19 projects to discover CLPMS that will serve as the next generation of AD biomarkers.
Investigator:
Schafer, Nicholas
Institution:
Constantiam Biosciences Inc.
Project Title:
MAVEvidence Novel Variant Effect Predictors for Alzheimer
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The primary objective of this research is to utilize population data to validate predictors derived from functional assays for Alzheimer’s disease (AD) and Alzheimer’s disease-related dementias (ADRD) risk genes, specifically focusing on APP and SNCA. We aim to validate the functional relevance of these predictors by assessing their correlation with both AD prevalence and clinically meaningful endophenotypes that characterize disease progression. Study Design: This study will employ a multi-faceted design consisting of the following components: Curation of MAVE Data: We will systematically curate existing MAVE studies for APP and SNCA, integrating data from published literature to establish a comprehensive dataset. Validation of Functional Assays: We will conduct validation studies to quantify the discriminative power of MAVE scores in identifying pathogenic versus benign variants using known variants from ClinVar as a benchmark. Quantitative Risk Modeling: We will develop a survival analysis framework to model the age- and genotype-dependent penetrance of AD, leveraging MAVE data alongside population incidence data. Analysis Plan: The analysis will focus on evaluating the following phenotypic characteristics in association with genetic variants: Pathogenicity Discrimination: We will assess the ability of MAVE scores to distinguish between pathogenic and benign variants for APP and SNCA using Receiver Operating Characteristic (ROC) curves, aiming for an area under the curve (AUC) greater than 0.7. Odds of Pathogenicity: The OddsPath for each variant will be calculated using MAVE data in conjunction with ClinVar annotations to quantify the strength of functional evidence. AD Risk Estimation: We will develop a quantitative model that estimates the risk of developing AD based on genotype, validated against data from the Alzheimer’s Disease Sequencing Project and independent cohorts such as the UK Biobank.
Non-Technical Research Use Statement:
We developed a tool, MAVEvidence, for use by variant scientists to interpret variants in AD/ADRD risk genes. We note that although this use case is not regulated by bodies such as the FDA, it is guided by the published ACMG/AMP framework, to which MAVEvidence adheres. We aim to collect function scores from MAVEs for APP and SNCA, collect clinical annotations from ClinVar, and evaluate the ability of each MAVE to distinguish known pathogenic and benign variants. To apply functional evidence within the ACMG/AMP framework, function scores must first be converted to evidence strengths. We will calibrate variant-level functional evidence by first calculating the odds of pathogenicity and then applying a Bayesian approach to convert to evidence strength. Finally, we will generate quantitative measures of risk by integrating MAVE data and population AD/ADRD incidence data into a survival model that predicts age- and genotype-dependent penetrance of Alzheimer’s disease.
Investigator:
Schellenberg, Gerard
Institution:
University of Pennsylvania
Project Title:
ADSP Data Analysis
Date of Approval:
July 8, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
As part of the Consortium for Alzheimers Sequence Analysis (CASA: NIA grant U19AG047133). We plan to analyze whole exome and whole genome sequence data generated from subjects with Alzheimer's disease (AD) and elderly normal controls. These data will be generated by the National Human Genome Institute Large-Scale Sequence Program. The goal of the planned analyses is to identify genes that have alleles that protect against or increase susceptibility to AD. We will evaluate variants detected in the sequence data for association with AD to identify protective and susceptibility alleles using the whole exome and whole genome case-control data. We will also evaluate sequence data from multiplex AD families to identify variants associated with AD risk and protection, and evaluate variant co-segregation with AD. The family data will be whole genome data. The family-based data will be used to inform the cases control analysis and visa veras. We also will focus on structural variants (insertion-deletions, copy number variants, and chromosomal rearrangements). Evaluation of structural variants will involve both whole genome and whole exome data. Structural variants will be analyzed with single nucelotide variants detected and analyzed in the case-control and family-based data.
Non-Technical Research Use Statement:
We are attempting to identify all the inherited elements that contribute to Alzheimer's disease risk. To do this we will analyze DNA sequence data from subjects with Alzheimer's disease and elderly subjects who are cognitively normal. The sequence data from these 2 groups will be compared to identify differences that contribute to the risk of developing Alzheimer's disease of that protect against Alzheimer's disease. These DNA differences can be at a single site in the genetic code, or can span multiple sites, changing the copy number of DNA sequences. Both types of genetic variants will be examined.
Investigator:
Schellenberg, Gerard
Institution:
University of Pennsylvania
Project Title:
Genetic Association Study of Alzheimer’s Disease with Whole-Genome and Whole-Exome Sequence Data
Date of Approval:
July 17, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective of the proposed research: Recent studies have found that single nucleotide polymorphism (SNPs) and copy number variations (CNVs) can both play significant roles in missing heritability of Alzheimer's Disease (AD). In this project, we propose to conduct a comprehensive investigation on both variant types and understand their contributions in AD risk. Study design: We will us whole-genome (WGS) and whole-exome (WES) sequence data in the Alzheimer's Disease Sequencing Project (ADSP) and conduct case-control association analyses of SNPs and CNVs. Analysis plan: Using the ADSP sequence data, we will start with CNV detection and characterization of CNV sequence features (e.g., microhomology, non-template insertions, and segmental duplications) to understand potential mechanisms of CNV formation. Next, we will study the association of AD status with SNPs and CNVs (common and rare) using standard association methods and adjusting for population regressions to assure efficient modeling of joint SNP-CNV effects from common and rare variants. We will perform ethnic-specific and ethnic-combined association analyses. We will use principle-component-based methods to adjust for PS, but also explore the efficacy of other PS adjustment methods. Finally, we will conduct biological annotation on identified risk variants. Collaborators: This team includes researchers from University of Pennsylvania (UPenn) and North Carolina State University (NCSU). UPenn includes Gerard Schellenberg (PI: Professor of Pathology and Laboratory Medicine), Li-San Wang (PI: Professor of Pathology and Laboratory Medicine, Wan-Ping Lee (PI: Research Assistant Professor of Pathology and Laboratory Medicine), Adam Naj (Assistant Professor of Biostatistics and Epidemiology) and Yuk Yee Leung (Research Assistant Professor of Pathology and Laboratory Medicine). NCSU includes Jung-Ying Tzeng (PI: Professor of Statistics and Bioinformatics Research Center), Wenbin Lu (Professor of Statistics), Arnab Maity (Associate Professor of Statistics) and Jessie Jeng (Associate Professor of Statistics).
Non-Technical Research Use Statement:
Copy number variants (CNVs) are DNA regions that have gains (duplications) or losses (deletions). CNVs affect a considerable number of base pairs in the human genome. Unlike single-nucleotide polymorphisms (SNPs) that has been broadly studied in diseases, CNVs were not intensively discovered. The large-scale Alzheimer’s Disease Sequencing Project (ADSP) provides a systematic way to capture nearly all genomic variations and to study the genetic basis of Alzheimer’s Disease (AD). In this project, using the data of affected and unaffected samples from ADSP, we propose to conduct a comprehensive investigation on both variant types (SNPs and CNVs) and study their contributions in AD risk and etiology. We will start with CNV genotyping, followed by conducting standard association analysis of AD with SNPs and CNVs. We will also develop and apply new analytical methods for efficient modeling of joint SNP-CNV effects from common and rare variants. Finally, we will conduct functional annotation on identified risk variants to uncover possible biological mechanisms.
Investigator:
Schellenberg, Gerard
Institution:
University of Pennsylvania
Project Title:
PSP and CBD Genetics
Date of Approval:
July 27, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We plan to analyze whole exome and whole genome sequence data generated from subjects with progressive supranuclear palsy (PSP), corticobasal degeneration (CBD), Alzheimer's disease (AD) and elderly normal controls. The goal is to detect mutations and variants that cause, contribute to risk, or protect against PSP and/or CBD. We want to compare PSP and CBD genotypes to those from AD and normal controls sequenced by the Alzheimer's Disease Sequence Project. We would like both whole genome and whole exome data from the Alzheimer's Disease Sequence Project for AD and normal controls. We would also like whole genome and whole exome data for PSP and CBD generated by the PSP and Tau consortiums. We will use these data to determine which mutations and variants are associated with PSP or CBD versus benign variants. All PSP and CBD subjects being sequenced are deceased. The requested data sets will have variants recalled as a batch and combined to evaluate allele frequencies of called variants. The AD and control variant frequencies will then be compared to allele frequencies from PSP and CBD subjects as described above. We will also compare structural variants (insertion-deletions, copy number variants, and chromosomal rearrangements) identified in PSP and CBD subjects to those found in AD and in cognitively normal controls in order to determine structural variants involved in PSP and CBD pathogenesis. All of the investigators that are listed will be using a joint called VCF generated from the requested data sets. PSP is a neurodegenerative disease closely related to Alzheimer's disease (AD). PSP, CBD and AD have neurofibrillary tangles as part of the signature neuropathology defining these disorders. PSP and CBD are considered Alzheimer’s Disease Related Disorders (ADRD).
Non-Technical Research Use Statement:
We are attempting to identify all the inherited elements that contribute to progressive supranuclear palsy (PSP) and corticobasal degeneration (CBD) risk. To do this we will analyze DNA sequence data from subjects with AD, PSP, CBD, and subjects who are cognitively normal. The sequence data from these groups will be compared to identify differences that contribute to the risk of developing PSP and CBD, or that protect against these diseases. These DNA differences can be at a single site in the genetic code, or can span multiple sites, changing the copy number of DNA sequences. Both types of genetic variants will be examined.
Investigator:
Schliep, Alexander
Institution:
University of Gothenburg
Project Title:
AD subtypes
Date of Approval:
August 10, 2021
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) is a highly heterogeneous disease with diverse clinical manifestations and rate of progression. AD subtypes identification will improve our understanding of the underlying disease mechanisms, enable us to predict disease trajectory and develop new disease-modifying treatments. Objective. To identify distinct AD subtypes from WGS data of AD individuals Analysis plan. We will use 3000 WGS data derived from the ADSP Discovery Case-Control Based Extension Study. We will use the available SNVs and INDELS and infer structural variants (SVs) with our in-house multi-caller pipelines. Rare variants will be retained for further analysis. We will then split the dataset in training and tests set, and use the identified set of genetic variants (i.e. SNVs, INDELS and SVs) as input to a deep neural network (an autoencoder architecture) to learn an unsupervised latent representation of the data. AD subtypes will be identified within this reduced space and characterized using, demographics and clinical data. We will then contrast each subtype with the control groups to identify subtype relevant variants (i.e. putative subtype biomarkers), which will be used as input features to a gradient boosted tree model, to generate a subtype predictive model and subtype specific features. Planned collaboration. Each member of the team will devote effort in specific areas of investigation, nevertheless, all the team members will discuss, through regular meeting, individual progress and potential challenges. In particular, Dr Coppola (Research Scientist, Department of Pathology, Yale University, USA), together with Dr Dean Palejev (Associate Professor, GATE Institute, Sofia University, Bulgaria) will be involved in the deep learning model generation and validation, and subtype identification; Dr Fredrik Johansson (Assistant Professor, Department of Computer Science & Engineering, Chalmers University of Technology. Sweden), will work on the supervised machine learning model; Dr Alexander Schliep, Associate Professor, Department of Computer Science & Engineering, University of Gothenburg, Sweden), will work on the SVs inference
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a highly heterogeneous disease with diverse clinical manifestations and rate of progression. The heterogeneity of AD has complicated both clinical trial design and outcomes, and thus the need for better models of AD, and/or better strategies for selection of participants into specific clinical trials is evident. The identification of more homogeneous disease subgroups (i.e. AD subtypes) will improve our understanding of the underlying disease mechanisms, enable us to predict disease trajectory and develop new disease-modifying treatments. We will use a comprehensive set of genetic variants in combination with deep learning algorithms to identify AD subtypes. Subtypes will be characterized using clinical and demographic data. Finally, variants specific to each cluster will be identified and used to train a predictive machine-learning model to classify new individuals.
Investigator:
Schwaba, Ted
Institution:
Michigan State University (MSU)
Project Title:
A Genomic Window Into Lifespan Personality Development
Date of Approval:
July 15, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: Across the lifespan, a person's genetics are associated with their personality traits. We will capitalize on recent advancements in personality genomics to test whether and how common genetic variants relevant to personality (i.e., polygenic indices (PGIs) for personality traits) are associated with trajectories of phenotypic personality development, both with age and in the context of life events. By linking genetics to personality through environment, we will substantiate mechanisms by which lifespan development occurs. Study design: Observational; we will use existing data (the Health and Retirement Survey; HRS) to estimate longitudinal correlations between genetic profiles (PGIs) and personality trait development over time Analysis plan: 1) Using genomic information from HRS participants, we will apply polygenic index weights (PGIs) from the forthcoming Revived Genomics of Personality Consortium (ReGPC) Genome-Wide Association Study (Schwaba et al., 2025) to assign each participant a PGI value for each of the Big Five personality traits (extraversion, agreeableness, conscientiousness, neuroticism, openness to experience). 2) Using multilevel models, we will estimate stable levels and change over time in the phenotypic Big Five personality traits (measured bi-yearly from 2006 onwards using the Midlife Development Inventory). Change will be operationalized as yearly development in the years before/after a stressful life event (measured with a life events questionnaire) and yearly age-graded development (measured in terms of change with age). 3) We will correlate PGI values with levels and change in the Big Five personality traits, to examine whether a person's personality-relevant genetics differentiate trajectories of development with age and in the context of life events
Non-Technical Research Use Statement:
Across the lifespan, a person's genetics are associated with their personality traits. We will test whether and how genetic variants relevant to personality are associated with trajectories of personality trait development, both with age and in the context of life events. By linking genetics to personality through environment, we will substantiate mechanisms through which lifespan development occurs.
Investigator:
Seshadri, Sudha
Institution:
Glenn Biggs Institute for Alzheimer's and Neurodegenerative Diseases, University of Texas Health Sciences Center, San Antonio, TX
Project Title:
Therapeutic target discovery in ADSP data via comprehensive whole-genome analysis incorporating ethnic diversity and systems approaches
Date of Approval:
August 12, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective: Utilize ADSP data sets to identify genes & specific genetic variants that confer risk for or protection from Alzheimer disease. Aim 1: Using combined WGS/WES across the ADSP Discovery, Disc-Ext, and FUS Phases, including single nucleotide variants, small insertion/deletions, and structural variants. We will: Aim 1a. Perform whole genome single variant and rare variant case/control association analyses of AD using ADSP and other available data; Aim 1b. Target protective variant identification via association analysis using selected controls within the ADSP data and performing meta analysis across association results based on selected controls from non-ADSP data sets. Aim 1c. Perform endophenotype analyses including cognitive function measures, hippocampal volume and circulation beta-amyloid ADSP data in subjects for which these measures are available. Meta analysis will be conducted across ADSP and non-ADSP analysis results. Aim 2: To leverage ethnically-diverse and admixed populations to identify AD variants we will: Aim 2a. Estimate and account for global and local ancestry in all analyses; Aim 2b. Perform admixture mapping in samples of admixed ancestry; and Aim 2c. Perform ethnicity-specific and trans-ethnic meta-analyses. Aim 3: To identify putative therapeutic targets through functional characterization of genes and networks via bioinformatics, integrative ‘omics analyses. We will: Aim 3a. Annotate variants with their functional consequences using bioinformatic tools and publicly available “omics” data. Aim 3b. Prioritize results, group variants with shared function, and identify key genes functionally related to AD via weighted association analyses and network approaches. Analyses will be performed in coordination with the following PIs. Coordination will involve sharing expertise, analysis plans or analysis results. No individual level data will be shared across institutions. Philip De Jager, Columbia University; Eric Boerwinkle & Myriam Fornage, U of Texas Health Science Center, Houston; Sudha Seshadri, U of Texas, San Antonio; Ellen Wijsman, U of Washington. William Salerno, Baylor College of Medicine
Non-Technical Research Use Statement:
This proposal seeks to analyze existing genetic sequencing data generated as part of the Alzheimer’s Disease Sequencing Project (ADSP) including the ADSP Follow-up Study (FUS) with the goal of identifying genes and specific changes within those genes that either confer risk for Alzheimer’s Disease or provide protection from Alzheimer’s Disease. Analytic challenges include analysis of whole genome sequencing data, appropriately accounting for population structure across European ancestry, Hispanic, and African American participants, and interpreting results in the context of other genomic data available.
Investigator:
Shah, Naisha
Institution:
J. Craig Venter Institute
Project Title:
Multimodal Analysis of AD
Date of Approval:
October 5, 2020
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
The principal goal of this study is to identify novel genetic signatures associations with subtypes of Alzheimer’s disease (AD). Specifically, we aim to (a) identify subtypes of AD using phenotypic variables including age-of-onset, sex, years of education, clinical measurements, and neurocognitive measurements, and (b) identify novel genetic signatures that include rare variants, APOE status and/or polygenic risk scores (PRS), which are associated with these subtypes. Current knowledge from literature and databases will be utilized to perform feature engineering such as calculation of PRS and genomic-region-based-bins for rare variant burden. We will employ unsupervised learning such as community detection and clustering algorithms to identify subtypes of AD, and supervised learning such as decision tree and regularization algorithms to find genetic signatures that are associated with the subtypes. The identified genetic signatures will be evaluated using appropriate performance metrics for the predictive models used. As more phenotypic variables are made available, the models will be updated to refine the subtypes with better characterization. It is anticipated that the novel genetic signatures will yield insight into the etiologies of the heterogeneous Alzheimer’s disease, and therefore provide opportunities to develop personalized treatments. Initially, we will use the NIAGADS data to generate preliminary data for grant submissions. After receiving award, the project would utilize the very valuable dataset fully.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a heterogeneous condition and is high heritable (58-79% heritability for late-onset and >90% heritability for early-onset). We have yet to identify large proportion of genetic variants that either increase or decrease risk for different subtypes of AD such as early-onset or late-onset. In the proposed study, we plan to identify genetic signatures for the different subtypes of AD. Unraveling the heterogeneity of AD and its associated genetic signatures is critical for implementation of precision medicine to combat such a devastating disease.
Investigator:
Sharp, Andrew
Institution:
Icahn School of Medicine at Mount Sinai
Project Title:
Investigating the role of tandem repeat variation in Alzheimer’s disease
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Variation in tandem repeats (TRs), particularly large expansions of triplet repeats (eg. polyCAG), are known to cause a number of late-onset neurological diseases. Due to their repetitive and degenerate nature, variations in TRs are typically ignored by standard genome analysis pipelines. Furthermore, pathogenic repeat expansions typically span hundreds to thousands of bases, making variations in them difficult to detect in short read data. However, recently a number of specialized algorithms have been developed which enable expansions of short TRs (motif sizes 1-12bp) to be detected in short-read sequencing data. Our lab has also developed approaches that allow the copy number of repeats with larger motifs (motif size ranging from 12bp-200kb) to be estimated based on read depth. We hypothesize that variation in TR regions contributes to risk of AD, and will test two hypotheses: 1. That rare pathogenic expansions of TRs (either in the “full” or “pre-mutation” range) occur at increased frequency in AD patients compared to controls. 2. That length variation in TRs of all sizes represents a class of common genetic variation that may alter an individual’s susceptibility to AD. In Aim 1, we will search for expansions of microsatellite repeats using tools such as ExpansionHunter, exSTRa and STRetch, that analyze WGS BAM files for signatures of expansion. We will look for loci with an excess of rare outlier genotypes in cases vs controls. If loci showing rare expansions in cases are identified, if possible, we would request aliquots of DNA from the specific individuals to perform long-read sequencing to validate the presence of potentially pathogenic repeat expansions. In Aim 2, we will use read depth approaches to estimate copy number of large TRs. We will compare estimated copy numbers of these repetitive regions in cases vs. controls to identify TR loci that show significant associations of copy number with AD compared to controls. Analysis will incorporate technical and biological covariates, such as principal components of WGS read depth data, ethnicity from SNV data, gender, etc, and will utilize a multiple testing correction for genome-wide analysis.
Non-Technical Research Use Statement:
Some types of neurodegenerative diseases are known to be caused by a specific type of genetic mutation where a short piece of DNA becomes repeated hundreds of time. Termed “repeat expansions”, these types of mutation can be difficult to find using standard methods scientists use to sequence DNA. We believe that some cases of Alzheimer’s disease may also be caused by repeat expansions. We will apply new analysis approaches that are specifically designed to find these repeat expansions in genome sequencing data, with the aim of finding novel types of genetic mutation that contribute to some cases of Alzheimer’s disease.
Investigator:
Shelton, Janie
Institution:
Bristol Myers Squibb
Project Title:
A longitudinal study of Alzheimer’s Disease and other dementing illnesses – KnightADRC GWAS
Date of Approval:
January 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Recently approved Alzheimer’s disease (AD) therapies, such as lecanemab (Leqembi) and donanemab (Kisunla), represent a significant advancement toward disease-modifying treatment. However, their impact on cognitive decline remains modest, and both are associated with potentially serious adverse events, including amyloid-related imaging abnormalities (ARIA). These limitations underscore the urgent need for additional therapeutic strategies to reduce disease burden. Genetic approaches offer a powerful avenue for drug target discovery, with evidence suggesting that genetically supported targets are at least twice as likely to progress successfully through clinical development to FDA approval (Nelson et al., 2015, Nat Genet; King et al., 2019, PLoS Genet; Minikel et al., 2024, Nature). To date, most genetic studies in AD have focused on identifying loci associated with disease risk. Large-scale genome-wide association studies (GWAS) have uncovered approximately 75 risk loci (Bellenguez et al., 2022, Nat Genet), providing valuable insights into disease etiology. However, therapeutic interventions are typically aimed at individuals already diagnosed with AD, making the genetics of disease progression a critical—yet underexplored—complementary approach for target discovery. Progression-focused genetic studies face challenges due to limited availability of longitudinal phenotypic data. To address this, meta-analysis of multiple GWAS datasets offers a practical strategy to increase statistical power and detect robust associations. We propose to incorporate summary statistics from the Knight Alzheimer Disease Research Center (Knight-ADRC) AD progression GWAS into a meta-analysis alongside several publicly available and proprietary datasets. Our objective is to identify novel genetic drivers of AD progression, prioritize new therapeutic targets, and assess the impact of existing pipeline candidates on disease trajectory.
Non-Technical Research Use Statement:
New Alzheimer’s treatments like lecanemab (Leqembi) and donanemab (Kisunla) are an important step forward in the search for ways to help patients, but these drugs have only moderate benefits and can come with serious side effects. Better therapies are still needed to reduce the impact of the disease. Genetics offers a powerful way to discover new drugs—studies show that treatments based on genetic findings are more likely to succeed. So far most genetic research has focused on the genes which increase the risk of developing Alzheimer’s, but understanding genes that drive how the disease progresses in Alzheimer’s patients may be even more beneficial. However this type of data, which involves following participants over time, is limited, combining results from multiple smaller studies (a meta-analysis) can help uncover important patterns. We plan to add data from the Knight Alzheimer Disease Research Center to a larger analysis to find new genetic clues, identify better treatment targets, and evaluate how current and future drugs may slow disease progression.
Investigator:
Shen, Li
Institution:
University of Pennsylvania
Project Title:
AI4AD (Artificial Intelligence for Alzheimer’s Disease): Ultrascale Machine Learning to Empower Discovery in Alzheimer’s Disease Biobanks
Date of Approval:
November 21, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to genomic, imaging and cognitive data, in order to 1) identify AD genotypes and endophenotypes that dissect AD’s heterogeneity; 2) relate said genotypes and endophenotypes with clinical progression in pre-dementia patients; 3) identify novel treatment targets for AD by analyzing whole genome and associated phenotypic data. The goals of this multisite initiative (Paul Thompson, USC; Christos Davatzikos, Li Shen, Penn; Andy Saykin, IU; Heng Huang, Pitt, Paul Crane, UW; Adam Brickman, Columbia; Tim Hohman, Vanderbilt; Guyngah Jun, BU; Duygu Tosun, UCSF; Alexander Zaranek, Curii) leverage the promise of machine learning (ML) to contribute to precision diagnostics, prognostication, and targeted and novel treatments. We will develop ML and deep learning methods to apply to large scale biobanks of whole genome sequences (WGS), neuroimaging, cognitive, and clinical data, aiming to discover new genomic features that influence biological processes of AD. We will apply methods of genome representation and tiling to WGS repositories to create inputs for AI methods. We will develop novel, interpretable, biological knowledge guided deep learning methods to discover genomic motifs associated with AD, AD risk, and biological processes of AD as defined by NIA-AA criteria. To quantify subtypes and disentangle biological processes of AD, we will apply computational methods to multimodal MRI and amyloid- and tau-sensitive PET to stratify and subtype patient groups; novel imaging genomics methods will detect genomic markers and pathways that modulate the developing pathology as detected in the images, and that predict future clinical decline or resilience. We hypothesize that advanced deep learning methods combined with whole genome data will outperform traditional methods and GWAS for predicting AD onset and progression, and will assist with disease subtyping and discovering treatable targets in the genome. A team will rank and repurpose existing, and identify novel drugs and targets in the genome based on the discovered genetic motifs affecting AD.
Non-Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to extensive and rich genomic, imaging and cognitive data, in order to 1) identify genotypes and endophenotypes of AD that dissect the heterogeneity of the disease; 2) relate these genotypes and endophenotypes with clinical progression, in pre-dementia patients; 3) identify novel treatment targets for AD, by analyzing whole genome and associated phenotypic data at a previously impossible scale. Collectively, the goals of this highly collaborative multi-site initiative leverage the promise of machine learning to contribute to precision diagnostics, prognostication, and targeted and novel treatments.
Investigator:
Shen, Li
Institution:
University of Pennsylvania
Project Title:
Artificial Intelligence Strategies for Alzheimer's Disease Research
Date of Approval:
May 18, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The goal of this project is to develop artificial intelligence (AI) approaches for extracting unforeseen patterns from clinical, genetic, genomic, and imaging data that could lead to ideas for new drug development or drug repurposing. Our proposed AI methods and software will be open-source, user-friendly, and freely available for all to use. Specifically, we will analyze ADSP data sets using three novel informatics methods to tailor our automated machine learning (AutoML) tool to the analysis of Alzheimer’s disease (AD) data. First, we will develop a Multi-Modal Interaction (M2I) feature selection algorithm for identifying genetic interactions that are predictive of AD (AIM 1). Second, we will develop a Knowledge-driven Multi-omics Integration (KMI) algorithm for combining omics features for AI analysis of AD (AIM 2). Third, we will develop a Multidimensional Brain Imaging Omics (MBIO) integration framework for the joint analysis of multisource large-scale data for predicting AD. Finally, we will integrate all three biomedical informatics methods into our open-source AutoML software package and apply it to the ADSP data sets. We expect our methods will reveal new biomarkers for AD that will open the door for better treatments and clinical decision support.
Non-Technical Research Use Statement:
The goal of this project is to develop artificial intelligence (AI) approaches for extracting unforeseen patterns from clinical, genetic, genomic, and imaging data that could lead to ideas for new drug development or drug repurposing. We will develop three biomedical informatics methods with focuses on genetics, genomics and imaging respectively. We will integrate these methods into our open-source AutoML software package, and apply it to the ADSP data sets. We expect our methods will reveal new biomarkers for AD that will open the door for better treatments and clinical decision support.
Investigator:
Shortt, Jonathan
Institution:
University of Colorado Anschutz Medical Campus
Project Title:
Understanding the role of genetic admixture in Alzheimer's disease risk in Latino populations
Date of Approval:
September 5, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We aim to evaluate the potential modification of AD risk and manifestation of AD phenotypes through varying patterns of admixture, local ancestry, and global ancestry proportions within and between Latíno populations. We aim to develop an ancestry informed risk model and report how consideration of local and global ancestry may influence estimated effects of known AD risk factors to influence AD risk and phenotypes. We will leverage this information to build improved PRS and compare local ancestry aware methods to other common methods for improving multi-ancestry portability of PRS scores. Specifically our study aims to classify: 1. The risk of AD conferred by the local ancestry of genetic risk factors in admixed populations 2. Methods and computational pipeline for establishing an ancestry informed risk score for AD in admixed populations from the US and Latin America 3. How patterns of admixture across a genome influence AD risk and modify associations with common co-morbidities or AD risk in response to social determinants of health or heath and other non-genetic AD risk factors. 4. How local ancestry informed PRS compare to other common methods for PRS derivation in admixed populations (PRS-CSx for example, as well as PolyPred-S, SBayesRC, etc, and other to-be published methods) including PRS methods which leverage ADRD-relevant functional annotation and QTL resources, such as for transcriptome based risk scoring Since we will be using multiple data sets and harmonization of those data might not be the most statistically appropriate route to answer the specific questions our project seeks to address, we will in some cases perform a meta-analysis. To complete a meta- analysis specific to each objective of this proposal, we will employ a meta-analysis framework that has utility in both multiethnic and ethnic-specific analysis. Additionally, where appropriate, we will use linear mixed effects models and linear regression models to measure associations/correlations of the variables we will examine in our investigation (e.g. associations/correlations of longitudinal or cross-sectional global cognition with gene expression).
Non-Technical Research Use Statement:
AD disparities between populations are well documented and are influenced by both social determinants of health and the genetic architecture specific ethnic groups. Our research aims to build a comprehensive model that elucidates how AD risk is modified in response of varying patterns of admixture within and between Latino populations. This study will report how ancestry differentiated and specific genetic loci (in terms of both variant frequency and effect size) modify AD risk, if differential patterns of admixture influence AD risk in the context of global ancestry proportions, and whether the expressivity of AD phenotypes is modified by admixture patterns across the genome in Latíno populations. Furthermore, this study will determine whether non-genetic risk factors along in combination with environmental risk factors that may be specific to sub-populations of Latínos mediate risk for AD in response to differential patterns of admixture.
Investigator:
Shu, Hai
Institution:
New York University
Project Title:
SCH: Novel Multi-View Statistical Machine Learning for Alzheimer's Disease
Date of Approval:
June 15, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) is the most common type of dementia without curative medications. Early detection of AD is thus essential for timely intervention and effective treatment development. Multi-view data provides a transformative approach to enhance the understanding, diagnosis, and prediction of AD. This research aims to develop three novel statistical machine learning methods for AD research using multi-view imaging, genomic, and clinical data, focusing on multi-view brain and genomic network analysis, significant differential feature testing, and diagnosis and survival analysis. The specific aims of this proposal include: 1. Develop a novel high-dimensional multi-view data decomposition based on uncorrelated common and distinctive latent factors (C&DLFs) to construct multi-view networks, with application to comparing brain and genomic networks across AD statuses; 2. Develop an optimal false discovery rate (FDR) control method based on a novel semi-parametric hidden Markov random field for high-dimensional spatial multiple testing, with application to identifying significant brain and genomic differences across AD statuses; 3. Develop a highly accurate deep-learning-based diagnosis and survival framework for high-dimensional tabular data, incorporating feature selection and view ablation to enhance cost-effectiveness and data accessibility in clinical practice; 4. Apply the three proposed methods to four large-scale AD-related datasets and disseminate the methods with an open, efficient software package. The three novel methods in our project will undergo rigorous theoretical and numerical analyses. We will combine the de-identified Alzheimer's Disease Research Centers Datasets stored in the NIAGADS database with these centers' de-identified imaging data stored in the NACC database to conduct this secondary data analysis project.
Non-Technical Research Use Statement:
The research goal is to develop three novel statistical machine learning methods for Alzheimer’s disease (AD) using multi-view imaging, genomic, and clinical data. These methods will focus on multi-view brain and genomic network analysis, significant differential feature testing, and diagnosis and survival analysis. The proposed project will enhance our understanding of neuro-genetic associations in AD and significantly contribute to biomarker discovery, early detection, and improved patient survival in AD.
Investigator:
Sinclair, Lindsey
Institution:
University of Bristol
Project Title:
Is depression a modifiable risk factor for Alzheimer's disease?
Date of Approval:
September 17, 2024
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objectives I aim to identify whether individuals with normal cognition and a high genetic risk of depression are at higher risk of developing atrophy in brain regions affected by Alzheimer’s disease than those at lower genetic risk. Study Design Depression was identified by the Lancet commission in 2017 as a potentially modifiable risk factor for Alzheimer’s disease. The extent to which this risk is actually modifiable is unknown, as are the mechanisms by which depression increases dementia risk. We wish to use polygenic risk scores (PRS) to look at whether someone’s total genetic risk for depression makes them more likely to have shrinkage of brain regions affected in AD and/or involved in mood over time. If individuals at higher genetic risk of depression have greater shrinkage of brain regions affected early in AD e.g. medial temporal lobe, this could help to explain part of the increase in risk. We will do this using NACC and ROSMAP data on people with normal cognition at the time that their MRI scans were performed. We will use logistic regression and mediation analyses to examine whether social contact may mediate some of the increased risk for dementia in those who are depressed. We would like to be able to include genetic risk for depression and AD in this analysis. Analysis Plan I will use data from individuals in NACC and ROSMAP with normal cognition at the time of their MRI. I will examine whether the PRS is related to the risk of AD by comparing mean regional brain volumes for those in the highest and lowest quartile of PRS-D. Included individuals will need to have completed ≥1 depression rating scales and ≥1 MRI. In NACC depression will be defined as GDS ≥8 and/or NPI-depression severity ≥2 & distress ≥3. In ROSMAP depression will be determined using DSM-IV diagnoses & CESD ≥4. The PRS for depression will be calculated using R or PLINK using data from the most recent depression GWAS. I plan to use lassosum2, ldpred2 and PRS clumping & thresholding and to include a smaller GWAS with better phenotyping (Wray et al 2018) in a secondary analysis. The PRS for AD will also be calculated and included as a co-variate in analyses
Non-Technical Research Use Statement:
The development of depression in mid to later life increases a person's risk of developing Alzheimer's disease. Although depression can be treated, at the moment it is not clear if this treatment may also prevent AD later in life. It is therefore important to know how depression itself increases this risk. We aim to find out 1. What changes in the brains of people who are depressed? 2. Could these changes increase dementia risk? Methods 1. We will use existing data for brain tissue from individuals without dementia. We will use this to find out which pathways differ in people with depression. We will study both areas involved in mood and areas affected early in AD. 2. We will examine whether genetic risk of depression is linked to brain shrinkage. 3. We will assess the relationship between depression and loneliness/social isolation and whether social isolation may be a mechanism by which depression increases risk.
Investigator:
Sinclair, Lindsey
Institution:
University of Bristol
Project Title:
Understanding genetic contributions to neuropsychiatric symptoms in dementia
Date of Approval:
April 16, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
As well as progressive cognitive decline, most patients with Alzheimer’s disease and other dementias experience non-cognitive symptoms during disease progression. Twelve non-cognitive symptoms are measured on the ‘Neuropsychiatric Inventory’ (NPI), including apathy, agitation, irritability, depression, anxiety, aberrant motor behaviours, disinhibition, euphoria, sleep disturbances, appetite changes, as well as delusions and hallucinations. Project aims: 1. Derive a statistical model of which NPI items most commonly occur together to identify NPI symptom profiles. 2. Perform a genome wide association analysis (GWAS) on patients with the identified NPI symptoms profiles in AD compared to all other AD patients. 3. Upstream analysis using the GWAS output for the NPI symptom profiles will include: Identification of genes involved in specific biological pathways, heritability of the profiles, and the polygenetic crossover of these profiles with other neurological disorders and dementias. Study Design: GWAS meta-analysis. We already have genetic and phenotypic data from GERAD, BDR and the HUNT study. Analysis Principal component analysis, hierarchical clustering, and confirmatory factor analysis will be performed on the 12 item NPI to identify symptom profiles. Genome wide association analysis (GWAS) between AD cases with the identified NPI symptom profiles vs all other AD cases will be performed. The exact case-control separation is undecided (i.e. dichotomous or continuous for the severity of the symptoms). However, the GWAS will control for confounders including sex, population stratification, and either age or time since disease diagnosis depending on data availability. Meta-analysis will be performed using METAL. Since all the participants in the GWAS are AD cases, we will assess and adjust for collider bias, as appropriate. Upstream GWAS analysis will include: Assessing the enrichment of SNPs in genes or biological pathways using MAGMA. Polygenic risk scores to assess genetic overlap with the identified NPI profiles and other characteristics. Genome Complex Trait Analysis (GCTA) will estimate the heritability of the NPI symptom profiles.
Non-Technical Research Use Statement:
Dementia is characterized by progressive cognitive decline, but non memory symptoms are also present in most people living with dementia. These can include psychological symptoms such as apathy and psychosis, as well as behavioural changes such as disinhibition. For those living with dementia, these symptoms can cause significant distress and reduced quality of life, as well as a greater need to access healthcare services and an increased chance of requiring residential care. The medications used to treat these symptoms have significant side effects including increased stroke risk. These symptoms have largely been studied singularly, which does not reflect the experience of individuals with disease. The aim of this study is to investigate the genetics of Alzheimer’s Disease patients which present with the different subsets of non-cognitive symptoms which most commonly occur together. This approach has the potential to develop more tailored treatments for patients with dementia.
Investigator:
Singleton, Andrew
Institution:
National Institute on Aging
Project Title:
Genetic Characterization of Movement Disorders and Dementias
Date of Approval:
January 28, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The goal of this project is to utilize standard genetics tools and ensemble/deep learning methods to predict/classify the etiological aspects of Alzheimer's disease and other neurodegenerative diseases based on genetic data and genomic data (including individual level data e.g. genotype and sequencing data, transcriptomic, and epigenomics data, and also by the use of summary statistics). Our primary phenotypes of interest include case:control status, age at onset, survival time (in terms of disease duration from diagnosis to loss to follow-up) and related biomarker data, although there may be other phenotypes of interest that are derived later based on available data.
Non-Technical Research Use Statement:
We are attempting to identify and predict risk of Alzheimer's disease and other neurodegenerative diseases based on genetic and genomic data using standard tools and advanced machine-learning methods.
Investigator:
Sirota, Marina
Institution:
UCSF
Project Title:
Elucidating Sex Differences in Alzheimer's Disease Using Genetics
Date of Approval:
January 30, 2024
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) is a devastating multifactorial neurodegenerative disorder caused by interactions among multiple genetic and environmental factors. A major challenge of studying AD pathogenesis and developing and testing new drugs is the disease heterogeneity in both clinical phenotype and the underlying pathophysiology. Sex differences both play a significant role in disease risk and are a major source of disease heterogeneity in AD. Although the sex differences in the risk of AD, vulnerability to genetic load and severity of AD pathology burden have been well established, the molecular underpinnings and pathways that are differentially mediated in male and female AD patients are still poorly understood. The goal of this project is to analyze publicly available, large-scale genomic datasets of AD patients and age-matched controls to identify genomic regions that are associated with AD differentially in male and female patients and examine their interactive effects with apoE genotypes.
Non-Technical Research Use Statement:
We would like to leverage GWAS and other sequencing efforts in AD to identify sex specific markers associated with the disease.
Investigator:
Song, Qianqian
Institution:
University of Florida
Project Title:
Unraveling Genetic Variants and Risk Factors in Alzheimer's Disease
Date of Approval:
March 7, 2025
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Objectives of the proposed research: Our research aims to investigate the role of genetic variants in Alzheimer's Disease (AD). We seek to identify and understand genetic factors that may contribute differently to AD risk in males and females. We also aim to identify the genetic variants for a certain cohort of AD patients receiving drug treatment. Study design: We will utilize data from the Alzheimer's Disease Sequencing Project (ADSP) resource to perform rigorous genetic analyses. This includes genome-wide association studies (GWAS) to identify sex-specific and cohort-specific genetic variants associated with AD. We will also employ bioinformatics and functional genomics approaches to annotate and characterize these variants, exploring their potential impact on gene function and regulation. Analysis plan: We will collect the available genetic and phenotype data to perform association analyses (e.g., logistic regression) to identify genetic variants associated with AD within each sex/cohort. Multiple testing correction using appropriate methods (e.g., Bonferroni correction or false discovery rate control) will be performed. We will annotate identified genetic variants with functional information (e.g., functional impact on genes, pathways, regulatory elements). Enrichment analysis will be conducted to understand the biological processes affected by sex-specific or cohort-specific variants. Planned collaboration at other institutions: Name of collaborator: Adam Naj; Institution: University of Pennsylvania This collaboration involves joint efforts to achieve shared goals, with specific objectives, tasks, and outcomes in mind. Such collaborations may involve the exchange of expertise, resources, or personnel to leverage collective strengths and achieve more impactful results than would be possible independently.
Non-Technical Research Use Statement:
Our study aims to uncover if and how Alzheimer's Disease affects sex-specific and drug-treated sub-cohorts differently. We'll analyze genomics data to find specific genetic factors associated with the disease in each sub-cohort. Then, we'll explore how these factors work and their impact on Alzheimer's development. Our goal is to provide insights that could lead to more personalized approaches for Alzheimer's diagnosis and treatment, ultimately improving our understanding of this disease.
Investigator:
Stefansson, Hreinn
Institution:
deCODE genetics
Project Title:
Meta-analysis of AD
Date of Approval:
May 30, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The data will undergo meta-analysis, combining it with another extensive whole-genome sequenced Alzheimer's project encompassing participants from Iceland, Denmark, Norway, Sweden, and the UK. Our analysis will specifically concentrate on the identification of rare sequence variants with significant impact, utilizing burden analysis and investigating loci associated with risk through a recessive mode of inheritance. By pooling these datasets, we hope to gain further insights into the genetic architecture of Alzheimer's disease, shedding light on rare variants and potential recessive risk factors. Our main focus will be on genetics of Alzheimer's disease (G30* + F00*).
Non-Technical Research Use Statement:
The data will undergo meta-analysis, combining it with another extensive whole-genome sequenced Alzheimer's project encompassing participants from Iceland, Denmark, Norway, Sweden, and the UK. Our analysis will specifically concentrate on the identification of rare sequence variants with significant impact, utilizing burden analysis and investigating loci associated with risk through a recessive mode of inheritance. By pooling these datasets, we hope to gain further insights into the genetic architecture of Alzheimer's disease, shedding light on rare variants and potential recessive risk factors.
Investigator:
Sul, Jae Hoon
Institution:
Regeneron Pharmaceuticals
Project Title:
Impact of common and rare genetic variants in Alzheimer's Disease using whole-genome and whole-exome sequencing data
Date of Approval:
January 11, 2021
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) has a strong genetic component, and several studies have identified genetic variants that influence AD. A majority of those variants are common variants that appear frequently in a population, and studies have also found that those variants do not explain all of genetic basis of AD. This finding has led genetic studies to investigate effect of rare variants that may have larger effect than that of common variants. To better understand effect of rare variants on AD, we aim to ADSP whole-genome sequencing (WGS) and whole-exome datasets. We will determine whether rare variants in genes appear more frequently among AD patients than controls. Our lab has developed several statistical approaches for the rare variant association method (both case/control and family), and we will apply these methods to the ADSP dataset. Through this analysis, we will quantify the effect of rare variants on AD. We will also estimate polygenic risk scores (PRS) of individuals with AD and compare them to those without AD. We will check how much phenotypic variance of AD is explained by PRS.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) has a strong genetic component, and although several studies found several genetic variations associated with AD, they do not explain all of genetic basis of AD. Those genetic variations are mostly common in population, and recent studies have shown that rare genetic variations may also influence AD. In this study, we propose to identify rare genetic variations that are associated with AD by applying the statistical approaches that combine effect of multiple rare variants. Our lab has developed several methods to identify effect of rare variants both among unrelated individuals and among family members. We will apply these methods to the ADSP dataset and find rare variants associated with AD. In addition to rare variants, we will also investigate effect of common variants using a method called polygenic risk score. These analyses in this study will elucidate impact of both common and rare variants in AD.
Investigator:
Swardfager, Walter
Institution:
Sunnybrook Research Institute
Project Title:
Cerebral small vessel disease and Alzheimer's disease susceptibility
Date of Approval:
December 3, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Introduction: Individuals with Alzheimer's disease (AD) often also present with brain small vessel disease (cSVD). However, molecular pathways underlying the relationship between cSVD and AD phenotypes remain unclear. Objective: We aim to identify genetic variants that contribute to the relationship between cSVD and AD biomarkers. Study Design: We will conduct a novel genome-wide interaction study to identify the contribution of specific genes in the relationship between cSVD and AD biomarkers. We will also apply a candidate genes approach to explore properties of specific identified genes and genes with known biological relevance to cerebrovascular health and AD, in the NACC dataset. The relationship between the genetic variations and relevant biological phenotypes (e.g. specific brain volumetric measures, performance on cognitive assessments, fluid biomarkers, neuropathology, and other co-occurring neurodegenerative diseases, etc.) will be explored. Significance: This study will allow us to investigate potential genetic predisposition which may increase the risk of developing AD biomarkers in individuals with extensive cSVD.
Non-Technical Research Use Statement:
Individuals with Alzheimer’s disease (AD) often also have brain small vessel disease (SVD), where there are damages in the small blood vessels in their brain. However, the relationship between AD and SVD is not fully understood. This project aims to identify the genetic contribution to this relationship by looking at the relationships between genetic variations and imaging, and pathological markers of AD and SVD, as well as the impact of such genetic variations on gene and protein expression.
Investigator:
Sweet, Robert
Institution:
University of Pittsburgh
Project Title:
Prediction of Psychosis in Alzheimer Disease
Date of Approval:
March 14, 2024
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
• Objectives of the proposed research: To identify genetic correlates of psychotic symptoms, defined as the occurrence of delusions or hallucinations, in individuals with Alzheimer Disease (AD+Psychosis, AD+P) • Study design: Individuals who were analyzed in our genome-wide meta-analysis of psychosis in AD (https://www.medrxiv.org/content/10.1101/2020.08.07.20139261v1 and Mol Psychiatry, in revision) who have available whole exome or whole genome sequence data will be included. We will analyze the association of psychosis with the presence of missense, stop/gain/truncating, and canonical splice site mutations with the presence of psychosis at all loci in which our GWAS identified a suggestive (p<10-6) or a significant (p<5x10-8) association with psychosis. • Analysis plan: Analyses as described above will be conducted in individuals of European ancestry using Fisher’s exact test and the SKAT family of tests as appropriate. Psychosis presence/absence will be as defined in https://www.medrxiv.org/content/10.1101/2020.08.07.20139261v1. Covariates (sex, CDR score, age) will be included as appropriate. • If applicable, a brief description of any planned collaboration with researchers at other institutions, including the name of the collaborator(s) and their institutions(s). N/A
Non-Technical Research Use Statement:
Individuals who develop psychotic symptoms such as delusions or hallucinations during Alzheimer disease (AD) have a more rapid deterioration and worse outcomes. We have found that the risk for developing psychosis during AD is influenced by genetic factors. In this proposed research plan, we build on our prior genome-wide association study (GWAS) of psychosis in the context of AD by asking if promising GWAS signal corresponds to association signal from whole-exome sequencing. To do so, we evaluate the correspondence between our GWAS association signals and those from whole-exome sequencing. We will also conduct an exploratory analysis of individuals with AD, with and without psychosis, for sequence variants that predispose to psychotic symptoms in Alzheimer disease.
Investigator:
Swindell, William
Institution:
University of Texas Southwestern Medical Center
Project Title:
Meta-analysis of gene expression differences in post-mortem cortex samples from ALS patients and control subjects
Date of Approval:
March 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Gene expression profiling has been used to understand transcriptional alterations that distinguish tissues from ALS patients and normal control individuals [1, 2], as well as to identify molecular subtypes and substages contributing to the heterogeneity observed among ALS patients [3]. This work will use a random effect inverse variance meta-analysis model to identify genes with robustly altered expression in post-mortem ALS tissues [2]. The model will generate differential expression summary statistics for all protein-coding genes having detectable expression within the CNS, identifying those transcriptional alterations that are most repeatable across a broad range of expression profiling platforms and technologies (i.e., microarray, bulk RNA-seq, snRNA-seq). These genes will then be further investigated to identify over-represented annotations, based upon multiple databases, such as Gene Ontology, Kyoto Encyclopedia of Genes and Genomes (KEGG) and Reactome pathways. A comparison will be made to genes located near previously identified ALS genetic risk loci, with the goal of identifying functional genomics changes that correspond to genetic alterations predisposing patients to disease. A mixed-effects meta-regression model will be used to identify moderating variables associated with between-study variation in effect size. Data will only be accessed by Dr. William Swindell in accordance with NIAGADS. appropriate use policies, without further access by any other investigators. [1] Swindell WR. 2025. Meta-analysis of gene expression in bulk-processed post-mortem spinal cord from ALS patients and normal controls. NeuroSci 6(3): 65. [2] Swindell WR. 2024. Meta-analysis of differential gene expression in lower motor neurons isolated by laser capture microdissection from post-mortem ALS spinal cords. Frontiers in Genetics 15:1385114. [3] Swindell WR, Kruse CPS, List EO, Berryman DE, Kopchick JJ. 2019. ALS blood expression profiling identifies new biomarkers, patient subgroups, and evidence for neutrophilia and hypoxia. Journal of Translational Medicine 17(1):170.
Non-Technical Research Use Statement:
ALS is a debilitating neurodegenerative disease with genetic and sporadic forms. Prognosis is poor although there is substantial heterogeneity among patients and new treatments have been developed and approved for those with certain types of familial disease (e.g., SOD1 mutations). This study will use gene expression profiling to identify mRNA transcripts having expression that is consistently altered in post-mortem cortex samples from ALS patients. The approach will involve large-scale integration between multiple datasets that have been generated in over decades and deposited within the public domain. The goal of this work is to highlight highly reproducible findings in post-mortem tissue samples from ALS patients, which can help focus investigational effort on genes and pathways most robustly altered in end-stage disease.
Investigator:
Tanzi, Rudolph
Institution:
Massachusetts General Hospital
Project Title:
ADSP extension
Date of Approval:
March 27, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Late-onset AD (LOAD) is caused by a complex polygenic and environmental background. Whole genome sequencing provides comprehensive coverage of the genome and has several advantages over exome sequencing and genotyping. We plan to use an aggregated collection of whole genome sequenced family-based and case-control datasets to address the following goals. 1) Identify variants (specifically rare) and regions associated with AD (and related or derived phenotypes) or showing an interaction pattern; 2) Functionally finemap associated loci and identify the functional impact of associated variants in non-coding regions; 3) Use identified variants to validate them in a 3D neural-glial culture model. We will utilize several datasets with whole genome sequencing data, including AD datasets from National Institiute of Mental Health (NIMH) AD family sample and Alzheimer’s Disease Sequencing Project. We will use family-based association tests robust to population confounding and other approaches suitable for case-control studies. Novel analysis approaches will be developed and tested. Analysis results and derived data will be made available to the research community.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is the most common neurodegenerative disorder with a huge burden on the healthcare system and the 6th leading cause of death in the United States. Sequenced DNA from people will help us to better describe the genetic architecture of AD. We will utilize two types of genomic datasets: genomes from related individuals (family-based) and genomes from unrelated individuals (case/control). Identified functional variants will be validated in a 3D neural-glial culture model and enhance the biological understanding of AD.
Investigator:
Tanzi, Rudolph
Institution:
Massachusetts General Hospital
Project Title:
NIAGADS Metalbolomics
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We propose to analyze and replicate MGWAS (Metabolome-Wide Association Studies) findings using blood samples from several population-based cohorts. This dataset will serve to replicate metabolite-SNP association signals from blood from several population-based cohorts. Additionally, we plan to conduct Mendelian Randomization (MR) and mediation analyses for pleiotropic SNPs (single nucleotide polymorphisms) identified in our study. Finally, we will utilize this dataset to test and fine-tune a multiomics machine learning prediction model for Alzheimer's Disease (AD). Specifically, this metabolomics dataset, obtained from both brain and cerebrospinal fluid (CSF), will be instrumental in validating our findings from blood metabolomic datasets. This validation step is crucial for establishing a systematic understanding of the shared genetic basis of metabolites and AD, potentially leading to early prediction and intervention strategies for AD. By integrating data from multiple biological sources, we aim to enhance the robustness and reliability of our findings. This dataset will serve as an additional validation resource, providing a solid foundation for advancing our knowledge of AD's genetic underpinnings and developing predictive models.
Non-Technical Research Use Statement:
Metabolites (i.e. quantified small molecules) can reflect what has been encoded by the genome and modified by environmental factors at a specific point in time. Such biomarkers could be particularly useful for diagnosis improvement, potential therapeutic targets, as well as additional insights into disease pathogenesis. We hypothesize that many metabolites are strong intermediate variables (mediators) of genetic effects and/or environmental effects on AD. Specifically, several metabolite quantitative trait loci (metQTLs) affect biomarkers across the human life course starting from childhood and indirectly leading to AD, while others are influenced by lifestyle and environment later in life. To understand AD mechanisms, we will systematically study SNP-metabolite associations in blood and integrate them with AD susceptibility loci. We will use the NG00131 cohort to validate our findings in metabolites from brain and CSF, as the tissues of interest for AD.
Investigator:
TCW, Julia
Institution:
Boston University School of Medicine
Project Title:
ADSP xQTL
Date of Approval:
May 16, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Population-based genetic association studies have identified over 30 risk loci for Alzheimer’s disease (AD). However, such approaches do not directly reveal the true causal variants or unfold the functional mechanisms of the risk variants. To bridge these gaps, we need to investigate the functional impacts of genetic variants on molecular traits (e.g., mRNA, proteins, and epigenetic modifications) in disease-relevant tissues (e.g. brains) and cell types. We hypothesize that many of the SNPs influence multiple clinical and molecular features. By integrating the genetic associations with functional quantitative trait loci (QTLs), we aim to investigate the potential cascading causal effect of genetic variations in multiple layers of omics data in AD. In this proposal, we plan to perform a multi-omic quantitative trait locus (xQTL) analyses to RNA-seq, proteomics, and DNA methylation data from the large number of postmortem brain tissues available from the AMP-AD project. This study will be part of NIH/NIA Alzheimer's Disease Sequencing Project (ADSP) Functional Genomics xQTL Consortium, a joint effort to generate a reference map of Alzheimer's-related quantitative loci (QTLs). Specifically, our team will be responsible for the xQTL analysis in the Mount Sinai Brain Bank (MSBB) cohort, which contains whole-genome sequencing (WGS), RNA-seq gene expression, proteomics, and DNA methylation data from over 300 AD and control brains. We will follow the unified xQTL calling pipelines developed by the xQTL Consortium to predict QTLs and conduct the subsequent fine mapping, causal inference, and functional annotation integration. We will validate the identified xQTLs in independent samples from the ROSMAP cohort. Through the AMP-AD portal, we already have access to the MSBB and ROSMAP sample meta data, RNA-seq raw read files, proteomics and DNA methylation data. However, the WGS genotype data in the AMP-AD portal were called based on the human hg19 genome, inconsistent with the hg38 genome-based genotype data for other studies in the xQTL consortium. Thus, we are applying for the access to the MSBB and ROSMAP WGS hg38 genome-based genotype data available in NIAGADS.
Non-Technical Research Use Statement:
Numerous genetic loci have been identified to associate with when Alzheimer's disease (AD) occurs and how it progresses. However, the widely used genome-wide association studies (GWAS)-based approaches do not directly reveal the true causal variants or unfold the functional mechanisms of the risk variants. To bridge these gaps, we plan to study the functional impacts of genetic variants on molecular traits (e.g., mRNA, proteins, and epigenetic modifications) in postmortem brain tissues of large AD cohorts, including Mount Sinai Brain Bank and Religious Orders Study/Memory and Aging Project (ROSMAP). By integrating the genetic association signals with the functional quantitative trait loci, we aim to investigate the potential cascading causal effect of genetic variations in multiple layers of omics data in AD.
Investigator:
Thompson, Paul
Institution:
University of Southern California (USC)
Project Title:
AI4AD
Date of Approval:
October 8, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to genomic, imaging and cognitive data, in order to 1) identify AD genotypes and endophenotypes that dissect AD’s heterogeneity; 2) relate said genotypes and endophenotypes with clinical progression in pre-dementia patients; 3) identify novel treatment targets for AD by analyzing whole genome and associated phenotypic data. The goals of this multisite initiative (Paul Thompson, USC; Christos Davatzikos, Li Shen, Penn; Andy Saykin, IU; Heng Huang, Pitt, Paul Crane, UW; Adam Brickman, Columbia; Tim Hohman, Vanderbilt; Guyngah Jun, BU; Duygu Tosun, UCSF; Alexander Zaranek, Curii) leverage the promise of machine learning (ML) to contribute to precision diagnostics, prognostication, and targeted and novel treatments. We will develop ML and deep learning methods to apply to large scale biobanks of whole genome sequences (WGS), neuroimaging, cognitive, and clinical data, aiming to discover new genomic features that influence biological processes of AD. We will apply methods of genome representation and tiling to WGS repositories to create inputs for AI methods. We will develop novel, interpretable, biological knowledge guided deep learning methods to discover genomic motifs associated with AD, AD risk, and biological processes of AD as defined by NIA-AA criteria. To quantify subtypes and disentangle biological processes of AD, we will apply computational methods to multimodal MRI and amyloid- and tau-sensitive PET to stratify and subtype patient groups; novel imaging genomics methods will detect genomic markers and pathways that modulate the developing pathology as detected in the images, and that predict future clinical decline or resilience. We hypothesize that advanced deep learning methods combined with whole genome data will outperform traditional methods and GWAS for predicting AD onset and progression, and will assist with disease subtyping and discovering treatable targets in the genome. A team will rank and repurpose existing, and identify novel drugs and targets in the genome based on the discovered genetic motifs affecting AD.
Non-Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to extensive and rich genomic, imaging and cognitive data, in order to 1) identify genotypes and endophenotypes of AD that dissect the heterogeneity of the disease; 2) relate these genotypes and endophenotypes with clinical progression, in pre-dementia patients; 3) identify novel treatment targets for AD, by analyzing whole genome and associated phenotypic data at a previously impossible scale. Collectively, the goals of this highly collaborative multi-site initiative leverage the promise of machine learning to contribute to precision diagnostics, prognostication, and targeted and novel treatments.
Investigator:
Thompson, Paul
Institution:
University of Southern California (USC)
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
November 22, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups and initiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative – the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” – is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. will accelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Thyagarajan, Bharat
Institution:
University of Minnesota
Project Title:
Omics-based Machine Learning Model to Predict AD dementia
Date of Approval:
January 22, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) dementia is a heterogeneous neurodegenerative disease among older adults. Early detection of AD dementia remains challenging due to heterogeneity in disease onset and progression. Our goal is to develop a genetic variants-based VAE model to predict AD dementia. We will use GWAS data collected in Health and Retirement Study (HRS). HRS has genotyped saliva DNA samples collected since 2006 at multiple time points during field visits yielding a total of 19004 unique participants. We will use the most recent version of genotype data from 2006 to 2015 for those samples that have both epigenetic and transcriptomic data available. The genotyping was performed by NIH Center for Inherited Disease Research, using the Illumina HumanOmni2.5-4v1/8v1 array, and genotyping QC analysis was performed at the University of Michigan using HumanOmni2.5-4v1 H for SNP annotation. We will use the quality and minor allele frequency (MAF) filters specified in the HRS QC report for genotypic data to filter out poor quality SNPs. We will use cognition measures collected in HRS 2016 survey to classify participants into 'Dementia' and 'Normal' using the Langa Weir Classification algorithm. We will employ two main feature selection processes: 1. Based on the association with dementia, the top 50% of associated SNPs will be selected to input to the VAE model and filter out low-frequency SNPs. 2. We will also train a VAE model with a more comprehensive list of SNPs. We will employ the model regularization by incorporating biological knowledge as constraints in the model using the gene-gene interaction network from REACTOME/ STRING. We will also evaluate the biological interpretability of latent features that are representative of input genetic variants. We will evaluate the distribution of weights of all encoded features to select the positive high and negative high features based on 2 SD above or below the mean weight. These selected features will be input for the pathway analysis to identify pathways associated with AD. The candidate genes identified can be used to develop blood-based biomarkers for early identification of AD
Non-Technical Research Use Statement:
We will develop a genetic variants-based VAE model to predict dementia. We will employ various feature selection processes based on the complexity of data to evaluate the VAE model performance and to identify a representative list of genes. In addition, we will evaluate the biological interpretability of latent features obtained from the VAE encoder layer by extracting their decoder weights that captures the input feature contribution to the learned latent feature. This will also allow us to evaluate if the VAE model has learned novel features known to be associated with AD dementia. We will then utilize the learned features to identify the biological pathways associated with AD dementia.
Investigator:
Torkamani, Ali
Institution:
The Scripps Research Institute
Project Title:
Genetic Dissection of Cognitive Resilience
Date of Approval:
September 3, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Our goal is an improved understanding of the genetic contributors to cognitive resilience in neurodegenerative disease (ND) and normal aging, and the development of machine-learning (ML) models for the prediction of expected cognition and its decline. Our specific aims are: I: Identify novel genetic factors for cognitive resilience through the development of genetically informed, ML models of cognition. We will define a novel measure of residual cognition using ML to identify novel genetic associations for cognitive resilience. To achieve this aim, we will: (1) Characterize and compare the degree of association between polygenic risk scores (PRS) and measured risk and resilience factors with risk of diagnosis, cognition, and cognitive decline across NDs. (2) Develop integrated, multi-PRS predictive models of cognition in normal aging, which will be further refined to account for neuropathology in AD, PD, and ADRSs. Comparison of these models will highlight differences in factors driving cognition across ND conditions. (3) Conduct genome-wide and rare variant association studies of cognitive resilience, as quantified by a novel continuous residual cognition measure defined in (2). II: Identify and characterize genetic signatures of cognitive resilience in healthy aging. Similarly, we will characterize if and how ND, cognitive reserve, and brain structure PRS associated with cognition in healthy aging - in cohorts including ADSP and Wellderly. To identify novel genetic contributors, we will identify individuals with outlying genetic signatures of risk to cognitive decline – and perform association studies and analyses as in I.3. III: Genetically informed dissection of the causal contributions of cognitive / brain reserve in cognitive resilience. We will dissect the causal relationship between cognitive resilience, educational attainment, and neuroimaging features. Using polygenic Mendelian randomization analysis, educational attainment, and brain structure PRS as instrument variables, and measured educational attainment and brain imaging as exposures, we can dissect the relationship between cognitive reserve, brain reserve, and cognitive decline in ND.
Non-Technical Research Use Statement:
Cognitive decline occurs during normal brain aging and during neurodegeneration. Understanding shared genetic factors that protect from cognitive decline in aging and neurodegeneration may help inform strategies to stop its decline. The focus of this proposal is to identify those genetic factors that protect against cognitive decline. In particular, we aim to identify novel genetic factors involved in cognitive resilience through associative, predictive, and causal analysis strategies. The overarching goal is an improved understanding of the genetic contributors to cognitive resilience in neurodegenerative disease and normal aging, and the development of machine-learning (ML) models for the prediction of expected cognition and its decline, which will have downstream utility in clinical trial design, clinical decision-making, and have the potential to reveal determinants of therapeutic response in neurodegenerative diseases.
Investigator:
Tosun, Duygu
Institution:
San Francisco VA Health Care System
Project Title:
AI4AD (Artificial Intelligence for Alzheimer’s Disease): Ultrascale Machine Learning to Empower Discovery in Alzheimer’s Disease Biobanks
Date of Approval:
August 11, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to genomic, imaging and cognitive data, in order to 1) identify AD genotypes and endophenotypes that dissect AD’s heterogeneity; 2) relate said genotypes and endophenotypes with clinical progression in pre-dementia patients; 3) identify novel treatment targets for AD by analyzing whole genome and associated phenotypic data. The goals of this multisite initiative (Paul Thompson, USC; Christos Davatzikos, Li Shen, Penn; Andy Saykin, IU; Heng Huang, Pitt, Paul Crane, UW; Adam Brickman, Columbia; Tim Hohman, Vanderbilt; Guyngah Jun, BU; Duygu Tosun, UCSF; Alexander Zaranek, Curii) leverage the promise of machine learning (ML) to contribute to precision diagnostics, prognostication, and targeted and novel treatments. We will develop ML and deep learning methods to apply to large scale biobanks of whole genome sequences (WGS), neuroimaging, cognitive, and clinical data, aiming to discover new genomic features that influence biological processes of AD. We will apply methods of genome representation and ‘tiling’ to WGS repositories to create inputs for AI methods. We will develop novel, interpretable, biological knowledge guided deep learning methods to discover genomic motifs associated with AD, AD risk, and biological processes of AD as defined by NIA-AA criteria. To quantify subtypes and disentangle biological processes of AD, we will apply computational methods to multimodal MRI and amyloid- and tau-sensitive PET to stratify and subtype patient groups; novel imaging genomics methods will detect genomic markers and pathways that modulate the developing pathology as detected in the images, and that predict future clinical decline or resilience. We hypothesize that advanced deep learning methods combined with whole genome data will outperform traditional methods and GWAS for predicting AD onset and progression, and will assist with disease subtyping and discovering treatable targets in the genome. A team will rank and repurpose existing, and identify novel drugs and targets in the genome based on the discovered genetic motifs affecting AD.
Non-Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to extensive and rich genomic, imaging and cognitive data, in order to 1) identify genotypes and endophenotypes of AD that dissect the heterogeneity of the disease; 2) relate these genotypes and endophenotypes with clinical progression, in pre-dementia patients; 3) identify novel treatment targets for AD, by analyzing whole genome and associated phenotypic data at a previously impossible scale. Collectively, the goals of this highly collaborative multi-site initiative leverage the promise of machine learning to contribute to precision diagnostics, prognostication, and targeted and novel treatments.
Investigator:
Tosun, Duygu
Institution:
San Francisco VA Health Care System
Project Title:
Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC)
Date of Approval:
February 6, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The ADSP-PHC was established to harmonize the rich endophenotype data across cohort studies to enable modern genomic analyses of ADRD with the ultimate goal to generate harmonized data that will become a “legacy” dataset perpetually curated and shared NIAGADS. The ADSP-PHC will facilitate phenotypic data harmonization for ADSP participants with genetic and genomic data. This effort represents a multi-disciplinary approach leveraging interdisciplinary expertise in large-scale genetic and genomic studies, clinical neuroscience, neuroimaging, psychometrics, and bioinformatics. This study will utilize a team structure consisting of two coordinating centers to oversee activities of all harmonization teams and to oversee coordination with other ADSP workgroups andinitiatives, a Storage and Informatics Core that will oversee the coordination between LONI and NIAGADS for all data storage, compliance, and dissemination, a CHARGE Coordination Core to oversee the alignment of data and protocols with CHARGE workgroups, an Integration & Analytics Core that will enable data integration across phenotypes to facilitate downstream machine learning applications, and eight Domain-Specific Harmonization Teams tasked with harmonization in their area of expertise. The endophenotypes that will be harmonized by this project include Cognition, Fluid Biomarkers, Amyloid PET, Structural MRI, White Matter Hyperintensities, White Matter Integrity, Vascular Risk Factors, and Neuropathology. Site PIs on this project include: Jesse Mez (Boston University), Adam Brickman (Columbia University), Andy Saykin (Indiana University), Elizabeth Mormino (Stanford University), Pauline Maillard (UC Davis), Duygu Tosun-Turgut (UC San Francisco), Christos Davatzikos (University of Pennsylvania), Arthur Toga (USC); Mohamad Habes (University of Texas Health Science Center at San Antonio), Michael Cuccaro (University of Miami), Paul Crane (University of Washington), Bennett Landman (Vanderbilt University), Timothy Hohman (Vanderbilt University Medical Center), and Carlos Cruchaga (Washington University in St. Louis).
Non-Technical Research Use Statement:
The growing availability of endophenotypic data in cohort studies of Alzheimer’s disease and related dementias (ADRD) provides an exciting opportunity to further characterize the genetic architecture of this devastating disease. However, there is a pressing need to develop and apply advanced harmonization approaches to align ADRD endophenotypes across cohorts. The goal of this coordinated national initiative –the AD Sequencing Project Phenotype Harmonization Consortium, or “ADSP-PHC” –is to provide a centralized database of robust endophenotypes for large-scale genomic analyses that will accelerate the identification of novel targets for therapeutic intervention in ADRD. willaccelerate the identification of novel targets for therapeutic intervention in ADRD.
Investigator:
Tucker-Drob, Elliot
Institution:
University of Texas at Austin
Project Title:
Genetics of Multisystem Aging
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The present study aims to address how gene-environment interactions across lifestyle, socioeconomic and health-related factors predict levels and rates of declines across diverse domains of health, including multiple measures of cognitive function, functional ability, chronic diseases, and wellbeing. First, we will assess and characterize individual patterns of change across health domains. Second, we will search for common and specific sources of senescent change over time across health domains. Third, we will assess how genetic liabilities for medical traits and environmental exposures determine broad and specific sources of variation in intra- and inter-individual change across health domains. We will use phenotypic data from all the waves of the Health and Retirement Study (HRS) (1992–2024) covering the following domains: 1) demographic and socioeconomic variables, 2) lifestyle behaviors 3) cognitive function, 4) functional ability, 5) chronic diseases, and 6) wellbeing. We will use genetic data from the Health and Retirement Study (HRS) to compute polygenic risk scores (PRS) across socioeconomic and health domains, using Genomic Structural Equation Modelling (Genomic SEM) to integrate the most recent and large GWAS summary data available from published meta-analyses and biobanks. To analyze the data, we will fit univariate LGCM models for each individual indicator embedded into health domains 3 to 5. We will specify up to three latent factors: a) an intercept factor, representing levels, b) a linear change factor, and c) a quadratic change factor. Secondly, we will fit a series of multivariate LGCM models. We will start fitting an unconstrained associative LGCM to estimate correlations among the factors of levels and slopes across health indicators, identifying clusters of health indicators with shared underlying change processes. Thirdly, we will expand the multivariate LGCM models to include additive and non-additive exogenous covariates, covering genetic and environmental exposures and their interaction.
Non-Technical Research Use Statement:
The present study aims to uncover how health-related genetic factors and environmental exposures interact, shaping heterogeneous trajectories of health in old age. We investigate pathways of aging across diverse domains of health, including multiple measures of cognitive function, functional ability, chronic diseases, and well-being, identifying common and specific sources of senescent change across health-related outcomes. A better understanding of genetic and environmental determinants of health trajectories is key to promoting health and well-being in old age.
Investigator:
Tzeng, Jung-Ying
Institution:
Department of Statistics and Bioinformatics Research Center, North Carolina State University
Project Title:
Genetic Association Study of Alzheimer’s Disease with Whole-Genome and Whole-Exome Sequence Data
Date of Approval:
July 27, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective of the proposed research: Recent studies have found that single nucleotide polymorphism (SNPs) and copy number variations (CNVs) can both play significant roles in missing heritability of Alzheimer’s Disease (AD). In this project, we propose to conduct a comprehensive investigation on both variant types and understand their contributions in AD risk. Study design: We will use the whole-genome (WGS) and whole-exome (WES) sequence data in the Alzheimer's Disease Sequencing Project (ADSP) and conduct case-control association analyses of SNPs and CNVs. Analysis plan: Using the ADSD sequence data, we will start with CNV detection and characterization of CNV sequence features (e.g., microhomology, non-template insertions, and segmental duplications) to understand potential mechanisms of CNV formation. Next, we will study the association of AD status with SNPs and CNVs (common and rare) using standard association methods and adjusting for population structure (PS) and ages of onset. We will also develop and apply new methods using kernel and regularized regressions to assure efficient modeling of joint SNP-CNV effects from common and rare variants. We will perform ethnic-specific and ethnic-combined association analyses. We will use principle-component-based methods to adjust for PS, but also explore the efficacy of other PS adjustment methods. Finally, we will conduct biological annotation on identified risk variants. Collaborators: The team includes researchers from University of Pennsylvania (UPenn) and North Carolina State University (NCSU). UPenn researchers include Gerard Schellenberg (PI: Professor of Pathology and Laboratory Medicine), Li-San Wang (PI: Professor of Pathology and Laboratory Medicine), Wan-Ping Lee (PI: Research Assistant Professor of Pathology and Laboratory Medicine), Adam Naj (Assistant Professor of Biostatistics and Epidemiology) and Yuk Yee Leung (Research Assistant Professor of Pathology and Laboratory Medicine).
Non-Technical Research Use Statement:
Copy number variants (CNVs) are DNA regions that have gains (duplications) or losses (deletions). CNVs affect a considerable number of base pairs in the human genome. Unlike single-nucleotide polymorphisms (SNPs) that has been broadly studied in diseases, CNVs were not intensively discovered. The large-scale Alzheimer’s Disease Sequencing Project (ADSP) provides a systematic way to capture nearly all genomic variations and to study the genetic basis of Alzheimer’s Disease (AD). In this project, using the data of affected and unaffected samples from ADSP, we propose to conduct a comprehensive investigation on both variant types (SNPs and CNVs) and study their contributions in AD risk and etiology. We will start with CNV genotyping, followed by conducting standard association analysis of AD with SNPs and CNVs. We will also develop and apply new analytical methods for efficient modeling of joint SNP-CNV effects from common and rare variants. Finally, we will conduct functional annotation on identified risk variants to uncover possible biological mechanisms.
Investigator:
Valdmanis, Paul
Institution:
University of Washington
Project Title:
Quantification of Noncoding Variant Burden in DNA De-Identified Samples and Data from Patients with Alzheimer's Disease Versus Controls.
Date of Approval:
November 7, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Alternative splicing patterns are notoriously complex and diverse in the brain. Moreover, appropriate regulation of brain-specific splicing is lost during normal aging, which can be exacerbated by trauma or oxidative stress. We have evaluated RNA sequencing data from brain samples from patients with Alzheimer’s disease (AD) and age-matched controls. Our analysis has revealed several alternative splice products and intronic repeat sequences that are enriched in patients with AD in genes implicated in disease (APP, PSEN1 and PSEN2). Our primary objective is to integrate our RNA sequencing results with whole genome sequence data to identify intronic variants that in combination with alternative splicing products may predispose to AD. We will study AD cases and controls to identify a rare variant burden analysis across defined genomic regions. Our analysis plan is as follows: we will extract genomic regions corresponding to intronic regions that can influence alternative splicing from CRAM files. We will quantify the presence of small insertions, deletions and variants in cases and controls and use splicing prediction software to identify the potential contribution to disease. We will determine the burden of these nucleotide changes in cases, controls and large sequencing databases (e.g. Gnomad) and perform t-tests and Chi-square tests to quantify whether an enrichment of intronic variants are present in patients with AD.
Non-Technical Research Use Statement:
The genetic information that can be ascertained from large scale sequencing projects can enable novel discoveries for genes that can contribute to disease. The primary objective of many of these projects is to detect nucleotide changes that alter the protein encoded by the host gene. However, the same sequencing information can be used to identify non-coding elements of the genome that can contribute to disease, including variants that can influence splicing – the appropriate assembly of exons that are spliced together to form a gene. As we age, certain elements that preserve tight regulation of exon choice lose effectiveness, particularly when faced with injury or various stressors. We wish to detect variants that influence alternative splicing products in the context of relaxed regulation of exon choice and integrate findings with RNA sequencing databases to identify methods to preserve appropriate splicing.
Investigator:
Vanner, Robert
Institution:
University Health Network
Project Title:
CHIP in the aging brain
Date of Approval:
September 5, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The proposed project seeks to uncover the effects of clonal hematopoiesis (CH) and inflammation on brain aging in health and disease. CH is a common, age-related phenomenon wherein somatic mutations in hematopoietic stem cells undergo clonal expansion and drive systemic inflammation, potentially influencing the onset and progression of neurological disease. The goal of this study is to evaluate the effects of clonal hematopoiesis (CH) on brain aging across healthy and diseased states, with an emphasis on understanding how CH and aging propagate inflammation in the brain to modulate disease progression. Using genomic sequencing data from peripheral blood and brain tissue samples from The Religious Orders Study and Memory and Aging Project (ROSMAP) Study, we will apply validated pipelines developed by our group to identify CH (PMID: 36652671) in the cohort. We will then correlate CH status with inflammatory marker expression based on differential expression and pathway analysis using RNA-seq and proteomics data. Additionally, we will leverage neuroimaging data and clinical phenotypes including disease onset and severity to determine the effect of CH on inflammation-mediated pathologies of the brain. This project will be conducted in collaboration with Dr. Michael Rauh at Queen’s University / Dr. Robert Vanner at University Health Network. Drs. Rauh and Vanner jointly supervise PhD student Marco Buttigieg who will be carrying out the research. Our combined expertise in clonal hematopoiesis, aging, and inflammation leaves us well positioned to further our understanding of brain health with aging.
Non-Technical Research Use Statement:
As we age, a common condition called clonal hematopoiesis (CH) can develop when there are mutations in the stem cells that produce our immune (white blood) cells. These mutations are passed on to new immune cells, causing them to produce too much inflammation. This inflammation raises the risk of chronic diseases like heart disease and cancer. Some studies have shown that CH and its related inflammation can have both harmful and protective effects on brain aging and neurodegenerative diseases. Our research is studying the role of CH and inflammation in brain health and aging. We will use genetic data from blood and brain samples to determine whether or not each study participant has CH. We will then measure how inflammation and markers of brain aging differ between people with and without CH.
Investigator:
Vassar, Robert
Institution:
Northwestern University
Project Title:
Genome-wide analysis and functional assessment of rare variants in Alzheimer's disease
Date of Approval:
March 6, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The first objective is to perform an exome-wide burden analysis of variants in 10,088 ADSP cases and controls. We will count all alleles per individual across frequencies (AF< 5%, 1%, 0.1%, singleton, ultra-rare variants), functional annotations (Protein altering variants, nonsynonymous, loss-of-function, synonymous and noncoding) and damaging predictions (CADD scores >12.37=damaging). We will also stratify cases according to age of onset (early onset, late onset). Then we will run a logistic regression modeling the number of alleles per individual against disease status, including correction for relevant covariates, such as age, sex, population structure (PCA) and sequencing coverage if applicable. We will correct associations for multiple testing using the Bonferroni method to detect significant results. The second objective is to perform a gene-set burden analysis of the most enriched variants from the previous goal, using gene-list from highly constrained genes according to gnomAD (pLI>0.9), the Molecular Signatures database Hallmark and C2 curated gene-sets, and highly expressed genes from 54 specific GTEx tissues, in order to identify molecular pathways, biological processes and tissue-specific expression patterns enriched. Here we will use the SKAT-O software to perform the variant enrichment on each geneset with the same covariates used on the first objective. We will use 10,000 permutations and a family-wise error rate (FWER< 0.05) as correction for multiple testing to select the most enriched gene-sets and tissues. The third objective will be to run a gene-wise burden test and perform a protein-protein interaction network along with enrichment in brain single-cell expression data, in order to prioritize significant candidate genes. Here we will map variants to single genes and use SKAT-O in a similar way to the previous objective. Then we will take all genes with uncorrected P< 0.05 and run a protein-protein interaction network with WebgestaltR, using the network-topology analysis and random walk algorithm, and Gene-Ontology enrichment of the resulting network using BIOGRID. We will use STRING in order to get network interaction significance.
Non-Technical Research Use Statement:
We will assess the load of rare variants in the 10,088 ADSP Discovery Case Control WES samples across the whole-exome, biological pathways and genes in a “hypothesis-free” approach, leveraging state-of-the art variant annotation tools and databases. We aim to detect the most enriched type of variants in cases that increase risk, including different ages of onset. This approach will help us to increase our power to reveal novel biological pathways and genes associated with AD, expanding the understanding of rare variation and their implication on disease risk.
Investigator:
Viard, Mathias
Institution:
LEIDOS BIOMEDICAL RESEARCH, INC.
Project Title:
Genetic basis for resistance or susceptibility to Alzheimer disease conferred by the HLA and KIR loci
Date of Approval:
April 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The purpose of our study is to use the great resources of these datasets to study the involvement of HLA and KIR in the development of Alzheimer disease. Using the HLA regions of chromosome 6 and KIR regions of chromosome 19, we can input HLA and KIR, although KIR requires WES or WGS within chromosome 19 rather than genotyping due to the abundance of structural variants along with the SNP polymorphisms for this locus. Once we call HLA and KIR for the different participants, we will, beyond allelic analyses, derive from them broader immune properties that will allow us to group individuals and study the association of those properties with Alzheimer disease. With case/control, we will use logistic regressions. We will also try to look into progression to disease and death using cox regressions to see if parameters we can associate with onset of disease also correlate with the progression of the disease and understand what parameter of the immune system is associated with one step, or the other, or both. All the results will be conditioned by the apolipoprotein E genotyping of the individuals. The use of different datasets will allow us to get enough numbers to evaluate the reproducibility of the results which we will adjust for differences in population background through the use of principal components. The interpretation of the results will be dependent on the associations we can identify. HLA and KIR are at the center of the innate and acquired immune system so we will be able to evaluate which arm of the immune system is involved in the disease. For example, we showed that HIV disease can associate with compound HLA class I (HLA-I)-KIR genotypes, allele-specific HLA-I binding to the inhibitory receptor LILRB2 as well as HLA-C expression levels. In addition, allele-specific HLA-A expression levels have been suggested to modulate NK cell activity and HIV control through modulation of cell surface expression of HLA-E, which bind peptides derived from HLA-I signal sequences. In addition, HLA-I alleles have been characterized for their tapasin dependence level, which influences the breadth of peptide repertoire and can modulate immune responses.
Non-Technical Research Use Statement:
Alzheimer’s disease is influenced by many factors, including how the immune system works. This study examines whether natural differences in immune-related genes help explain who develops Alzheimer’s disease and how the disease changes over time. We focus on two groups of genes that are important for immune responses, HLA and KIR. Using large genetic datasets, we will identify shared immune patterns across individuals, rather than focusing on single genetic changes. We will compare people with and without Alzheimer’s disease to see whether certain immune patterns are linked to disease risk. We will also examine whether these immune features are associated with how quickly the disease progresses and with survival after diagnosis. All analyses will take into account known genetic risk factors for Alzheimer’s disease. By using multiple datasets from different populations, we can test whether results are consistent and reliable. This research aims to improve understanding of how the immune system may contribute to Alzheimer’s disease and inform future prevention and treatment efforts.
Investigator:
Wainberg, Michael
Institution:
Sinai Health System
Project Title:
Uncovering the causal genetic variants, genes and cell types underlying brain disorders
Date of Approval:
February 3, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We propose a multifaceted approach to elucidate and interpret genetic risk factors for Alzheimer's disease. First, we propose to perform a whole-genome sequencing meta-analysis of the Alzheimer's Disease Sequencing Project with the UK Biobank and All of Us to associate rare coding and non-coding variants with Alzheimer's disease and related dementias. We will explore a variety of case definitions in the UK Biobank and All of Us, including those based on ICD codes from electronic medical records (inpatient, primary care and/or death), self-report of Alzheimer's disease or Alzheimer's disease and related dementias, and/or family history of Alzheimer's disease or Alzheimer's disease and related dementias. We will perform single-variant, coding-variant burden, and non-coding variant burden tests using the REGENIE genome-wide association study toolkit. Second, we propose to develop statistical and machine learning models that can effectively infer (“fine-map”) the causal gene(s), variant(s), and cell type(s) underlying each association we find, as well as associations from existing genome-wide association studies and other Alzheimer's- and aging-related cohorts found in NIAGADS. In particular, we propose to improve causal gene identification by incorporating knowledge of gene function as a complement to functional genomics. For instance, we plan to develop improved methods for inferring biological networks, particularly from single-cell data, and integrate these networks with the results of the non-coding associations from our first aim to fine-map causal genes. To fine-map causal variants and cell types, we plan to integrate the associations from our first aim with single-nucleus chromatin accessibility data from postmortem brain cohorts to simultaneously infer which variant(s) are causal for each discovered locus and which cell type(s) they act through.
Non-Technical Research Use Statement:
We have a comprehensive plan to understand and explain the genetic factors that contribute to Alzheimer's disease. Our approach involves two main steps. First, we'll analyze genetic information from large research databases to identify rare genetic changes associated with Alzheimer's and related memory disorders. We'll look at both specific changes in genes and other parts of the genetic code. We'll use data from different studies and combine them to get a clearer picture. Second, we'll create advanced computer models that can help us figure out which specific genes, genetic changes, and cell types are responsible for these associations. This will help us pinpoint the most important factors contributing to Alzheimer's disease. We'll also analyze data from previous studies to build a more complete understanding of these genetic links.
Investigator:
Wang, Gao
Institution:
Columbia University
Project Title:
The FunGen-xQTL project
Date of Approval:
June 3, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The FunGen-xQTL project is a collaborative effort across the FunGen-AD, the Accelerating Medicines Partnership Alzheimer’s Disease (AMP AD), the NIH Center for Alzheimer’s and Related Dementias (CARD), and the ADSP aiming for systematic discovery and detailed characterization of molecular quantitative trait loci (QTL) in AD-related cell and tissue types, to determine the effect of genetic variation on AD mediated via molecular and cellular programs. The goal of this project is to generate a reference map of AD-related QTLs to determine the effect of genetic variation on molecular traits. This project builds on existing datasets from multiple omics layers, including bulk and single-cell transcriptomics, epigenomics, proteomics, lipidomics, and metabolomics, and from multiple tissues including brain, cerebrospinal fluid, and plasma. The multi-omics approach will enable mapping the propagation of functional consequences of each variant, which will be instrumental to identifying the causal gene(s) in each locus as well as novel biomarkers and therapeutic targets for AD/ADRD. The xQTL reference map will be made available to both the AD and general scientific communities. Collaboration with other researchers The FunGen-xQTL working group consists of investigators from different institutions to • Catalog and gather genetic, genomic, and multi-omics data that have been generated by its members and other relevant datasets that are publicly available. • Process and harmonize these data for integration and analysis. • Identify analytical methods for QTL mapping and integration across multi-omics data types. • Conduct analyses. • Develop manuscripts.
Non-Technical Research Use Statement:
Genome-wide association studies (GWAS) have so far identified more than 80 loci associated with Alzheimer’s disease (AD). However, progress towards understanding the disease mechanisms has been limited due to the difficulty in assigning molecular functions to these loci. The FunGen-xQTL project is aiming for systematic discovery and detailed characterization of molecular quantitative trait loci (QTL) in AD-related cell and tissue types, to determine the effect of genetic variation on AD mediated via molecular and cellular programs. QTL are genetic regions that influence the phenotypic variation of a molecular (transcriptomic, proteomic, lipidomic among others) trait. The ultimate goal of the FunGen-xQTL working group is to develop and identify analytical methods for QTL mapping and integration across multi-omics data which will be instrumental to identifying the causal gene(s) in each locus as well as novel biomarkers and therapeutic targets for AD.
Investigator:
Wang, Li-San
Institution:
University of Pennsylvania
Project Title:
ADSP Data Processing
Date of Approval:
February 15, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
NIAGADS is the data coordinating center for ADSP. This request will allow us to access genotype and phenotype data for all ADSP samples and perform data processing and quality assurance, before distributing to the scientific community. Currently a data deposition plan is being developed by ADSP: 1. Plans for aggregating phenotype, GWAS, and exome chip genotype data are in place, and NIAGADS will work with data contributors to organize and review files before data are distributed to study investigators. 2. As suggested by dbGaP, NIAGADS will work with dbGaP/SRA and the three NHGRI large-scale sequencing centers to develop a plan for reviewing incoming sequencing data. This will be done in parallel with basic quality assurance procedures by dbGaP/SRA before data are promoted to archival status and ready for analysis. NIA is in discussion with other similar whole-genome and whole-exome sequencing projects. We plan to harmonize these additional datasets with the ADSP WGS/WES data so the community can combine these datasets for analysis. All associated phenotypes are minimized and there is minimal risk to the participants.
Non-Technical Research Use Statement:
NIAGADS is the data coordinating center for ADSP. This request will allow us to access genotype and phenotype data for all ADSP samples and perform data processing and quality assurance, before distributing to the scientific community.
Investigator:
Wang, Lily
Institution:
University of Miami
Project Title:
DNA methylation associated with Alzheimer’s disease and cognitive outcomes in the Health and Retirement Study
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives The goal of this study is to perform statistical and bioinformatics analyses and identify DNA methylation changes associated with cognitive data and dementia outcomes collected by the Health and Retirement Study. Our specific aims are: (1) Identify blood DNAm associated with aging, AD diagnosis and longitudinal cognitive outcomes (2) Validate DNAm-based prediction models for identifying subjects with high risk for dementia. Study design. The Health and Retirement Study (HRS) is a longitudinal panel study Analysis plan To identify blood DNAm associated with LOAD diagnosis and longitudinal cognitive outcomes, we will fit a mixed effects model with LOAD diagnosis (or longitudinal outcome) as the dependent variable, CpG methylation as the main independent variable, along with age, sex, batch type, estimated blood cell type proportions as covariates. In addition, we will apply our DNAm-based prediction models for LOAD to the Health and Retirement Study data and evaluate the prediction model by computing AUC (area under ROC curve). Phenotypic characteristics. We will evaluate the association of DNA methylation variants with dementia diagnosis and chronological age, adjusting for covariate variables race/ethnicity, APOE genotype, baseline MMSE at time of blood draw, age, sex, education, smoking history.
Non-Technical Research Use Statement:
This study aims to explore how DNA methylation changes are associated dementia outcomes in older adults using data from the Health and Retirement Study. We will identify specific DNA methylation patterns associated with aging and Alzheimer's dementia, while also validating prediction models to identify individuals at high risk for dementia.
Investigator:
Wang, Qi
Institution:
ARIZONA STATE UNIVERSITY-TEMPE CAMPUS
Project Title:
Systematic investigation of genetic contributions of sex chromosomes to Alzheimer's disease
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Sex chromosomes have long been excluded in genetic and genomic research of human diseases, partially due to the technical challenges of analyzing the data from large-scale molecular profiling methods, such as next-generation sequencing (NGS). Yet their contributions are significantly implicated in the sex dimorphism of Alzheimer’s disease (AD). In this project, the team will apply a sex-chromosome-completement aware alignment approach to re-align large amounts of NGS data from large clinical cohorts of AD, for systematically investigating the genetic contributions of sex chromosomes to AD. Variants will be called based on the realignments, and both X and Y chromosomes will be specifically investigated for the variants’ associations with the harmonized AD phenotypes from ADSP-PHC. The team will also consider the contributions of the homologous X-Y gene pairs to gene expression and perform gene network analysis to evaluate how the expressions affect sex differences in AD. The reprocessed data will be disseminated to the public for broad use to benefit the scientific community. Taken together, the work will lay the foundation for determining the genetic components of sex chromosomes to AD and pave the way for future in-depth study of these genomic features and their implication in sexual dimorphism in AD.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) impacts millions of US populations, especially women, yet the molecular mechanism behind the sex differences remains poorly understood. This study aims to specifically study the genetic contributions of the sex chromosomes to the observed sex disparities in AD, which will build a benchmark of the molecular mechanism driving sex differences in AD and greatly facilitate untangling the genetic contributions from other factors in its etiology.
Investigator:
Wang, Qi
Institution:
ARIZONA STATE UNIVERSITY-TEMPE CAMPUS
Project Title:
The effects of MGAM inhibitors on cognitive decline in Alzheimer's Disease cohorts
Date of Approval:
August 15, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The National Alzheimer's Coordinating Center (NACC) has coordinated all the Alzheimer's Disease Research Centers (ADRCs) across the US to build one of the world’s largest and most comprehensive datasets on Alzheimer’s disease and related dementias (AD/ADRD). To utilize these datasets for precision drug repurposing, we are requesting the genetic data profiled by the 15 rounds of genomic array of ADC since it covers large proportions of the NACC subjects whose genetic data like APOE are not available otherwise. We are proposing the following study: 1. Harmonize the genetic data across the 15 rounds of microarrays. Since these arrays were from different platforms profiled for different subjects, we plan to harmonize the genomic data to the reference genome GPCh38 for all the available subjects. Together with other subjects whose genetic data were available from ADSP WGS, we are aiming to impute the missing APOE genotypes in these subjects in any datasets which makes the APOE4 status available to all the subsequent studies. 2. Focus on the study of the TAS2R38 variants and their relationships with AD risks. Using the harmonized genetic data, we are investigating the relationship of TAS2R38 supertaster variants with the cognitive functions of the subjects within the ADC cohorts. Our preliminary data shows TAS2R38 supertasters are associated a lower risk of AD with the advancement of aging by mediating the variant-MGAM eQTL. With the longitudinal clinical and other biomarker data available through NACC, we are validating and further investigating the relationship. 3. Since NACC also provides the medication data for all the subjects, we are investigating if inhibition of MGAM expression could remediate the cognition decline in aging subjects. FDA approved MGAM inhibitors including acarbose and miglitol will be investigated for the purpose of precision drug repurposing for AD by comparing the drug taker and nontaker groups.
Non-Technical Research Use Statement:
ADC have profiled >10,000 subjects' genetic data using different microarray platforms. We requesting the data to harmonize them for the purposing of precision drug repurposing, taking into account of different people's different genetic risk by leveraging the longitudinal data from NACC.
Investigator:
Ware, Erin
Institution:
University of Michigan
Project Title:
Alzheimer's disease polygenic scores and cognition
Date of Approval:
September 2, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Our goal is to investigate the roles of cumulative genetic variation, disparity-related factors, and their interactions on late-onset Alzheimer’s disease (LOAD) and dementia phenotypes, extending precision public health to environmental susceptibility across ancestries. LOAD is the leading terminal form of dementia affecting a growing number of aging U.S. adults. As LOAD risk is disproportionately high among minorities, women, rural inhabitants, and people with lower education, disparities in LOAD risk represent a critical knowledge gap. Novel approaches characterizing the multifaceted etiology of LOAD disparities are needed to identify the genetic underpinnings, biological pathways, and potentially modifiable environmental factors that lead to sustained LOAD disparities. We propose whole genome estimations of polygenic risk of cognition, LOAD, and LOAD risk factors to be examined for their effect on dementia phenotypes among individuals >70, independently and in concert; potential interactions between PGS and factors with disparities in LOAD; and application of our methods in European and African ancestry groups (Fig. 1). AIM 1. Determine the cumulative genetic risk of LOAD by estimating the effect of cognitive polygenic scores on dementia phenotypes in individuals of European and African ancestry. AIM 2. Determine the association between polygenic scores for a) behavioral, b) physiological, and c) social/psychosocial domains and dementia phenotypes in individuals of European and African ancestry. We will consider Mendelian Randomization approaches for this aim. AIM 3. For the relationships between polygenic scores and dementia phenotypes (AIMS 1 and 2), test for effect modification by LOAD disparity-related factors (sex, educational attainment, urban/rural), in individuals of European and African ancestry.
Non-Technical Research Use Statement:
The overall purpose of this proposal is to establish the relevance of polygenic risk in susceptibility to dementia, particularly among groups at increased risk of disease, including women, minorities, rural inhabitants, and those with low educational attainment. Because an individual’s susceptibility to dementia is likely a combination of genetics and environmental risk factors, we will jointly test the effects of cumulative genetic risk and dementia risk factors in our analysis. The proposal provides an opportunity to identify a genetic etiologic component in vulnerable groups that could lead to mechanistic understanding or targeted interventions to substantially benefit public health in the US.
Investigator:
Ware, Erin
Institution:
University of Michigan
Project Title:
DNA Methylation,Genetics, and Modifiable Risk Factors of Dementia in a Nationally Representative, Multi-Ethnic Cohort
Date of Approval:
August 15, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Our goal is to determine the joint epigenetic and environmental contributions to ADRD risk that underlie these health disparities. Using existing epigenetic and genetic data, well-characterized dementia phenotypes, and diverse risk factor data, we will analyze a population representative, multi-ethnic aging sample from the Health and Retirement Study (HRS). We aim to (1) test the associations between DNA methylation and dementia phenotypes (prevalent, 8-year incident), stratified by race/ethnicity and test for effect modification by ADRD disparity-related factors (educational attainment, sex, urban/rural); (2) identify associations between longitudinal measures of modifiable risk factors for ADRD and DNA methylation, stratified by race/ethnicity and test for effect modification or mediation by ADRD disparity-related factors; and finally, (3) identify genetic polymorphisms controlling DNA methylation and whether these are enriched in dementia outcomes to evaluate the role of DNA methylation in disease development. This study will likely impact the field of Alzheimer’s research and contribute to public health because it will a) establish the relevance of DNA methylation on ADRD in multiple race/ethnicities; b) elucidate important biological epigenetic mechanisms; c) determine the combined and individual epigenetic-environment interplay contributions to ADRD; and d) consider the effects of sex, educational attainment, race/ethnicity, younger age groups, and urban/rural status in the same study where comparisons of relative contribution to risk can be made. Here, we have the opportunity to simultaneously and substantially improve our understanding of the genetic and environmental etiologic contributions to health disparities in ADRD.
Non-Technical Research Use Statement:
The overall purpose of this proposal is to identify modifiable risk factors for Alzheimer’s disease and related dementias that influence DNA methylation and dementia status among groups at increased risk for dementia including women, minorities, rural inhabitants, and those with low educational attainment. Results from this proposal may provide an opportunity to identify epigenetic components that contribute to the prevalence and risk of dementia that could lead to a mechanistic understanding or targeted interventions that may substantially decrease the burden of Alzheimer’s disease and related dementias in the US population
Investigator:
Wedow, Robbee
Institution:
Purdue University
Project Title:
Unpacking the Emergence of Dementia Etiology Across the Life Course
Date of Approval:
August 29, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Moderate to severe impairments in cognitive functioning are a primary hallmark of Alzheimer's disease (AD) and Alzheimer's Disease Related Dementias (ADRD), a class of disorders affecting ~30% of the population by age 90. Currently, scientists hypothesize that the AD/ADRD disease process begins decades prior to the low functioning observed at the time of diagnosis. The ideal study design to gain insight into liability in prodromal and preclinical stages of AD/ADRD would involve collecting data on a wide range of measures from a large group of participants across the entirety of the life course. However, this data collection strategy includes major pragmatic barriers. Longitudinal study designs that might identify prospective risk factors of later life disease onset carry high participant and financial costs and take decades to produce conclusive results. Because of these limitations, much of the literature has been left to speculate in a piecemeal fashion on what characterizes the AD/ADRD prodromal period. However, research into these prodromal and preclinical periods holds significant promise for improving prevention and intervention efforts by identifying at-risk individuals and those who are at an earlier, and likely more intervenable, stage of disease. We will analyze the links between genotypes and phenotypes to investigate the onset times of risk factors for AD/ADRD across the life course using data from the HRS. Our insights will focus on pinpointing specific periods when these outcomes manifest as individuals age. We hypothesize that genetic data and structural equation modeling can help identify the specific times when Alzheimer's risk factors emerge as individuals age throughout their lives. Our team proposes to leverage the Genomic Structural Equation Modeling (Genomic SEM) framework to identify genetic risk pathways to AD/ADRD across the life course using existing data from large epidemiological studies that index different age ranges. Results from this study will add previously unseen levels of precision to our understanding of when genetic risk for AD/ADRD emerges across the life course and which specific risk factors index its onset.\
Non-Technical Research Use Statement:
We will analyze the links between genotypes and phenotypes to investigate the onset times of risk factors for Alzheimer's and Dementia (AD) across the life course using data from the Health and Retirement Study (HRS). Our insights will focus on pinpointing specific periods when these outcomes manifest as individuals age. We hypothesize that genetic data and structural equation modeling can help identify the specific times when Alzheimer's risk factors emerge as individuals age throughout their lives.
Investigator:
Wijsman, Ellen
Institution:
University of Washington
Project Title:
Therapeutic target discovery in ADSP data via comprehensive whole-genome analysis incorporating ethnic diversity and systems approaches
Date of Approval:
October 21, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Objective: Utilize ADSP data sets to identify genes & specific genetic variants that confer risk for or protection from Alzheimer disease. Aim 1: Using combined WGS/WES across the ADSP Discovery, Disc-Ext, and FUS Phases, including single nucleotide variants, small insertion/deletions, and structural variants. We will: Aim 1a. Perform whole genome single variant and rare variant case/control association analyses of AD using ADSP and other available data; Aim 1b. Target protective variant identification via association analysis using selected controls within the ADSP data and performing meta analysis across association results based on selected controls from non-ADSP data sets. Aim 1c. Perform endophenotype analyses including cognitive function measures, hippocampal volume and circulation beta-amyloid ADSP data in subjects for which these measures are available. Meta analysis will be conducted across ADSP and non-ADSP analysis results. Aim 2: To leverage ethnically-diverse and admixed populations to identify AD variants we will: Aim 2a. Estimate and account for global and local ancestry in all analyses; Aim 2b. Perform admixture mapping in samples of admixed ancestry; and Aim 2c. Perform ethnicity-specific and trans-ethnic meta-analyses. Aim 3: To identify putative therapeutic targets through functional characterization of genes and networks via bioinformatics, integrative ‘omics analyses. We will: Aim 3a. Annotate variants with their functional consequences using bioinformatic tools and publicly available “omics” data. Aim 3b. Prioritize results, group variants with shared function, and identify key genes functionally related to AD via weighted association analyses and network approaches. Analyses will be performed in coordination with the following PIs. Coordination will involve sharing expertise, analysis plans or analysis results. No individual level data will be shared across institutions. Philip De Jager, Columbia University; Eric Boerwinkle & Myriam Fornage, U of Texas Health Science Center, Houston; Sudha Seshadri, U of Texas, San Antonio; Ellen Wijsman, U of Washington. William Salerno, Baylor College of Medicine.
Non-Technical Research Use Statement:
This proposal seeks to analyze existing genetic sequencing data generated as part of the Alzheimer’s Disease Sequencing Project (ADSP) including the ADSP Follow-up Study (FUS) with the goal of identifying genes and specific changes within those genes that either confer risk for Alzheimer’s Disease or provide protection from Alzheimer’s Disease. Analytic challenges include analysis of whole genome sequencing data, appropriately accounting for population structure across European ancestry, Hispanic, and African American participants, and interpreting results in the context of other genomic data available.
Investigator:
Wingo, Thomas
Institution:
University of California Davis
Project Title:
Identifying Alzheimer's Disease Genetic Risk Factors By Integrated Genomic and Proteomic Analysis
Date of Approval:
October 2, 2023
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
We aim to uncover new genetic risk variants for Alzheimer’s disease (AD) by analysis of an integrated analysis of proteomics and genetic sequencing performed at Emory University. Results of these analyses will be used to weight analysis of whole-genome sequencing (WGS), whole-genome genotyping (WGG), and whole-exome sequencing (WES) data from dbGaP and ADSP. We plan to publish our findings, so they are shared with the scientific community. Outcomes that will be tested include: (1) clinical disease status, (2) pathologic characterization (e.g., measures of beta-amylodi, tau, etc.), and (3) cognitive decline. For sequencing data, we will perform joint calling from samples previously mapped by ADSP using PECaller using default settings. Variant annotation will be performed using Bystro and quality control will follow Wingo et al., 2017. For rare variants, we will use burden- and variance-based tests to estimate association between genetic variants and each outcome for every gene in the genome. External weights from proteomic analyses will be optionally used, as well as measures of genomic conservation for each site. For common variants, we plan to test for differences in allele frequencies using maximum likelihood tests. For all analyses, we plan to control for population structure deriving principal components from the underlying sequencing or genotyping data.
Non-Technical Research Use Statement:
Our aim is to identify genetic variants that are associated with Alzheimer's Disease (AD) either using genomic data (from dbGap or from Emory University) or brain protein sequencing data (from Emory University) as a starting point. Each center’s data will be analyzed separately, and we will determine whether the findings are consistent among the centers. Additionally, we will use protein data from brain or cerebrospinal fluid of individuals with or without AD to guide the analysis of the genomic data to identify genetic variants that influence AD risk. Our overarching aim is to use genetic discoveries to identify mechanisms of AD pathogenesis and creation of more meaningful models of the disease.
Investigator:
Wingo, Thomas
Institution:
University of California Davis
Project Title:
Identifying Alzheimer's Disease Genetic Risk Factors By Integrated Genomic and Proteomic Analysis
Date of Approval:
January 21, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We aim to uncover new genetic risk variants for Alzheimer’s disease (AD), AD-related dementia (ADRD), and behavioral and psychiatric symptoms (BPS) associated with AD/ADRD. We expect to use whole-genome sequencing (WGS), whole-genome genotyping (WGG), and whole-exome sequencing (WES) data. Additionally, we will use the results of brain proteomic analysis to nominate genes and pathways for AD, ADRD, and dementia BPS. We plan to publish our findings to share them with the scientific community. Outcomes that will be tested include: (1) clinical disease status, (2) pathologic characterization (e.g., measures of beta-amyloid, tau, etc.), (3) cognitive decline, (4) BPSD, and (5) outcomes related to AD/ADRD severity. For sequencing data, we will extract raw sequencing reads from CRAM/BAM (or equivalent encrypted files) and re-map those to hg38 build of the human genome using PEMapper. Bascalling will be performed using PECaller using default settings. Variant annotation will use Bystro and quality control will follow approaches to assess completeness and account for ancestry as is customary in our lab. For rare variants, we will a variety of kernel-based approaches and for common variants, use standard statistical modeling. For all analyses, we plan to control for population structure deriving principal components from the underlying sequencing or genotyping data.
Non-Technical Research Use Statement:
Our aim is to identify genetic variants that are associated with Alzheimer's Disease (AD) to uncover new genetic associations. We will examine the role of important risk factors for AD (e.g., age and sex) in our analyses. Separately, we will perform integration of genetic findings for AD with information about how genetic variants influence or are associated with gene expression in the brain, cerebrospinal fluid, or blood to uncover new pathways of disease. Our overarching aim is to use genetic discoveries to identify mechanisms of AD pathogenesis to help nominate new treatment targets.
Investigator:
Won, Hong-Hee
Institution:
Sungkyunkwan University
Project Title:
Identification of genetic factors for Alzheimer´s disease using large multi-omics analyses
Date of Approval:
November 14, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Understanding the neurobiological mechanisms of Alzheimer's disease is crucial for developing potential treatments. However, most research currently focuses on European populations. It is anticipated that by conducting analyses on diverse populations, such as Asians and Africans, new signals not previously identified in existing research may be uncovered. In this study, we aim to discover new signals through large-scale Alzheimer's disease Whole Genome Sequencing (WGS) analysis across different populations. First, we plan to perform single variant and gene burden association analyses specific to each ethinic group to observe how signals may vary among them. We will identify associated risk factors for AD from independent genomics datasets including the Alzheimer's Disease Sequencing Project (ADSP) and other trans-ethnic genomics datasets. We will employ the REGENIE program, widely used in large-scale genomics datasets. Our phenotypes of interest include Alzheimer's disease status and related variables. Secondly, we will conduct causal variant discovery through fine mapping analysis. We will use various brain tissue expression Quantitative Trait Locus (eQTL) datasets like MetaBrain to identify genes influenced by genetic variants. Additionally, we plan to investigate enhancer or candidate Cis-Regulatory Element (CRE) regions using single-cell level ATAC sequencing data or HiC data to determine whether AD associated variants exist within these regions. Through this research, we aim to uncover differences in GWAS signals across different ethnic groups and identify causal variants by leveraging a variety of data sources.
Non-Technical Research Use Statement:
Alzheimer's disease is known to have a significant genetic influence, and this genetic influence varies across different ethnic groups. In this study, we aim to identify which genetic variations contribute to Alzheimer's disease on an ethnic-specific basis. Identifying ethnic-specific genetic variations could assist in developing tailored treatments for each ethnic group. Additionally, we intend to explore various datasets to understand how certain genetic variations influence particular genes, ultimately contributing to the development of Alzheimer's disease. Through these analyses, we aim to enhance our understanding of the mechanisms underlying Alzheimer's disease.
Investigator:
Wu, Gang
Institution:
St Jude Children's Research Hospital
Project Title:
Evaluation and development of rare variant analysis methods for novel disease or trait-related gene or region discovery using whole exome sequencing data
Date of Approval:
January 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
It is very important and it is also our aim to evaluate/utilize/develop the most powerful rare variant analysis methods in the discovery of novel cancer predisposition genes. The advance of sequencing technology and the refinement of reference sequence bring us cost-effective sequencing platforms and better accuracies, however, they have also generated very heterogeneous sequencing data sets, such as using different reference builds, different mapping software (versions). Objective: 1) evaluate/develop methods for building a pipeline in harmonizing different available cohorts to reduce false positives due to sequencing artifacts or batch effects; 2) evaluate/develop rare variant methods in analyzing large scale sequencing data, for example, identify which tests are robust and powerful, how to define regions or SNP sets, how to incorporate variant information. Study design: because large sample size is critical to achieve enough power, we will combine the data sets including TCGA, ADSP, as well as our inhouse sequencing data such as pediatric cancer sequence data or Amyotrophic lateral sclerosis (ALS) sequencing data. We need to deal with potential batch effects of different datasets. Then we will do case-control based association test, e.g., using one trait versus other traits to evaluate the enrichment of rare variants in each particular trait. Other public control data sets, such as 1000 Genomes or gnomAD might also help filtering potential pathogenic variants.
Non-Technical Research Use Statement:
For sequenced rare variants, it is very important to develop powerful methods in identifying trait specific predisposition genes or variants, which can incorporate as much prior information as possible and control potential batch effects due to different processing platforms. We will use the TCGA, ADSP and our inhouse pediatric and ALS sequencing dataset for evaluation and development of methods for identifying predisposition variations of each trait, for example, cancer risk genes, or neurodegenerative risk genes.
Investigator:
Xavier, Rose Mary
Institution:
UNC Chapel Hill
Project Title:
Sleep Disturbance and Cognitive Function in Alzheimer’s Disease: The Shared Genetic Basis
Date of Approval:
July 16, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Sleep disturbances (SD) are linked to cognitive function (CF) and Alzheimer’s disease (AD), but the genetic mechanisms, especially in non-European populations, are underexplored. With the goal to further the understanding of the genetic architecture of AD and promote the development of early prediction of AD, this research will use large-scale data to investigate the genetic basis underlying SD and CF in AD. Objectives 1. Identify unique and shared genetic basis of SD and CF in AD. 2. Examine the associations between genetic liability and measurement of SD and CF in AD progression. Study Design Objective 1 will be a genome wide association study (GWAS). Objective 2 will be a polygenic risk score (PRS) study. Analysis Plan We will combine the genotype data from NIAGADS with phenotype data from the NACC to conduct the proposed projects. Specifically, we propose to use the Alzheimer’s Disease Research Centers (ADRC) GWAS Datasets ADC1-15 to maximize the sample that matches with NACC phenotypes (SD, CF, and AD). Analyses will follow the established Ricopili protocol (https://sites.google.com/a/broadinstitute.org/ricopili/). First, we will conduct quality control (QC) checks on genotype data following standard protocols. Second, we will conduct data imputation where genetic variants were not genotypes using Michigan Imputation Server following standard protocols using the 1000 Genomes Phase 3 Reference data. For objective 1, we will conduct the phenotype-specific GWAS on SD, CF, and AD. We will then run cross-phenotype meta-analysis to examine the shared genetic basis of sleep disturbance and cognitive function in AD. For objective 2, we will construct PRSs for SD, CF and AD using PRS-CSx approach. Linear and logistic regressions will be used to examine the associations between PRS and measurements of SD and CF, and AD. Cox proportional hazard regression will be used to examine the association between SD PRS and longitudinal CF changes in AD progression. Outcomes, such as summary statistics of GWAS and constructed PRSs, will be submitted to NIAGADS.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a neurodegenerative disease characterized by progressive memory loss and cognitive deterioration. It affects approximately 34 million people worldwide, yet reliable early prediction methods remain elusive. Prior research has implicated the impact of sleep disturbance on cognitive decline and AD pathophysiology. However, few studies have explored the genetic correlation between sleep disturbance and cognitive function in the context of AD, especially among non-European populations. To address these research gaps, the proposed research will employ bioinformatic and computational techniques to analyze largescale databases to further understand the unique and shared genetic variants that contribute to sleep disturbance and cognitive function in AD. This exploration of the genetic correlation between sleep disturbance and cognitive function in AD will inform future research to improve early detection of AD risk in individuals with pre-clinical symptoms and prediction of cognitive deterioration through AD development and progression.
Investigator:
Xiao, Peng
Institution:
University of Nebrask Medical Center
Project Title:
Uncovering the genetic basis of Alzheimer's Diseases by integrating GWAS with multiomics approaches across different ethnicities
Date of Approval:
April 16, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives In this study we aim to understand the system level understanding of Alzheimer's disease (AD) by integrating GWAS with robust multiomics datasets across diverse ethnic groups and harmonization of the results to include associated genes and pathways to understand underlying disease mechanisms and to inform our understanding of biological continuum of the diseases. Investigating AD associated variants as quantitative trait loci for epigenetic, transcriptomic, and proteomic layers to explore how variants in genes perturb pathways leading to AD. Study design We will comprehensively examine the genetic architecture of Alzheimer's diseases based on different races (Caucasians, Latinos, Asians and Africans) GWAS data from publicly available datasets. We request access to as many datasets available in NIAGADS and other repositories like EADB-consortium, IGAP and we also have requested access to datasets through our literature search from corresponding authors for Asian cohorts. We will perform meta-analysis at two levels for GWAS datasets and find genome wide significant loci (GWS). Mendelian randomization analyses will be adapted to multi-omics setting through analyzing QTLs and GWS. We will perform correlation and enrichment analysis for significant findings from different omics layers. Analysis Plan Genome wide meta-analysis across different races to identify new loci and functional pathways influencing AD. Find genes most likely to be responsible for association signal with AD at each loci by applying mendelian randomization (MR) method. We plan to use MR method to combine multiomics (GWAS, eqtl, mqtl, aqtl and pqtl).
Non-Technical Research Use Statement:
To our knowledge our findings reveal crosstalk between epigenetic, genomic, and transcriptomic determinants of AD pathogenesis and define catalogues of candidate genes. In addition, rare or population specific common variants can be identified thus genes with underlying genetic support for an association with AD are likely to encode successful drug targets in clinical development. This should further lead to patient stratification.
Investigator:
Xu, Hongyan
Institution:
Augusta University
Project Title:
Haplotype association with Alzheimer’s disease
Date of Approval:
October 9, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
High-throughput sequence technologies enable us to easily genotype dozens of single nucleotide polymorphisms (SNPs) within any interesting gene. Such genome-wide SNP data are rapidly growing in disease association studies. The association analysis includes single-SNP and haplotype-based disease associations. Haplotype-based association analysis has several advantages over single-SNP association analysis. We propose to develop novel statistical methods for haplotype-based association using the whole-genome sequencing data from the Alzheimer’s Disease Sequencing Project (ADSP) sponsored by the National Institute of Aging. The phenotype is the case-control status. We will use current programs for haplotyping, then use the haplotypes for genetic association studies. We will develop new methods for modeling the recombination interference.
Non-Technical Research Use Statement:
High-throughput sequence technologies enable us to easily genotype dozens of single nucleotide polymorphisms (SNPs) within any interesting gene. Such genome-wide SNP data are rapidly growing in disease association studies. The association analysis includes single-SNP and haplotype-based disease associations. Haplotype-based association analysis has several advantages over single-SNP association analysis. We propose to develop novel statistical methods for haplotype-based association using the whole-genome sequencing data from the Alzheimer’s Disease Sequencing Project (ADSP) sponsored by the National Institute of Aging.
Investigator:
Yan, Jingwen
Institution:
Indiana university Indianapolis
Project Title:
Progression profiling of blood and CSF proteomics in aging and Alzheimer's disease
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) progression is inherently a continuous biological process, yet it has traditionally been characterized using discrete stages. While these categorical frameworks have proven clinically useful, they may obscure the subtle molecular changes in the earliest phases of the disease. This project seeks to address this limitation by developing novel computational methods to model the continuous trajectory of Alzheimer’s disease progression. Going beyond rigid diagnostic boundaries, our approach aims to capture the dynamic and progressive nature of the disease. These methods will be applied to high-dimensional protein data from both blood and cerebrospinal fluid (CSF), enabling us to track how protein phenotypes evolve over time in relation to disease advancement. The primary goal of the project is to detect when and how specific protein signatures evolve along the AD continuum. Special emphasis will be placed on identifying subtle, early-stage blood molecular changes that precede the emergence of hallmark CSF biomarkers. This could provide critical insights into the preclinical phase of AD, where therapeutic interventions may be most effective.
Non-Technical Research Use Statement:
Alzheimer’s disease develops gradually over time, but it is often described in a few stages like early or late disease. While these stages help doctors understand and diagnose the disease, they may miss the small, early changes happening in the body. This project aims to develop new tools to better track how Alzheimer’s progresses in a continuous manner. By studying proteins in both blood and spinal fluid, the project will look for patterns that change as the disease gets worse. A key goal is to find early signs from blood that could help detect the disease sooner and open the door to earlier treatment.
Investigator:
Yang, Jingjing
Institution:
Emory University
Project Title:
Novel statistical methods for integrating transcriptomic and proteomic data in GWAS
Date of Approval:
December 2, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of the proposed project is to derive novel statistical methods to integrate multi-omics data and pathology data in genome-wide association studies (GWAS) for studying complex phenotypes, with the goal of prioritizing genetic variants and identifying causal genes. First, we will develop novel statistical methods to integrate summary-level omics data and pathology data of diverse populations with GWAS data to prioritize risk genes. Second, we will apply our tools to publicly available xQTL data and the ADSP GWAS data. Third, we will also use the ADSP GWAS summary data to conduct causal analysis of other aging-related phenotypes and AD dementia. We will first develop novel statistical methods to integrate summary-level xQTL data of multiple populations with GWAS data to test gene associations with complex human diseases. We are interested in studying all complex phenotypes that were profiled for the ADSP samples, especially Alzheimer’s disease (AD) and AD-related complex phenotypes. Especially, our lab has access to the ROS/MAP multi-omics data shared by the Rush Alzheimer’s disease center (http://www.radc.rush.edu/), and GTEx data. All samples in the ROS/MAP study are well-characterized with extensive complex phenotypes profiled, including clinical diagnosis of AD, AD-related complex phenotypes, and psychological phenotypes. GTEx provides transcriptomic data of multiple human tissues. We will leverage multiple omics data profiled from the ROS/MAP study and transcriptomics data profiled from GTEx to learn SNP-omics relations, and then integrate such learned relationships with ADSP data to identify risk genes of complex diseases. We will also validate our findings by using omics and pathology data in the requested data sets. The purpose of using ADSP data is to increase sample size for testing our derived methods for functional genetic association studies of complex phenotypes, studying the genetic etiology of AD and AD-related phenotypes, and validating our finding by using the omics data from Rush Alzheimer's Disease Center. We are not limited to studying AD only. We are flexible to study any complex phenotypes that are profiled for ADSP samples.
Non-Technical Research Use Statement:
This proposed project is to develop novel statistical methods to integrate summary-level multi-omics data such as transcriptomic, proteomics, and epigenetics, and pathology data, in genome-wide association studies (GWAS) of complex phenotypes, with the goal of identifying causal genes. i) We will develop novel statistical method for integrating summary-level omics data and pathology data with GWAS data. ii) We will apply our tools to publicly available summary-level omics data, omics data from the ROS/MAP study, and ADSP GWAS data for studying AD and AD-related phenotypes. iii) We will conduct causal inference to test the causal relationship between AD and other aging-related phenotypes. We propose to test our proposed methods on the applied genomic analysis data to study complex phenotypes that are profiled for ADSP, including AD, AD-related pathology traits, and related psychological disorders.
Investigator:
Yesavage, Jerome
Institution:
Stanford University
Project Title:
Identifying Variable Number Tandem Repeats Associated with Alzheimer Disease in Diverse Populations
Date of Approval:
December 18, 2019
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
This goal of this study is to examine the presence of Variable Number Tandem Repeats (VNTRs) in Alzheimer’s disease (AD) population. Within a genome exists short sequences of repeating DNA. While the repeated sequence (usually >6 bases in length) is usually conserved within a population, the number of times the sequence is repeated in any given individual varies. These genetic variants are known as VNTRs and the number of these repeats can be considered a polymorphism, with individuals or families having a different number of repeats than those seen in the general population. In the past VNTRs have been an overlooked polymorphic component of the genome even though a number of VNTRs have been shown to be associated with neurological disorders and brain functions. We aim to understand not only the presence of specific VNTRs associated with Alzheimer’s disease but also how different phenotypes influences the relationship of these VNTRs to AD, this may inform precise genetic profiles which can be applied to a diverse population. As the associative genetics of AD is known not to be conserved across races it is important to not only assess the dataset as a whole but also the association of the ethno-racial phenotypes to inform these precise genetic profiles. If we can use VNTRs to predict this debilitating disorder it opens up avenues to apply treatments earlier and impact one of the most prevalent social and economic burdens on our society. Whole genome sequence data will be analyzed using VNTRseek against a set of reference tandem repeats generated from the tandem repeats database. This software determines the presence of a particular repeat. Each output will contain the number of repeats for each VNTR for each genome and this will be filtered for repeats that have a variation from the reference to determine the presence of a VNTR. This will be used to determine allelic frequency of specific polymorphic repeats comparing controls to AD. We will also further separate the analysis into gender and the different ancestral phenotypes in the ADSP to assess for specific alleles that may be more associated in one ethno-racial group than another.
Non-Technical Research Use Statement:
Within a genome exists short sequences of repeating DNA. While the repeated sequence (usually >6 bases in length) is usually conserved within a population, the number of times the sequence is repeated in any given individual varies. These variants are known as Variable Number Tandem Repeats (VNTRs) and the number of these repeats can be considered a polymorphism. In the past VNTRs have been an overlooked polymorphic component of the genome even though a number of VNTRs have been shown to be associated with neurological disorders and brain functions. This study will examine the association of VNTRs in Alzheimer’s disease (AD) population using a specialized program, VNTRseek, to explore the presence of these VNTRs in the whole genome sequences from AD cases and controls. We believe analyzing the distribution of VNTRs in a large and diverse AD population may yield new associative genetic alleles which may not only assist in the prediction of AD development but identify new cellular pathways of interest in understanding the pathophysiology of the disorder.
Investigator:
Yesavage, Jerome
Institution:
Stanford University
Project Title:
Identifying Variable Number Tandem Repeats Associated with Alzheimer Disease in Diverse Populations
Date of Approval:
February 12, 2024
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
This goal of this study is to examine the presence of Variable Number Tandem Repeats (VNTRs) in Alzheimer’s disease (AD) population. Within a genome exists short sequences of repeating DNA. While the repeated sequence (usually >6 bases in length) is usually conserved within a population, the number of times the sequence is repeated in any given individual varies. These genetic variants are known as VNTRs and the number of these repeats can be considered a polymorphism, with individuals or families having a different number of repeats than those seen in the general population. In the past VNTRs have been an overlooked polymorphic component of the genome even though a number of VNTRs have been shown to be associated with neurological disorders and brain functions. We aim to understand not only the presence of specific VNTRs associated with Alzheimer’s disease but also how different phenotypes influences the relationship of these VNTRs to AD, this may inform precise genetic profiles which can be applied to a diverse population. As the associative genetics of AD is known not to be conserved across races it is important to not only assess the dataset as a whole but also the association of the ethno-racial phenotypes to inform these precise genetic profiles. If we can use VNTRs to predict this debilitating disorder it opens up avenues to apply treatments earlier and impact one of the most prevalent social and economic burdens on our society. Whole genome sequence data will be analyzed using VNTRseek against a set of reference tandem repeats generated from the tandem repeats database. This software determines the presence of a particular repeat. Each output will contain the number of repeats for each VNTR for each genome and this will be filtered for repeats that have a variation from the reference to determine the presence of a VNTR. This will be used to determine allelic frequency of specific polymorphic repeats comparing controls to AD. We will also further separate the analysis into gender and the different ancestral phenotypes in the ADSP to assess for specific alleles that may be more associated in one ethno-racial group than another.
Non-Technical Research Use Statement:
Within a genome exists short sequences of repeating DNA. While the repeated sequence (usually >6 bases in length) is usually conserved within a population, the number of times the sequence is repeated in any given individual varies. These variants are known as Variable Number Tandem Repeats (VNTRs) and the number of these repeats can be considered a polymorphism. In the past VNTRs have been an overlooked polymorphic component of the genome even though a number of VNTRs have been shown to be associated with neurological disorders and brain functions. This study will examine the association of VNTRs in Alzheimer’s disease (AD) population using a specialized program, VNTRseek, to explore the presence of these VNTRs in the whole genome sequences from AD cases and controls. We believe analyzing the distribution of VNTRs in a large and diverse AD population may yield new associative genetic alleles which may not only assist in the prediction of AD development but identify new cellular pathways of interest in understanding the pathophysiology of the disorder.
Investigator:
Yokoyama, Jennifer
Institution:
University of California, San Francisco
Project Title:
Rare variation contributing to Alzheimer's disease risk
Date of Approval:
February 3, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Identification and characterization of genetic modifiers of risk for Alzheimer’s disease (AD) is paramount to development of a deeper understanding of AD pathogenesis as well as the identification of biomarkers and drug targets. The proposed research seeks to identify novel rare variants that could modulate an individual’s risk for developing sporadic late-onset or early-onset AD, and validate rare variants identified and characterized by our group that contribute to AD risk. In particular, we will focus on characterizing the contributions of changes in repetitive sequences within the coding regions of genes to AD risk. This study will also combine datasets from this project with whole genome or exome sequencing data generated by the University of California, San Francisco from individuals diagnosed with atypical or early-onset (< 65 years of age at diagnosis) AD to identify genetic risk factors unique to these less common forms of AD.
Non-Technical Research Use Statement:
We will use ADSP data in conjunction with existing data from our research center to characterize genetic variation that influence a person’s risk for developing Alzheimer’s disease. After establishing a set of candidate variants, we will functionally characterize their biological effect using cell and biochemical assays. Identification and characterization of Alzheimer’s disease risk modifiers will not only enhance our understanding of disease pathogenesis, but may also facilitate identification of therapeutic targets and biomarkers for preventing Alzheimer’s disease.
Investigator:
Yokoyama, Jennifer
Institution:
University of California, San Francisco
Project Title:
Trans-ethnic meta-analysis and fine-mapping of Alzheimer’s Disease loci
Date of Approval:
August 14, 2023
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) is a leading cause of death and disability worldwide. Despite the tremendous burden of the disease and arduous research efforts in the field, there are currently no disease-modifying treatments. Better elucidation of the genetic etiology of AD is needed to drive further drug discovery. Although the largest genome-wide association study (GWAS) of AD included over one million individuals of European ancestry and identified 38 associated loci, previous studies have highlighted the disproportionate disease risk and differing genetic architecture for AD across global populations. We hypothesize that a trans-ancestry approach would improve the generalizability of GWAS results, increase statistical power for locus discovery, and improve fine-mapping resolution to identify putative causal variants. Using a method that accounts for ancestral heterogeneity, we plan to leverage published and de-novo GWAS from individuals of European, East Asian, African American, South Asian, and Caribbean Hispanic ancestry and performed the largest trans-ancestry meta-analysis of AD to date. This method has allowed us to identify a novel AD locus near the Lymphocyte Cytosolic Protein 1 (LCP1) gene and prioritize an intronic SNP, rs2146890, as likely driving the observed association. Our findings further support the involvement of immune-related pathways in AD pathogenesis and highlight the importance of multi-ancestry representation in genetic studies. We propose utilizing the LASI-DAD GWAS and Imputation dataset to further extend these results by completing additional trans-ancestry meta-analyses.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is a leading cause of death and disability worldwide. Although the largest genome-wide association study (GWAS) of AD included over one million individuals of European ancestry and identified 38 associated loci, previous studies have highlighted the disproportionate disease risk and differing genetic architecture for AD across global populations. Our findings have supported the involvement of immune-related pathways in AD pathogenesis and highlight the importance of multi-ancestry representation in genetic studies. We will use the LASI-DAD GWAS and Imputation data in conjunction with existing open datasets from individuals of European, East Asian, African American, South Asian, and Caribbean Hispanic ancestry to perform the largest trans-ancestry meta-analysis of Alzheimer’s disease to date.
Investigator:
Yokoyama, Jennifer
Institution:
University of California, San Francisco
Project Title:
Developing a novel tau-based polygenic score (TPGS)
Date of Approval:
June 5, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of this study is to develop a novel tau polygenic score (TPGS). We hypothesize that variation in genes associated with tauopathies—which are genes related to tau-metabolism or differentially expressed in comparative studies of iPSC-derived neurons, astrocytes, and microglia generated from carriers of MAPT pathogenic variants vs. isogenic controls—is associated with greater risk of sporadic primary and secondary tauopathy. Study Design: We will utilize summary statistics from large genome-wide association studies of tauopathies, selecting relevant variants that are flanking genes implicated by functional studies. Data requested from NG00169 will be analyzed in parallel with other publicly-available GWAS summary statistics as well as internal data generated at UCSF. To our knowledge, we do not anticipate that combining these datasets will increase risk of participant re-identification due to these datasets deriving from non-overlapping cohorts. Analysis Plan: We will assess multiple methods for generating the TPGS, and test whether they are associated with primary (FTLD-tau, PSP) and secondary (AD) tauopathy risk and endophenotypes of tauopathy (e.g., tau pathological burden, measures of tau in CSF). First, we will classify genetic variants using machine learning methods. We hypothesize that machine learning can be used to improve biologically driven predictions for diagnoses, pathology, and biomarkers. We will also leverage summary statistics from NG00169 to generate polygenic scores from a preselected, MAPT-related marker list. We will apply to our internal whole genome sequencing discovery dataset. The TPGS will be further validated in a separate internal cohort. Collaboration: Data obtained through this application will not be shared with researchers at other institutions.
Non-Technical Research Use Statement:
Tauopathies are a class of neurodegenerative diseases characterized by the abnormal accumulation of the tau protein. Tauopathies, including Alzheimer’s disease (AD), frontotemporal dementia (FTD), and progressive supranuclear palsy (PSP), are influenced by the combination of genetic variants that are present within an individual. Polygenic scores (PGS) can be used to estimate an individual’s genetic susceptibility for developing a trait. In this study, we will be generating a new tau polygenic score (TPGS) to predict the likelihood that a person will develop traits associated with tauopathies, such as increased tau in blood plasma. We will use summary statistics generated from genome-wide association studies in order to calculate the risk that each genetic variant carriers. Then, we will calculate individual risk scores in a cohort of UCSF research participants in order to validate the scoring algorithm.
Investigator:
Yokoyama, Jennifer
Institution:
University of California, San Francisco
Project Title:
Investigating the immunogenetic mechanisms of Alzheimer's disease
Date of Approval:
September 5, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: The primary aim of this study is to compare how the abundance and expression of immune cells, in the brains of Alzheimer's disease (AD) patients and healthy individuals change stratified on genotype at immune-related genes using the SEA-AD dataset. Secondly, in both SEA-AD and ADSP datasets, we aim to associate SNPs in immune-related genes with clinical and pathological measures. Study Design: This study will combine the publicly available single-cell sequencing data and WGS data from the SEA-AD dataset, comprising samples from both AD patients and healthy individuals. We will also assess WGS and cognitive data from the ADSP study. We will stratify patients by diagnosis (AD, control). Immune-associated genotypes will be extracted from whole genome sequencing data; gene expression will be quantified from bulk and single-cell transcriptomic data available in the SEA-AD cohort. Analysis Plan: Using the immune cell data from the SEA-AD datasets, we will identify and characterize different immune cell populations present in the brain samples. Gene expression profiles of immune cells will be analyzed to identify differentially expressed genes (DEGs) between genotypes. This analysis will be conducted using methods such as DESeq2 or edgeR, with adjustments for multiple testing. DEGs will be functionally annotated and subjected to pathway and functional enrichment analysis to elucidate biological processes and pathways associated with immune cell function. We will compare the abundance of immune cell types between individuals with different genotypes using statistical tests such as t-tests or Wilcoxon rank-sum tests and adjust for cofactors through the use of generalized linear models. To assess associations between genetic variants and cognitive outcomes, we will leverage data from ADSP and SEA-AD cohorts, employing linear-mixed effects regression models that control for demographic (race, sex, age, education) and clinical (APOE status) characteristics.
Non-Technical Research Use Statement:
This study aims to understand how immune cells in the brain differ between Alzheimer's disease (AD) patients and healthy individuals, focusing on genetic differences in immune-related genes. Additionally, we will analyze single-cell sequencing and whole genome sequencing (WGS) data to compare immune cell abundance, gene expression, and genetic variation. We will also investigate how specific genetic variations (SNPs) in immune-related genes are linked to clinical and pathological features of AD.
Investigator:
Younkin, Steven
Institution:
Mayo Clinic
Project Title:
Evaluating polygenic architecture of Alzheimer’s disease using next gen sequencing data
Date of Approval:
August 11, 2022
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Research objective: To examine the polygenic architecture of AD using genetic variants identified in the ADSP data set. We plan to evaluate genetic variation originating from genes, pathways and other functional units for association with AD. Study design: Using whole genome and whole exome sequence, we will perform subject and variant QC using in house pipelines. We will then search for small sets of functionally related variants that show significant, replicable association with AD. Analysis plan: Our analysis plan will include in depth quality control of samples/subjects for the following metrics: sequencing coverage, sample call rate, missing chromosomes, sample contamination, sex check, relatedness, population substructure and a check of APOE genotypes. This will be followed by variant quality control and subsequent single variant association analysis adjusting for variation arising from population substructure, sequencing centers, sex, and APOE genotype. Using approaches suitable for analyzing sets of genes and/or variants (e.g. polygenic score analysis, SKAT-O), we will explore the polygenic architecture of AD of all available datasets by searching for sets of functionally related genes and/or variants which show significant, replicable association with AD.
Non-Technical Research Use Statement:
To identify novel AD genes which can be studied to develop new therapeutic approaches to AD, we will extend conventional analysis of single genetic variants by using methods capable of jointly analyzing all of the genetic variation in individual genes and functionally related sets of genes. These methods should enable us to identify novel, functionally related sets of genes which alter risk of AD.
Investigator:
Yu, Haiyuan
Institution:
Cornell University
Project Title:
Methods development in detecting rare non-coding variants in enhancer regions in Alzheimer's disease
Date of Approval:
January 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The project aims at developing a new method to detect rare non-coding variants in enhancers in Alzheimer's disease. Briefly, we limit variants discovery within potential enhancer hotspots from a combination of chromatin marks and eRNAs. We will study the effects of newly discovered non-coding variants using mutations from the whole-genome sequencing data, and examine their associations with phenotypes. The requested datasets will be used to establish and validate our method.
Non-Technical Research Use Statement:
Alzheimer's disease is potentially caused by rare mutations in human genome. Detecting these rare variants proves to be difficult. Our major goal is to develop a new approach to identify these rare mutations, and find out their contribution to the risk of Alzheimer's disease.
Investigator:
Zaranek, Alexander (Sasha) Wait
Institution:
Curii Corporation
Project Title:
AI4AD (Artificial Intelligence for Alzheimer’s Disease): Ultrascale Machine Learning to Empower Discovery in Alzheimer’s Disease Biobanks
Date of Approval:
February 1, 2023
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to genomic, imaging and cognitive data, in order to 1) identify AD genotypes and endophenotypes that dissect AD’s heterogeneity; 2) relate said genotypes and endophenotypes with clinical progression in pre-dementia patients; 3) identify novel treatment targets for AD by analyzing whole genome and associated phenotypic data. The goals of this multisite initiative (Paul Thompson, USC; Christos Davatzikos, Li Shen, Penn; Andy Saykin, IU; Heng Huang, Pitt, Paul Crane, UW; Adam Brickman, Columbia; Tim Hohman, Vanderbilt; Guyngah Jun, BU; Duygu Tosun, UCSF; Alexander Zaranek, Curii) leverage the promise of machine learning (ML) to contribute to precision diagnostics, prognostication, and targeted and novel treatments. We will develop ML and deep learning methods to apply to large scale biobanks of whole genome sequences (WGS), neuroimaging, cognitive, and clinical data, aiming to discover new genomic features that influence biological processes of AD. We will apply methods of genome representation and tiling to WGS repositories to create inputs for AI methods. We will develop novel, interpretable, biological knowledge guided deep learning methods to discover genomic motifs associated with AD, AD risk, and biological processes of AD as defined by NIA-AA criteria. To quantify subtypes and disentangle biological processes of AD, we will apply computational methods to multimodal MRI and amyloid- and tau-sensitive PET to stratify and subtype patient groups; novel imaging genomics methods will detect genomic markers and pathways that modulate the developing pathology as detected in the images, and that predict future clinical decline or resilience. We hypothesize that advanced deep learning methods combined with whole genome data will outperform traditional methods and GWAS for predicting AD onset and progression, and will assist with disease subtyping and discovering treatable targets in the genome. A team will rank and repurpose existing, and identify novel drugs and targets in the genome based on the discovered genetic motifs affecting AD.
Non-Technical Research Use Statement:
The AI4AD (Artificial Intelligence for Alzheimer’s Disease) Initiative aims to create and develop advanced AI methods, and apply them to extensive and rich genomic, imaging and cognitive data, in order to 1) identify genotypes and endophenotypes of AD that dissect the heterogeneity of the disease; 2) relate these genotypes and endophenotypes with clinical progression, in pre-dementia patients; 3) identify novel treatment targets for AD, by analyzing whole genome and associated phenotypic data at a previously impossible scale. Collectively, the goals of this highly collaborative multi- site initiative leverage the promise of machine learning to contribute to precision diagnostics, prognostication, and targeted and novel treatments.
Investigator:
Zhan, Huixin
Institution:
New Mexico Institute of Mining and Technology
Project Title:
AI-Driven Analysis of Genetic and Transcriptomic Data in Alzheimer’s Disease
Date of Approval:
March 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: This study aims to improve understanding of the genetic and molecular mechanisms underlying Alzheimer’s disease (AD) by applying advanced computational and deep learning models to existing genomic and transcriptomic datasets. Specifically, we seek to identify and characterize genetic variants associated with AD risk, progression, and related phenotypes, contributing to precision medicine approaches for neurodegenerative disorders. Study Design: This project involves secondary analysis of de-identified, controlled-access datasets from NIAGADS (NG00067, NG00116, NG00174, NG00027, NG00075). No new data will be collected. The data will be securely downloaded and analyzed on institutional servers at New Mexico Tech under an approved IRB and Data Use Agreement. Analysis Plan: We will integrate genomic, transcriptomic, and phenotypic data to develop and evaluate machine learning models—such as large language model–based architectures and disease-specific neural networks—to predict variant pathogenicity and gene-level associations. Phenotypic characteristics evaluated will include Alzheimer’s disease diagnosis, cognitive performance measures, neuropathological burden, and biomarker profiles (e.g., amyloid and tau levels). Statistical and model-based analyses will assess associations between genetic variants and these phenotypes, with results reported in aggregate, non-identifiable form. Collaborations (if applicable): N/A
Non-Technical Research Use Statement:
This project uses advanced artificial intelligence and statistical tools to study the genetic and molecular factors that contribute to Alzheimer’s disease. By analyzing existing, de-identified research data from the National Institute on Aging’s NIAGADS repository, we aim to identify genetic variants and biological pathways linked to disease risk and progression. The study will combine information from DNA and gene-expression data to build computer models that can better predict how certain genetic changes affect brain health. Our ultimate goal is to improve scientific understanding of Alzheimer’s disease and support future efforts in early detection and personalized treatment.
Investigator:
Zhang, Haiyang
Institution:
Vivid Genomics
Project Title:
Validation and optimization of Alzheimer's Disease phenotypes prediction using machine learning enabled polygenic risk models
Date of Approval:
March 28, 2023
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Analyzing postmortem phenotype and genomics data from ~1100 human brain samples with machine learning, Vivid Genomics, Inc., has developed prototype genetic biomarker assays that predict the presence of amyloid plaques, Lewy body pathology, cerebral amyloid angiopathy, and rate of cognitive decline. The objective is to increase subject numbers with similar data available through NIAGADS and NACC, along with datasets from several individual academic centers, to further optimize and validate our assays for neurodegenerative/cerebrovascular lesion types including tau, TDP-43, hippocampal sclerosis and microinfarcts, and for predicting rate of cognitive decline. NIAGADS datasets requested are NG00067 (including the newly released data which is new version 9 for dataset NG00067), NG00119, NG00117 and NG00127; data use limitations from these do not exclude our proposed usage. We are targeting >3000 subjects in total to be used for the validation of our models. We will focus on SNP selection and test the effects of different analysis strategies: 1) changing SNP p-value cutoffs 2) using LD-filtered representative SNPs with full genome coverage 3) testing the value of stratifying by APOE genotype 4) determining if it is better to add other covariates including age and sex. A fraction of the genetic data (~30%) will be withheld for validation. Optimization is defined as an area under the curve (AUC) of 80% and positive predictive value (PPV) of 80%, as well as R2 >0.75 for all assays. Values within 10% of this will be considered a successful validation. Through these assays, this project will benefit those suffering from Alzheimer’s disease and other neurodegenerative disorders by increasing clinical trial efficiency through more precise subject selection and/or stratification.
Non-Technical Research Use Statement:
Vivid Genomics is dedicated to developing genetic tests, typically done from DNA obtained from blood, that will predict, for any given older person, the likelihood that they have, or might develop when they become old, the characteristic brain changes of Alzheimer’s disease as well as other brain changes that affect thinking in older people. These changes include amyloid or senile plaques, tangles or tau, amyloid angiopathy, Lewy bodies, TDP-43 pathology, hippocampal sclerosis and brain infarcts (strokes). The objective of this study is to improve upon initial tests developed by Vivid, and to also develop genetic tests to predict the rate at which older people’s thinking ability decreases over time. To do this, Vivid Genomics requests human subject DNA analysis data stored at NIAGADS. Through these new genetic tests, Vivid hopes to benefit those suffering from Alzheimer’s disease and other brain diseases of aging by allowing better selection of subjects for clinical trials of these diseases, which would increase the chances of clinical trials finding useful new treatments.
Investigator:
Zhao, Dongjiao
Institution:
Messiah University
Project Title:
Title: Gene Expression and Biological Pathway Associations with Clinical Phenotypes in Alzheimer’s Disease
Date of Approval:
June 26, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
This project will examine how genetic variants, gene expression profiles, transcriptomic patterns, APOE status where available, and biological pathways are associated with Alzheimer’s disease and related cognitive, functional, and neuropathological phenotypes. The goal is to identify genomic and pathway-level factors that may help explain cognitive decline, daily functional changes, neuropathological burden, and vulnerability or resilience in Alzheimer’s disease and related neurodegenerative or cognitive aging conditions. This study is a secondary analysis of existing coded/de-identified controlled-access data obtained through NIAGADS DSS. No new participants will be recruited, no participant contact will occur, and no new biospecimens or data will be collected. The requested datasets include Alzheimer’s disease sequencing data, oldest-old clinical and pathological data, microglia single-nuclei RNA-seq data, multi-brain-region mRNA-seq data from sporadic ALS, and genomic data related to cognitive ability. All analyses will comply with NIAGADS data use limitations, NIH Genomic Data Sharing Policy, NIA requirements, and Messiah University institutional oversight. Analyses will evaluate associations between genetic or gene-level features and approved phenotypes, including AD diagnosis, case-control status, cognitive performance, general cognitive ability, functional status, daily activity measures, neuropathological findings, microglia-related expression profiles, brain-region-specific expression patterns, and disease stage where available. Methods may include quality control, phenotype harmonization, regression-based association testing, differential gene expression analysis, variant-to-gene annotation, gene-level aggregation, pathway enrichment, and aggregate or polygenic measures if appropriate. Models will adjust for relevant covariates such as age, sex, ancestry, relatedness, brain region, cell type, disease stage, batch effects, and technical factors. Only aggregate or summary-level results will be reported.
Non-Technical Research Use Statement:
This project will use existing de-identified research data to study why some people develop Alzheimer’s disease or cognitive decline while others remain more resilient. The study will look at genetic information, gene activity, and biological pathways that may be related to memory, thinking ability, daily function, and brain changes seen in Alzheimer’s disease and related conditions. No new participants will be recruited, and no new samples will be collected. The project will only use approved research data from NIAGADS. The goal is to better understand how inherited genetic differences and changes in gene expression may contribute to Alzheimer’s disease, cognitive decline, and functional changes in daily life. This research may help identify biological patterns that could support future studies on earlier detection, risk prediction, and better understanding of neurodegenerative diseases. All data used in this project will be coded or de-identified. The study will not attempt to identify any individual participant, and results will only be reported in summary form.
Investigator:
Zhao, Jinying
Institution:
University of Florida
Project Title:
Identifying novel biomarkers for human complex diseases using an integrated multi-omics approach
Date of Approval:
November 21, 2023
Request status:
Closed
Research use statements:
Show statements
Technical Research Use Statement:
GWAS, WES and WGS have identified many genes associated with Alzheimer’s Dementia (AD) and its related traits. However, the identified genes thus far collectively explain only a small proportion of disease heritability, suggesting that more genes remained to be identified. Moreover, there is a clear gender and ethnic disparity for AD susceptibility, but little research has been done to identify gender- and ethnic-specific variants associated with AD. Of the many challenges for deciphering AD pathology, lacking of efficient and power statistical methods for genetic association mapping and causal inference represents a major bottleneck. To tackle this challenge, we have developed a set of novel statistical and bioinformatics approaches for genetic association mapping and multi-omics causation inference in large-scale ethnicity-specific epidemiological studies. The goal of this project is to leverage the multi-omics and clinical data archived by the ADSP, ADNI, ADGC as well as other AD-related data repositories to identify novel genes and molecular markers for AD. Specifically, we will (1) validate our novel methods for identifying novel risk and protective genomic variants and multi-omics causal pathways of AD; (2) identify novel ethnicity- and gender-specific genes and molecular causal pathways of AD. We will share our results, statistical methods and computational software with the scientific community.
Non-Technical Research Use Statement:
Although many genes have been associated with Alzheimer’s Dementia (AD), these genes altogether explain only a small fraction of disease etiology, suggesting more genes remained to be identified. Of the many challenges for deciphering AD pathology, lacking of power statistical methods represents a major bottleneck. To tackle this challenge, we have developed a set of novel statistical and bioinformatics approaches for genetic association mapping and multi-omics causation inference in large-scale ethnicity-specific epidemiological studies. The goal of this project is to leverage the rich genetic and other omic data along with clinical data archived by the ADSP, ADNI, ADGC as well as other AD-related data repositories to identify novel genes and molecular markers for AD. Such results will enhance our understanding of AD pathogenesis and may also serve as biomarkers for early diagnosis and therapeutic targets.
Investigator:
Zhao, Yize
Institution:
Yale University
Project Title:
Integrative analysis for patient-centered outcomes and time-to-event data in Alzheimers disease
Date of Approval:
March 30, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: To develop and validate analytical models that integrate NIAGADS-hosted genetic data with NACC longitudinal clinical and biomarker profiles to characterize the progression of the continuum of Alzheimer’s disease from preclinical stages. The goals are to reconstruct individualized biomarker dynamics under sparse and irregular sampling, relate these dynamics to the timing of clinical hallmarks, and identify genetic variants associated with quantitative trajectory-based phenotypes. Study design: The project focuses on new statistical and ML model development, as well as on retrospective observational analyses of de-identified NACC participants with linked genotype data and repeated assessments. Methods will be developed and internally validated using the observed NACC follow-up schedules and available longitudinal history. Analysis plan: We will build statistical and machine learning models tailored to irregular longitudinal data to estimate person-specific trajectories from available biomarkers, including multi-modal imaging biomarkers, and from clinical and cognitive measures. From fitted trajectories, we will derive composite trajectory traits that summarize individualized marker dynamics and their relationship to event timing, including onset and progression to MCI and Alzheimer’s disease when available. We will test associations between genetic variants and these quantitative phenotypes using appropriate covariate adjustment for sex and genetic ancestry components, apply multiple testing control, and assess stability with resampling-based inference where appropriate. We will develop dynamic prediction models that update risk over time using subject-specific landmark times defined by each participant’s assessment schedule, and evaluate discrimination and calibration by cross-validation. We will also develop time-varying composite measures and examine how they modify predicted risk trajectories.
Non-Technical Research Use Statement:
We will develop new analytical methods to learn how Alzheimer’s disease develops before symptoms appear. Using de-identified NACC health and biomarker data linked to NIAGADS genetics, we will build models that reconstruct how key biomarkers change over time even when measurements are infrequent, irregular, or missing. These models will produce individualized trajectory scores and other quantitative summaries, and we will study how genetic differences relate to those early patterns. The long-term aim is to improve understanding of early disease biology and support earlier identification of vulnerability.
Investigator:
Zhao, Zhongming
Institution:
University of Texas Health Science Center at Houston
Project Title:
AIM-AI: an Actionable, Integrated and Multiscale genetic map of Alzheimer's disease via deep learning
Date of Approval:
June 1, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: The objective of our study is to advance our understanding of the genetic basis of Alzheimer’s Disease (AD) through the analysis of comprehensive genomic datasets such as Whole Exome Sequencing (WES), Whole Genome Sequencing (WGS), single-nuclei RNA sequencing, and Genome-Wide Association Studies (GWAS), as well as the related phenotype. We aim to identify genetic variants that are integral to the development and progression of AD. Study Design: Our approach involves a detailed multi-omics analysis focusing on both coding and non-coding regions within these datasets. We will develop new analytical variables from existing data, ensuring that our research adheres to the established data use limitations and contributes meaningfully to the field of genetic research in AD. Analysis Plan: The plan centers on investigating the correlation between genetic variants and AD, exploring how these variants influence the disease at a genetic level. We will employ cutting-edge computational methods to analyze interactions between these genetic markers and their potential role in AD pathogenesis. The integration of data from multiple sources will be carefully executed to maintain compliance with data use agreements, emphasizing the scientific exploration of AD.
Non-Technical Research Use Statement:
Our research is dedicated to unraveling the genetic components of Alzheimer’s Disease. By analyzing genetic sequences and variations through various genomic datasets, we seek to deepen the scientific understanding of how these genetic elements contribute to AD. The outcomes of this study will be shared with the public, enhancing general knowledge of Alzheimer’s Disease and supporting the global research community in its ongoing efforts to decode this complex condition.
Investigator:
Zhi, Degui
Institution:
University of Texas Health Science Center at Houston
Project Title:
Genetics of deep-learning-derived neuroimaging endophenotypes for Alzheimer's Disease
Date of Approval:
February 6, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer’s disease (AD) affects 5.6 million Americans over the age of 65 and exacts tremendous and increasing demands on patients, caregivers, and healthcare resources. Our current understanding of the biology and pathophysiology of AD is still limited, hindering advances in the development of therapeutic and preventive strategies. Existing genetic studies of AD have some success but these explain only a fraction of the overall disease risk, suggesting opportunities for additional discoveries. The proposed project will leverage existing neuroimaging and genetic data resources from the UK Biobank, the Alzheimer’s Disease Sequencing Project (ADSP), the Alzheimer’s Disease Neuroimaging Initiative (ADNI), and the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium, and will be conducted by a multidisciplinary team of investigators. We will derive AD endophenotypes from neuroimaging data in the UK Biobank using deep learning (DL). We will identify novel genetic loci associated with DL-derived imaging endophenotypes and optimize the co-heritability of these endophenotypes with AD-related phenotypes using UK Biobank genetic data. We will leverage resources and collaborations with AD Consortia and the power of DL-derived neuroimaging endophenotypes to identify novel genes for Alzheimer’s Disease and AD-related traits. Also, we will develop DL-based neuroimaging harmonization and imputation methods and distribute implementation software to the research community. We expect to discover new genes relevant to AD which may leads to understanding of molecular basis of AD and potential new treatment.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) exacts a tremendous burden on patients, caregivers, and healthcare resources. Our current understanding of the biology of AD is still limited, hindering advances in the development of treatment and prevention. Existing genetic studies of AD have some success but more studies are needed. The proposed project will leverage existing neuroimaging and genetic data resources from the UK Biobank, the Alzheimer’s Disease Sequencing Project (ADSP) and other consortia and will be conducted by a multidisciplinary team of investigators. We will derive new AD relevant intermediate phenotypes from neuroimaging data using deep learning (DL), an AI approach. We will identify novel genetic loci associated with these phenotypes. Also, we will develop imaging harmonization and imputation methods and distribute implementation software to the research community. We expect to discover new genes relevant to AD which may leads to understanding of molecular basis of AD and potential new treatment.
Investigator:
Zhou, Weichen
Institution:
University of Michigan
Project Title:
Explore the functional impact of transposable elements in Alzheimer’s disease and related dementias
Date of Approval:
September 4, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Explore somatic transposable elements and their Alzheimer's disease-related patterns using genomic and phenotypic data from large cohorts: In order to explore the impact of the transposable element in Alzheimer's disease, we propose to conduct a systematic survey in the available large cohorts. The ADSP dataset in NIAGAlzheimer's diseaseS (Accession No. NG00067) includes 16,906 whole-genome sequences and 20,504 whole-exome sequences for case-control and family-based studies of Alzheimer's disease from diverse populations, which is a perfect resource to leverage in this project. Under the support of the Michigan Alzheimer's Disease Center, we will request to access NIAGADS. To detect somatic transposable elements in the ADSP dataset, we will employ established computational pipelines to resolve the transposable elements in the sequencing data, MELT and xTEA for WGS and SCRAMble for WES, respectively. Parameters in these tools, for instance, the calling threshold of supporting reads, will be adjusted accordingly to cooperate with the detection of somatic transposable elements in cells at low frequency. To exclude potential germline transposable elements, we will leverage a master set of polymorphic transposable elements from diverse populations, which are based on our previous projects at the Human Genome Structural Variation Consortium, and the case-control information provided by ADSP. We aim to summarize a spectrum of somatic transposable elements that would be Alzheimer's disease-relevant along with various clinical and phenotypic information. To build Alzheimer's disease-related genetic patterns we will implement Mutect2 (GATK) and Strelka2 to discover SNVs from WGS and WES data and link them with transposable elements in the same haplotype. After obtaining this set of patterns, we will collect phenotypic information from the ADSP dataset to conduct family-based associated analysis and gene-burden analysis. RegulomeDB will be used to annotate the effects of non-coding functional impact and regulatory changes for these Alzheimer's disease-related patterns.
Non-Technical Research Use Statement:
It seeks to explore the connection between the somatic transposable elements in the human genome and Alzheimer’s disease and related dementias. It will leverage large-scale datasets to extensively explore the genome-wide transposable elements and then stratify Alzheimer’s disease-relevant ones by using the rich clinical information from the cohorts. Further analysis pipelines will be built based on the results of the proposed project to investigate the functional impact of these transposable elements on Alzheimer’s disease and would improve the understanding of genetic causes of Alzheimer’s disease and related dementias.
Investigator:
Zhou, Wenyu
Institution:
Teal Omics. Inc
Project Title:
Understanding CNS diseases through organ specific aging biomarkers
Date of Approval:
February 27, 2025
Request status:
Expired
Research use statements:
Show statements
Technical Research Use Statement:
Can blood-based protein biomarkers of aging predict functional decline in organ health and in systemic health? We have developed organ-specific aging models derived from plasma proteins, and previously found that the blood proteome can be used to monitor organ health and aging in smaller cohorts (Oh, et., al., Nature 624, 164–172 (2023)). We aim to test this hypothesis at scale with diversified datasets. Aim 1. Validate our approach to modeling aging and brain health with NIAGADS cohorts. Approach: we will use protein biomarker data from cohorts to train machine learning models of organ aging. We will employ machine learning best practice for data curation, data augmentation, data normalization, and training/testing, to evaluate whether we can reproducibly estimate brain aging in diversified healthy cohorts and disease cohorts. Aim 2. Test the effects of brain aging on future disease risk and functional decline. We will focus particularly on the aging brain to investigate the relationship between organ aging and cognitive decline. Approach: we will evaluate the relationships between our model aging predictions and aging, diseases, and clinical phenotypes, such as mortality and lab and physical assessments. This is primarily done by first calculating the difference between a sample’s chronological age and its predicted organ age. This organ age residual is then tested for association with phenotypes of interest, such as Alzheimer’s Disease status or future risk of cognitive decline.
Non-Technical Research Use Statement:
As we age, our risk of getting sick increases, but did you know that everyone ages differently? Some people's bodies deteriorate faster than others, and we don't fully understand why. By studying how our bodies change with age, we aim to identify which organs are aging the most and find ways to improve quality of life tailored to each individual's needs. We want to understand how people age differently, both as individuals and as a population on organ specific levels. By analyzing molecules in blood, we aim to: - Identify how aging affects people uniquely - Link aging patterns to diseases and lifespan - Use advanced computer models to connect blood changes to organ function Our research can lead to: - Personalized patient care - New discoveries for medicines - Improved understanding of aging and age-related diseases We'll use a large, diverse dataset to validate our approach and gain valuable insights into the aging process. By unlocking these secrets, we can work towards a healthier future for everyone.
Investigator:
ZHU, HONGTU
Institution:
Department of Biostatistics, The University of North Carolina at Chapel Hill
Project Title:
Development of a structured knowledge graph for AD for better prediction, diagnosis and treatment
Date of Approval:
June 25, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We will use the data to build an AD-related omics database both by using state-of-the-art and by developing advanced deep learning-based methods for harmonization and imputation of multi-omics data, facilitating system biology studies to gain deep understanding of AD. We will use the data to identify the genetic biomarkers with causal effect on behavioral deficits in Alzheimer’s study and use these biomarkers to help predict Alzheimer's disease (AD). SNP data will be screened first using the GWAS and significant SNPs will be selected as genetic biomarkers according to their p-values. Then causal effects will be estimated to evaluate the contribution of genetic biomarkers. For AD prediction, enhanced statistical, machine learning, and deep learning approaches will be explored and compared, which may include but not limited to: the PCA decomposition, ridge regression/elastic net algorithm, boosting algorithms such as XGBoost/lightGBM, deep learning models such as the deep factorization machine. In far future, we want to develop a structured, literature and expert-knowledge based knowledge graph for better prediction, diagnosis and treatment of AD.
Non-Technical Research Use Statement:
Leveraging our newly developed causal inference method, we aim to identify genetic causal pathways for the Alzheimer's disease (AD) from genomic data collected from multiple populations. The identified features will be used to predict cognitive and behavior scores among patients with AD or mild cognitive impairment (MCI). Finally, we want to develop a knowledge graph for better prediction, diagnosis and treatment of AD.
Investigator:
Zody, Michael
Institution:
New York Genome Center
Project Title:
Characterizing complex structural variation in Alzheimer's Disease
Date of Approval:
August 15, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We will run multiple structural variation (SV) callers, including but not limited to Absinthe, Canvas, Manta, MELT, and PanGenie, on the entire Alzheimer’s Disease Sequencing Program (ADSP) WGS data set. We will combine these data into a single coherent SV call set which we will genotype on the entire ADSP WGS data set. We will use these data, together with SNV and indel calls, to generate a phased integrated SNV/indel/SV genotype panel for ADSP. We will then use this to impute the presence of SVs in data from the Alzheimer’s Disease Genetics Consortium (ADGC) genotyping on samples not included in ADSP and also in external phenotyped population sequencing projects including but not necessarily limited to UK Biobank. The resulting directly genotyped and imputed SVs will be used in association studies with AD status and related phenotypes. We will perform this work in collaboration with Badri Vardarajan at Columbia University. We will collaborate with Gerard Schellenberg, Li-San Wang, Wan-Ping Lee and Yuk Yee (Fanny) Leung in GCAD at University of Pennsylvania to share individual SV callsets we have generated distinct from theirs and build a joint SV callset. NYGC will contract the services of Wayne Clarke from Outlier Informatics for this project. Dr. Clarke will be responsible for the cloud implementation of NYGC's SV calling pipelines and for optimizing and running data analysis on AWS and GCP as well as locally on NYGC hardware. In this capacity Dr. Clarke will use NYGC accounts and compute infrastructure to access NIAGADS data in the cloud or downloaded to NYGC's on prem compute infrastructure.
Non-Technical Research Use Statement:
Structural variation refers to alterations in DNA that change the copy number or ordering of large blocks of DNA. Historically, these have been difficult to detect accurately and thus their impact on disease risk has often been ignored. We will apply the latest methods to discover and characterize structural variants in Alzheimer’s Disease (AD) cases and controls, and then use genetic association techniques to determine whether these events are likely to be correlated with increased or decreased risk of developing AD or any of the specific features of AD. Identifying additional genetic risk factors may improve diagnosis of AD and may increase our understanding of the biological mechanisms leading to AD and possible therapeutics.