NG00102 - Genomic atlas of the proteome from brain, CSF and plasma prioritizes proteins implicated in neurological disorders

The pQTL summary statistics are available in the “Open Access Dataset” tab.
To access the proteomic data, please log into DSS and submit an application.
Within the application, add this dataset (accession NG00102) in the “Choose a Dataset” section.
Once approved, you will be able to log in and access the data within the DARM portal.

Description

Understanding the tissue-specific genetic controls of protein levels is essential to uncover mechanisms of post-transcriptional gene regulation. We previously generated a genomic atlas of protein levels in three tissues relevant to neurological disorders (brain, cerebrospinal fluid and plasma) by profiling thousands of proteins from participants with and without Alzheimer’s disease. We now enhanced this work by analyzing more proteins (1,300 versus 1,079) and an almost twofold increase in high-quality imputed genetic variants (8.4 million versus 4.4 million) by using TOPMed reference panel. We identified 38 genomic regions associated with 43 proteins in brain, 150 regions associated with 247 proteins in cerebrospinal fluid, and 95 regions associated with 145 proteins in plasma. Compared to our previous study, this study newly identified 12 loci in brain, 30 loci in cerebrospinal fluid, and 22 loci in plasma. cis-pQTLs were more likely to be tissue shared, but trans-pQTLs tended to be tissue specific. Between 48.0% and 76.6% of pQTLs did not co-localize with expression, splicing, DNA methylation or histone acetylation QTLs. Using Mendelian randomization, we nominated proteins implicated in neurological diseases, including Alzheimer’s disease, Parkinson’s disease and stroke. This first multi-tissue study will be instrumental to map signals from genome-wide association studies onto functional genes, to discover pathways and to identify drug targets for neurological diseases.

This dataset is part of the Knight ADRC Collection. Other datasets in this collection can be found at: https://www.niagads.org/knight-adrc-collection

Sample Summary per Data Type

Sample Set	Accession	Data Type	Number of Samples
Genomic atlas of the proteome from brain, CSF and plasma prioritizes proteins implicated in neurological disorders	snd10048	Proteomic	1157

Available Filesets

Name	Accession	Latest Release	Description
KGAD Proteomics: pQTL summary statistics and protein annotations (open access)	fsa000065	NG00102.v1	pQTL summary statistics and protein annotations
KGAD Proteomics: Proteomics, protein annotations, and QC documents (application needed)	fsa000066	NG00102.v1	Proteomics, protein annotations, and QC documents

View the File Manifest for a full list of files released in this dataset.

Provided in this dataset is a set of multi-tissue proteomic data that underwent a process of quality control measures by the Cruchaga Lab at Washington University in St. Louis, as well as pQTL summary statistics. From 1157 subjects, 1300 protein analytes were measured for 328 brain samples, 869 protein analytes were measured for 770 CSF samples, and 953 protein analytes were measured for 500 plasma samples on the SomaLogic SomaScan 1.3K platform at the Washington University Neurogenomics and Informatics Center.

Sample Set	Accession Number	Number of Subjects	Number of Samples
Genomic atlas of the proteome from brain, CSF and plasma prioritizes proteins implicated in neurological disorders	snd10048	1,157	1,598

Consent Level	Number of Subjects
DS-ADRD-IRB-PUB	1151
HMB-IRB-PUB	6

Acknowledgment statement for any data distributed by NIAGADS:

Data for this study were prepared, archived, and distributed by the National Institute on Aging Alzheimer’s Disease Data Storage Site (NIAGADS) at the University of Pennsylvania (U24-AG041689), funded by the National Institute on Aging.

Use the study-specific acknowledgement statements below (as applicable):

For investigators using any data from this dataset:

Please cite/reference the use of NIAGADS data by including the accession NG00102.

For investigators using Charles F. and Joanne Knight Alzheimer’s Disease Research Center (sa000008) data:

This work was supported by grants from the National Institutes of Health (R01AG044546, P01AG003991, RF1AG053303, R01AG058501, U01AG058922, RF1AG058501 and R01AG057777). The recruitment and clinical characterization of research participants at Washington University were supported by NIH P50 AG05681, P01 AG03991, and P01 AG026276. This work was supported by access to equipment made possible by the Hope Center for Neurological Disorders, and the Departments of Neurology and Psychiatry at Washington University School of Medicine.

We thank the contributors who collected samples used in this study, as well as patients and their families, whose help and participation made this work possible. This work was supported by access to equipment made possible by the Hope Center for Neurological Disorders, and the Departments of Neurology and Psychiatry at Washington University School of Medicine.

See below for additional dataset specific acknowledgments:

For use of the ADSP-PHC harmonized phenotypes deposited within dataset, ng00067, use the following statement:

The Memory and Aging Project at the Knight-ADRC (Knight-ADRC), supported by NIH grants R01AG064614, R01AG044546, RF1AG053303, RF1AG058501, U01AG058922 and R01AG064877 to Carlos Cruchaga. The recruitment and clinical characterization of research participants at Washington University was supported by NIH grants P30AG066444, P01AG03991, and P01AG026276. Data collection and sharing for this project was supported by NIH grants RF1AG054080, P30AG066462, R01AG064614 and U01AG052410. This work was supported by access to equipment made possible by the Hope Center for Neurological Disorders, the Neurogenomics and Informatics Center (NGI: https://neurogenomics.wustl.edu/) and the Departments of Neurology and Psychiatry at Washington University School of Medicine.

For use of ng00050 and ng00052, use the following statement:
This work was supported by Pfizer and grants from the National Institutes of Health (R01-AG044546, P01-AG003991), and the Alzheimer's Association (NIRG-11–200110). This research was conducted while Carlos Cruchaga was a recipient of a New Investigator Award in Alzheimer's disease from the American Federation for Aging Research. Carlos Cruchaga is a recipient of a BrightFocus Foundation Alzheimer's Disease Research Grant (A2013359S). The recruitment and clinical characterization of research participants at Washington University were supported by NIHP50 AG05681, P01 AG03991, and P01 AG026276. Some of the samples used in this study were genotyped by the ADGC and GERAD. ADGC is supported by grants from the NIH (#U01AG032984) and GERAD from the Wellcome Trust (GR082604MA) and the Medical Research Council (G0300429). Data collection and sharing for this project was funded by the Alzheimer's Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: Alzheimer's Association; Alzheimer's Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen Idec; Bristol-Myers Squibb Company; Eisai; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd. and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC; Johnson & Johnson Pharmaceutical Research & Development LLC; Medpace; Merck; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Synarc Inc.; and Takeda Pharmaceutical Company. The Canadian Institutes of Rev December 5, 2013 Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer's Disease Cooperative Study at the University of California, San Diego. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.

Total number of approved DARs: 12

Investigator:
Belloy, Michael
Institution:
Washington University in St Louis
Project Title:
Elucidating sex-specific risk for Alzheimer's disease through state-of-the-art genetics and multi-omics
Date of Approval:
January 6, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
• Objectives: In this project, we seek to holistically investigate the genetic and molecular drivers of sex dimorphism in Alzheimer’s disease across ancestries. • Study design: This study integrates large-scale population genetics with multi-omics and endophenotype analyses. We are integrating all data available from ADGC and ADSP, together with other data from AMP-AD and biobanks such as UKB, FinnGen, and MVP to conduct large-scale multi-ancestry GWAS, rare-variant gene aggregation analyses, QTL studies, PWAS, TWAS, etc. We also particularly focus on X chromosome association studies. The study design also interrogates interactions with ancestry, hormone exposures, and with APOE*4, as well as comparisons to non-stratified GWAS/XWAS of Alzheimer’s disease. Further, we will also employ genetic correlation analyses, mendelian randomization, colocalization, and pleiotropy analyses, to interrogate overlap with other complex traits to better understand the mechanisms underlying sex dimorphism in Alzheimer’s disease. • Analysis plan, including the phenotypic characteristics that will be evaluated in association with genetic variants: Our phenotypes will include Alzheimer’s disease risk, conversion risk, various endophenotypes (including amyloid/tau biomarkers, brain imaging metrics, etc.) as well as molecular traits. As noted above, we will conduct large-scale multi-ancestry GWAS, XWAS, rare-variant gene aggregation analyses, QTL studies, PWAS, TWAS, etc. Specific aims include interrogating these question and analyses on (1) the autosomes, (2) the X chromosome, and (3) leveraging sex stratified QTL studies to drive discovery of risk genes.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) manifests itself differently across men and women, but the genetic and molecular factors that drive this remain elusive. AD is the most common cause of dementia and till today remains largely untreatable. It is thus crucial to study the genetics of AD in a sex-specific manner, as this will help the field gain important insights into disease pathophysiology, identify novel sex-specific risk factors relevant to personalized genetic medicine, and uncover potential new AD drug targets that may benefit both sexes. This project uses large-scale genomics and multi-omics to elucidate novel sex agnostic and sex-specific AD risk genes. We will interrogate sex dimorphism for AD risk on the autosomes and the sex chromosomes. We similarly interrogate sex dimorphism in the genetic regulation of gene expression and protein levels, which we will integrate with genetic risk for Alzheimer’s disease to further discovery risk genes. Throughout, we will also interrogate how sex-specific risk for AD interactions with hormone exposures, ancestry, and the APOE*4 risk allele.
Investigator:
Cruchaga, Carlos
Institution:
Washington University School of Medicine
Project Title:
The Familial Alzheimer Sequencing (FASe) Project
Date of Approval:
March 18, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The goal of this study is to identify new genes and mutations that cause or increase risk for Alzheimer disease (AD), as well as protective factors. Individuals and families were selected from the Knight-ADRC (Washington University) and the NIA-LOAD study. Only families with at least three first-degree affected individuals were included. Families with pathogenic variants in the known AD or FTD genes, or in which APOE4 segregated with disease were excluded. At least two cases and one control were selected per family. Cases had an age at onset (AAO) after 65 yo and controls had a larger age at last assessment than the latest AAO within the family. Whole exome (WES) and whole genome sequencing (WGS) was generated for 1,235 individuals (285 families) that together with data from our collaborators and the ADSP family-based cohort (3,449 individuals and 757 families) will provide enough statistical power to identify new genes for AD. Dr. Tanzi (Harvard Medical School) will provide WGS from 400 families from the NIMH Alzheimer disease genetics initiative study. We will perform single variant and gene-based analyses to identify genes and variants that increase risk for disease in AD families. Single variant analysis will consist of a combination of association and segregation analyses. We will run family-based gene-based methods to identify genes that show and overall enrichment of variants in AD cases. We will also look for protective and modifier variants. To do this we will identify families loaded with AD cases, that also include individuals with a high burden of known risk variants but that do not develop the disease (escapees). We will use the sequence data and the family structure to identify variants that segregate with the escapee phenotype. The most promising variants and genes will be replicated in independent datasets (ADSP case-control, ADNI, Knight-ADRC, NIA-LOAD ). We will perform single variant and gene-based analyses to replicate the initial findings, and survival analysis to replicate the protective variants. We will select the most promising variants/genes for functional studies
Non-Technical Research Use Statement:
Family-based approaches led to the identification of disease-causing Alzheimer’s Disease (AD) variants in the genes encoding APP, PSEN1 and PSEN2. The identification of these genes led to the A?-cascade hypothesis and to the development of drugs that target this pathway. Recently, we have identified rare coding variants in TREM2, ABCA7, PLD3 and SORL1 with large effect sizes for risk for AD, confirming that rare coding variants play a role in the etiology of AD. In this proposal, we will identify rare risk and protective alleles using sequence data from families densely affected by AD. We hypothesize that these families are enriched for genetic risk factors. We already have sequence data from 695 families (2,462 individuals), that combined with the ADSP and the NIMH dataset will lead to a dataset of more than 1,042 families (4,684 individuals). Our preliminary results support the flexibility of this approach and strongly suggest that protective and risk variants with large effect size will be found, which will lead to a better understanding of the biology of the disease.
Investigator:
Ertekin-Taner, Nilufer
Institution:
Mayo Clinic
Project Title:
CLEAR-AD
Date of Approval:
January 6, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
This U19 aims to bridge these knowledge gaps for discovery and validation of Centrally-linked Longitudinal pEripheral biomARkers of AD (CLEAR-AD) in multi-ethnic populations. CLEAR-AD U19 is based on the premise that AD is a complex disorder in which many biological pathways are disrupted due to multi-omic perturbations, which can be detected in brain and reflected in blood. The specific aims of CLEAR-AD are: 1) To discover CLPMS of the complex and heterogeneous AD pathophysiology and its co-pathologies. 2) To identify longitudinal CLPMS that detect and predict dynamic neuroimaging, fluid biomarker, and clinical changes across AD spectrum. 3) To characterize differences and similarities in CLPMS profiles across NHW, African American (AA) and Latino American (LA) participants to uncover biomarker patterns in multi-ethnic groups. 4) To make these vast resources available to the scientific community to amplify and accelerate its impact. In this U19, we will leverage NIH-funded ADNI, MCSA and ADRC cohorts of >3,700 multi-ethnic participants to generate >20,000 multi-omics measures (Omics Core) that will be processed and integrated with >48,000 harmonized AD cognitive, neuroimaging and fluid endophenotypes (Analytic Core). Using these data, we will identify brain region and cell-type specific CLPMS, which reflect biological subtypes of AD and disease stage (Project 1). We will discover longitudinal changes in CLPMS that predict cognitive and A/T/N/V progression (Project 2). We will define longitudinal cognitive and A/T/N/V changes and CLPMS in URP that are either conserved with NHW or population-specific (Project 3). This U19 will a) Identify the next generation of AD biomarkers with mechanistic insights; b) Establish a precision medicine approach for rigorous multi-omics biomarker discovery and validation in AD; c) Discover molecules that can serve as biomarkers and therapeutic targets; d) Enhance biomarker research in trial-ready multi-ethnic populations; and e) Generate and share a vast and harmonized resource of endophenotype and multi-omics data in NIH-funded cohorts.
Non-Technical Research Use Statement:
There is a clear and immediate need for the discovery of peripheral molecular signatures linked to central disease processes, core and co-pathologies in Alzheimer’s Disease (AD), that will serve as precision medicine blood-based biomarkers for diagnostic, prognostic, theragnostic and therapeutic purposes. AD is a complex disorder in which many biological pathways are disrupted due to multi-omic perturbations, which can be detected in brain and reflected in blood, i.e. centrally-linked peripheral molecular signatures (CLPMS). This U19 will leverage deeply phenotyped, longitudinal NIH-funded multi-ethnic cohorts and cross-disciplinary expertise for multi-omics data generation and its integration with harmonized AD endophenotypes, will share these data and utilize them in integrated U19 projects to discover CLPMS that will serve as the next generation of AD biomarkers.
Investigator:
Ezzati, Ali
Institution:
University of California, Irvine
Project Title:
Advanced Modeling in Alzheimer's Cohorts and Trials
Date of Approval:
January 28, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We aim to develop robust, calibrated, and reliable predictive models by utilizing machine learning techniques on a rich dataset of imaging, omics, and phenotypic data. Our approach is twofold:1. We will employ supervised machine learning methods to construct predictive models for defined outcomes such as clinical disease stages, meaningful cognitive decline, and imaging and neuropathologic markers (e.g., amyloid or tau positivity). We acknowledge that a major challenge in implementing predictive models is their integration into real-world settings. Therefore, once validated, we will convert these models into web applications or simplify them into risk scores to improve their usability and practical application. 2. Given the significant heterogeneity in Alzheimer's disease and existing gaps in our mechanistic understanding, we will utilize semi-supervised machine learning to subtype the disease. This method will incorporate genetics, biofluid biomarkers, imaging, and neuropsychological tests. Our methodology offers a significant advantage over the unsupervised machine and deep learning techniques commonly used in the field. By incorporating "partial labeling" and incorporating knowledge of disease states, our semi-supervised approach enables more clinically relevant disease subtyping and results.For this project, we are collaborating with experts from other institutions, including Dr. Christos Davatzikos from the University of Pennsylvania, who is an expert in neuroimaging and machine learning, and Dr. Richard Lipton from Albert Einstein College of Medicine, known for his expertise in epidemiology and the design of trials and cohorts.
Non-Technical Research Use Statement:
Despite the extensive research on Alzheimer's and other dementias, the success rate of randomized clinical trials has been disappointingly low. This is primarily due to the significant clinical variations among patients, which limits our ability to predict their outcomes accurately. To address this challenge, we aim to develop a predictive framework specifically focusing on Alzheimer's and related dementias. We will use several cohorts and will employ machine learning and advanced statistical modeling on various types of data, including demographics, neuropsychological measures, biofluid and imaging biomarkers from existing international cohorts and concluded clinical trials. The framework will enable us to predict disease progression which will facilitate early intervention and personalized care for individuals at risk of developing dementia. Our project seeks to harness the power of machine learning and advanced analytics to improve our understanding of Alzheimer's Disease, enhance prediction accuracy, and ultimately contribute to more effective treatments and interventions.
Investigator:
Goate, Alison
Institution:
Icahn School of Medicine at Mount Sinai
Project Title:
Study of Alzheimer's disease and other dementias (e.g. frontotemporal dementia) and related phenotypes
Date of Approval:
March 4, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Alzheimer's disease (AD) is the most common form of dementia but has no effective prevention or treatment. Developing a comprehensive picture of the genetic architecture of AD including a network level functional assessment of risk/resilience genes is essential to develop novel therapeutic targets. The overarching goals of this study are to use genetic and genomic approaches to: 1) identify genes and variants that are involved in the development of AD and related disorders; 2) identify functional networks enriched for AD or related disorder risk and protective loci; 3) determine how cellular function and physiology is impacted by these genetic factors in disease-relevant cell types and animal models. This study will use publicly available whole genome/exome sequence data generated by the Alzheimer’s Disease Sequencing Project (ADSP) and genome-wide association study (GWAS) data from the International Genomics of Alzheimer’s Project (IGAP) and others. We will apply a suite of case-control and family approaches to investigate genetic association with dichotomous and continuous disease traits. This study will not only further our understanding of the genetic architecture of AD but also provide key information regarding the molecular mechanisms, setting the stage for novel therapeutic development.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is the only disease among the top ten killers in the U.S. without a disease modifying therapy. Genetic studies provide a powerful means to identify genes and pathways that are causally linked to disease etiology. We propose to use genomic and functional approaches to identify genes that alter the risk of AD and investigate how these genes disrupt cellular pathways leading to disease.
Investigator:
Greicius, Michael
Institution:
Stanford University School of Medicine
Project Title:
Examining Genetic Associations in Neurodegenerative Diseases
Date of Approval:
December 19, 2024
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We are studying the effects of rare (minor allele frequency
Non-Technical Research Use Statement:
Current genetic understanding of Alzheimer’s Disease (AD) does not fully explain its heritability. The APOE4 allele is a well-established risk factor for the development of Alzheimer’s Disease (AD). However, some individuals who carry APOE4 remain cognitively healthy until advanced ages. Additionally, the cause of mixed dementia pathology development in individuals remains largely unexplained. We aim to identify genetic factors associated with these “protected” and mixed pathology phenotypes.
Investigator:
Hohman, Timothy
Institution:
Vanderbilt University Medical Center
Project Title:
Genetic Drivers of Resilience to Alzheimer's Disease
Date of Approval:
February 4, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
“Asymptomatic” Alzheimer’s disease (AD) is a phenomenon in which 30% of individuals over age 65 meet criteria for autopsy-confirmed pathological AD (beta-amyloid plaques and tau aggregation) but do not clinically manifest cognitive impairment.1-3 The resilience that underlies asymptomatic AD is marked by both protection from neurodegeneration (brain resilience)4 and preserved cognition (cognitive resilience).Our central hypothesis is that genetic effects allow a subset of individuals to endure extensive AD neuropathology without marked brain atrophy or cognitive impairment. We are uniquely positioned to identify resilience genes by leveraging the Resilience from Alzheimer’s Disease (RAD) database, a local resource in which we have harmonized a validated quantitative phenotype of resilience across 8 large AD cohort studies.Our strong interdisciplinary team represents international leaders in genetics, neuroscience, neuropsychology, neuropathology, and psychometrics who will leverage the infrastructure and rich resources of the AD Genetics Consortium, IGAP, ADSP, and our recently established and harmonzed continuous metric of resilience to fulfill the following aims:Aim 1. Identify and replicate common genetic variants that predict cognitive resilience (preserved cognition) and brain resilience (protection from brain atrophy) in the presence of AD pathology. We hypothesize that common genetic variation will explain variance in resilience above and beyond known predictors like education. Replication analyses will leverage age of onset data from IGAP to demonstrate that resilience loci predict a later age of AD onset.Aim 2. Identify and replicate rare and low-frequency genetic variants that predict cognitive and brain resilience. Rare and low-frequency variants with large effects have been identified in AD case/control studies, providing new insight into the genetic architecture of AD.Aim 3: Identify sex-specific genetic drivers of cognitive and brain resilience to AD pathology. Our preliminary results highlight sex differences in the downstream consequences of AD neuropathology, including sex-specific genetic markers of resilience.
Non-Technical Research Use Statement:
As the population ages, late-onset Alzheimer’s disease (AD) is becoming an increasingly important public health issue. Clinical trials targeted a reducing AD progression have demonstrated that patients continue to decline despite therapeutic intervention. Thus, there is a pressing need for new treatments aimed at novel therapeutic targets. A shift in focus from risk to resilience has tremendous potential to have a major public health impact by highlighting mechanisms that naturally counteract the damaging effects of AD neuropathology. The goal of the present project is to characterize genetic factors that protect the brain from the downstream consequences of AD neuropathology. We will identify both rare and common genetic variants using a robust metric of resilience developed and validated by our research team. The identification of such genetic effects will provide novel targets for therapeutic intervention in AD.
Investigator:
Kamboh, M. Ilyas
Institution:
University of Pittsburgh
Project Title:
Genetics of Alzheimer's Disease and Endophenotypes
Date of Approval:
January 7, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: We are requesting access to the NIAGADS datasets to augment our ongoing studies on the genetics of Alzheimer’s disease (AD) and AD-related endophenotypes being carried out by Kamboh and his group since 1995. We are doing GWAS using array genotypes, whole-exome sequencing and whole-genome sequencing on datasets derived from University of Pittsburgh ADRC and ancillary population-based longitudinal studies on dementia and biomarkers. Different available phenotypes include AD and non-AD dementia, age-at-set, disease progression and survival, neuroimaging, cognitive decline, plasma biomarkers for the core ATN and non-ATN pathologies. We also plan to expand on gene-gene interaction and sex-stratified analyses which require the actual genotype data. The NIAGADS datasets will be used for replication and meta-analysis, and for gene-gene interaction and sex-stratified analyses. Study Design: A case-control design will incorporate a diverse cohort of individuals with AD and age-matched controls. For quantitative traits (neuroimaging and plasma biomarkers, cognitive performance measures, indicators of disease progression), linear regression analyses will be performed to identify genetic loci. To ensure the findings are robust and inclusive, participants from diverse demographic backgrounds will be included, enabling the exploration of potential genetic variations across populations. Analysis Plan: We will conduct GWAS and targeted analyses on candidate genes on different AD and AD-related phenotypes. Primary phenotypic variables include AD disease status, age-at-onset, last age for controls, APOE genotype, cognitive decline trajectories, sex, and race. Analyses will evaluate the influence of specific genetic variants on disease risk, cognitive performance, and biomarker levels, considering both individual and interactive effects of the APOE genotype. Results will be adjusted for potential confounders, such as demographic factors, to ensure valid associations. Detail analytical methods are described in our published papers for case-control (PMID: 32651314;35694926), quantitative traits (PMID: 30361487;37666928), and cognitive decline (PMID: 37089073; 30954325).
Non-Technical Research Use Statement:
Our research group at the University of Pittsburgh (Pitt), has been working on the genetics of Alzheimer’s disease (AD) and AD-related endophenotypes for almost three decades, on data derived largely from the University of Pittsburgh Alzheimer’s Disease Research Center and ancillary dementia studies. We are requesting access to the NIAGADS genotype and phenotype datasets to augment our sample size to increase power to detect novel genetic associations with AD and related endophenotypes.
Investigator:
Lee, Kun Ho
Institution:
Chosun University
Project Title:
Alzheimer's disease(AD) subtype analysis using genome sequencing data
Date of Approval:
January 21, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives of the Proposed Research Alzheimer’s disease (AD) is a common degenerative disease, causing irreversible dementia. Early diagnosis is difficult due to a long asymptomatic period and requires invasive, expensive procedures. A screening method to classify high-risk groups for early AD diagnosis is needed. Study Design Early AD risk prediction can use genomic variants like the Polygenic Risk Score (PRS), which predicts high-risk groups but shows performance differences due to genetic heterogeneity and ethnic specificity. To address this, ethnicity-specific analysis is considered and validated with different ethnic datasets. This study aims to develop Korea-specific PRS models for early AD risk prediction using genomic data from a Korean cohort and the ADSP. Trans-ethnic genomic data will be created by combining GARD and ADSP data, including African American (AA), non-Hispanic Whites (NHW), and East Asian (EA) data. Cross-validation (CV) analysis will divide data into training and test sets. Genomic variants' importance (e.g., p-values, BLUP) will be calculated, and selected variants applied to PRS. PRS models will be evaluated using CV-divided test data to select the best model. Trans-ethnic and ethnicity-specific PRS models will be validated using reserved validation data. Analysis Plan The proposal aims to identify ethnicity differences in genomic prediction built with Caucasian-centric GWA SNVs and improve the model for trans-ethnic groups, particularly East Asians. A Bayesian machine learning approach transfers genetic risk model knowledge from the NHW dataset to other ethnic groups for better accuracy. Genotype datasets from all ancestry groups are used together. Instead of trans-ethnic meta-analysis, the approach by Gim et al. is adopted. Each ethnic group dataset is divided for cross-validation. Training datasets are analyzed to evaluate p-values and BLUP of SNVs. Summary statistics are used to build the prediction model and apply nested-CV for model selection. The best model for each ethnic group is tested using the test dataset. Data is analyzed similarly by learning from ethnic-specific variants and building a prediction model with the new method.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) is the leading cause of dementia and is irreversible once symptoms appear. A long asymptomatic period of AD complicates early diagnosis requiring invasive and costly procedures like CSF extraction or PET scans. Therefore, a screening method to identify high-risk groups for early AD diagnosis is necessary.One approach uses the Polygenic Risk Score (PRS), which calculation is based on multiple genomic variants associated with AD. However, PRS predictions vary significantly (60-80%) due to genetic heterogeneity and ethnic specificity. Thus, data from multiple ethnicities must be analyzed. Although Asia accounts for over 50% of global dementia cases, most large-scale AD cohorts are predominantly White, lacking studies on Asians.This study aims to develop trans-ethnic and ethnicity-specific PRS models for early AD risk prediction using genomic data from the GARD cohort, centered on Koreans, and the ADSP, which includes various European ethnicities. It investigates AD’s genetic heterogeneity due to ethnic differences and proposes methods to adjust for variability.
Investigator:
Pan, Wei
Institution:
University of Minnesota
Project Title:
Powerful and novel statistical methods to detect genetic variants associated with or putative causal to Alzheimer’s disease
Date of Approval:
March 25, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We have been developing more powerful statistical methods to detect common variant (CV)- or rare variant (RV)-complex trait associations and/or putative causal relationships for GWAS and DNA sequencing data. Here we propose applying our new methods, along with other suitable existing methods, to the existing ADSP sequencing data and other AD GWAS data provided by NIA, hence requesting approval for accessing the ADSP sequencing and other related GWAS/genetic data. We have the following two specific Aims: Aim1. Association testing under genetic heterogeneity: For complex traits, genetic heterogeneity, especially of RVs, is ubiquitous as well acknowledged in the literature, however there is barely any existing methodology to explicitly account for genetic heterogeneity in association analysis of RVs based on a single sample/cohort. We propose using secondary and other omic data, such as transcriptomic or metabolomic data, to stratify the given sample, then apply a weighted test to the resulting strata, explicitly accounting for genetic heterogeneity that causal RVs may be different (with varying effect sizes) across unknown and hidden subpopulations. Some preliminary analyses have conﬁrmed power gains of the proposed approach over the standard analysis. Aim 2. Meta analysis of RV tests: Although it has been well appreciated that it is necessary to account for varying association effect sizes and directions in meta analysis of RVs for multi-ethnic cohorts, existing tests are not highly adaptive to varying association patterns across the cohorts and across the RVs, leading to power loss. We propose a highly adaptive test based on a family of SPU tests, which cover many existing meta-analysis tests as special cases. Our preliminary results demonstrated possibly substantial power gains.
Non-Technical Research Use Statement:
We propose applying our newly developed statistical analysis methods, along with other suitable existing methods, to the existing ADSP sequencing data and other AD GWAS data to detect common or rare genetic variants associated with Alzheimer’s disease (AD). The novelty and power of our new methods are in two aspects: first, we consider and account for possible genetic heterogeneity with several subcategories of AD; second, we apply powerful meta-analysis methods to combine the association analyses across multiple subcategories of AD. The proposed research is feasible, promising and potentially signiﬁcant to AD research. In addition, our proposed analyses of the existing large amount of ADSP sequencing data and other AD GWAS data with our developed new methods are novel, powerful and cost-effective.
Investigator:
Zhao, Zhongming
Institution:
University of Texas Health Science Center at Houston
Project Title:
AIM-AI: an Actionable, Integrated and Multiscale genetic map of Alzheimer's disease via deep learning
Date of Approval:
March 27, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: The objective of our study is to advance our understanding of the genetic basis of Alzheimer’s Disease (AD) through the analysis of comprehensive genomic datasets such as Whole Exome Sequencing (WES), Whole Genome Sequencing (WGS), single-nuclei RNA sequencing, and Genome-Wide Association Studies (GWAS), as well as the related phenotype. We aim to identify genetic variants that are integral to the development and progression of AD.Study Design: Our approach involves a detailed multi-omics analysis focusing on both coding and non-coding regions within these datasets. We will develop new analytical variables from existing data, ensuring that our research adheres to the established data use limitations and contributes meaningfully to the field of genetic research in AD.Analysis Plan: The plan centers on investigating the correlation between genetic variants and AD, exploring how these variants influence the disease at a genetic level. We will employ cutting-edge computational methods to analyze interactions between these genetic markers and their potential role in AD pathogenesis. The integration of data from multiple sources will be carefully executed to maintain compliance with data use agreements, emphasizing the scientific exploration of AD.
Non-Technical Research Use Statement:
Our research is dedicated to unraveling the genetic components of Alzheimer’s Disease. By analyzing genetic sequences and variations through various genomic datasets, we seek to deepen the scientific understanding of how these genetic elements contribute to AD. The outcomes of this study will be shared with the public, enhancing general knowledge of Alzheimer’s Disease and supporting the global research community in its ongoing efforts to decode this complex condition.
Investigator:
Zhou, Wenyu
Institution:
Teal Omics. Inc
Project Title:
Understanding CNS diseases through organ specific aging biomarkers
Date of Approval:
February 27, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Can blood-based protein biomarkers of aging predict functional decline in organ health and in systemic health? We have developed organ-specific aging models derived from plasma proteins, and previously found that the blood proteome can be used to monitor organ health and aging in smaller cohorts (Oh, et., al., Nature 624, 164–172 (2023)). We aim to test this hypothesis at scale with diversified datasets. Aim 1. Validate our approach to modeling aging and brain health with NIAGADS cohorts. Approach: we will use protein biomarker data from cohorts to train machine learning models of organ aging. We will employ machine learning best practice for data curation, data augmentation, data normalization, and training/testing, to evaluate whether we can reproducibly estimate brain aging in diversified healthy cohorts and disease cohorts. Aim 2. Test the effects of brain aging on future disease risk and functional decline. We will focus particularly on the aging brain to investigate the relationship between organ aging and cognitive decline. Approach: we will evaluate the relationships between our model aging predictions and aging, diseases, and clinical phenotypes, such as mortality and lab and physical assessments. This is primarily done by first calculating the difference between a sample’s chronological age and its predicted organ age. This organ age residual is then tested for association with phenotypes of interest, such as Alzheimer’s Disease status or future risk of cognitive decline.
Non-Technical Research Use Statement:
As we age, our risk of getting sick increases, but did you know that everyone ages differently? Some people's bodies deteriorate faster than others, and we don't fully understand why. By studying how our bodies change with age, we aim to identify which organs are aging the most and find ways to improve quality of life tailored to each individual's needs. We want to understand how people age differently, both as individuals and as a population on organ specific levels. By analyzing molecules in blood, we aim to: - Identify how aging affects people uniquely - Link aging patterns to diseases and lifespan - Use advanced computer models to connect blood changes to organ function Our research can lead to: - Personalized patient care - New discoveries for medicines - Improved understanding of aging and age-related diseases We'll use a large, diverse dataset to validate our approach and gain valuable insights into the aging process. By unlocking these secrets, we can work towards a healthier future for everyone.

Total number of samples: 1,157

Female 632 54.6 %

Male 525 45.4 %

AD
Control	785	67.8%
Case	649	56.1%
Other	161	13.9%
Unknown	3	0.3%

NG00102 – Genomic atlas of the proteome from brain, CSF and plasma prioritizes proteins implicated in neurological disorders

Overview

Description

Sample Summary per Data Type

Available Filesets

Sample information

Related Studies

Cohorts

Consent Levels

Acknowledgement

Acknowledgment statement for any data distributed by NIAGADS:

For investigators using any data from this dataset:

For investigators using Charles F. and Joanne Knight Alzheimer’s Disease Research Center (sa000008) data:

Related Publications

Approved Users

Total number of samples: 1,157