NG00154 - Dementia in Parkinson's Disease (PDD)

To access this data, please log into DSS and submit an application.
Within the application, add this dataset (accession NG00154) in the “Choose a Dataset” section.
Once approved, you will be able to log in and access the data within the DARM portal.

Description

One of the most common Alzheimer’s Disease Related Dementias (ADRD) is Lewy body dementia. This umbrella term comprises two clinically distinct ADRDs: Parkinson’s disease dementia (PD-dementia) and dementia with Lewy bodies (DLB). There has been a growing interest in the genetic bases of DLB, however, PD-dementia has not yet been studied using large-scale genetic analyses. Nevertheless, it has been shown that dysfunction of the endolysosomal pathway plays a key role in Alzheimer’s disease and ADRDs. This involvement, however, is not fully understood and the exact failure points for each disease have yet to be identified. Thus, there is a gap in knowledge regarding the specific molecular mechanisms underlying each disease.

This dataset includes Genotyping SNP Array data individually in IDAT files, together in a processed VCF, and includes phenotype data for the 292 PD samples from USA and Portugal.

Sample Summary per Data Type

Sample Set	Accession	Data Type	Number of Samples
Dementia in Parkinson's Disease (PDD)	snd10049	Genotyping SNP array	292

Available Filesets

Name	Accession	Latest Release	Description
PDD Genotype and Phenotype files	fsa000067	NG00154.v1	Phenotype, IDAT and VCF files for Genotyping SNP Array.

View the File Manifest for a full list of files released in this dataset.

This dataset contains data from 292 subjects that were genotyped using the Infinium™ Global Screening Array-24 v3.0 BeadChip. In addition to a processed VCF file with all 292 samples, the dataset also includes a green and a red raw IDAT file for each subject (total of 584 IDAT files).

Sample Set	Accession Number	Number of Subjects	Number of Samples
Dementia in Parkinson's Disease (PDD)	snd10049	292	292

Consent Level	Number of Subjects
DS-ND-IRB-PUB	292

Total number of approved DARs: 8

Investigator:
Belloy, Michael
Institution:
Washington University in St Louis
Project Title:
Elucidating sex-specific risk for Alzheimer's disease through state-of-the-art genetics and multi-omics
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
• Objectives: In this project, we seek to holistically investigate the genetic and molecular drivers of sex dimorphism in Alzheimer’s disease across ancestries. • Study design: This study integrates large-scale population genetics with multi-omics and endophenotype analyses. We are integrating all data available from ADGC and ADSP, together with other data from AMP-AD and biobanks such as UKB, FinnGen, and MVP to conduct large-scale multi-ancestry GWAS, rare-variant gene aggregation analyses, QTL studies, PWAS, TWAS, etc. We also particularly focus on X chromosome association studies. The study design also interrogates interactions with ancestry, hormone exposures, and with APOE*4, as well as comparisons to non-stratified GWAS/XWAS of Alzheimer’s disease. Further, we will also employ genetic correlation analyses, mendelian randomization, colocalization, and pleiotropy analyses, to interrogate overlap with other complex traits to better understand the mechanisms underlying sex dimorphism in Alzheimer’s disease. • Analysis plan, including the phenotypic characteristics that will be evaluated in association with genetic variants: Our phenotypes will include Alzheimer’s disease risk, conversion risk, various endophenotypes (including amyloid/tau biomarkers, brain imaging metrics, etc.) as well as molecular traits. As noted above, we will conduct large-scale multi-ancestry GWAS, XWAS, rare-variant gene aggregation analyses, QTL studies, PWAS, TWAS, etc. Specific aims include interrogating these question and analyses on (1) the autosomes, (2) the X chromosome, and (3) leveraging sex stratified QTL studies to drive discovery of risk genes.
Non-Technical Research Use Statement:
Alzheimer’s disease (AD) manifests itself differently across men and women, but the genetic and molecular factors that drive this remain elusive. AD is the most common cause of dementia and till today remains largely untreatable. It is thus crucial to study the genetics of AD in a sex-specific manner, as this will help the field gain important insights into disease pathophysiology, identify novel sex-specific risk factors relevant to personalized genetic medicine, and uncover potential new AD drug targets that may benefit both sexes. This project uses large-scale genomics and multi-omics to elucidate novel sex agnostic and sex-specific AD risk genes. We will interrogate sex dimorphism for AD risk on the autosomes and the sex chromosomes. We similarly interrogate sex dimorphism in the genetic regulation of gene expression and protein levels, which we will integrate with genetic risk for Alzheimer’s disease to further discovery risk genes. Throughout, we will also interrogate how sex-specific risk for AD interactions with hormone exposures, ancestry, and the APOE*4 risk allele.
Investigator:
Cruchaga, Carlos
Institution:
Washington University School of Medicine
Project Title:
The Familial Alzheimer Sequencing (FASe) Project
Date of Approval:
January 21, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The goal of this study is to identify new genes and mutations that cause or increase risk for Alzheimer disease (AD), as well as protective factors. Individuals and families were selected from the Knight-ADRC (Washington University) and the NIA-LOAD study. Only families with at least three first-degree affected individuals were included. Families with pathogenic variants in the known AD or FTD genes, or in which APOE4 segregated with disease were excluded. At least two cases and one control were selected per family. Cases had an age at onset (AAO) after 65 yo and controls had a larger age at last assessment than the latest AAO within the family. Whole exome (WES) and whole genome sequencing (WGS) was generated for 1,235 individuals (285 families) that together with data from our collaborators and the ADSP family-based cohort (3,449 individuals and 757 families) will provide enough statistical power to identify new genes for AD. Dr. Tanzi (Harvard Medical School) will provide WGS from 400 families from the NIMH Alzheimer disease genetics initiative study. We will perform single variant and gene-based analyses to identify genes and variants that increase risk for disease in AD families. Single variant analysis will consist of a combination of association and segregation analyses. We will run family-based gene-based methods to identify genes that show and overall enrichment of variants in AD cases. We will also look for protective and modifier variants. To do this we will identify families loaded with AD cases, that also include individuals with a high burden of known risk variants but that do not develop the disease (escapees). We will use the sequence data and the family structure to identify variants that segregate with the escapee phenotype. The most promising variants and genes will be replicated in independent datasets (ADSP case-control, ADNI, Knight-ADRC, NIA-LOAD ). We will perform single variant and gene-based analyses to replicate the initial findings, and survival analysis to replicate the protective variants. We will select the most promising variants/genes for functional studies
Non-Technical Research Use Statement:
Family-based approaches led to the identification of disease-causing Alzheimer’s Disease (AD) variants in the genes encoding APP, PSEN1 and PSEN2. The identification of these genes led to the A?-cascade hypothesis and to the development of drugs that target this pathway. Recently, we have identified rare coding variants in TREM2, ABCA7, PLD3 and SORL1 with large effect sizes for risk for AD, confirming that rare coding variants play a role in the etiology of AD. In this proposal, we will identify rare risk and protective alleles using sequence data from families densely affected by AD. We hypothesize that these families are enriched for genetic risk factors. We already have sequence data from 695 families (2,462 individuals), that combined with the ADSP and the NIMH dataset will lead to a dataset of more than 1,042 families (4,684 individuals). Our preliminary results support the flexibility of this approach and strongly suggest that protective and risk variants with large effect size will be found, which will lead to a better understanding of the biology of the disease.
Investigator:
Fernandez, Victoria
Institution:
ACE Alzheimer Center
Project Title:
GADIR
Date of Approval:
February 10, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of this study is to contribute to our understanding of neurodegenerative diseases by examining the genetic contributors of major dementia neuropathological hallmarks (amyloid-β deposition, tau pathology, TDP-43, hippocampal sclerosis, Lewy body pathology, and cerebrovascular disease, among others. We will generate the largest Iberian database(N=3500) of neuropathologically curated brains (Aim 1) with a subset of those (N≈350) undergoing deep digital phenotyping (Aim 3). We will generate an associated genetic map (Aim 2) order to elucidate how common and rare genetic variants contribute to specific pathologies. We additionally aim to determine how polygenic risk scores (PRS) and pathway-specific PRS correspond to single and mixed neuropathological profiles, and to clarify the genetic architecture driving co-pathologies that frequently complicate clinical diagnosis. Eventually, we will replicate and fine-map our findings (Aim 4) leveraging available datasets at NIAGADS and other public repositories.Our analysis plan includes genome-wide association testing of ordinal, binary, and quantitative neuropathological traits; rare-variant burden analyses for coding and non-coding regions; PRS and pathway-PRS modeling across multiple dementia-related diseases; unsupervised clustering to identify variant sets defining specific endophenotypes; and pathway and network analyses to interpret significant signals. Colocalization and functional annotation approaches will integrate genomic findings with transcriptomic and proteomic resources.Data obtained from NIAGADS will be used to strengthen replication, broaden meta-analytic power, validate associations across independent neuropathology cohorts, and support functional interpretation using available genetic, expression, and multi-omic datasets. All analyses will use de-identified data in compliance with ethical and data-sharing standards.
Non-Technical Research Use Statement:
Dementia is an immensely challenging and prevalent condition, deeply impacting the lives of over 55 million individuals worldwide. While Alzheimer's disease stands as the most commonly recognized form of dementia, there exist other conditions that present comparable symptoms but distinct underlying pathological characteristics. To provide more effective support to patients and their families, we need to better understand the genetic causes associated to each of these brain pathologies, and to develop advanced tools for early classification and diagnosis. This grant proposal aims to tackle these challenges by establishing the largest Iberian (Spanish and Portuguese) database of dementia neuropathological cases, marked by a modernized and standardized neuropathological classification alongside comprehensive genomic data. Our goal is to delve further into the genetic architecture underpinning these pathological features and to refine existing risk assessment tools for more accurate diagnoses.
Investigator:
Greicius, Michael
Institution:
Stanford University School of Medicine
Project Title:
Examining Genetic Associations in Neurodegenerative Diseases
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
We are studying the effects of rare (minor allele frequency < 5%) genetic variants on the risk of developing late-onset Alzheimer’s Disease (AD). We are interested in variants that have a protective effect in subjects who are at an increased genetic risk, or variants that lead to multiple dementias. Our aim is to identify any genetic variants that are present in the “case” group but not the “AD control” groups for both types of variants. The raw data we receive will be annotated to identify SNP locations and frequencies using existing databases such as 1,000 Genomes. We will filter the data based on genetic models such as compounded heterozygosity, recessive and dominant models to identify different types of variants.
Non-Technical Research Use Statement:
Current genetic understanding of Alzheimer’s Disease (AD) does not fully explain its heritability. The APOE4 allele is a well-established risk factor for the development of Alzheimer’s Disease (AD). However, some individuals who carry APOE4 remain cognitively healthy until advanced ages. Additionally, the cause of mixed dementia pathology development in individuals remains largely unexplained. We aim to identify genetic factors associated with these “protected” and mixed pathology phenotypes.
Investigator:
Kamboh, M. Ilyas
Institution:
University of Pittsburgh
Project Title:
Genetics of Alzheimer's Disease and Endophenotypes
Date of Approval:
March 31, 2026
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objectives: We are requesting access to the NIAGADS datasets to augment our ongoing studies on the genetics of Alzheimer’s disease (AD) and AD-related endophenotypes being carried out by Kamboh and his group since 1995. We are doing GWAS using array genotypes, whole-exome sequencing and whole-genome sequencing on datasets derived from University of Pittsburgh ADRC and ancillary population-based longitudinal studies on dementia and biomarkers. Different available phenotypes include AD and non-AD dementia, age-at-set, disease progression and survival, neuroimaging, cognitive decline, plasma biomarkers for the core ATN and non-ATN pathologies. We also plan to expand on gene-gene interaction and sex-stratified analyses which require the actual genotype data. The NIAGADS datasets will be used for replication and meta-analysis, and for gene-gene interaction and sex-stratified analyses. Study Design: A case-control design will incorporate a diverse cohort of individuals with AD and age-matched controls. For quantitative traits (neuroimaging and plasma biomarkers, cognitive performance measures, indicators of disease progression), linear regression analyses will be performed to identify genetic loci. To ensure the findings are robust and inclusive, participants from diverse demographic backgrounds will be included, enabling the exploration of potential genetic variations across populations. Analysis Plan: We will conduct GWAS and targeted analyses on candidate genes on different AD and AD-related phenotypes. Primary phenotypic variables include AD disease status, age-at-onset, last age for controls, APOE genotype, cognitive decline trajectories, sex, and race. Analyses will evaluate the influence of specific genetic variants on disease risk, cognitive performance, and biomarker levels, considering both individual and interactive effects of the APOE genotype. Results will be adjusted for potential confounders, such as demographic factors, to ensure valid associations. Detail analytical methods are described in our published papers for case-control (PMID: 32651314;35694926), quantitative traits (PMID: 30361487;37666928), and cognitive decline (PMID: 37089073; 30954325).
Non-Technical Research Use Statement:
Our research group at the University of Pittsburgh (Pitt), has been working on the genetics of Alzheimer’s disease (AD) and AD-related endophenotypes for almost three decades, on data derived largely from the University of Pittsburgh Alzheimer’s Disease Research Center and ancillary dementia studies. We are requesting access to the NIAGADS genotype and phenotype datasets to augment our sample size to increase power to detect novel genetic associations with AD and related endophenotypes.
Investigator:
Konermann, Silvana
Institution:
Arc institute
Project Title:
Modeling Alzheimer’s disease risk and associated molecular phenotypes
Date of Approval:
August 8, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The objective of the proposed research is to determine the relationship between Alzheimer’s disease (AD) genetic risk and associated molecular phenotypes. Genotype data will be used to compute a polygenic risk score (PRS) for disease-affected and control (non-disease-affected) participants. Statistical regression and mediation analyses will be used to model variation of molecular phenotypes with respect to PRS and, where available, pathology stage or cognitive impairment. Molecular phenotypes to be analyzed include bulk/single-cell/single-nucleus transcriptome, epigenome, proteome, metabolome, lipidome, amyloid, and tau. Molecular phenotypes of participants, including controls, will be matched with molecular phenotypes of in vitro cellular models, informing the design of in vitro perturbation experiments that recapitulate the genetic drivers of AD risk.
Non-Technical Research Use Statement:
Our goal is to determine the relationship between human genetic profiles associated with Alzheimer’s disease (AD) risk and specific measurable characteristics of human cells. Using multiple statistical analysis methods, we will build quantitative models that describe how those characteristics vary as a function of AD genetic risk. The models we build will help us design in vitro cellular systems that reflect different levels of AD risk, enabling experiments that inform new strategies for treating or preventing AD.
Investigator:
Lee, Jonghun
Institution:
TAKEDA PHARMACEUTICAL COMPANY LTD
Project Title:
Identification of genetic risks and potential target for stratified Alzheimer's disease patient groups
Date of Approval:
October 24, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
The goal of analyzing ADSP umbrella cohort data is identifying variants, genes and pathways associated to Alzheimer’s disease (AD), and stratifying patients by genetic risks. Following describes procedure.1) Identification and validation of genetic risks The whole genome and whole exome sequencing data will be analyzed to identify genetic variants or genes associated to phenotypes in case-control cohort, such as AD status and Braak stages. Several methods will be applied, such as VEP [William McLaren et al, 2016], LOFTEE [Karczewski, 2015] and PEXT scoring [Beryl B.C et al., 2020] for variant annotation and SAIGE-GENE [Wei Zhou et al., 2020], and KGWAS [Kexin Huang 2024] for the association test. The association will be tested for other endophenotypes such as cognitive scores and brain volumes that available in subset of the cohort. Replication and meta-analysis will be conducted on UK biobank and Tohoku medical megabank organization (ToMMo) cohort data. The ToMMo data consists of Japanese cohort so that we can analyze the effect of the variants among multi ethnic groups.2) Patient stratification in ADSP cohort Leveraging the increased sample size, we will stratify the cohort by genetic risks such as ApoE types, or phenotypes such as Braak stages, and compare the effect size of variants or genes among the patient groups. In addition, the genetic risk score (GRS) will be calculated using LDpred2 [Florian Prive, 2020], RapidoPGS [Guillermo Reales, 2020], and PRSice2 [Choi, S.W., 2020], and validated in independent cohorts and compared to available clinical endophenotypes. Then we will search the effect of the GRS to extensive phenotypes in UK biobank and ToMMo. Last, the NG00130 proteome will be used for unsupervised classification of patients. Overlapping the protein signature-based groups with genetic signals, we’ll find casual pathologic pathways and targets for each subgroups.
Non-Technical Research Use Statement:
The aim of our study is identifying variants or genes potentially causal of the Alzheimer’s disease in whole or subset of patients. To be specific, WES and WGS data will be analyzed to investigate common and rare variants associated with disease status and intermediate phenotypes. In addition, the patients will be stratified and sub-grouped by their genetic and proteomic signatures. Last, we will incorporate other large biobanks such as UK biobank or ToMMo to investigate the genetic effects to extensive phenotypes potentially linked to symptoms appearing in sub patient groups.
Investigator:
Seshadri, Sudha
Institution:
Glenn Biggs Institute for Alzheimer's and Neurodegenerative Diseases, University of Texas Health Sciences Center, San Antonio, TX
Project Title:
Therapeutic target discovery in ADSP data via comprehensive whole-genome analysis incorporating ethnic diversity and systems approaches
Date of Approval:
August 12, 2025
Request status:
Approved
Research use statements:
Show statements
Technical Research Use Statement:
Objective: Utilize ADSP data sets to identify genes & specific genetic variants that confer risk for or protection from Alzheimer disease. Aim 1: Using combined WGS/WES across the ADSP Discovery, Disc-Ext, and FUS Phases, including single nucleotide variants, small insertion/deletions, and structural variants. We will: Aim 1a. Perform whole genome single variant and rare variant case/control association analyses of AD using ADSP and other available data; Aim 1b. Target protective variant identification via association analysis using selected controls within the ADSP data and performing meta analysis across association results based on selected controls from non-ADSP data sets. Aim 1c. Perform endophenotype analyses including cognitive function measures, hippocampal volume and circulation beta-amyloid ADSP data in subjects for which these measures are available. Meta analysis will be conducted across ADSP and non-ADSP analysis results. Aim 2: To leverage ethnically-diverse and admixed populations to identify AD variants we will: Aim 2a. Estimate and account for global and local ancestry in all analyses; Aim 2b. Perform admixture mapping in samples of admixed ancestry; and Aim 2c. Perform ethnicity-specific and trans-ethnic meta-analyses. Aim 3: To identify putative therapeutic targets through functional characterization of genes and networks via bioinformatics, integrative ‘omics analyses. We will: Aim 3a. Annotate variants with their functional consequences using bioinformatic tools and publicly available “omics” data. Aim 3b. Prioritize results, group variants with shared function, and identify key genes functionally related to AD via weighted association analyses and network approaches. Analyses will be performed in coordination with the following PIs. Coordination will involve sharing expertise, analysis plans or analysis results. No individual level data will be shared across institutions. Philip De Jager, Columbia University; Eric Boerwinkle & Myriam Fornage, U of Texas Health Science Center, Houston; Sudha Seshadri, U of Texas, San Antonio; Ellen Wijsman, U of Washington. William Salerno, Baylor College of Medicine
Non-Technical Research Use Statement:
This proposal seeks to analyze existing genetic sequencing data generated as part of the Alzheimer’s Disease Sequencing Project (ADSP) including the ADSP Follow-up Study (FUS) with the goal of identifying genes and specific changes within those genes that either confer risk for Alzheimer’s Disease or provide protection from Alzheimer’s Disease. Analytic challenges include analysis of whole genome sequencing data, appropriately accounting for population structure across European ancestry, Hispanic, and African American participants, and interpreting results in the context of other genomic data available.

Total number of samples: 292

Female 43 14.7 %

Male 155 53.1 %

Sex not reported: 94 (32.2%)

NG00154 – Dementia in Parkinson’s Disease (PDD)

Overview

Description

Sample Summary per Data Type

Available Filesets

Sample Information

Related Studies

Cohorts

Consent Levels

Acknowledgement

Acknowledgment statement for any data distributed by NIAGADS:

For investigators using any data from this dataset:

For investigators using Dementia in Parkinson's Disease (sa000038) data:

Approved Users

Total number of samples: 292