Overview
To access this data, please log into DSS and submit an application.
Within the application, add this dataset (accession NG00154) in the “Choose a Dataset” section.
Once approved, you will be able to log in and access the data within the DARM portal.
Description
One of the most common Alzheimer’s Disease Related Dementias (ADRD) is Lewy body dementia. This umbrella term comprises two clinically distinct ADRDs: Parkinson’s disease dementia (PD-dementia) and dementia with Lewy bodies (DLB). There has been a growing interest in the genetic bases of DLB, however, PD-dementia has not yet been studied using large-scale genetic analyses. Nevertheless, it has been shown that dysfunction of the endolysosomal pathway plays a key role in Alzheimer’s disease and ADRDs. This involvement, however, is not fully understood and the exact failure points for each disease have yet to be identified. Thus, there is a gap in knowledge regarding the specific molecular mechanisms underlying each disease.
This dataset includes Genotyping SNP Array data individually in IDAT files, together in a processed VCF, and includes phenotype data for the 292 PD samples from USA and Portugal.
Sample Summary per Data Type
Sample Set | Accession | Data Type | Number of Samples |
---|---|---|---|
Dementia in Parkinson's Disease (PDD) | snd10049 | Genotyping SNP array | 292 |
Available Filesets
Name | Accession | Latest Release | Description |
---|---|---|---|
PDD Genotype and Phenotype files | fsa000067 | NG00154.v1 | Phenotype, IDAT and VCF files for Genotyping SNP Array. |
View the File Manifest for a full list of files released in this dataset.
Sample information
This dataset contains data from 292 subjects that were genotyped using the Infinium™ Global Screening Array-24 v3.0 BeadChip. In addition to a processed VCF file with all 292 samples, the dataset also includes a green and a red raw IDAT file for each subject (total of 584 IDAT files).
Sample Set | Accession Number | Number of Subjects | Number of Samples |
---|---|---|---|
Dementia in Parkinson's Disease (PDD) | snd10049 | 292 | 292 |
Related Studies
Consent Levels
Consent Level | Number of Subjects |
---|---|
DS-ND-IRB-PUB | 292 |
Visit the Data Use Limitations page for definitions of the consent levels above.
Approved Users
- Investigator:Cruchaga, CarlosInstitution:Washington University School of MedicineProject Title:The Familial Alzheimer Sequencing (FASe) ProjectDate of Approval:May 9, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:The goal of this study is to identify new genes and mutations that cause or increase risk for Alzheimer disease (AD), as well as protective factors. Individuals and families were selected from the Knight-ADRC (Washington University) and the NIA-LOAD study. Only families with at least three first-degree affected individuals were included. Families with pathogenic variants in the known AD or FTD genes, or in which APOE4 segregated with disease were excluded. At least two cases and one control were selected per family. Cases had an age at onset (AAO) after 65 yo and controls had a larger age at last assessment than the latest AAO within the family. Whole exome (WES) and whole genome sequencing (WGS) was generated for 1,235 individuals (285 families) that together with data from our collaborators and the ADSP family-based cohort (3,449 individuals and 757 families) will provide enough statistical power to identify new genes for AD. Dr. Tanzi (Harvard Medical School) will provide WGS from 400 families from the NIMH Alzheimer disease genetics initiative study. We will perform single variant and gene-based analyses to identify genes and variants that increase risk for disease in AD families. Single variant analysis will consist of a combination of association and segregation analyses. We will run family-based gene-based methods to identify genes that show and overall enrichment of variants in AD cases. We will also look for protective and modifier variants. To do this we will identify families loaded with AD cases, that also include individuals with a high burden of known risk variants but that do not develop the disease (escapees). We will use the sequence data and the family structure to identify variants that segregate with the escapee phenotype. The most promising variants and genes will be replicated in independent datasets (ADSP case-control, ADNI, Knight-ADRC, NIA-LOAD ). We will perform single variant and gene-based analyses to replicate the initial findings, and survival analysis to replicate the protective variants. We will select the most promising variants/genes for functional studiesNon-Technical Research Use Statement:Family-based approaches led to the identification of disease-causing Alzheimer’s Disease (AD) variants in the genes encoding APP, PSEN1 and PSEN2. The identification of these genes led to the A?-cascade hypothesis and to the development of drugs that target this pathway. Recently, we have identified rare coding variants in TREM2, ABCA7, PLD3 and SORL1 with large effect sizes for risk for AD, confirming that rare coding variants play a role in the etiology of AD. In this proposal, we will identify rare risk and protective alleles using sequence data from families densely affected by AD. We hypothesize that these families are enriched for genetic risk factors. We already have sequence data from 695 families (2,462 individuals), that combined with the ADSP and the NIMH dataset will lead to a dataset of more than 1,042 families (4,684 individuals). Our preliminary results support the flexibility of this approach and strongly suggest that protective and risk variants with large effect size will be found, which will lead to a better understanding of the biology of the disease.
- Investigator:Lee, JonghunInstitution:TAKEDA PHARMACEUTICAL COMPANY LTDProject Title:Identification of genetic risks and potential target for stratified Alzheimer's disease patient groupsDate of Approval:June 12, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:The goal of analyzing ADSP umbrella cohort data is identifying variants, genes and pathways associated to Alzheimer’s disease (AD), and stratifying patients by genetic risks. Following describes procedure.1) Identification and validation of genetic risks The whole genome and whole exome sequencing data will be analyzed to identify genetic variants or genes associated to phenotypes in case-control cohort, such as AD status and Braak stages. Several latest methods will be applied, such as VEP [William McLaren et al, 2016], LOFTEE [Karczewski, 2015] and PEXT scoring [Beryl B.C et al., 2020] for variant annotation and SAIGE-GENE [Wei Zhou et al., 2020] for the association test. The association will be tested for other endophenotypes such as cognitive scores and brain volumes that available in subset of the cohort. Replication and meta-analysis will be conducted on UK biobank and Tohoku medical megabank organization (ToMMo) cohort data. The ToMMo data consists of Japanese cohort so that we can analyze the effect of the variants among multi ethnic groups.2) Patient stratification in ADSP cohort Leveraging the increased sample size, we will stratify the cohort by genetic risks such as ApoE types, or phenotypes such as Braak stages, and compare the effect size of variants or genes among the patient groups. In addition, the genetic risk score (GRS) will be calculated using LDpred2 [Florian Prive, 2020], RapidoPGS [Guillermo Reales, 2020], and PRSice2 [Choi, S.W., 2020], and validated in independent cohorts and compared to available clinical endophenotypes. Last, we will search the effect of the GRS to extensive phenotypes in UK biobank and ToMMo.3) Identification of variants associated with pathologies and disease progression To further characterize patients by genetic risk, we will conduct GWAS and EWAS on pathology measurements, models of co-pathology, comorbidity with other neurological diseases, and disease progression. NG00127 and NG00154 will be used for this purpose.Non-Technical Research Use Statement:The aim of our study is identifying variants or genes potentially causal of the Alzheimer’s disease in whole or subset of patients. To be specific, WES and WGS data will be analyzed to investigate common and rare variants associated with disease status and intermediate phenotypes. In addition, the patients will be stratified and sub-grouped by their genetic or phenotypic characteristics. Last, we will incorporate other large biobanks such as UK biobank or ToMMo to investigate the genetic effects to extensive phenotypes potentially linked to symptoms appearing in sub patient groups.
Acknowledgement
Acknowledgment statement for any data distributed by NIAGADS:
Data for this study were prepared, archived, and distributed by the National Institute on Aging Alzheimer’s Disease Data Storage Site (NIAGADS) at the University of Pennsylvania (U24-AG041689), funded by the National Institute on Aging.
Use the study-specific acknowledgement statements below (as applicable):
For investigators using any data from this dataset:
Please cite/reference the use of NIAGADS data by including the accession NG00154.
For investigators using Dementia in Parkinson's Disease (sa000038) data:
The American cohort of samples for this project was collected by Dr. Cyrus Zabetian and Dora Yearout at VA Puget Sound. The Portuguese cohort was collected by Dr. Ana Morgadinho. Drs. Rita Guerreiro and Jose Bras as well as Kimberly Paquette, Kaitlyn Westra, and Andrew Pyman were involved in preparing the data. This work was supported by grant funding from NIH R56 AG070857. The author(s) thank the Van Andel Institute Genomics Core (RRID:SCR_022913) (Grand Rapids, MI) for providing whole-genome genotyping facilities and services.