Data Available
To access this data, please log into DSS and submit an application.
Within the application, add this dataset (accession NG00106) in the “Choose a Dataset” section.
Once approved, you will be able to log in and access the data within the DARM portal.
Description
In 2018, 960 respondents from LASI-DAD who consented to the blood sample collection have been genotyped using Illumina Infinium genotyping platforms. The datasets being submitted include the original genotype assayed by the genotyping platforms, imputed data to the 1000G reference panel, as well as imputed data to the TOPMed reference panel. The first dataset is the genotype data assayed by the Illumina Infinium Global Screening Array-24 v2.0 BeadChip. It contains 1008 scans derived from 993 unique subjects (including 960 LASI-DAD subjects and 33 1000G control subjects) and is in PLINK format. The second dataset contains the imputed data to the 1000G reference panel (phase 3 v5) and is in vcf format. It contains 960 unique LASI-DAD subjects. The third dataset contains the imputed data to the TOPMed reference panel (r2) and is in vcf format. It contains 960 unique LASI-DAD subjects.
Sample Summary per Data Type
Sample Set | Accession | Data Type | Number of Samples |
---|---|---|---|
LASI-DAD GWAS GSA | snd10023 | GWAS | n = 1008 |
Available Filesets
Name | Accession | Latest Release | Description |
---|---|---|---|
LASI-DAD – GWAS Data | fsa000011 | NG00106.v1 | GWAS Data |
LASI-DAD - 1000G Imputed Data | fsa000012 | NG00106.v1 | 1000G Imputed Data |
LASI-DAD - TOPMed Imputed Data | fsa000013 | NG00106.v1 | TopMed Imputed Data |
LASI-DAD Association Results, Phenotypes, etc. | fsa000014 | NG00106.v1 | Association Results, Phenotypes, etc. |
View the File Manifest for a full list of files released in this dataset.
Sample information
The dataset contains 1008 scans derived from 993 unique subjects (including 960 LASI-DAD subjects and 33 1000G control subjects) and is in PLINK format. The DNA samples were genotyped at MedGenome Inc. using the Illumina Infinium Global Screening Array-24 v2.0 BeadChip.
Sample Set | Accession Number | Number of Subjects | Number of Samples |
---|---|---|---|
LASI-DAD GWAS GSA | snd10023 | 993 | 1,008 |
Related Studies
Consent Levels
Consent Level | Number of Subjects |
---|---|
GRU-IRB-PUB | n = 993 |
Visit the Data Use Limitations page for definitions of the consent levels above.
Approved Users
- Investigator:Chen, JingchunInstitution:University of Nevada, Las VegasProject Title:Classification of Alzheimer’s disease with Genetic Data and Artificial IntelligenceDate of Approval:May 8, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:Alzheimer's disease(AD) is the most common cause of dementia, accounting for 60% to 80% of cases that affect over six million people in the United States. The disease gradually progresses from mild cognitive impairment(MCI) to dementia, which takes more than a decade. Identifying individuals who have a high risk of AD earlier is essential for AD prevention and intervention. As the heritability of AD is high(up to 79%), genetic data should be powerful to identify individuals at high risk. Indeed, polygenic risk score (PRS), designed to estimate individual genetic liability by integrating large GWAS summary statistics and individual genotype data, has been shown to be promising for AD risk prediction(AUCs up to 84%). However, the prediction accuracy using a single PRS is still not sufficient for MCI and AD classification in clinical practice. We hypothesize that convolution neural network(CNN) models can improve the classification of AD and MCI by multiple integrating PRSs from multiple traits, multi-omics data (genotyping data, scRNA-seq), clinical data, and imaging data. The objective is to develop advanced AI algorithms and build data-driven models for disease risk assessment, earlier identifying individuals with high risk for MCI and AD. Our long-term goal is to develop and validate a prediction model that can be translated into clinical practice. Our CNN model has recently shown an improved performance for AD with PRSs from multiple traits(AUC 92.4%). We want to extend our approach to predicting AD and MCI in different ethnic groups and validate the results with independent datasets. To this end, we would like to apply for multi-omics data in NG00067.v9 from https://dss.niagads.org/datasets/ng00067/. With an extensive experience in genetic studies on complex disorders and disease modeling, we are confident that we will achieve the specified goals and promote the integration of genetic data with AI algorithms, facilitating data-driven, personalized care of AD. We expect to finish this study within 2 years with publication and grant application. We have IRB approval and will follow the rules for data sharing and acknowledgment.Non-Technical Research Use Statement:Alzheimer’s disease (AD), the most common form of dementia, that usually develops from mild cognitive impairment to dementia. There is currently no treatment to slow the progression of this disorder. But earlier identification of the individuals with higher risk maybe critical to prevent the disease. We propose a new approach to create models for classification of AD and MCI with artificial intelligence and genetic data. This study will have a significant value in personalized medicine for AD risk assessment, classification, and earlier intervention.We don’t have the planned collaboration with researchers outside Cleveland Clinic in the current analytic plans.
- Investigator:Cruchaga, CarlosInstitution:Washington University School of MedicineProject Title:The Familial Alzheimer Sequencing (FASe) ProjectDate of Approval:May 9, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:The goal of this study is to identify new genes and mutations that cause or increase risk for Alzheimer disease (AD), as well as protective factors. Individuals and families were selected from the Knight-ADRC (Washington University) and the NIA-LOAD study. Only families with at least three first-degree affected individuals were included. Families with pathogenic variants in the known AD or FTD genes, or in which APOE4 segregated with disease were excluded. At least two cases and one control were selected per family. Cases had an age at onset (AAO) after 65 yo and controls had a larger age at last assessment than the latest AAO within the family. Whole exome (WES) and whole genome sequencing (WGS) was generated for 1,235 individuals (285 families) that together with data from our collaborators and the ADSP family-based cohort (3,449 individuals and 757 families) will provide enough statistical power to identify new genes for AD. Dr. Tanzi (Harvard Medical School) will provide WGS from 400 families from the NIMH Alzheimer disease genetics initiative study. We will perform single variant and gene-based analyses to identify genes and variants that increase risk for disease in AD families. Single variant analysis will consist of a combination of association and segregation analyses. We will run family-based gene-based methods to identify genes that show and overall enrichment of variants in AD cases. We will also look for protective and modifier variants. To do this we will identify families loaded with AD cases, that also include individuals with a high burden of known risk variants but that do not develop the disease (escapees). We will use the sequence data and the family structure to identify variants that segregate with the escapee phenotype. The most promising variants and genes will be replicated in independent datasets (ADSP case-control, ADNI, Knight-ADRC, NIA-LOAD ). We will perform single variant and gene-based analyses to replicate the initial findings, and survival analysis to replicate the protective variants. We will select the most promising variants/genes for functional studiesNon-Technical Research Use Statement:Family-based approaches led to the identification of disease-causing Alzheimer’s Disease (AD) variants in the genes encoding APP, PSEN1 and PSEN2. The identification of these genes led to the A?-cascade hypothesis and to the development of drugs that target this pathway. Recently, we have identified rare coding variants in TREM2, ABCA7, PLD3 and SORL1 with large effect sizes for risk for AD, confirming that rare coding variants play a role in the etiology of AD. In this proposal, we will identify rare risk and protective alleles using sequence data from families densely affected by AD. We hypothesize that these families are enriched for genetic risk factors. We already have sequence data from 695 families (2,462 individuals), that combined with the ADSP and the NIMH dataset will lead to a dataset of more than 1,042 families (4,684 individuals). Our preliminary results support the flexibility of this approach and strongly suggest that protective and risk variants with large effect size will be found, which will lead to a better understanding of the biology of the disease.
- Investigator:Farrer, LindsayInstitution:Boston UniversityProject Title:ADSP Data AnalysisDate of Approval:January 22, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:As part of the Collaborative for Alzheimer's Disease genetics REsearch (CADRE: NIA grant U01-AG058654), we plan to analyze whole exome and whole genome sequence data generated from subjects with Alzheimer's disease (AD) and elderly normal controls. These data will be generated by the National Human Genome Institute Large-Scale Sequence Program. The goal of the planned analyses is to identify genes that have alleles that protect against or increase susceptibility to AD. We will evaluate variants detected in the sequence data for association with AD to identify protective and susceptibility alleles using the whole exome case-control data. We will also evaluate sequence data from multiplex AD families to identify variants associated with AD risk and protection, and evaluate variant co-segregation with AD. The family data will be whole genome data. The family-based data will be used to inform the cases control analysis and visa versa. We also will focus on structural variants (insertion-deletions, copy number variants, and chromosomal rearrangements). Evaluation of structural variants will involve both whole genome and whole exome data. Structural variants will be analyzed with single nucelotide variants detected and analyzed in the case-control and family-based data.Non-Technical Research Use Statement:We are attempting to identify all the inherited elements that contribute to Alzheimer's disease risk. To do this we will analyze DNA sequence data from subjects with Alzheimer's disease and elderly subjects who are cognitively normal. The sequence data from these 2 groups will be compared to identify differences that contribute to the risk of developing Alzheimer's disease of that protect against Alzheimer's disease. These DNA differences can be at a single site in the genetic code, or can span multiple sites, changing the copy number of DNA sequences. Both types of genetic variants will be examined.
- Investigator:Masters, ColinInstitution:The Florey Institute, The University of MelbourneProject Title:The Australian Imaging Biomarkers and Lifestyle (AIBL) Flagship Study of Ageing: Detecting and Preventing Alzheimer’s disease: Towards Lifestyle Interventions-Somatic mutation in Alzheimer's DiseaseDate of Approval:May 15, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:Project Title: The Australian Imaging Biomarkers and Lifestyle (AIBL) Flagship Study of Ageing: Detecting and Preventing Alzheimer’s disease: Towards Lifestyle Interventions - Somatic mutation in Alzheimer's Disease (sub-project)Objectives -- Somatic Mutation in AD is a project to identify non-congenitally acquired genetic risks associated with disease onset of sporadic Alzheimer’s disease (AD). Somatic mutation can be any form of alteration in DNA that occur after conception. As opposed to congenital, it’s generally not hereditary unless the germ cells are involved. These alterations can (but do not always) cause disease. We aim to identify somatic variants that contribute to sporadic AD. We believe that the detection of somatic mutations can overcome the flaws of the large genome-wide multiple testing and increase the signal-to-noise ratio to pinpoint the rare genetic determinants that were largely neglected by current genetic association studies.Study design -- We have collected 20 paired human brain microglial DNAs (treated as “tumour”) and whole blood DNAs (treated as “normal”) to call somatic mutations by a tumour-normal mode using a software, MuTect2 (Broad Institute). The sequence has been obtained from the whole genome. Hundreds of rare genetic variants have been identified to connect with AD.Analysis plan -- We’d like to validate our results using datasets like NG00067, NG00105 and NG00106. However, it’s ideal if we could access the alignment data (i.e., BAM files) as well. Because technically somatic calling is not simply a difference between normal (germline) and reference; but also calls for tumour against normal (germline) alongside alignment. MuTect2 is developed to identify somatic mutations. It works with or without matching normal. Once we get access to the alignment data, we will reprocess all samples using the MuTect2 without matching the normal pipeline. We'll call somatic mutations using those datasets and validate the rare genetic determinants that contribute to sporadic AD.Non-Technical Research Use Statement:Somatic Mutation in Alzheimer's disease is a project to identify non-congenitally acquired genetic risks associated disease onset of a sporadic Alzheimer’s disease (AD). We believe that detection of somatic mutations can pinpoint the rare genetic determinants that were largely neglected by current genetic association studies. In our pilot study, we have identified hundreds of rare genetic mutations that are strongly associated with AD. We'd like to validate our results using an independent cohort. We plan to reprocess NIH datasets using our own pipeline. But we would need to access the raw data rather than the processed data. This research will greatly accelerate the research on the molecular genetics of AD.
- Investigator:Pendergrass, RionInstitution:GenentechProject Title:Genetic Analyses Using Data from the Alzheimer’s Disease Sequencing Project (ADSP) and related studiesDate of Approval:October 3, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:The purpose of our study is to identify novel genetic factors associated with Alzheimer’s Disease, corticobasal degeneration (CBD) and progressive supranuclear palsy (PSP). This includes identifying genetic factors associated with the risk of these conditions, as well as genetic risk factors associated with age-at-onset (AAO) for these conditions. We will also evaluate genetic associations with sub-phenotypes individuals have within these broad disease categories, such as their Braak staging results which provide insights into the level of severity of Alzheimer’s. Thus we are requesting access to the set of genomic Whole Exome and Whole Genome Sequences (WES and WGS) have just been released through the National Institute on Aging Genetics of Alzheimer’s Disease Data Storage Site (DSS NIAGADS). The findings from our genetic association testing have the potential for identification of new therapeutic targets for Alzheimer's Disease, CBD, and PSP. The findings from our studies also have the potential for identification of genetic and phenotypic biomarkers that will be beneficial for subsetting patients in new ways. We will use standard genetic epidemiological methods to handle the WGS and WES data. We will also analyze cell type-specific expression differences in AD to identify biomarkers and disease pathways using standard gene expression analysis methods currently in use. We will also use other multi-omic and other genetic data that has now become available to further understand genetic association results we have found in AD.All data will remain anonymized and securely stored, and only those listed on our application and their staff will have access to these data. We will not share any of the individual level data outside of Genentech nor beyond the researchers on our application. We will adhere to all data use agreement stipulations through the DSS NIAGADS. We have a secure computational environment called Rosalind within Genentech where we will use these data. We have IT security staff that constantly monitor all our research computing, assuring safety and privacy of all of our stored data. We will not collaborate with researchers at other institutions.Non-Technical Research Use Statement:Genetic variation and gene expression data allows us to understand more of the genetic contribution to risk and protection from diseases such as Alzheimer’s and dementia. This information also allows us to identify important biological contributors to disease for developing effective treatment strategies, and identifying groups of individuals that would benefit most from new treatments. Our exploration of this relationship between genotype, disease traits, gene expression, and outcomes, through these datasets will allow us to pursue important new findings for disease treatment.
- Investigator:Pericak-Vance, MargaretInstitution:University of MiamiProject Title:Collaboration on Alzheimer Disease ResearchDate of Approval:October 24, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:We plan to analyze GWAS, whole exome and whole genome sequence data generated from subjects with Alzheimer's disease (AD) and elderly normal controls. The goal of the planned analyses is to identify genes that have alleles that protect against or increase susceptibility to AD. We will evaluate variants detected in the sequence data for association with AD to identify protective and susceptibility alleles using the whole exome and whole genome case-control data. We will also evaluate sequence data from multiplex AD families to identify variants associated with AD risk and protection, and evaluate variant co-segregation with AD. The family data will be whole genome data. The family-based data will be used to inform the cases control analysis and visa versa. We also will focus on structural variants (insertion-deletions, copy number variants, and chromosomal rearrangements). Evaluation of structural variants will involve both whole genome and whole exome data. Structural variants will be analyzed with single nucelotide variants detected and analyzed in the case-control and family-based dataNon-Technical Research Use Statement:We are attempting to identify all the inherited elements that contribute to Alzheimer's disease risk. To do this we will analyze DNA sequence data from subjects with Alzheimer's disease and elderly subjects who are cognitively normal. The sequence data from these 2 groups will be compared to identify differences that contribute to the risk of developing Alzheimer's disease of that protect against Alzheimer's disease. These DNA differences can be at a single site in the genetic code, or can span multiple sites, changing the copy number of DNA sequences. Both types of genetic variants will be examined.
- Investigator:Wainberg, MichaelInstitution:Sinai Health SystemProject Title:Uncovering the causal genetic variants, genes and cell types underlying brain disordersDate of Approval:September 5, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:We propose a multifaceted approach to elucidate and interpret genetic risk factors for Alzheimer's disease. First, we propose to perform a whole-genome sequencing meta-analysis of the Alzheimer's Disease Sequencing Project with the UK Biobank and All of Us to associate rare coding and non-coding variants with Alzheimer's disease and related dementias. We will explore a variety of case definitions in the UK Biobank and All of Us, including those based on ICD codes from electronic medical records (inpatient, primary care and/or death), self-report of Alzheimer's disease or Alzheimer's disease and related dementias, and/or family history of Alzheimer's disease or Alzheimer's disease and related dementias. We will perform single-variant, coding-variant burden, and non-coding variant burden tests using the REGENIE genome-wide association study toolkit.Second, we propose to develop statistical and machine learning models that can effectively infer (“fine-map”) the causal gene(s), variant(s), and cell type(s) underlying each association we find, as well as associations from existing genome-wide association studies and other Alzheimer's- and aging-related cohorts found in NIAGADS. In particular, we propose to improve causal gene identification by incorporating knowledge of gene function as a complement to functional genomics. For instance, we plan to develop improved methods for inferring biological networks, particularly from single-cell data, and integrate these networks with the results of the non-coding associations from our first aim to fine-map causal genes. To fine-map causal variants and cell types, we plan to integrate the associations from our first aim with single-nucleus chromatin accessibility data from postmortem brain cohorts to simultaneously infer which variant(s) are causal for each discovered locus and which cell type(s) they act through.Non-Technical Research Use Statement:We have a comprehensive plan to understand and explain the genetic factors that contribute to Alzheimer's disease. Our approach involves two main steps.First, we'll analyze genetic information from large research databases to identify rare genetic changes associated with Alzheimer's and related memory disorders. We'll look at both specific changes in genes and other parts of the genetic code. We'll use data from different studies and combine them to get a clearer picture.Second, we'll create advanced computer models that can help us figure out which specific genes, genetic changes, and cell types are responsible for these associations. This will help us pinpoint the most important factors contributing to Alzheimer's disease. We'll also analyze data from previous studies to build a more complete understanding of these genetic links.
- Investigator:Xiao, PengInstitution:University of Nebrask Medical CenterProject Title:Uncovering the genetic basis of Alzheimer's Diseases by integrating GWAS with multiomics approaches across different ethnicitiesDate of Approval:August 9, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:ObjectivesIn this study we aim to understand the system level understanding of Alzheimer's disease (AD) by integrating GWAS with robust multiomics datasets across diverse ethnic groups and harmonization of the results to include associated genes and pathways to understand underlying disease mechanisms and to inform our understanding of biological continuum of the diseases. Investigating AD associated variants as quantitative trait loci for epigenetic, transcriptomic, and proteomic layers to explore how variants in genes perturb pathways leading to AD.Study designWe will comprehensively examine the genetic architecture of Alzheimer's diseases based on different races (Caucasians, Latinos, Asians and Africans) GWAS data from publicly available datasets. We request access to as many datasets available in NIAGADS and other repositories like EADB-consortium, IGAP and we also have requested access to datasets through our literature search from corresponding authors for Asian cohorts. We will perform meta-analysis at two levels for GWAS datasets and find genome wide significant loci (GWS). Mendelian randomization analyses will be adapted to multi-omics setting through analyzing QTLs and GWS. We will perform correlation and enrichment analysis for significant findings from different omics layers.Analysis PlanGenome wide meta-analysis across different races to identify new loci and functional pathways influencing AD. Find genes most likely to be responsible for association signal with AD at each loci by applying mendelian randomization (MR) method. We plan to use MR method to combine multiomics (GWAS, eqtl, mqtl, aqtl and pqtl).Non-Technical Research Use Statement:To our knowledge our findings reveal crosstalk between epigenetic, genomic, and transcriptomic determinants of AD pathogenesis and define catalogues of candidate genes. In addition, rare or population specific common variants can be identified thus genes with underlying genetic support for an association with AD are likely to encode successful drug targets in clinical development. This should further lead to patient stratification.
- Investigator:Yang, JingjingInstitution:Emory UniversityProject Title:Novel Bayesian methods for integrating transcriptomic data in GWASDate of Approval:June 12, 2024Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:The proposed project aims to derive novel statistical methods to integrate multi-omics data with genome-wide association studies (GWAS) summary data for studying complex phenotypes. First, we will develop novel statistical methods to integrate transcriptomic, proteomic, and DNA methylation molecular trait data with GWAS data, for the goal of identifying risk genes with interpretable biological functions. We will develop novel statistical methods to learn molecular QTL information by using external individual-level transcriptomics data sets such as GTEx V8 and summary-level molecular QTL datasets, and then integrate this information with the whole genome sequence data from ADSP to prioritize risk genes associated with AD-related phenotypes. We are interested in studying all AD-related phenotypes profiled for the ADSP samples, especially Alzheimer’s disease (AD) and AD-related complex phenotypes. Especially, our lab has access to the ROS/MAP multi-omics data shared by the Rush Alzheimer’s Disease Center (http://www.radc.rush.edu/). All samples in the ROS/MAP study are well-characterized with extensive complex phenotypes profiled, including clinical diagnosis of AD, AD-related complex phenotypes, and psychological phenotypes. We will combine the whole genome sequence data from both ADSP and ROS/MAP samples to increase the total sample size in our study, thus improving the study power.Second, we will investigate integrating multi-omics data and polygenic risk scores, with clinical variables to derive biomarkers for Alzheimer's disease. We will also use both ROS/MAP and ADSP data to improve the study power.We plan to use ADSP data as a validation data set to test our derived methods. We are not limited to studying AD only. We are flexible to study any complex phenotypes that are profiled for both ADSP and ROS/MAP samples.Non-Technical Research Use Statement:This proposed project is to develop novel statistical methods to integrate multi-omics data such as summary-level molecular QTL data with GWAS summary data to study complex phenotypes, prioritizing potential causal genes. i) We will develop novel statistical methods to leverage summary-level molecular QTL data of multiple cohorts and ancestries. ii) We will apply our developed methods to study AD-related phenotypes. iii) We will derive risk prediction models using multiple modalities of omics, clinical, and image data, for predicting AD dementia. We propose to apply our proposed methods to the applied genomic analysis data and ROS/MAP multi-omics data to study AD-related phenotypes that are profiled for both ADSP and ROS/MAP samples, including AD, AD-related pathology traits, and related psychological disorders.
- Investigator:Yokoyama, JenniferInstitution:University of California, San FranciscoProject Title:Trans-ethnic meta-analysis and fine-mapping of Alzheimer’s Disease lociDate of Approval:August 14, 2023Request status:ClosedResearch use statements:Show statementsTechnical Research Use Statement:Alzheimer’s disease (AD) is a leading cause of death and disability worldwide. Despite the tremendous burden of the disease and arduous research efforts in the field, there are currently no disease-modifying treatments. Better elucidation of the genetic etiology of AD is needed to drive further drug discovery. Although the largest genome-wide association study (GWAS) of AD included over one million individuals of European ancestry and identified 38 associated loci, previous studies have highlighted the disproportionate disease risk and differing genetic architecture for AD across global populations. We hypothesize that a trans-ancestry approach would improve the generalizability of GWAS results, increase statistical power for locus discovery, and improve fine-mapping resolution to identify putative causal variants. Using a method that accounts for ancestral heterogeneity, we plan to leverage published and de-novo GWAS from individuals of European, East Asian, African American, South Asian, and Caribbean Hispanic ancestry and performed the largest trans-ancestry meta-analysis of AD to date. This method has allowed us to identify a novel AD locus near the Lymphocyte Cytosolic Protein 1 (LCP1) gene and prioritize an intronic SNP, rs2146890, as likely driving the observed association. Our findings further support the involvement of immune-related pathways in AD pathogenesis and highlight the importance of multi-ancestry representation in genetic studies. We propose utilizing the LASI-DAD GWAS and Imputation dataset to further extend these results by completing additional trans-ancestry meta-analyses.Non-Technical Research Use Statement:Alzheimer’s disease (AD) is a leading cause of death and disability worldwide. Although the largest genome-wide association study (GWAS) of AD included over one million individuals of European ancestry and identified 38 associated loci, previous studies have highlighted the disproportionate disease risk and differing genetic architecture for AD across global populations. Our findings have supported the involvement of immune-related pathways in AD pathogenesis and highlight the importance of multi-ancestry representation in genetic studies. We will use the LASI-DAD GWAS and Imputation data in conjunction with existing open datasets from individuals of European, East Asian, African American, South Asian, and Caribbean Hispanic ancestry to perform the largest trans-ancestry meta-analysis of Alzheimer’s disease to date.
- Investigator:Zhao, JinyingInstitution:University of FloridaProject Title:Identifying novel biomarkers for human complex diseases using an integrated multi-omics approachDate of Approval:November 21, 2023Request status:ApprovedResearch use statements:Show statementsTechnical Research Use Statement:GWAS, WES and WGS have identified many genes associated with Alzheimer’s Dementia (AD) and its related traits. However, the identified genes thus far collectively explain only a small proportion of disease heritability, suggesting that more genes remained to be identified. Moreover, there is a clear gender and ethnic disparity for AD susceptibility, but little research has been done to identify gender- and ethnic-specific variants associated with AD. Of the many challenges for deciphering AD pathology, lacking of efficient and power statistical methods for genetic association mapping and causal inference represents a major bottleneck. To tackle this challenge, we have developed a set of novel statistical and bioinformatics approaches for genetic association mapping and multi-omics causation inference in large-scale ethnicity-specific epidemiological studies. The goal of this project is to leverage the multi-omics and clinical data archived by the ADSP, ADNI, ADGC as well as other AD-related data repositories to identify novel genes and molecular markers for AD. Specifically, we will (1) validate our novel methods for identifying novel risk and protective genomic variants and multi-omics causal pathways of AD; (2) identify novel ethnicity- and gender-specific genes and molecular causal pathways of AD. We will share our results, statistical methods and computational software with the scientific community.Non-Technical Research Use Statement:Although many genes have been associated with Alzheimer’s Dementia (AD), these genes altogether explain only a small fraction of disease etiology, suggesting more genes remained to be identified. Of the many challenges for deciphering AD pathology, lacking of power statistical methods represents a major bottleneck. To tackle this challenge, we have developed a set of novel statistical and bioinformatics approaches for genetic association mapping and multi-omics causation inference in large-scale ethnicity-specific epidemiological studies. The goal of this project is to leverage the rich genetic and other omic data along with clinical data archived by the ADSP, ADNI, ADGC as well as other AD-related data repositories to identify novel genes and molecular markers for AD. Such results will enhance our understanding of AD pathogenesis and may also serve as biomarkers for early diagnosis and therapeutic targets.
- Investigator:Zhi, DeguiInstitution:University of Texas Health Science Center at HoustonProject Title:Genetics of deep-learning-derived neuroimaging endophenotypes for Alzheimer's DiseaseDate of Approval:October 2, 2023Request status:ExpiredResearch use statements:Show statementsTechnical Research Use Statement:Alzheimer’s disease (AD) affects 5.6 million Americans over the age of 65 and exacts tremendous and increasing demands on patients, caregivers, and healthcare resources. Our current understanding of the biology and pathophysiology of AD is still limited, hindering advances in the development of therapeutic and preventive strategies. Existing genetic studies of AD have some success but these explain only a fraction of the overall disease risk, suggesting opportunities for additional discoveries. The proposed project will leverage existing neuroimaging and genetic data resources from the UK Biobank, the Alzheimer’s Disease Sequencing Project (ADSP), the Alzheimer’s Disease Neuroimaging Initiative (ADNI), and the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium, and will be conducted by a multidisciplinary team of investigators. We will derive AD endophenotypes from neuroimaging data in the UK Biobank using deep learning (DL). We will identify novel genetic loci associated with DL-derived imaging endophenotypes and optimize the co-heritability of these endophenotypes with AD-related phenotypes using UK Biobank genetic data. We will leverage resources and collaborations with AD Consortia and the power of DL-derived neuroimaging endophenotypes to identify novel genes for Alzheimer’s Disease and AD-related traits. Also, we will develop DL-based neuroimaging harmonization and imputation methods and distribute implementation software to the research community. We expect to discover new genes relevant to AD which may leads to understanding of molecular basis of AD and potential new treatment.Non-Technical Research Use Statement:Alzheimer’s disease (AD) exacts a tremendous burden on patients, caregivers, and healthcare resources. Our current understanding of the biology of AD is still limited, hindering advances in the development of treatment and prevention. Existing genetic studies of AD have some success but more studies are needed. The proposed project will leverage existing neuroimaging and genetic data resources from the UK Biobank, the Alzheimer’s Disease Sequencing Project (ADSP) and other consortia and will be conducted by a multidisciplinary team of investigators. We will derive new AD relevant intermediate phenotypes from neuroimaging data using deep learning (DL), an AI approach. We will identify novel genetic loci associated with these phenotypes. Also, we will develop imaging harmonization and imputation methods and distribute implementation software to the research community. We expect to discover new genes relevant to AD which may leads to understanding of molecular basis of AD and potential new treatment.
Acknowledgement
Acknowledgment statement for any data distributed by NIAGADS:
Data for this study were prepared, archived, and distributed by the National Institute on Aging Alzheimer’s Disease Data Storage Site (NIAGADS) at the University of Pennsylvania (U24-AG041689), funded by the National Institute on Aging.
Use the study-specific acknowledgement statements below (as applicable):
For investigators using any data from this dataset:
Please cite/reference the use of NIAGADS data by including the accession NG00106.
For investigators using The Diagnostic Assessment of Dementia for the Longitudinal Aging Study of India (LASI-DAD) (sa000019) data:
In text: "The Longitudinal Aging Study in India, Diagnostic Assessment of Dementia data is sponsored by the National Institute on Aging (grant numbers R01AG051125 and U01AG065958) and is conducted by the University of Southern California."
In references: "The Longitudinal Aging Study in India, Diagnostic Assessment of Dementia Study. Produced and distributed by the University of Southern California with funding from the National Institute on Aging (grant numbers R01AG051125 and U01AG065958), Los Angles, CA."
Related Publications
- Lee J, Dey AB. Introduction to LASI-DAD: The Longitudinal Aging Study in India-Diagnostic Assessment of Dementia. J Am Geriatr Soc. 2020 Aug;68 Suppl 3(Suppl 3):S3-S4. doi: 10.1111/jgs.16740. PMID: 32815600; PMCID: PMC7513796. PubMed link
- Lee J, Khobragade PY, Banerjee J, Chien S, Angrisani M, Perianayagam A, Bloom DE, Dey AB. Design and Methodology of the Longitudinal Aging Study in India-Diagnostic Assessment of Dementia (LASI-DAD). J Am Geriatr Soc. 2020 Aug;68 Suppl 3(Suppl 3):S5-S10. doi: 10.1111/jgs.16737. PMID: 32815602; PMCID: PMC7503220. PubMed link
- Smith JA, Zhao W, Yu M, Rumfelt KE, Moorjani P, Ganna A, Dey AB, Lee J, Kardia SLR. Association Between Episodic Memory and Genetic Risk Factors for Alzheimer’s Disease in South Asians from the Longitudinal Aging Study in India-Diagnostic Assessment of Dementia (LASI-DAD). J Am Geriatr Soc. 2020 Aug;68 Suppl 3(Suppl 3):S45-S53. doi: 10.1111/jgs.16735. PMID: 32815605; PMCID: PMC7507858. PubMed link