Description

This dataset includes Compressed Sequence Alignment files (CRAMs) mapped to GRCh38 and GATK-called gVCFs from the ADSP and ADNI studies. These data were called by the Genome Center for Alzheimer’s Disease (GCAD) using VCPA 1.0, a functionally equivalent CCDG/TOPMed pipeline. GCAD processed a total of 4789 whole genomes, including, 876 ADSP Family Discovery and Discovery Extension samples, 3104 ADSP Case Control Extension samples, and 809 ADNI samples. The next data release will include the ADSP quality control checked GATK joint called VCF containing all 4789 whole genomes.

Sample Summary per Data Type

WGS CRAMsWGS gVCFsGATK Called Genotypes
ADSP Discovery
snd10000
n = 580n = 580n = NA
ADSP Extension
snd10001
n = 3400n = 3400n = NA
ADNI-WGS-1
snd10002
n = 809n = 809n = NA

Available Filesets

NameAccessionVersion/DateDescription/What’s New
ADSP Discovery WGSCRAMs/GATK gVCFsnd10000VCPA1.0/2018.07.30Mapped to GRCh38
ADSP Extension WGSCRAMs/ GATK gVCFssnd10001 VCPA1.0/2018.07.30Mapped to GRCh38
ADNI-WGS-1 CRAMs/GATK gVCFssnd10002 VCPA1.0/2018.07.30Mapped to GRCh38
ADSP/ADNI QC MetricsNA VCPA1.0/2018.07.30Sequencing Data Quality Control Metrics
ADSP/ADNIPhenotypes/Pedigreesdnd000012018.07.30Phenotypes and Pedigree structures for all whole-genome sequenced subjects

View the File Manifest for a full list of files released in this dataset.