Description

This dataset includes sequencing data from samples sequenced by the Alzhiemer’s Disease Sequencing Project and other AD and Related Dementia’s studies. Samples are processed using a common workflow called VCPA (Variant Calling Pipeline and data management tool), a functionally equivalent CCDG/TOPMed pipeline.

Data Releases:

  1. The first release (July 30, 2018) included CRAMs, gVCFs, and phenotypes for 4,789 whole genomes. These data were called by GCAD using the VCPA1.0 pipeline.
  2. The second release (October 30, 2018) included an ADSP quality controlled project level VCF for the 4,789 whole genomes previously released.
  3. The third release (February 18, 2020) includes CRAMs, gVCFs, and phenotypes for 19,922 whole exomes. These data were called by GCAD using the VCPA1.1 pipeline.

Sample Summary per Data Type

Sample SetAccessionCRAMsgVCFsGATK Called Genotypes
ADSP Discovery - WGSsnd10000n = 580n = 580n = 580
ADSP Discovery - WESsnd10000n = 10088n = 10088NA
ADSP Extension - WGSsnd10001n = 3400n = 3400n = 3400
ADNI-WGS-1snd10002n = 809n = 809n = 809
ADGC AA - WES snd10003n = 3144n = 3144NA
FASe Families - WESsnd10004n = 1100n = 1100NA
Brkanac Families - WESsnd10005n = 75n = 75NA
Miami Families - WESsnd10006n = 108n = 108NA
Columbia WHICAP - WESsnd10007n = 3861n = 3861NA
Knight ADRC - WESsnd10008n = 650n - 650NA
CBD - WESsnd10009n = 346n = 346NA
PSP - WESsnd10010n = 550n = 550NA

Available Filesets

NameAccessionVersion/DateDescription/What’s New
WGS CRAMs/GATK gVCFsfsa000001VCPA1.0/2018.07.30Mapped to GRCh38
WGS QC Metricsfsa000001VCPA1.0/2018.07.30Sequencing Data Quality Control Metrics
Phenotypes/Pedigreesfsa0000022020.01.15Phenotypes and Pedigree structures for all sequenced subjects
WGS Project Level VCFfsa0000032018.09.17ADSP quality control checked GATK joint called VCF containing 4789 whole-genomes.
WES CRAMs/GATK gVCFsfsa000004VCPA1.1/2020.01.15Mapped to GRCh38

View the File Manifest for a full list of files released in this dataset.