The contents of the Database of Genomic Variants can be downloaded as tab delimited text files. The first line of each file is the column description. Each variant is provided with an accession which is a stable identifier and will remain constant. Studies where CNV coordinates are based on a newer assembly are not mapped back to older builds. The NCBI Build 36 (hg18) download file will therefore contain less data than the GRCh37 (hg19).
We have provided three categories of files for users to download. The first set of files, contained in the "DGV variants" section, represents the data that is displayed in our primary DGV structural variants track. The second set of files, in the "Supporting Variants" section, are the sample level and supporting variants that are displayed in our Supporting Variants track. The last section is the "Filtered Variants" files. These are variants that have been removed from the database following our curation process.
Note that for the DGV Variants and Supporting Variants sections, a new file has been made available under the "Other Mappings" column. These files contain those variants that do not map to the reference assembly but instead to an alternate sequence/contig assembly.
DGV Gold Standard Variants
|Release Date||GRCh 37 (hg19)||GRCh 38 (hg38)|
Old versions (note that each file contains both DGV Variants and Supporting Variants)
|Release Date||Build 36 (hg18)||GRCh 37 (hg19)||Other Mappings|
A Copy Number Variation Map of the Human Genome (Nature Reviews Genetics, 2015)
Studies included in DGV and links to the original data
The table below outlines the studies that have been used as sources for the data and the number of regions included from each study. Here you can also download the original dataset for all studies.
|Display Name||Pyear||Regions||Pubmed Id||Reference Assembly||Link||Curation Comments|