Id: | hg38/cnv_collections/gnomAD.v4.1_Genome_SV |
Type: | cnv_collection |
Version: | 0 |
Summary: |
gnomAD v4.1 Genome SV collection |
Description: |
The gnomAD v4 Genome Structural Variant (SV) Collection is a comprehensive resource cataloging structural variations across human genomes, derived from whole-genome sequencing data. This dataset represents one of the largest publicly available SV collections, encompassing deletions, duplications, insertions, inversions, and translocations (including complex rearrangements labeled as BND). By integrating data from diverse populations, gnomAD v4 aims to provide a detailed view of SV diversity, frequency, and distribution in the human genome, enabling researchers to better understand the role of structural variations in health and disease. For more information: https://gnomad.broadinstitute.org/news/2023-11-v4-structural-variants/ Collins et al, A structural variation reference for medical and population genetics, Nature 2020 Downloaded on 01/07/2025 from https://storage.googleapis.com/gcp-public-data--gnomad/release/4.1/genome_sv/gnomad.v4.1.sv.sites.vcf.gz DataPrep.py converts vcf file to GRR format (only taking deletion and duplications, dropping records where two ends of an SV are on different chromosomes) and creates genomic_resource.yaml file. |
Labels: |
ID | Type | Default annotation | Description | Histogram | Range |
---|---|---|---|---|---|
deletion_duplication | str |
deletion_duplication |
duplication or deletion
|
![]() |
deletion, duplication |
cnv_name | str |
cnv_name |
Handy name to refer to the CNV.
|
No histogram: Empty histogram for cnv_name in a region: Too many unique values 101 for categorical histogram. |
NO DOMAIN |
SVLEN | int |
SVLEN |
SV length
|
![]() |
[50.000, 222470487.000] |
AN | int |
AN |
Total number of alleles genotyped (biallelic sites only).
|
![]() |
[1.000, 126092.000] |
AC | int |
AC |
Number of non-reference alleles observed (biallelic sites only).
|
![]() |
[1.000, 126092.000] |
AF | float |
AF |
Allele frequency (biallelic sites only).
|
![]() |
[0.000, 1.000] |
AN_afr | int |
AN_afr |
Total number of African-American alleles genotyped (biallelic sites only).
|
![]() |
[0.000, 33816.000] |
AN_amr | int |
AN_amr |
Total number of Latino alleles genotyped (biallelic sites only).
|
![]() |
[0.000, 12594.000] |
AN_asj | int |
AN_asj |
Total number of Ashkenazi Jewish alleles genotyped (biallelic sites only).
|
![]() |
[0.000, 3180.000] |
AN_eas | int |
AN_eas |
Total number of East Asian alleles genotyped (biallelic sites only).
|
![]() |
[0.000, 4054.000] |
AN_fin | int |
AN_fin |
Total number of Finnish alleles genotyped (biallelic sites only).
|
![]() |
[0.000, 6476.000] |
AN_mid | int |
AN_mid |
Total number of Middle Eastern alleles genotyped (biallelic sites only).
|
![]() |
[0.000, 64.000] |
AN_nfe | int |
AN_nfe |
Total number of Non-Finnish European alleles genotyped (biallelic sites only).
|
![]() |
[0.000, 59088.000] |
AN_sas | int |
AN_sas |
Total number of South Asian alleles genotyped (biallelic sites only).
|
![]() |
[0.000, 4406.000] |
AC_afr | int |
AC_afr |
Number of non-reference African-American alleles observed (biallelic sites only).
|
![]() |
[0.000, 33816.000] |
AC_amr | int |
AC_amr |
Number of non-reference Latino alleles observed (biallelic sites only).
|
![]() |
[0.000, 12594.000] |
AC_asj | int |
AC_asj |
Number of non-reference Ashkenazi Jewish alleles observed (biallelic sites only).
|
![]() |
[0.000, 3180.000] |
AC_eas | int |
AC_eas |
Number of non-reference East Asian alleles observed (biallelic sites only).
|
![]() |
[0.000, 4054.000] |
AC_fin | int |
AC_fin |
Number of non-reference Finnish alleles observed (biallelic sites only).
|
![]() |
[0.000, 6476.000] |
AC_mid | int |
AC_mid |
Number of non-reference Middle Eastern alleles observed (biallelic sites only).
|
![]() |
[0.000, 64.000] |
AC_nfe | int |
AC_nfe |
Number of non-reference Non-Finnish European alleles observed (biallelic sites only).
|
![]() |
[0.000, 59088.000] |
AC_sas | int |
AC_sas |
Number of non-reference South Asian alleles observed (biallelic sites only).
|
![]() |
[0.000, 4406.000] |
AF_afr | float |
AF_afr |
African-American allele frequency (biallelic sites only).
|
![]() |
[0.000, 1.000] |
AF_amr | float |
AF_amr |
Latino allele frequency (biallelic sites only).
|
![]() |
[0.000, 1.000] |
AF_asj | float |
AF_asj |
Ashkenazi Jewish allele frequency (biallelic sites only).
|
![]() |
[0.000, 1.000] |
AF_eas | float |
AF_eas |
East Asian allele frequency (biallelic sites only).
|
![]() |
[0.000, 1.000] |
AF_fin | float |
AF_fin |
Finnish allele frequency (biallelic sites only).
|
![]() |
[0.000, 1.000] |
AF_mid | float |
AF_mid |
Middle Eastern allele frequency (biallelic sites only).
|
![]() |
[0.000, 1.000] |
AF_nfe | float |
AF_nfe |
Non-Finnish European allele frequency (biallelic sites only).
|
![]() |
[0.000, 1.000] |
AF_sas | float |
AF_sas |
South Asian allele frequency (biallelic sites only).
|
![]() |
[0.000, 1.000] |
Filename | Size | md5 |
---|---|---|
DataPrep.py | 7.58 KB | ba2d4e268414810cba7614ed25289307 |
genomic_resource.yaml | 5.23 KB | 5abba66d9ecb3d395989f028e5ab07d6 |
gnomad.v4.1.sv.sites.tsv | 328.98 MB | 96ad8177e46fa9d5bbcef48068516b93 |
gnomad.v4.1.sv.sites.tsv.dvc | 109.0 B | c9c7278a49a6239a7faa2f44ce24d497 |
gnomad.v4.1.sv.sites.vcf | 11.88 GB | 59ee22c4615e584d3d663a2af37b9e84 |
gnomad.v4.1.sv.sites.vcf.dvc | 111.0 B | e4f9060b1f79dcf581f1712b797f1fcd |
statistics/ |