Biomina/MedGen VariantDB: Annotation and filtering of variants detected using next-generation sequencing (tutorial)

Documentation : Annotations Explained

Quality Information
Stretch		Is variant located in stretch of repetitive sequence, as indicated by GATK recent versions. Annotations Stretch_Unit and Stretch_Length give more information about the stretch
Stretch_Unit		Unit of the stretch (e.g. GCC or T)
Stretch_Length		Length of the stretch on the present alleles.
AllelicRatio		Fraction of total reads called as alternative allele
Phred_Polymorphism		The Phred scaled probability of Probability that REF/ALT polymorphism exists at this site given sequencing data. Because the Phred scale is -10 * log(1-p), a value of 10 indicates a 1 in 10 chance of error, while a 100 indicates a 1 in 10^10 chance. The GATK values can grow very large when lots of NGS data is used to call.
Phred_Genotype		The Genotype Quality, as a Phred-scaled confidence at the true genotype is the one provided in GT. In diploid case, if GT is 0/1, then GQ is really L(0/1) / (L(0/0) + L(0/1) + L(1/1)), where L is the likelihood of the NGS sequencing data under the model of that the sample is 0/0, 0/1/, or 1/1.
Genotype		Observed genotype, homozygous or heterozygous
Ref_Allele_Depth		Number of reads passing GATK quality threshold, with the reference allele
Alt_Allele_Depth		Number of reads passing GATK quality threshold, with the alternative allele
Quality_By_Depth		Variant confidence (given as (AB+BB)/AA from the PLs) / unfiltered depth. (PL : phred-scaled likelyhood of the genotype). Low scores are indicative of false positive calls and artifacts. Note that QualByDepth requires sequencing reads associated with the samples with polymorphic genotypes.
Mapping_Quality		Root Mean Square of the mapping quality of the reads across all samples.
Base_Quality_Rank_Sum		The u-based z-approximation from the Mann-Whitney Rank Sum Test for base qualities (ref bases vs. bases of the alternate allele). Note that the base quality rank sum test can not be calculated for homozygous sites.
Mapping_Quality_Rank_Sum		The u-based z-approximation from the Mann-Whitney Rank Sum Test for mapping qualities (reads with ref bases vs. those with the alternate allele) Note that the mapping quality rank sum test can not be calculated for homozygous sites.
Read_Position_Rank_Sum		The u-based z-approximation from the Mann-Whitney Rank Sum Test for the distance from the end of the read for reads with the alternate allele; if the alternate allele is only seen near the ends of reads this is indicative of error. Note that the read position rank sum test can not be calculated for homozygous sites.
Strand_Bias		How much evidence is there for Strand Bias (the variation being seen on only the forward or only the reverse strand) in the reads? Higher SB values denote more bias (and therefore are more likely to indicate false positive calls).
Fisher_Strand_Bias		Phred-scaled p-value using Fisher's Exact Test to detect strand bias (the variation being seen on only the forward or only the reverse strand) in the reads. More bias is indicative of false positive calls. Note that the fisher strand test may not be calculated for certain complex indel cases or for multi-allelic sites.
VQSLOD		VQSLOD is the log odds ratio of being a true variant versus being false under the trained Gaussian mixture model when Variant Recalibration is applied.
DeltaPL		This field provides the likelihoods of the given genotypes (here, 0/0, 0/1, and 1/1). These are normalized, Phred-scaled likelihoods for each of the 0/0, 0/1, and 1/1, without priors. To be concrete, for the heterozygous case, this is L(data given that the true genotype is 0/1). The most likely genotype (given in the GT field) is scaled so that it's P = 1.0 (0 when Phred-scaled), and the other likelihoods reflect their Phred-scaled likelihoods relative to this most likely genotype
Tranches_Filter		Value in the VCF Filter Column. Mainly used by GATK Variant Recalibration.
RBQ		Average quality of reference-supporting bases (qual1)
ABQ		Average quality of variant-supporting bases (qual2)
PhredFisherP		Pred-scaled P-value from Fisher's Exact Test
Genomic_SuperDups_Score		Similarity of the SegDup region to the alternate location region. See UCSC table description for details.
Genomic_SuperDups_Location		Alternate location of the SegDup region. See UCSC table description for details.

Population_Frequency Information
Occurence_All_Samples_Any.Alternate		Absolute Occurence of the variant as an alternate call in all your samples
1000g2012apr_all	Deprecated	alternative allele frequency data in 1000 Genomes Project for ALL populations
1000g2012apr_afr	Deprecated	alternative allele frequency data in 1000 Genomes Project for African population
1000g2012apr_asn	Deprecated	alternative allele frequency data in 1000 Genomes Project for Asian population
1000g2012apr_amr	Deprecated	alternative allele frequency data in 1000 Genomes Project for admixed American population
1000g2012apr_eur	Deprecated	alternative allele frequency data in 1000 Genomes Project for European population
1000g2014oct_all	Deprecated	alternative allele frequency data in 1000 Genomes Project for ALL populations. Based on 201409 collection v5 (based on 201305 alignment) but including chrX and chrY data finally. ANNOVAR specific version with better indel matching.
1000g2014oct_afr	Deprecated	alternative allele frequency data in 1000 Genomes Project for African population. Based on 201409 collection v5 (based on 201305 alignment) but including chrX and chrY data finally. ANNOVAR specific version with better indel matching.
1000g2014oct_eas	Deprecated	alternative allele frequency data in 1000 Genomes Project for East-Asian population. Based on 201409 collection v5 (based on 201305 alignment) but including chrX and chrY data finally. ANNOVAR specific version with better indel matching.
1000g2014oct_sas	Deprecated	alternative allele frequency data in 1000 Genomes Project for South-Asian population. Based on 201409 collection v5 (based on 201305 alignment) but including chrX and chrY data finally. ANNOVAR specific version with better indel matching.
1000g2014oct_amr	Deprecated	alternative allele frequency data in 1000 Genomes Project for admixed American population. Based on 201409 collection v5 (based on 201305 alignment) but including chrX and chrY data finally. ANNOVAR specific version with better indel matching.
1000g2014oct_eur	Deprecated	alternative allele frequency data in 1000 Genomes Project for European population. Based on 201409 collection v5 (based on 201305 alignment) but including chrX and chrY data finally. ANNOVAR specific version with better indel matching.
esp5400_all	Deprecated	alternative allele frequency in all subjects in the NHLBI-ESP project with 5400 exomes
esp5400_ea	Deprecated	alternative allele frequency in European Americans in the NHLBI-ESP project with 5400 exomes
esp5400_aa	Deprecated	alternative allele frequency in African Americans in the NHLBI-ESP project with 5400 exomes
esp6500_all	Deprecated	alternative allele frequency in all subjects in the NHLBI-ESP project with 6500 exomes, including the indel calls and the chrY calls
esp6500_ea	Deprecated	alternative allele frequency in European Americans in the NHLBI-ESP project with 6500 exomes, including the indel calls and the chrY calls
esp6500_aa	Deprecated	alternative allele frequency in African Americans in the NHLBI-ESP project with 6500 exomes, including the indel calls and the chrY calls
ExAC_v02	Deprecated	Alternative allele frequency in ALL populations in the ExAC 65.000 exomes database. Note that these are not all healthy individuals. Although values are similar, there are discrepancies between ANNOVAR ALL_population frequencies (used here) and the data available online. This is due to ANNOVAR using raw allele counts, versus the ExAC Browser using adjusted allele counts (DP >= 10 & GQ >= 20)
ExAC_v03_AF_ALL		Alternative allele frequency in ALL populations in the ExAC 65.000 exomes database. Note that these are not all healthy individuals. Although values are similar, there are discrepancies between ANNOVAR ALL_population frequencies (used here) and the data available online. This is due to ANNOVAR using raw allele counts, versus the ExAC Browser using adjusted allele counts (DP >= 10 & GQ >= 20)
ExAC_v03_AN_ALL		Total Chromosome Count in ALL populations in the ExAC 65.000 exomes database for this variant. Only chromosomes used in variant calling are counted, discarding non-covered samples.
ExAC_v03_AF_AFR		Alternative allele frequency in AFR population in the ExAC 65.000 exomes database. Note that these are not all healthy individuals. Although values are similar, there are discrepancies between ANNOVAR AFR_population frequencies (used here) and the data available online. This is due to ANNOVAR using raw allele counts, versus the ExAC Browser using adjusted allele counts (DP >= 10 & GQ >= 20)
ExAC_v03_AN_AFR		Total Chromosome Count in AFR population in the ExAC 65.000 exomes database for this variant. Only chromosomes used in variant calling are counted, discarding non-covered samples.
ExAC_v03_AF_AMR		Alternative allele frequency in AMR population in the ExAC 65.000 exomes database. Note that these are not all healthy individuals. Although values are similar, there are discrepancies between ANNOVAR AMR_population frequencies (used here) and the data available online. This is due to ANNOVAR using raw allele counts, versus the ExAC Browser using adjusted allele counts (DP >= 10 & GQ >= 20)
ExAC_v03_AN_AMR		Total Chromosome Count in AMR population in the ExAC 65.000 exomes database for this variant. Only chromosomes used in variant calling are counted, discarding non-covered samples.
ExAC_v03_AF_EAS		Alternative allele frequency in EAS population in the ExAC 65.000 exomes database. Note that these are not all healthy individuals. Although values are similar, there are discrepancies between ANNOVAR EAS_population frequencies (used here) and the data available online. This is due to ANNOVAR using raw allele counts, versus the ExAC Browser using adjusted allele counts (DP >= 10 & GQ >= 20)
ExAC_v03_AN_EAS		Total Chromosome Count in EAS population in the ExAC 65.000 exomes database for this variant. Only chromosomes used in variant calling are counted, discarding non-covered samples.
ExAC_v03_AF_FIN		Alternative allele frequency in FIN population in the ExAC 65.000 exomes database. Note that these are not all healthy individuals. Although values are similar, there are discrepancies between ANNOVAR FIN_population frequencies (used here) and the data available online. This is due to ANNOVAR using raw allele counts, versus the ExAC Browser using adjusted allele counts (DP >= 10 & GQ >= 20)
ExAC_v03_AN_FIN		Total Chromosome Count in FIN population in the ExAC 65.000 exomes database for this variant. Only chromosomes used in variant calling are counted, discarding non-covered samples.
ExAC_v03_AF_NFE		Alternative allele frequency in NFE population in the ExAC 65.000 exomes database. Note that these are not all healthy individuals. Although values are similar, there are discrepancies between ANNOVAR NFE_population frequencies (used here) and the data available online. This is due to ANNOVAR using raw allele counts, versus the ExAC Browser using adjusted allele counts (DP >= 10 & GQ >= 20)
ExAC_v03_AN_NFE		Total Chromosome Count in NFE population in the ExAC 65.000 exomes database for this variant. Only chromosomes used in variant calling are counted, discarding non-covered samples.
ExAC_v03_AF_OTH		Alternative allele frequency in OTH population in the ExAC 65.000 exomes database. Note that these are not all healthy individuals. Although values are similar, there are discrepancies between ANNOVAR OTH_population frequencies (used here) and the data available online. This is due to ANNOVAR using raw allele counts, versus the ExAC Browser using adjusted allele counts (DP >= 10 & GQ >= 20)
ExAC_v03_AN_OTH		Total Chromosome Count in OTH population in the ExAC 65.000 exomes database for this variant. Only chromosomes used in variant calling are counted, discarding non-covered samples.
ExAC_v03_AF_SAS		Alternative allele frequency in SAS population in the ExAC 65.000 exomes database. Note that these are not all healthy individuals. Although values are similar, there are discrepancies between ANNOVAR SAS_population frequencies (used here) and the data available online. This is due to ANNOVAR using raw allele counts, versus the ExAC Browser using adjusted allele counts (DP >= 10 & GQ >= 20)
ExAC_v03_AN_SAS		Total Chromosome Count in SAS population in the ExAC 65.000 exomes database for this variant. Only chromosomes used in variant calling are counted, discarding non-covered samples.
Kaviar_150923_AF	Deprecated	Alternate allele frequency in kaviar database, release 2015-09-23.
Kaviar_150923_AN	Deprecated	Total Chromosome Count in kaviar database, release 2015-09-23. Only chromosomes used in variant calling are counted, discarding non-covered samples.
snp130_rsID	Deprecated	SNP-IDs for variants present in dbSNP.v130. No allele Frequencies
snp135_rsID	Deprecated	SNP-IDs for variants present in dbSNP.v135.
snp135_MAF	Deprecated	Minor Allele Frequency for variants present in dbSNP.v135.
snp135_NrChr	Deprecated	Number of Chromosomes in dbSNP to base the Minor Allele Frequency on.
snp135_Clinical	Deprecated	Are snps marked as clinically associated? 1 for true, zero for false.
snp137_rsID	Deprecated	SNP-IDs for variants present in dbSNP.v137.
snp137_MAF	Deprecated	Minor Allele Frequency for variants present in dbSNP.v137.
snp137_NrChr	Deprecated	Number of Chromosomes in dbSNP to base the Minor Allele Frequency on.
snp137_Clinical	Deprecated	Are snps marked as clinically associated? 1 for true, zero for false.
snp138_rsID		SNP-IDs for variants present in dbSNP.v138. ANNOVAR specific version is used for better indel matching
snp138_MAF		Minor Allele Frequency for variants present in dbSNP.v138. ANNOVAR specific version is used for better indel matching
snp138_NrChr		Number of Chromosomes in dbSNP to base the Minor Allele Frequency on.
snp138_Clinical		Are snps marked as clinically associated? 1 for true, zero for false.
snp142_rsID		SNP-IDs for variants present in dbSNP.v142. ANNOVAR specific version is used for better indel matching
snp142_MAF		Minor Allele Frequency for variants present in dbSNP.v142. ANNOVAR specific version is used for better indel matching
snp142_NrChr		Number of Chromosomes in dbSNP to base the Minor Allele Frequency on.
snp142_Clinical		Are snps marked as clinically associated? 1 for true, zero for false.
gADg_2.1_ALL_AF		MAF over all gnomAD Genome samples, release 2.1
gADg_2.1_ALL_AN		Number of chromosomes over all gnomAD Genome samples, release 2.1
gADg_2.1_ALL_Hom		Number of homozygous samples over all gnomAD Genome samples, release 2.1
gADg_2.1_Female_AF		MAF over all female gnomAD Genome samples, release 2.1
gADg_2.1_Female_AN		Number of chromosomes over all female gnomAD Genome samples, release 2.1
gADg_2.1_Female_Hom		Number of homozygous female samples over all gnomAD Genome samples, release 2.1
gADg_2.1_Male_AF		MAF over all male gnomAD Genome samples, release 2.1
gADg_2.1_Male_AN		Number of chromosomes over all male gnomAD Genome samples, release 2.1
gADg_2.1_Male_Hom		Number of homozygous male samples over all gnomAD Genome samples, release 2.1
gADg_2.1_NFE_AF		MAF over NFE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_AN		Number of chromosomes over NFE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_Hom		Number of homozygous samples over NFE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_AF_Female		MAF over Female NFE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_AN_Female		Number of chromosomes over Female NFE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_Hom_Female		Number of homozygous samples over Female NFE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_AF_Male		MAF over male NFE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_AN_Male		Number of chromosomes over male NFE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_Hom_Male		Number of homozygous samples over male NFE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_NWE_AF		MAF over NFE_NWE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_NWE_AN		Number of chromosomes over NFE_NWE gnomAD Genome samples, release 2.1
gADg_2.1_NFE_NWE_Hom		Number of homozygous samples over NFE_NWE gnomAD Genome samples, release 2.1
gADg_2.1_FIN_AF		MAF over FIN gnomAD Genome samples, release 2.1
gADg_2.1_FIN_AN		Number of chromosomes over FIN gnomAD Genome samples, release 2.1
gADg_2.1_FIN_Hom		Number of homozygous samples over FIN gnomAD Genome samples, release 2.1
gADg_2.1_FIN_AF_Female		MAF over Female FIN gnomAD Genome samples, release 2.1
gADg_2.1_FIN_AN_Female		Number of chromosomes over Female FIN gnomAD Genome samples, release 2.1
gADg_2.1_FIN_Hom_Female		Number of homozygous samples over Female FIN gnomAD Genome samples, release 2.1
gADg_2.1_FIN_AF_Male		MAF over male FIN gnomAD Genome samples, release 2.1
gADg_2.1_FIN_AN_Male		Number of chromosomes over male FIN gnomAD Genome samples, release 2.1
gADg_2.1_FIN_Hom_Male		Number of homozygous samples over male FIN gnomAD Genome samples, release 2.1
gADg_2.1_AFR_AF		MAF over AFR gnomAD Genome samples, release 2.1
gADg_2.1_AFR_AN		Number of chromosomes over AFR gnomAD Genome samples, release 2.1
gADg_2.1_AFR_Hom		Number of homozygous samples over AFR gnomAD Genome samples, release 2.1
gADg_2.1_AFR_AF_Female		MAF over Female AFR gnomAD Genome samples, release 2.1
gADg_2.1_AFR_AN_Female		Number of chromosomes over Female AFR gnomAD Genome samples, release 2.1
gADg_2.1_AFR_Hom_Female		Number of homozygous samples over Female AFR gnomAD Genome samples, release 2.1
gADg_2.1_AFR_AF_Male		MAF over male AFR gnomAD Genome samples, release 2.1
gADg_2.1_AFR_AN_Male		Number of chromosomes over male AFR gnomAD Genome samples, release 2.1
gADg_2.1_AFR_Hom_Male		Number of homozygous samples over male AFR gnomAD Genome samples, release 2.1
gADg_2.1_EAS_AF		MAF over EAS gnomAD Genome samples, release 2.1
gADg_2.1_EAS_AN		Number of chromosomes over EAS gnomAD Genome samples, release 2.1
gADg_2.1_EAS_Hom		Number of homozygous samples over EAS gnomAD Genome samples, release 2.1
gADg_2.1_EAS_AF_Female		MAF over Female EAS gnomAD Genome samples, release 2.1
gADg_2.1_EAS_AN_Female		Number of chromosomes over Female EAS gnomAD Genome samples, release 2.1
gADg_2.1_EAS_Hom_Female		Number of homozygous samples over Female EAS gnomAD Genome samples, release 2.1
gADg_2.1_EAS_AF_Male		MAF over male EAS gnomAD Genome samples, release 2.1
gADg_2.1_EAS_AN_Male		Number of chromosomes over male EAS gnomAD Genome samples, release 2.1
gADg_2.1_EAS_Hom_Male		Number of homozygous samples over male EAS gnomAD Genome samples, release 2.1
gADg_2.1_AMR_AF		MAF over AMR gnomAD Genome samples, release 2.1
gADg_2.1_AMR_AN		Number of chromosomes over AMR gnomAD Genome samples, release 2.1
gADg_2.1_AMR_Hom		Number of homozygous samples over AMR gnomAD Genome samples, release 2.1
gADg_2.1_AMR_AF_Female		MAF over Female AMR gnomAD Genome samples, release 2.1
gADg_2.1_AMR_AN_Female		Number of chromosomes over Female AMR gnomAD Genome samples, release 2.1
gADg_2.1_AMR_Hom_Female		Number of homozygous samples over Female AMR gnomAD Genome samples, release 2.1
gADg_2.1_AMR_AF_Male		MAF over male AMR gnomAD Genome samples, release 2.1
gADg_2.1_AMR_AN_Male		Number of chromosomes over male AMR gnomAD Genome samples, release 2.1
gADg_2.1_AMR_Hom_Male		Number of homozygous samples over male AMR gnomAD Genome samples, release 2.1
gADg_2.1_ASJ_AF		MAF over ASJ gnomAD Genome samples, release 2.1
gADg_2.1_ASJ_AN		Number of chromosomes over ASJ gnomAD Genome samples, release 2.1
gADg_2.1_ASJ_Hom		Number of homozygous samples over ASJ gnomAD Genome samples, release 2.1
gADg_2.1_ASJ_AF_Female		MAF over Female ASJ gnomAD Genome samples, release 2.1
gADg_2.1_ASJ_AN_Female		Number of chromosomes over Female ASJ gnomAD Genome samples, release 2.1
gADg_2.1_ASJ_Hom_Female		Number of homozygous samples over Female ASJ gnomAD Genome samples, release 2.1
gADg_2.1_ASJ_AF_Male		MAF over male ASJ gnomAD Genome samples, release 2.1
gADg_2.1_ASJ_AN_Male		Number of chromosomes over male ASJ gnomAD Genome samples, release 2.1
gADg_2.1_ASJ_Hom_Male		Number of homozygous samples over male ASJ gnomAD Genome samples, release 2.1
gADg_2.1_OTH_AF		MAF over OTH gnomAD Genome samples, release 2.1
gADg_2.1_OTH_AN		Number of chromosomes over OTH gnomAD Genome samples, release 2.1
gADg_2.1_OTH_Hom		Number of homozygous samples over OTH gnomAD Genome samples, release 2.1
gADg_2.1_OTH_AF_Female		MAF over Female OTH gnomAD Genome samples, release 2.1
gADg_2.1_OTH_AN_Female		Number of chromosomes over Female OTH gnomAD Genome samples, release 2.1
gADg_2.1_OTH_Hom_Female		Number of homozygous samples over Female OTH gnomAD Genome samples, release 2.1
gADg_2.1_OTH_AF_Male		MAF over male OTH gnomAD Genome samples, release 2.1
gADg_2.1_OTH_AN_Male		Number of chromosomes over male OTH gnomAD Genome samples, release 2.1
gADg_2.1_OTH_Hom_Male		Number of homozygous samples over male OTH gnomAD Genome samples, release 2.1
gADg_2.1_contr_AF		MAF over controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AN		Number of chromosomes over controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_Hom		Number of homozygous samples over controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AF_female		MAF over Female controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AN_female		Number of chromosomes over Female controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_Hom_female		Number of homozygous samples over Female controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AF_male		MAF over male controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AN_male		Number of chromosomes over male controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_Hom_male		Number of homozygous samples over male controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AF_nfe		MAF over NFE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AN_nfe		Number of chromosomes over NFE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_Hom_nfe		Number of homozygous samples over NFE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AF_nfe_female		MAF over Female NFE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AN_nfe_female		Number of chromosomes over Female NFE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_Hom_nfe_female		Number of homozygous samples over Female NFE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AF_nfe_male		MAF over male NFE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AN_nfe_male		Number of chromosomes over male NFE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_Hom_nfe_male		Number of homozygous samples over male NFE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AF_nfe_nwe		MAF over NFE_NWE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_AN_nfe_nwe		Number of chromosomes over NFEE_NWE controls only gnomAD Genome samples, release 2.1
gADg_2.1_contr_Hom_nfe_nwe		Number of homozygous samples over NFEE_NWE controls only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AF		MAF over non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AN		Number of chromosomes over non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_Hom		Number of homozygous samples over non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AF_female		MAF over Female non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AN_female		Number of chromosomes over Female non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_Hom_female		Number of homozygous samples over Female non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AF_male		MAF over male non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AN_male		Number of chromosomes over male non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_Hom_male		Number of homozygous samples over male non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AF_nfe		MAF over NFE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AN_nfe		Number of chromosomes over NFE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_Hom_nfe		Number of homozygous samples over NFE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AF_nfe_female		MAF over Female NFE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AN_nfe_female		Number of chromosomes over Female NFE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_Hom_nfe_female		Number of homozygous samples over Female NFE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AF_nfe_male		MAF over male NFE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AN_nfe_male		Number of chromosomes over male NFE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_Hom_nfe_male		Number of homozygous samples over male NFE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AF_nfe_nwe		MAF over NFE_NWE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_AN_nfe_nwe		Number of chromosomes over NFEE_NWE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_ntpmed_Hom_nfe_nwe		Number of homozygous samples over NFEE_NWE non_topmed only gnomAD Genome samples, release 2.1
gADg_2.1_n.neuro_AF		MAF over non_neuro only gnomAD Genome samples, release 2.1
gADg_2.1_n.neuro_AN		Number of chromosomes over non_neuro only gnomAD Genome samples, release 2.1
gADg_2.1_n.neuro_Hom		Number of homozygous samples over non_neuro only gnomAD Genome samples, release 2.1
gADg_2.1_n.neuro_AF_female		MAF over Female non_neuro only gnomAD Genome samples, release 2.1
gADg_2.1_n.neuro_AN_female		Number of chromosomes over Female non_neuro only gnomAD Genome samples, release 2.1
gADg_2.1_n.neuro_Hom_female		Number of homozygous samples over Female non_neuro only gnomAD Genome samples, release 2.1
gADg_2.1_n.neuro_AF_male		MAF over male non_neuro only gnomAD Genome samples, release 2.1
gADg_2.1_n.neuro_AN_male		Number of chromosomes over male non_neuro only gnomAD Genome samples, release 2.1
gADg_2.1_n.neuro_Hom_male		Number of homozygous samples over male non_neuro only gnomAD Genome samples, release 2.1
gADe_2.1_ALL_AF		MAF over all gnomAD Exome samples, release 2.1
gADe_2.1_ALL_AN		Number of chromosomes over all gnomAD Exome samples, release 2.1
gADe_2.1_ALL_Hom		Number of homozygous samples over all gnomAD Exome samples, release 2.1
gADe_2.1_Female_AF		MAF over all female gnomAD Exome samples, release 2.1
gADe_2.1_Female_AN		Number of chromosomes over all female gnomAD Exome samples, release 2.1
gADe_2.1_Female_Hom		Number of homozygous female samples over all gnomAD Exome samples, release 2.1
gADe_2.1_Male_AF		MAF over all male gnomAD Exome samples, release 2.1
gADe_2.1_Male_AN		Number of chromosomes over all male gnomAD Exome samples, release 2.1
gADe_2.1_Male_Hom		Number of homozygous male samples over all gnomAD Exome samples, release 2.1
gADe_2.1_AFR_AF		MAF over AFR gnomAD Exome samples, release 2.1
gADe_2.1_AFR_AN		Number of chromosomes over AFR gnomAD Exome samples, release 2.1
gADe_2.1_AFR_Hom		Number of homozygous samples over AFR gnomAD Exome samples, release 2.1
gADe_2.1_AFR_AF_Female		MAF over Female AFR gnomAD Exome samples, release 2.1
gADe_2.1_AFR_AN_Female		Number of chromosomes over Female AFR gnomAD Exome samples, release 2.1
gADe_2.1_AFR_Hom_Female		Number of homozygous samples over Female AFR gnomAD Exome samples, release 2.1
gADe_2.1_AFR_AF_Male		MAF over male AFR gnomAD Exome samples, release 2.1
gADe_2.1_AFR_AN_Male		Number of chromosomes over male AFR gnomAD Exome samples, release 2.1
gADe_2.1_AFR_Hom_Male		Number of homozygous samples over male AFR gnomAD Exome samples, release 2.1
gADe_2.1_EAS_AF		MAF over EAS gnomAD Exome samples, release 2.1
gADe_2.1_EAS_AN		Number of chromosomes over EAS gnomAD Exome samples, release 2.1
gADe_2.1_EAS_Hom		Number of homozygous samples over EAS gnomAD Exome samples, release 2.1
gADe_2.1_EAS_AF_Female		MAF over Female EAS gnomAD Exome samples, release 2.1
gADe_2.1_EAS_AN_Female		Number of chromosomes over Female EAS gnomAD Exome samples, release 2.1
gADe_2.1_EAS_Hom_Female		Number of homozygous samples over Female EAS gnomAD Exome samples, release 2.1
gADe_2.1_EAS_AF_Male		MAF over male EAS gnomAD Exome samples, release 2.1
gADe_2.1_EAS_AN_Male		Number of chromosomes over male EAS gnomAD Exome samples, release 2.1
gADe_2.1_EAS_Hom_Male		Number of homozygous samples over male EAS gnomAD Exome samples, release 2.1
gADe_2.1_NFE_AF		MAF over NFE gnomAD Exome samples, release 2.1
gADe_2.1_NFE_AN		Number of chromosomes over NFE gnomAD Exome samples, release 2.1
gADe_2.1_NFE_Hom		Number of homozygous samples over NFE gnomAD Exome samples, release 2.1
gADe_2.1_NFE_AF_Female		MAF over Female NFE gnomAD Exome samples, release 2.1
gADe_2.1_NFE_AN_Female		Number of chromosomes over Female NFE gnomAD Exome samples, release 2.1
gADe_2.1_NFE_Hom_Female		Number of homozygous samples over Female NFE gnomAD Exome samples, release 2.1
gADe_2.1_NFE_AF_Male		MAF over male NFE gnomAD Exome samples, release 2.1
gADe_2.1_NFE_AN_Male		Number of chromosomes over male NFE gnomAD Exome samples, release 2.1
gADe_2.1_NFE_Hom_Male		Number of homozygous samples over male NFE gnomAD Exome samples, release 2.1

Gene_Information Information
Ensembl_VariantType
Ensembl_GeneLocation
Ensembl_cPointAA
Ensembl_cPointNT
Ensembl_GeneID
Ensembl_TranscriptID
Ensembl_Exon
RefSeq_VariantType
RefSeq_GeneLocation
RefSeq_cPointAA
RefSeq_cPointNT
RefSeq_Symbol
RefSeq_Transcript
RefSeq_Exon
RefSeq_Protein_Length_Difference
RefGene_VariantType
RefGene_GeneLocation
RefGene_cPointAA
RefGene_cPointNT
RefGene_Symbol
RefGene_Transcript
RefGene_Exon
RefGene_Protein_Length_Difference
UCSC_VariantType
UCSC_GeneLocation
UCSC_cPointAA
UCSC_cPointNT
UCSC_Symbol
UCSC_Transcript
UCSC_Exon
Effect		Effect of the Variant on protein (Non-Synonymous, frameshift, UTR, ...)
Effect_Impact		Impact class : high, moderate, modifier, unknown
Functional_Class		More detailed effect on coding variant: missensen, silent, ...
Gene_Symbol
Gene_Exon		Exon or Intron number. Numbering starts from 1.
Codon_Change		For exonic variants: The affected codon sequences for WT and MUT. For intronic/downstream variants : distance to the nearest exon.
Amino_Acid_Change		p.Point notation of protein alteration.
Gene_Transcript		Transcript reference for the listed data.
Gene_Coding		Is the affected gene coding or non-coding
Transcript_BioType		Type of the gene (protein_coding,miRNA,pseudogene,...)

Splice_Prediction Information
scsnv11_ADA		Ensemble splice alteration prediction, using Adaptive boosting, based on PWM, MaxEntScan, NNSplice and HSF. pmid: 25416802. Score represents probability of altered spicing (0-1), proposed cutoff is 0.6
scsnv11_RF		Ensemble splice alteration prediction, using random forest classification, based on PWM, MaxEntScan, NNSplice and HSF. pmid: 25416802. Score represents probability of altered spicing (0-1), proposed cutoff is 0.6
spidex_dPSI	License: deepgenomics	Deep Learning prediction of alternate splicing. deltaPSI is the maximal predicted difference in splicing over 12 tissues. This annotation source is only available for non-profic, academic users. Please send a signed copy of the EULA to activate it.
spidex_Zscore	License: deepgenomics	Deep Learning prediction of alternate splicing. Zscore is a ranking of the observed value among all predictions. Assuming normal distribution, this rank can be represented as a Zscore. Lower tail represents reduced splicing, upper tail represents a new splice site. This annotation source is only available for non-profic, academic users. Please send a signed copy of the EULA to activate it.

Pathogenicity_Prediction Information
ljb_LRT	Deprecated	Rescaled (0-1) likelihood ratio test of codon constrained. Higher score for more constrained codons. See dbNSFP paper for details (pmid: 21520341)
ljb_MutTast	Deprecated	Mutation Taster has four categories: (A)utomatic and (D)isease causing, for which the score is p-value for a true prediction. (N)on-deleterious and (P)olymorphism known, for which the score is 1 - p-value for a true prediction. The higher the score, the more likely the variant is deleterious. See dbNSFP paper for details (pmid: 21520341)
ljb_PhyloP	Deprecated	Rescaled (0-1) PhyloP score. Higher Score for more conserved sites. Prediction is Rescaled Score higher than 0.95 for a conserved site. See dbNSFP paper for details (pmid: 21520341)
ljb_PolyPhen2	Deprecated	Polyphen2 Scores. (D)amaging if over 0.85, (P)ossibly damaging if over 0.15, (B)enign otherwise. See dbNSFP paper for details (pmid: 21520341)
ljb_Sift	Deprecated	1 - sift scores. Damaging if over 0.95. See dbNSFP paper for details (pmid: 21520341)
ljb_GERP	Deprecated	Conservation Scores. Exact method to be added. High score is indicative of constrained site. (See Davidov et al, 2010, plos computational biology)
CADD_raw	Deprecated	CADD C-Scores. Integrated pathogenicity predication score. Raw Scores, see publication for details.
CADD_phred	Deprecated	CADD C-Scores in Phred Scale. E.g.: Scores above 20 are in the 1% top scoring variants, scores above 30 are in the 0.1% top scoring variants.
ljb26_SIFT	Deprecated	ToDo
ljb26_pp2_hdiv	Deprecated	ToDo
ljb26_pp2_hvar	Deprecated	ToDo
ljb26_LRT	Deprecated	ToDo
ljb26_MutationTaster	Deprecated	ToDo
ljb26_MutationAssessor	Deprecated	ToDo
ljb26_FATHMM	Deprecated	ToDo
ljb26_RadialSVM	Deprecated	ToDo
ljb26_LR	Deprecated	ToDo
ljb26_VEST3	Deprecated	ToDo
ljb26_CADD_phred	Deprecated	ToDo
ljb26_GERP_RS	Deprecated	ToDo
ljb26_PhyloP46	Deprecated	ToDo
ljb26_PhyloP100	Deprecated	ToDo
ljb26_SiPhy	Deprecated	ToDo
dbnsfp30a_SIFT		ToDo
dbnsfp30a_pp2_hdiv		ToDo
dbnsfp30a_pp2_hvar		ToDo
dbnsfp30a_LRT		ToDo
dbnsfp30a_MutationTaster		ToDo
dbnsfp30a_MutationAssessor		ToDo
dbnsfp30a_FATHMM		ToDo
dbnsfp30a_PROVEAN		ToDo
dbnsfp30a_MetaSVM		ToDo
dbnsfp30a_MetaLR		ToDo
dbnsfp30a_VEST3		ToDo
dbnsfp30a_CADD_phred		ToDo
dbnsfp30a_DANN_Score		ToDo
dbnsfp30a_GERP_RS		ToDo
dbnsfp30a_fathmm_mkl		ToDo
dbnsfp30a_fitCons		Integrated FitCons Score (confidence)
dbnsfp30a_PhyloP7		ToDo
dbnsfp30a_PhyloP20		ToDo
dbnsfp30a_phastCons7		ToDo
dbnsfp30a_phastCons20		ToDo
dbnsfp30a_SiPhy		ToDo
CADDv1.4_phred		CADD Phred Scores, for CADD v1.4. These scores are calculated genome wide for both SNVs and InDels.
MutationTaster		Mutation Taster has four categories: (A)utomatic and (D)isease causing, for which the score is p-value for a true prediction. (N)on-deleterious and (P)olymorphism known, for which the score is 1 - p-value for a true prediction. The higher the score, the more likely the variant is deleterious. See Mutation Taster Website for details. These values are queried using the MTQE interface, and should be more complete than Annovar scores. Missing values reflect intronic and exotic alleles and should be checked manually
SIFT_Score		1 - SIFT scores, similar to ANNOVAR scale. Damaging if over 0.95.
SIFT_Effect		SIFT Effect prediction. (T)olerated, (D)eleterious are native SIFT predictions. If no prediction was possible, a dot (.) is returned. Frameshift and nonsense mutations are annotated with an asterisk (*).
PROVEAN_Score		PROVEAN Score. A novel prediction from the authors of SIFT. They use a cut-off of -2.5 as threshold for damaging mutations.
PROVEAN_Effect		PROVEAN Effect prediction. (N)eutral, (D)amaging are native PROVEAN predictions. If no prediction was possible, a dot (.) is returned. Frameshift and nonsense mutations are annotated with an asterisk (*).
SIFT_Protein		SIFT/PROVEAN predictions are transcript specific. Ensembl protein IDs are used to indicate transcript
SIFT_Position		SIFT/PROVEAN predictions are transcript specific. This annotation indicates the altered AA position in the transcript.
SIFT_Type		Alteration type predicted by SIFT/PROVEAN
SIFT_AA_Change		Amino Acid change predicted by SIFT/PROVEAN
Grantham_Score		Grantham score for the AA change predicted by SIFT/PROVEAN

Oncology Information
COSMICv70_ID
COSMICv70_Tissue
COSMICv70_Occurence
Genotype_Ratio		Allelic Ratio, as called by GATK Genotyper. This is 0/0.5/1 for diploid samples, but can be informative for samples of high ploidy.
Ploidy		Sample Ploidy used during genotyping. By default, this is two. Use this value to interpret the Genotype Ratio.
Somatic_State		Somatic State of the variant. Only informative for MuTect/VarScan imports. States are 0:reference, 1:germline, 2:somatic, 3:LOH, 4:post-translational, 5:unknown.

Clinical Information
SNV_Link		Link to the ClinVar page. ClinVar is designed to provide a freely accessible, public archive of reports of the relationships among human variations and phenotypes, with supporting evidence (Single Nucleotide Variant Entries).
SNV_Match		ClinVar Match Type. ClinVar holds both single nucleotide variants and copy number variations. Only SNV variants can have exact matches. Exact matches are defined as same position, same reference and alternate alleles. All other matches, such as SNV inside a ClinVar CNV, or indel spanning a ClinVar SNV, are labeled as overlapping variants (Single Nucleotide Variant Entries)
SNV_AA_Match		ClinVar Match Type at Amino Acid level. ClinVar holds both single nucleotide variants and copy number variations. Only SNV variants can have exact matches. Exact matches are defined variants leading to the same amino-acid change. All other matches, such as SNV inside a ClinVar CNV, or indel spanning a ClinVar SNV, are labeled as overlapping variants (Single Nucleotide Variant Entries)
SNV_Gene		GeneSymbol listed as the affected gene in ClinVar (Single Nucleotide Variant Entries).
SNV_NM_id		RefSeq Transcript listed as the affected gene in ClinVar. For many variants, this information was not provided. In case of multiple transcripts, these values are very prone to errors (Single Nucleotide Variant Entries)!
SNV_NP_id		RefSeq Protein listed as the affected protein in ClinVar. This might be ambigous or incorrect in case of a ClinVar CNV matching a sample SNV. For many variants, this information was not provided. In case of multiple transcripts, these values are very prone to errors!
SNV_Effect		Effect on Protein as listed in ClinVar. For many variants, this information was not provided. In case of multiple transcripts, these values are very prone to errors (Single Nucleotide Variant Entries)!
SNV_Last_Update		Time Stamp of the latest update to this variant in ClinVar (Single Nucleotide Variant Entries).
SNV_Disease		Disease associated to the variant (Single Nucleotide Variant Entries).
SNV_Class		Classification with regard to pathogenicity (Single Nucleotide Variant Entries).
SNV_Class_Comment		Comments about classification with regard to pathogenicity (Single Nucleotide Variant Entries).
SNV_XRef_Allele		Variant-oriented links to external databases (Single Nucleotide Variant Entries).
SNV_XRef_Gene		Gene-oriented links to external databases (Single Nucleotide Variant Entries).
SNV_XRef_Disease		Disease-oriented links to external databases. Benign variants are labeled as 'AllHighlyPenetrant' (Single Nucleotide Variant Entries).
SNV_PubMed		Link to Pubmed with articles related to the variant (Single Nucleotide Variant Entries).

Gene_Ontology Information
GO_ID		GO_ID for terms associated to a gene, based on gene2go from ncbi. SNP_to_GeneID is taken from snpEff annotations for Effects that are NOT downstream,intergenic,none,upstream.
GO_Term		GO Terms associated to a gene, based on gene2go from ncbi. SNP_to_GeneID is taken from snpEff annotations for Effects that are NOT downstream,intergenic,none,upstream..
GO_Term_Type		GO Term type: Biological Process, cellular Component, molecular function
GO_Obsolete		0/1 : Within GO, terms are sometimes replaced by other terms. Old terms are kept as reference, but should no longer be used. This annotation indicates if a term has been replaced.
GO_First_Level_Parent_IDs		Gene Ontology ID, one-level up the hierarchy. This is slow!
GO_First_Level_Parent_Names		Gene Ontology ID, one-level up the hierarchy. This is slow!

User_Defined Information
Panel_Definition		If Variant affects a gene in a gene panel (based on ncbiGene), list the name of the panel
Panel_Description		If Variant affects a gene in a gene panel, list the name of the panel
Panel_Gene		Gene Panel entry, hit by the variant. Taken from ANNOVAR_refgene
Panel_Gene_Comment		Gene specific comments from the GenePanel entry. User Provided info

Custom_VCF_Fields Information

VariantDB

Import options

Configuration

Manage access

Generate PDF

Use our BETA server

Platform Settings

Gene Panels

Manage Variant Classifiers

Checkbox Lists

Usergroup Settings

Documentation : Annotations Explained