Frequency Aware Zygosity

This algorithm computes the zygosity of a sample by using the individual’s genotype combined with frequency information available in a variant frequency catalog such as ExAC or 1000 genomes. The zygosity is categorized as follows:

  • Homozygous Major: Two major alleles called (Wild Type)
  • Homozygous Minor: Two minor alleles called
  • Heterozygous: Two different alleles called
  • Hemizygous Major: A single major allele called on sex chromosome (Wild Type)
  • Hemizygous Minor: A single minor allele called on sex chromosome

The advantage to using frequency information when computing zygosity is that the reference sequence contains an alternate allele at certain positions. The alternate allele frequency at this position is often greater than 90%. We denote the most common allele as the major allele.

Note

At a site with multiple alternate alleles, the major allele could have a frequency less than 50% (eg. Ref = 35%, Alt1 = 20%, Alt2 = 45%)

Requirements

Requires a genotype (GT) sample level field.

This algorithm requires first annotating variants using a variant source with an Alternate Allele Frequency field.

Output

Creates a categorical field denoting the frequency aware zygosity for each sample.