After assuring the quality of the data, association testing can be performed.
Open the Filtered Data for Association Testing spreadsheet and make sure the Phenotype 1 Binary column header is still set as the dependent variable.
Choose Analysis >Genotype Association Tests.
Make sure the Additive Model: (dd) -> (Dd) -> (DD) radio button is selected and check only Correlation/Trend test and Exact form of Cochran-Armitage test under Test Statistic or Method.
Under Multiple Testing Correction, make sure Bonferroni Adjustment is checked and uncheck the other options.
Check Output data for P-P/Q-Q Plots and click Run.
Upon completion a new spreadsheet is created, Association Tests (Additive Model). This spreadsheet displays several association statistics for each SNP (Figure 7-1).
The results from the Exact Cochran-Armitage Test should be examined in the case when a SNP has a significant p-value but the counts in the contingency table of Case Status by Number of Minor Alleles has at least one count less than 5. In this case the assumptions of the Correlation/Trend test are violated.
In this study the most significant marker: SNP_A-2070191 (Corr/Trend p-value = 2.729e-7) has the following contingency table:
| dd | Dd | DD | Total | |
| Case | 100 | 112 | 19 | 231 |
| Control | 148 | 71 | 5 | 224 |
| Total | 248 | 183 | 24 | 455 |
This results in an Exact Armitage P-Value of 2.416e-7. There is little difference because all cell counts are 5 or higher.
Q-Q plots are generated by plotting the expected chi-squared values against the observed chi-squared values.
This will generate a straight line with a slope of 1 and y-intercept of 0. You should have a Q-Q plot that looks like Figure 7-2.
Similarly, you might also plot a P-P plot by using the expected -log10 P on the X axis and -log10 P on the Y.
Notice the full-domain view now has chromosome bands and the X-axis is represented by chromosome and physical position (Figure 7-3).
There are many ways to zoom in the genome browser: double-clicking on a chromosome in the full domain view (upper band), double-clicking a cytoband or gene in the Annotation Tracks pane (lower band), manually selecting a chromosome and/or position in the Graph Attributes tab (at the top above the Full Domain Band), or “rubber band ” zooming in the plot view itself.
Zooming displays the karyogram view of chromosome 6. More information about SNPs are available with different annotation tracks.
This displays the marker name, its p-value, chromosome, and position in the Data Console (bottom-left pane), along with additional links to online resources.
Note
You can increase the size of the Annotation Tracks window by dragging the top of the pane up (Figure 7-5 with additional zooming).
Manhattan plots are popular images for publication purposes as they color-code by chromosome making it easy to see where significant markers reside.
This will split the graph into 22 different colors, one for each chromosome (Figure 7-6). You can change the color of each chromosome by selecting its respective node in the color tab.
You can save all displayed plots in the plot view to a number of popular image formats.
This will bring up a preview window (Figure 7-7). Here you can manipulate various image parameters.
You have now performed a cursory genome-wide association study on a case/control phenotype. For more challenging analyses, try running association tests and regression on the other phenotypes. If you click on the first node in the Project Navigator, SNP_GWAS_Tutorial, you will get more information on what can be found with each phenotype.