" 'Where is the missing heritability?' is a question asked frequently in genetic research. The difficulty seems to come down to the common disease/common variant hypothesis not holding up." » Read more
Updated: November 2009
Level: Beginner
Modules: HelixTree, WGA, Regression
The following tutorials are designed to systematically introduce you to a few of the new features and enhancements in SVS 7, particularly focusing on SNP association. They are not meant to replicate all the workflows you might use in a complete analysis, but instead touch on a sampling of the more typical scenarios you may come across in your own studies.
Golden Helix offers free webinar-based training and a host of analytic services covering every aspect of genetic association studies including study design, quality assurance, and data analysis.
Now that you have uncovered the top hits, you can use them in a full vs. reduced model regression analysis to see how much they add to the explanation of case/control status beyond what other associated phenotypic variables already explain.
At this point you are only interested in the top hits that passed a whole genome chi-squared test, so you need to subset them out from the original filtered spreadsheet.
For this tutorial only the top two hits will be considered as they were the only ones to meet a 0.05 Bonferroni significance level.
A new spreadsheet, Association Tests (Basic Allelic Tests) - Mapped Sheet 2, is created. This spreadsheet will be used to isolate only these two markers in the original filtered genotype spreadsheet.
A new spreadsheet tab is created, HM_All - Mapped Sheet 3, with only the two significant SNP columns active. Since it is hard to see this with so many columns, you can search for those SNPs in the spreadsheet to confirm.
In order to perform regression, the phenotypic variables need to be activated.
To clean up the project a little, first create a column subset spreadsheet from these active columns and then create a top-level spreadsheet.
In order to perform regression analysis on genotypic variables, you need to recode them as integers based on a specified genetic model.
Upon completion, a Regression Statistics Viewer window will appear (Figure 14) with the results for the full vs. reduced model. Notice SNP_A-4236717 has an odds ratio of ~2.92. Not bad...
© 2010 Golden Helix, Inc. All Rights Reserved