This vignette illustrates the basic usage of the SNPknock
package in combination with the phasing software fastphase
(Scheet and Stephens 2006) to create knockoffs of unphased genotypes or phased haplotypes (Sesia, Sabatti, and Candès 2019, Sesia et al. (2019)). Since fastphase
is not available as an R package, this functionality of SNPknock
requires the user to first obtain a copy of fastphase
.
To learn more about the application of SNPknock
to large genome-wide association studies (Sesia et al. 2019), visit: https://msesia.github.io/knockoffzoom/.
fastphase
fastphase
is a phasing and imputation tool based on the hidden Markov model described in (Scheet and Stephens 2006).
Binary executables for Linux and Mac OS are available from http://scheet.org/software.html.
Before continuing with this tutorial, download the fastphase
tarball from the above link and extract the fastphase
executable file into a convenient directory (e.g. “~/bin/”).
Finally, we can use the hidden Markov model created above to generate knockoffs.
## Xk
## 0 1 2
## 130114 90466 25620
Finally, we can use the hidden Markov model created above to generate knockoffs.
## Hk
## 0 1
## 351155 141245
If you want to see some basic usage of SNPknock
, see the introductory vignette.
If you want to learn about SNPknock
for large genome-wide association studies (Sesia et al. 2019), see https://msesia.github.io/knockoffzoom/.
Scheet, P., and M. Stephens. 2006. “A Fast and Flexible Statistical Model for Large-Scale Population Genotype Data: Applications to Inferring Missing Genotypes and Haplotypic Phase.” Am. J. Hum. Genet. 78:629–44. https://doi.org/10.1086/502802.
Sesia, M., E. Katsevich, S. Bates, E. Candès, and C. Sabatti. 2019. “Multi-Resolution Localization of Causal Variants Across the Genome.” bioRxiv. Cold Spring Harbor Laboratory. https://doi.org/10.1101/631390.
Sesia, M., C. Sabatti, and E. J. Candès. 2019. “Gene Hunting with Hidden Markov Model Knockoffs.” Biometrika 106:1–18. https://doi.org/10.1093/biomet/asy033.