Collection of Python and R modules to investigate the genetic mechanisms driving inherited behavioral traits (like pointing) in dogs. Initial round of analyses will be conducted using Illumina Canine HD 173K SNP array.
A pointing breed is a type of gundog typically used in finding game. The name pointer comes from the dog's instinct to point, by stopping and aiming its muzzle towards game. Pointers were selectively bred for dogs who had abundant pointing and backing instinct. They typically start to acquire their hunting instincts at about 2 months of age. We hope to interrogate the canine genome using high-density SNP genotyping arrays to find genetic regions under selective pressure that may be associated with pointing instinct.
- Compute pairwise genetic distances between each breed and every other breed
- E.g. for
n
total breeds, computen - 1
pairwise Fst measures using 173k Arrays
- E.g. for
- Calculate breed specific normalized Fst measures (d(i) from Akey et al 2010) for each pointing breed using two approaches:
- Using all pointer vs. non-pointer comparisons. Differentiated markers in this set presumably include regions under selection for pointing instinct and regions under selection for other breed-specific traits.
- Using all pointer vs. pointer comparisons. Regions associated with pointing instinct should be undifferentiated in these comparisons while regions corresponding to breed specific traits should still show differentiation.
- To remove breed-specific signatures of selection, the d(i) statistics for pointer vs. pointer comparisons will be subtracted from the d(i) statistics resulting from the comparisons with non-pointing breeds to produce a new set of adjusted d(i) values.
The elimination of standing variation in regions linked to a recently fixed beneficial mutation is known as a “selective sweep”.
Easier to locate. Use these to narrow search for selective sweeps.
- Document better
- Add Makefile to make pipeline reproducible
- Mine NGS data to look for overlapping regions of homozygosity