Saturday, September 19, 2009

Genealogical Application of 23andMe data

I don't pretend to understand this   -   but - in case anyone does, here it is.    But, while I don't understand it I do think that I can tell enough of what they're saying to give me hope for some genealogical findings from or 23andMe test results.    Additionally, I don't expect this to be the exclusive protocol for genealogical study using 23andMe data.

Discovering Distant Relatives within a Diverse Set of Populations Using DNA Segments Identical by Descent. L. Hon, B. M. Henn, J. M. Macpherson, N. Eriksson, A. Wojcicki, L. Avey, S. Saxonov, J. L. Mountain 23andMe, Inc, Mountain View, CA.

Close familial relationships, such as parent-offspring relationships, are inferred readily from genotype and allele frequency data for a limited number of single nucleotide polymorphisms(SNPs) or short tandem repeat (STR) polymorphisms.    However, researchers have not focused on accurately inferring the degree of relationship between individuals who share a common ancestor 3-10 generations ago;   such inference requires high density genome-wide information not available until recently.    In order to characterize relatively distant relationships for a sample, we analyzed the sharing of DNA identical by descent (IBD) in a large database of self-selected individuals of European ancestry, including a subset identified as Ashkenazim.    Individuals from more endogamous ethnic populations, such as the Ashkenazim, share on average more DNA identically by descent than do individuals of Asian ancestry or of European ancestry broadly defined.    Extensive population-level sharing of identical genomic segments complicates the prediction of relationship level for a pair of individuals since identical segments may reflect common ancestry older than the most recent ancestor for a pair of individuals in a pedigree.    In order to understand the pattern of the observed population-level sharing, we simulated extended pedigrees using different populations to calculate the expected amounts of sharing for 1st through 10th cousins.    Specifically we assessed the relationship between the length of the longest segment and the most recent common ancestor.    From the simulations, we also determined bounds for predicted cousinships given a specific amount of segmental sharing.    Using these bounds as a guide, we detected at least one distant relative, between 2nd-8th degree cousin, for 90% of individuals in our dataset of more than 5000 individuals with European ancestry.    An even higher fraction of our Ashkenazi sample, 99%, had at least one distant relative between 2nd-7th degree cousinship in our dataset.

0 Comments:

Post a Comment

<< Home