Inference on Recombination and Block Structure Using Unphased Data

Research output: Contribution to journalJournal articleResearchpeer-review

In this study compatibility with a tree for unphased genotype data is discussed. If the data are compatible with a tree, the data are consistent with an assumption of no recombination in its evolutionary history. Further, it is said that there is a solution to the perfect phylogeny problem; i.e., for each individual a pair of haplotypes can be defined and the set of all haplotypes can be explained without invoking recombination. A new algorithm to decide whether or not a sample is compatible with a tree is derived. The new algorithm relies on an equivalence relation between sites that mutually determine the phase of each other. (The previous algorithm was based on advanced graph theoretical tools.) The equivalence relation is used to derive the number of solutions to the perfect phylogeny problem. Further, a series of statistics, RMj, j ≥ 2, are defined. These can be used to detect recombination events in the sample's history and to divide the sample into regions that are compatible with a tree. The new statistics are applied to real data from human genes. The results from this application are discussed with reference to recent suggestions that recombination in the human genome is highly heterogeneous.

Original languageEnglish
JournalGenetics
Volume166
Issue number1
Pages (from-to)537-545
Number of pages9
ISSN0016-6731
DOIs
Publication statusPublished - 1 Jan 2004
Externally publishedYes

ID: 203902508