- The inference of demographic history from genome data is hindered by a lack of
efficient computational approaches. In particular, it has proved difficult to
exploit the information contained in the distribution of genealogies across the
genome. We have previously shown that the generating function (GF) of genealogies
can be used to analytically compute likelihoods of demographic models from configurations
of mutations in short sequence blocks (Lohse et al. 2011). Although the GF has
a simple, recursive form, the size of such likelihood calculations explodes quickly
with the number of individuals and applications of this framework have so far
been mainly limited to small samples (pairs and triplets) for which the GF can
be written by hand. Here we investigate several strategies for exploiting the
inherent symmetries of the coalescent. In particular, we show that the GF of genealogies
can be decomposed into a set of equivalence classes that allows likelihood calculations
from nontrivial samples. Using this strategy, we automated blockwise likelihood
calculations for a general set of demographic scenarios in Mathematica. These
histories may involve population size changes, continuous migration, discrete
divergence, and admixture between multiple populations. To give a concrete example,
we calculate the likelihood for a model of isolation with migration (IM), assuming
two diploid samples without phase and outgroup information. We demonstrate the
new inference scheme with an analysis of two individual butterfly genomes from
the sister species Heliconius melpomene rosina and H. cydno.@eng
Author list:
Konrad Lohse



Martin Chmelik




Simon Martin



Nicholas H Barton





DOI: 10.1534/genetics.115.183814
Issue: 2
Volume: 202
Date: 2016
Language: eng
Publisher: Genetics Society of America
Title: Efficient strategies for calculating blockwise likelihoods under the coalescent
coalescent@
