Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks

Barton NH, Etheridge A, Kelleher J, Véber A. 2013. Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks. Theoretical Population Biology. 87(1), 105–119.

Download
OA IST-2016-558-v1+1_inference_revised3101NB.pdf 1.55 MB
OA IST-2016-558-v1+2_inference_revised3101NBApp.pdf 822.96 KB

Journal Article | Published | English

Scopus indexed
Author
Barton, Nicholas HISTA ; Etheridge, Alison; Kelleher, Jerome; Véber, Amandine
Department
Abstract
We outline two approaches to inference of neighbourhood size, N, and dispersal rate, σ2, based on either allele frequencies or on the lengths of sequence blocks that are shared between genomes. Over intermediate timescales (10-100 generations, say), populations that live in two dimensions approach a quasi-equilibrium that is independent of both their local structure and their deeper history. Over such scales, the standardised covariance of allele frequencies (i.e. pairwise FS T) falls with the logarithm of distance, and depends only on neighbourhood size, N, and a 'local scale', κ; the rate of gene flow, σ2, cannot be inferred. We show how spatial correlations can be accounted for, assuming a Gaussian distribution of allele frequencies, giving maximum likelihood estimates of N and κ. Alternatively, inferences can be based on the distribution of the lengths of sequence that are identical between blocks of genomes: long blocks (>0.1 cM, say) tell us about intermediate timescales, over which we assume a quasi-equilibrium. For large neighbourhood size, the distribution of long blocks is given directly by the classical Wright-Malécot formula; this relationship can be used to infer both N and σ2. With small neighbourhood size, there is an appreciable chance that recombinant lineages will coalesce back before escaping into the distant past. For this case, we show that if genomes are sampled from some distance apart, then the distribution of lengths of blocks that are identical in state is geometric, with a mean that depends on N and σ2.
Publishing Year
Date Published
2013-08-01
Journal Title
Theoretical Population Biology
Volume
87
Issue
1
Page
105 - 119
IST-REx-ID

Cite this

Barton NH, Etheridge A, Kelleher J, Véber A. Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks. Theoretical Population Biology. 2013;87(1):105-119. doi:10.1016/j.tpb.2013.03.001
Barton, N. H., Etheridge, A., Kelleher, J., & Véber, A. (2013). Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks. Theoretical Population Biology. Elsevier. https://doi.org/10.1016/j.tpb.2013.03.001
Barton, Nicholas H, Alison Etheridge, Jerome Kelleher, and Amandine Véber. “Inference in Two Dimensions: Allele Frequencies versus Lengths of Shared Sequence Blocks.” Theoretical Population Biology. Elsevier, 2013. https://doi.org/10.1016/j.tpb.2013.03.001.
N. H. Barton, A. Etheridge, J. Kelleher, and A. Véber, “Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks,” Theoretical Population Biology, vol. 87, no. 1. Elsevier, pp. 105–119, 2013.
Barton NH, Etheridge A, Kelleher J, Véber A. 2013. Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks. Theoretical Population Biology. 87(1), 105–119.
Barton, Nicholas H., et al. “Inference in Two Dimensions: Allele Frequencies versus Lengths of Shared Sequence Blocks.” Theoretical Population Biology, vol. 87, no. 1, Elsevier, 2013, pp. 105–19, doi:10.1016/j.tpb.2013.03.001.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Main File(s)
Access Level
OA Open Access
Date Uploaded
2018-12-12
MD5 Checksum
9bf9d9a6fd03dd9df50906891f393bf8
Access Level
OA Open Access
Date Uploaded
2018-12-12
MD5 Checksum
2bceddb76edacd0cd5fad73051e2a928


Export

Marked Publications

Open Data ISTA Research Explorer

Search this title in

Google Scholar