# Computerintensive Methoden - Coalescent Theory - Project A

## Contents

The following data were taken from the segregating sites in a sequence of nucleotides from the Y chromosome of 355 Europeans. Sixteen segregating sites were found and 11 different alleles. At each site 0 represents the ancestral variant (as observed in the majority of a reasonably large sample of chimpanzees). The alleles observed and their frequencies are given below.

Alleles
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
C 0 1 0 0 1 0 0 1 1 0 0 0 0 1 0 0
E 0 1 0 0 1 0 1 0 0 0 0 0 0 0 0 0
F 0 1 0 0 1 1 0 0 0 0 0 0 0 0 0 0
G 0 1 0 0 1 0 0 1 1 0 0 0 0 1 0 1
I 0 1 0 0 1 0 0 1 1 0 0 0 1 0 0 0
J 0 1 0 0 1 0 0 1 1 1 1 1 0 0 0 0
K 0 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0
L 0 1 0 0 1 0 0 1 1 0 0 0 0 1 1 0
N 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
Q 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0
R 0 1 0 0 1 0 0 1 0 0 0 0 0 0 0 0

Calculate the matrix given the Hamming distance between each allele.

in

d=dist(alleles,method="manhattan") d

Calculate the nucleotide diversity.

Carry out the Tajima test to verify the Wright-Fisher model.

(i) Population growth might not be the dominant factor, because $\hat{\theta}_L < \hat{\theta}_\pi\,$ and this suggests falling population. (Chapter 1; Page 32)
(iii) Because $\hat{\theta}_L < \hat{\theta}_\pi\,$ division would be possible, but the migrationrate must be very high, since we accept $H_0\,$ (= we got a Wright-Fisher Model). (Chapter 3; Page 11)