The following data were taken from the segregating sites in a sequence of nucleotides from the Y chromosome of 355 Europeans. Sixteen segregating sites were found and 11 different alleles. At each site 0 represents the ancestral variant (as observed in the majority of a reasonably large sample of chimpanzees). The alleles observed and their frequencies are given below.



Alleles
1   2   3   4   5   6   7   8   9   10   11   12   13   14   15   16
C   0   1   0   0   1   0   0   1   1   0   0   0   0   1   0   0
E   0   1   0   0   1   0   1   0   0   0   0   0   0   0   0   0
F   0   1   0   0   1   1   0   0   0   0   0   0   0   0   0   0
G   0   1   0   0   1   0   0   1   1   0   0   0   0   1   0   1
I   0   1   0   0   1   0   0   1   1   0   0   0   1   0   0   0
J   0   1   0   0   1   0   0   1   1   1   1   1   0   0   0   0
K   0   1   1   1   0   0   0   0   0   0   0   0   0   0   0   0
L   0   1   0   0   1   0   0   1   1   0   0   0   0   1   1   0
N   1   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
Q   0   1   0   0   1   0   0   0   0   0   0   0   0   0   0   0
R   0   1   0   0   1   0   0   1   0   0   0   0   0   0   0   0


Calculate the matrix given the Hamming distance between each allele.

Calculate the nucleotide diversity.

Carry out the Tajima test to verify the Wright-Fisher model.

Consider the following effects i) population growth, ii) directional selection, iii) divisions within

(i) Population growth might not be the dominant factor, because $\hat{\theta}_L < \hat{\theta}_\pi\,$ and this suggests falling population. (Chapter 1; Page 32)
(iii) Because $\hat{\theta}_L < \hat{\theta}_\pi\,$ division would be possible, but the migrationrate must be very high, since we accept $H_0\,$ (= we got a Wright-Fisher Model). (Chapter 3; Page 11)