Distance and Parsimony methods (Part 2)

Parsimony-based methods

Character-based data: 4 nucleotides ACGT or 20 aminoacids => matrix of aligned characters
It does not rely on models of evolution
One seeks the tree that minimizes the amount of evolutionary change required to explain the data
Justification:
- Ockham’s razor: when two hypothesis provide equally valid explanations for a phenomenon, the simpler one should always be preferred
- More character-state changes imply a more complex hypothesis because homoplasy (sharing identical character states that cannot be explained by inheritance from a common ancestor) is an ad hoc hypothesis
Parsimony represents a useful fall back method when model-based methods cannot be used due to computational limitations

Assumptions

Parsimony methods are most effective when rate of evolution is slow, but this is not a necessary assumption
Parsimony methods can perform well under high rates of evolution as long as there are no pathological inequalities (long-branch attraction: Felsenstein zone)
The only real assumption of parsimony is independence among characters

Methodology

Determine the amount of character change required to explain the data by a given tree
Search over all possible tree topologies

We need to be able to calculate the length of a proposed tree which is defined as the amount of character change implied by a most parsimonious reconstruction of internal nodes
Just as in MSA, we need to have costs for substitutions (equal costs or unequal costs)

Example: Evaluate the length of the ((W,Y),(X,Z)); tree given the site:

W:G
X:C
Y:A
Z:C

Full solution: see this YouTube video
Note that this is only one site! We need to repeat this process for every site and add up the lengths
More on Newick (parenthetical) format here

Just as in MSA, we cannot do this by hand and there are dynamic programming algorithms that help us (what was dynamic programming?):

Fitch algorithm (HB Box 8.2) for equal costs
Sankoff algorithm (HB Box 8.1) for unequal costs

Fitch algorithm

1) Root the tree in a random place (parsimony score is not affected by the root)

2) Calculate the state-set $X_i$ for each internal node $i$ corresponding the set of states that can be assigned to each node so that the minimum possible length of the subtree can be achieved. Let $L(i)$ and $R(i)$ be the left and right child descendant nodes of $i$ respectively.

2.1) Form the intersection of the two child state sets: $X_{L(i)} \cap X_{R(i)}$

2.2) If the intersection is non-empty, set $X_i$ equal to this intersection and the accumulated length for this node as the sum of the accumulated lengths for the two child nodes: $s_i=s_{L(i)}+s_{R(i)}$

2.3) If the intersection is empty, let $X_i$ be equal to the union of the two child sets: $X_{L(i)} \cup X_{R(i)}$ and set the accumulated length for this node as the sum of the accumulated lengths for the two child nodes plus one: $s_i=s_{L(i)}+s_{R(i)}+1$

Example: Evaluate the length of the ((W,Y),(X,Z)); tree given the site:

W:G
X:C
Y:A
Z:C

using the Fitch algorithm.

Full solution: see this YouTube video
Homework: Redo the algorithm with different root positions to verify that you get the same length

Phylogenetic inference: Maximum Parsimony (MP) tree: Step 1) Evaluate the parsimony score of a given tree (length) with Fitch algorithm. Step 2) Search the space of trees until you find the optimum.

Some downsides

Parsimony methods have been shown to produce inconsistent trees
Read more in Felsenstein 1978