iEvoBio Submission

Quantifying Information in a Phylogenetic Tree: The Robustness of Phylogenetic Signal

Since Felsenstein (1985)’s influential paper on phylogenetic contrasts, phylogenetically-based inferences of evolutionary patterns have gotten increasingly more complex (Hansen, 1997; Butler and King, 2004; Freckleton and Harvey, 2006; O’Meara et al., 2006; Pagel and Meade, 2006). More and more sophisticated models are applied to datasets of species mean traits on a phylogenetic tree in an attempt to answer such questions as reconstructing ancestral states, identify changes in rates of character evolution across time and across clades, identify the number and location of evolutionary optima or the strength of stabilizing selection. In each of these analyses, an ultrametric phylogenetic tree is the only source of temporal information to quantify these intrinsically temporally-based questions.

The extent to which a phylogenetic tree can reliably shed light into the evolution of traits in the past depends not only on its size but on its structure. For instance, an unresolved or star phylogeny contains no information and renders phylogenetic techniques uninformative, regardless of the number of taxa. While the question of phylogenetic signal is not new (Price, 1997; Losos, 1996), there has been insufficient analysis of when a phylogenetic signal is substantial enough to inform the kinds of inferences we now routinely perform.

I will present examples of how typical phylogenetic analyses are applied to trees that are insufficiently informative to choose between models. I will discuss ways to detect and quantify the power of a phylogenetic tree relative to common methods. I will introduce an actively developing open source software package for addressing these issues. In the spirit of the conference, this research and code are being developed and released in the open under GPL-v3.0 and Creative Commons by-sa licenses.

NESCent proposal style topics:

Possible topics

Comparative Methods Workflow hackathon

Interfaces between programs
Extensibility of existing software
Redundancy of approaches
User / developer dialog and interface

Reliability: evaluating the robustness of current methods

Model choice criteria
Robustness to assumptions in tree, etc

Future of methods

Bayesian implementations
High-dimensional data
databases, standards

Research

Seminars

Discussing doi:10.1126/science.1084786 Harmon et al. (2003) in PDG today.
Considering how best to set up a Mendeley or Cite-U-Like collection for PDG. Ideally find a system where we might share individual comments or a collective rating of the paper.

Mendeley Public Collection
Mendeley Shared Collection
Cite-U-Like group

Notes

inclusion embedding (of Nescent ideas) done using curly brace transclusion, and

<onlyinclude> </onlyinclude>
<noinclude> </noinclude>

See wikimedia information for more details.