Skip to contents

TipInstability() calculates the instability of each leaf in a tree. Unstable leaves are likely to display roguish behaviour.

Usage

TipInstability(
  trees,
  log = TRUE,
  average = "mean",
  deviation = "sd",
  checkTips = TRUE,
  parallel = FALSE
)

ColByStability(
  trees,
  log = TRUE,
  average = "mean",
  deviation = "sd",
  pal = hcl.colors(131, "inferno")[1:101]
)

Arguments

trees

List of trees to analyse.

log

Logical specifying whether to log-transform distances when calculating leaf stability.

average

Character specifying whether to use "mean" or "median" tip distances to calculate leaf stability.

deviation

Character specifying whether to use "sd" or "mad" to calculate leaf stability.

checkTips

Logical specifying whether to check that tips are numbered consistently.

parallel

Logical specifying whether parallel execution should take place in C++.

pal

A vector listing a sequence of colours to be used for plotting. The earliest entries will be assigned to the most stable tips.

Value

ColByStability() returns a named character vector that assigns a colour to each leaf in trees according to their stability.

Details

Smith (2022) defines the instability of a pair of leaves as the median absolute divergence in the graph geodesic (the number of edges in the shortest path between the leaves) across all trees, normalized against the mean graph geodesic. The instability of a single leaf is the mean instability of all pairs that include that leaf; higher values characterise leaves whose position is more variable between trees.

Other concepts of leaf instability include

  • The "taxonomic instability index", as implemented in Mesquite: described by Thomson and Shaffer (2010) as \(\sum\limits_{(x, y), j \neq i}{\frac{|D~ijx~ - D~ijy~|}{(D~ijx~ - D~ijy~)^2}}\), where \(D~ijx~\) is the patristic distance (i.e. length of edges) between leaves \(i\) and \(j\) in tree \(x\).

  • the average stability of triplets (i.e. quartets including the root) that include the leaf (Thorley and Wilkinson 1999) , implemented in "Phyutility" (Smith and Dunn 2008) ; and related to "positional congruence" measures (Estabrook 1992; Pol and Escapa 2009) .

References

Estabrook GF (1992). “Evaluating undirected positional congruence of individual taxa between two estimates of the phylogenetic tree for a group of taxa.” Systematic Biology, 41(2), 172--177. doi:10.1093/sysbio/41.2.172 .

Pol D, Escapa IH (2009). “Unstable taxa in cladistic analysis: identification and the assessment of relevant characters.” Cladistics, 25(5), 515--527. doi:10.1111/j.1096-0031.2009.00258.x .

Smith MR (2022). “Using information theory to detect rogue taxa and improve consensus trees.” Systematic Biology, 71(5), 986--1008. doi:10.1093/sysbio/syab099 .

Smith SA, Dunn CW (2008). “Phyutility: a phyloinformatics tool for trees, alignments and molecular data.” Bioinformatics, 24(5), 715--716. doi:10.1093/bioinformatics/btm619 .

Thomson RC, Shaffer HB (2010). “Sparse supermatrices for phylogenetic inference: taxonomy, alignment, rogue taxa, and the phylogeny of living turtles.” Systematic Biology, 59(1), 42--58. doi:10.1093/sysbio/syp075 .

Thorley JL, Wilkinson M (1999). “Testing the phylogenetic stability of early tetrapods.” Journal of Theoretical Biology, 200(3), 343--344. doi:10.1006/jtbi.1999.0999 .

See also

Other tip instability functions: TipVolatility()

Examples

library("TreeTools", quietly = TRUE)

# Generate some trees with a rogue taxon
trees <- AddTipEverywhere(BalancedTree(8), "Rogue")[3:6]

# Display the strict consensus
plot(consensus(trees), tip.col = ColByStability(trees))

# Add a legend for the colour scale used
PlotTools::SpectrumLegend(
  "bottomleft", bty = "n", # No box
  legend = c("Unstable", "", "Stable"),
  palette = hcl.colors(131, "inferno")[1:101]
)


# Calculate leaf stability
instab <- TipInstability(trees, log = FALSE, ave = "mean", dev = "mad")

# Plot a consensus that omits the least stable leaves
plot(ConsensusWithout(trees, names(instab[instab > 0.2])))