TY - JOUR
T1 - Cherry picking in forests: A new characterization for the unrooted hybrid number of two phylogenetic trees
AU - Huber, Katharina
AU - Linz, Simone
AU - Moulton, Vincent
N1 - Data sharing statement: Data sharing not applicable to this article as no datasets were generated or analysed during
the current study.
PY - 2025/5/20
Y1 - 2025/5/20
N2 - Phylogenetic networks are a special type of graph which generalize phylogenetic trees and that are used to model non-treelike evolutionary processes such as recombination and hybridization. In this paper, we consider {\em unrooted} phylogenetic networks, i.e. simple, connected graphs $\cN =(V,E)$ with leaf set $X$, for $X$ some set of species, in which every internal vertex in $\cN$ has degree three. One approach used to construct such phylogenetic networks is to take as input a collection $\cP$ of phylogenetic trees and to look for a network $\cN$ that contains each tree in $\cP$ and that minimizes the quantity $r(\cN) = |E|-(|V|-1)$ over all such networks. Such a network always exists, and the quantity $r(\cN)$ for an optimal network $\cN$ is called the {\em hybrid number of $\cP$}. In this paper, we give a new characterization for the hybrid number in case $\cP$ consists of two trees. This characterization is given in terms of a {\em cherry picking sequence} for the two trees, although to prove that our characterization holds we need to define the sequence more generally for two forests. Cherry picking sequences have been intensively studied for collections of {\em rooted} phylogenetic trees, but our new sequences are the first variant of this concept that can be applied in the unrooted setting. Since the hybrid number of two trees is equal to the well-known {\em tree bisection and reconnection distance} between the two trees, our new characterization also provides an alternative way to understand this important tree distance.
AB - Phylogenetic networks are a special type of graph which generalize phylogenetic trees and that are used to model non-treelike evolutionary processes such as recombination and hybridization. In this paper, we consider {\em unrooted} phylogenetic networks, i.e. simple, connected graphs $\cN =(V,E)$ with leaf set $X$, for $X$ some set of species, in which every internal vertex in $\cN$ has degree three. One approach used to construct such phylogenetic networks is to take as input a collection $\cP$ of phylogenetic trees and to look for a network $\cN$ that contains each tree in $\cP$ and that minimizes the quantity $r(\cN) = |E|-(|V|-1)$ over all such networks. Such a network always exists, and the quantity $r(\cN)$ for an optimal network $\cN$ is called the {\em hybrid number of $\cP$}. In this paper, we give a new characterization for the hybrid number in case $\cP$ consists of two trees. This characterization is given in terms of a {\em cherry picking sequence} for the two trees, although to prove that our characterization holds we need to define the sequence more generally for two forests. Cherry picking sequences have been intensively studied for collections of {\em rooted} phylogenetic trees, but our new sequences are the first variant of this concept that can be applied in the unrooted setting. Since the hybrid number of two trees is equal to the well-known {\em tree bisection and reconnection distance} between the two trees, our new characterization also provides an alternative way to understand this important tree distance.
KW - TBR distance
KW - cherry picking sequence
KW - forest
KW - hybrid number
KW - phylogenetic network
UR - http://www.scopus.com/inward/record.url?scp=105008414413&partnerID=8YFLogxK
U2 - 10.46298/dmtcs.11633
DO - 10.46298/dmtcs.11633
M3 - Article
SN - 1462-7264
VL - 272
JO - Discrete Mathematics and Theoretical Computer Science
JF - Discrete Mathematics and Theoretical Computer Science
IS - 2
M1 - 15
ER -