TY - JOUR
T1 - On asymptotic joint distributions of cherries and pitchforks for random phylogenetic trees
AU - Choi, Kwok Pui
AU - Kaur, Gursharn
AU - Wu, Taoyang
N1 - Funding Information: K.P. Choi acknowledges the support of Singapore Ministry of Education Academic Research Fund R-155-000-188-114. The work of Gursharn Kaur was supported by NUS Research Grant R-155-000-198-114.
PY - 2021/9/23
Y1 - 2021/9/23
N2 - Tree shape statistics provide valuable quantitative insights into evolutionary mechanisms underpinning phylogenetic trees, a commonly used graph representation of evolutionary relationships among taxonomic units ranging from viruses to species. We study two subtree counting statistics, the number of cherries and the number of pitchforks, for random phylogenetic trees generated by two widely used null tree models: the proportional to distinguishable arrangements (PDA) and the Yule-Harding-Kingman (YHK) models. By developing limit theorems for a version of extended Pólya urn models in which negative entries are permitted for their replacement matrices, we deduce the strong laws of large numbers and the central limit theorems for the joint distributions of these two counting statistics for the PDA and the YHK models. Our results indicate that the limiting behaviour of these two statistics, when appropriately scaled using the number of leaves in the underlying trees, is independent of the initial tree used in the tree generating process.
AB - Tree shape statistics provide valuable quantitative insights into evolutionary mechanisms underpinning phylogenetic trees, a commonly used graph representation of evolutionary relationships among taxonomic units ranging from viruses to species. We study two subtree counting statistics, the number of cherries and the number of pitchforks, for random phylogenetic trees generated by two widely used null tree models: the proportional to distinguishable arrangements (PDA) and the Yule-Harding-Kingman (YHK) models. By developing limit theorems for a version of extended Pólya urn models in which negative entries are permitted for their replacement matrices, we deduce the strong laws of large numbers and the central limit theorems for the joint distributions of these two counting statistics for the PDA and the YHK models. Our results indicate that the limiting behaviour of these two statistics, when appropriately scaled using the number of leaves in the underlying trees, is independent of the initial tree used in the tree generating process.
KW - Joint subtree distributions
KW - Limit distributions
KW - PDA model
KW - Pólya urn model
KW - Tree shape
KW - Yule-Harding-Kingman model
UR - http://www.scopus.com/inward/record.url?scp=85115355124&partnerID=8YFLogxK
U2 - 10.1007/s00285-021-01667-2
DO - 10.1007/s00285-021-01667-2
M3 - Article
SN - 0303-6812
VL - 83
JO - Journal of Mathematical Biology
JF - Journal of Mathematical Biology
IS - 4
M1 - 40
ER -