Parallel Inference of Phylogenetic Stands with Gentrius.

Anastasis Togkousidis,Olga Chernomor,Alexandros Stamatakis

IPDPS Workshops(2023)

引用 0|浏览2
暂无评分
摘要
Multi-locus datasets are frequently used to infer phylogenies instead of using single locus. Missing data constitute a common challenge in such datasets as they can lead to stands, that is, sets of trees that are compatible with the incomplete per-locus trees. Under many common criteria the trees from one stand have identical score. Hence, identifying stands and determining their sizes is of crucial importance for a robust phylogenetic analysis. Recently, Chernomor et al. published Gentrius, a branch-and-bound algorithm that enumerates all stand trees given a set of unrooted incomplete locus trees. Despite its efficiency, the pattern and proportion of missing data in multi-locus datasets can still induce extremely long execution times. Here, we introduce the parallel version of the Gentrius algorithm. Our parallelization deploys a thread-pooling mechanism that maintains threads that finish early in busy-wait mode, such that they can contribute to solving long-running tasks. Thereby, we substantially reduce load imbalance and attain high parallel efficiency. Our performance assessment up to 16 cores yields linear parallel speedups on both, simulated, and empirical data. The parallel version of Gentrius is available as open source code under GNU GPL at https://github.com/togkousa/iqtree2/tree/terragen. All data we used for our analyses, are available for download at https://cme.h-its.org/exelixis/material/gentrius-parallel.tar.gz.
更多
查看译文
关键词
phylogenetic stands, phylogenetic terraces, multi-locus data, parallel computing, shared memory parallelism
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要