# Polynomial-Time Algorithms for Phylogenetic Inference Problems

@inproceedings{Iersel2018PolynomialTimeAF, title={Polynomial-Time Algorithms for Phylogenetic Inference Problems}, author={Leo van Iersel and Remie Janssen and Mark Jones and Yukihiro Murakami and Norbert Zeh}, booktitle={AlCoB}, year={2018} }

A common problem in phylogenetics is to try to infer a species phylogeny from gene trees. We consider different variants of this problem. The first variant, called Unrestricted Minimal Episodes Inference, aims at inferring a species tree based on a model of speciation and duplication where duplications are clustered in duplication episodes. The goal is to minimize the number of such episodes. The second variant, Parental Hybridization, aims at inferring a species network based on a model of…

## Figures and Topics from this paper

## 6 Citations

The rigid hybrid number for two phylogenetic trees

- MedicineJournal of mathematical biology
- 2021

This paper characterize when two trees can be rigidly displayed by a certain type of phylogenetic network called a temporal tree-child network in terms of fork-picking sequences, and presents an infinite family of pairs of trees which demonstrates that the difference between the rigid hybrid number and the temporal-hybrid number for two phylogenetic trees on the same set of n leaves can grow at least linearly with n.

Weakly displaying trees in temporal tree-child network

- Computer Science, MathematicsArXiv
- 2020

This paper characterize when two trees can be rigidly displayed by a temporal tree-child network in terms of fork-picking sequences, a concept that is closely related to that of cherry- picking sequences.

Minimizing genomic duplication episodes

- Computer Science, MedicineComput. Biol. Chem.
- 2020

An efficient quadratic time algorithm to solve the problem of genomic duplication clustering in which input gene trees are rooted, episode locations are restricted to preserve the minimal number of single gene duplications, and clustering rules are described by minimum episodes method.

Heading in the right direction? Using head moves to traverse phylogenetic network space

- Mathematics, Biology
- 2018

It is proved that finding the shortest sequence of head moves between two networks is NP-hard and makes head moves a good candidate for local search heuristics.

Reconstructibility of unrooted level-k phylogenetic networks from distances

- Mathematics, Computer ScienceAdv. Appl. Math.
- 2020

It is proved that level-$1$ networks and level-$2 networks are reconstructible from their shortest distances and multisets of distances, respectively, and it is shown that, in general, networks of level higher than~$1$ are not reconstructable from shortest distances or from their multiset of distances.

Solving Phylogenetic Network Containment Problems using Cherry-picking Sequences

- Mathematics
- 2018

Phylogenetic networks are used to represent evolutionary scenarios in biology and linguistics. To find the most probable scenario, it may be necessary to compare candidate networks: to distinguish…

## References

SHOWING 1-10 OF 36 REFERENCES

From Gene Trees to Species Trees

- Biology, Computer ScienceSIAM J. Comput.
- 2000

This paper studies various algorithmic issues in reconstructing a species tree from gene trees under the duplication and the mutation cost model and proposes a heuristic method that is significantly better than the existing program in Page's GeneTree 1.0 that starts the search from a random tree.

Computing the minimum number of hybridization events for a consistent evolutionary history

- Computer Science, MathematicsDiscret. Appl. Math.
- 2007

The main results of this paper show that computing this smallest number of hybridization events required to explain a given (input) set of data in a single (hybrid) phylogeny is APX- hard, and thus NP-hard, in the case the input is a collection of phylogenetic trees on sets of present-day species.

A Reduction Algorithm for Computing The Hybridization Number of Two Trees

- Computer Science, MedicineEvolutionary bioinformatics online
- 2007

A new reduction-based algorithm for computing the minimum number of hybridization events, when the initial data set consists of two trees, which always gives the exact solution and runs efficiently on many real biological problems.

Efficient Algorithms for Genomic Duplication Models

- Computer Science, MedicineIEEE/ACM Transactions on Computational Biology and Bioinformatics
- 2018

This work studies the method of clustering called minimum episodes for several models of allowed evolutionary scenarios with a focus on interval models in which every gene duplication has an interval consisting of allowed locations in the species tree.

On the Multiple Gene Duplication Problem

- Computer ScienceISAAC
- 1998

It is shown that the general form of this problem is NP-hard and various parameterized versions are hard for the complexity class W[1].

Reconciliation with Non-Binary Species Trees

- Computer Science, MedicineJ. Comput. Biol.
- 2008

This work presents the first formal algorithm for reconciling binary gene trees with non-binary species trees under a duplication-loss parsimony model, and presents a dynamic programming algorithm for a combined loss model, in which losses in sibling species may be represented as a single loss in the common ancestor.

Reconstruction of ancient molecular phylogeny.

- Biology, MedicineMolecular phylogenetics and evolution
- 1996

Under the assumption that differences among gene trees can be explained by gene duplications, and consequent losses, it is developed a method to obtain the global phylogeny minimizing the total number of postulated duplications and losses and to trace back such individual gene duplication to global genome duplications.

Linear-Time Algorithms for the Multiple Gene Duplication Problems

- Biology, MedicineIEEE/ACM Transactions on Computational Biology and Bioinformatics
- 2011

Two variations of the MULTIPLE GENE DUPLICATION problems: the EPISODE-CLUSTERING (EC) problem and the MINIMUM EPISODES (ME) problem are studied and an optimal linear-time algorithm is proposed.

Hybridization Number on Three Rooted Binary Trees is EPT

- Computer Science, MathematicsSIAM J. Discret. Math.
- 2016

The techniques generalize to more than three input trees with the exception of one key lemma that maps nodes in the network to tree nodes and, thus, minimizes the amount of guessing involved in constructing the network.

Locating Multiple Gene Duplications through Reconciled Trees

- Biology, Computer ScienceRECOMB
- 2008

The first exact and efficient algorithm that determines a minimum number of locations for gene duplication events from the gene trees on the species tree is introduced, which can provide hypotheses for precise locations of large-scale gene duplication Events with data from relatively few gene trees.