Skip to main content

Showing 1–9 of 9 results for author: Matsen, F A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.05058  [pdf, other

    stat.ML cs.LG

    Variational Bayesian Phylogenetic Inference with Semi-implicit Branch Length Distributions

    Authors: Tianyu Xie, Frederick A. Matsen IV, Marc A. Suchard, Cheng Zhang

    Abstract: Reconstructing the evolutionary history relating a collection of molecular sequences is the main subject of modern Bayesian phylogenetic inference. However, the commonly used Markov chain Monte Carlo methods can be inefficient due to the complicated space of phylogenetic trees, especially when the number of sequences is large. An alternative approach is variational Bayesian phylogenetic inference… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 26 pages, 7 figures

  2. arXiv:2204.07747  [pdf, other

    stat.ML cs.LG

    A Variational Approach to Bayesian Phylogenetic Inference

    Authors: Cheng Zhang, Frederick A. Matsen IV

    Abstract: Bayesian phylogenetic inference is currently done via Markov chain Monte Carlo (MCMC) with simple proposal mechanisms. This hinders exploration efficiency and often requires long runs to deliver accurate posterior estimates. In this paper, we present an alternative approach: a variational framework for Bayesian phylogenetic analysis. We propose combining subsplit Bayesian networks, an expressive g… ▽ More

    Submitted 22 May, 2024; v1 submitted 16 April, 2022; originally announced April 2022.

  3. arXiv:1811.11007  [pdf, other

    q-bio.PE cs.DS

    Systematic Exploration of the High Likelihood Set of Phylogenetic Tree Topologies

    Authors: Chris Whidden, Brian C. Claywell, Thayer Fisher, Andrew F. Magee, Mathieu Fourment, Frederick A. Matsen IV

    Abstract: Bayesian Markov chain Monte Carlo explores tree space slowly, in part because it frequently returns to the same tree topology. An alternative strategy would be to explore tree space systematically, and never return to the same topology. In this paper, we present an efficient parallelized method to map out the high likelihood set of phylogenetic tree topologies via systematic search, which we show… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: 25 pages, 16 figures

  4. arXiv:1611.02351  [pdf, ps, other

    cs.DM

    Chain Reduction Preserves the Unrooted Subtree Prune-and-Regraft Distance

    Authors: Chris Whidden, Frederick A. Matsen IV

    Abstract: The subtree prune-and-regraft (SPR) distance metric is a fundamental way of comparing evolutionary trees. It has wide-ranging applications, such as to study lateral genetic transfer, viral recombination, and Markov chain Monte Carlo phylogenetic inference. Although the rooted version of SPR distance can be com puted relatively efficiently between rooted trees using fixed-parameter-tractable algori… ▽ More

    Submitted 7 November, 2016; originally announced November 2016.

    Comments: 15 pages, 5 figures. Split from arXiv:1511.07529 and revised as a conference paper after feedback suggested that work was too long

  5. arXiv:1606.08893  [pdf, other

    cs.DS

    Efficiently Inferring Pairwise Subtree Prune-and-Regraft Adjacencies between Phylogenetic Trees

    Authors: Chris Whidden, Frederick A. Matsen IV

    Abstract: We develop a time-optimal $O(mn^2)$-time algorithm to construct the subtree prune-regraft (SPR) graph on a collection of m phylogenetic trees with n leaves. This improves on the previous bound of $O(mn^3)$. Such graphs are used to better understand the behaviour of phylogenetic methods and recommend parameter choices and diagnostic criteria. The limiting factor in these analyses has been the diffi… ▽ More

    Submitted 26 April, 2017; v1 submitted 28 June, 2016; originally announced June 2016.

    Comments: 21 pages, 3 figures. Revised in response to peer review

  6. arXiv:1511.07529  [pdf, ps, other

    cs.DS q-bio.PE

    Calculating the Unrooted Subtree Prune-and-Regraft Distance

    Authors: Chris Whidden, Frederick A. Matsen IV

    Abstract: The subtree prune-and-regraft (SPR) distance metric is a fundamental way of comparing evolutionary trees. It has wide-ranging applications, such as to study lateral genetic transfer, viral recombination, and Markov chain Monte Carlo phylogenetic inference. Although the rooted version of SPR distance can be computed relatively efficiently between rooted trees using fixed-parameter-tractable maximum… ▽ More

    Submitted 3 November, 2017; v1 submitted 23 November, 2015; originally announced November 2015.

    Comments: 21 double-column pages, 11 figures. Revised in response to peer review. The sections introducing socket forests and on chain reduction were spun off into a conference-length paper arXiv:1611.02351 to reduce the length and complexity of the manuscript

  7. arXiv:1504.00304  [pdf, other

    cs.DM cs.CE q-bio.PE

    Ricci-Ollivier Curvature of the Rooted Phylogenetic Subtree-Prune-Regraft Graph

    Authors: Chris Whidden, Frederick A. Matsen IV

    Abstract: Statistical phylogenetic inference methods use tree rearrangement operations to perform either hill-climbing local search or Markov chain Monte Carlo across tree topologies. The canonical class of such moves are the subtree-prune-regraft (SPR) moves that remove a subtree and reattach it somewhere else via the cut edge of the subtree. Phylogenetic trees and such moves naturally form the vertices an… ▽ More

    Submitted 3 November, 2015; v1 submitted 1 April, 2015; originally announced April 2015.

    Comments: 17 2-column pages, 6 figures, 2 tables. To appear in the Proceedings of the Thirteenth Workshop on Analytic Algorithmics and Combinatorics (ANALCO)

  8. arXiv:1205.6867  [pdf, other

    q-bio.PE cs.DM

    Minimizing the average distance to a closest leaf in a phylogenetic tree

    Authors: Frederick A. Matsen, Aaron Gallagher, Connor McCoy

    Abstract: When performing an analysis on a collection of molecular sequences, it can be convenient to reduce the number of sequences under consideration while maintaining some characteristic of a larger collection of sequences. For example, one may wish to select a subset of high-quality sequences that represent the diversity of a larger collection of sequences. One may also wish to specialize a large datab… ▽ More

    Submitted 31 August, 2012; v1 submitted 30 May, 2012; originally announced May 2012.

    Comments: Please contact us with any comments or questions!

  9. arXiv:1109.5423  [pdf, other

    q-bio.PE cs.DS

    Reconciling taxonomy and phylogenetic inference: formalism and algorithms for describing discord and inferring taxonomic roots

    Authors: Frederick A. Matsen, Aaron Gallagher

    Abstract: Although taxonomy is often used informally to evaluate the results of phylogenetic inference and find the root of phylogenetic trees, algorithmic methods to do so are lacking. In this paper we formalize these procedures and develop algorithms to solve the relevant problems. In particular, we introduce a new algorithm that solves a "subcoloring" problem for expressing the difference between the tax… ▽ More

    Submitted 1 October, 2011; v1 submitted 25 September, 2011; originally announced September 2011.

    Comments: Version submitted to Algorithms for Molecular Biology. A number of fixes from previous version