Exact Algorithms and Experiments for Hierarchical Tree Clustering

Authors

  • Sepp Hartung University of Jena
  • Jiong Guo Universität des Saarlandes
  • Christian Komusiewicz University of Jena
  • Rolf Niedermeier University of Jena
  • Johannes Uhlmann University of Jena

DOI:

https://doi.org/10.1609/aaai.v24i1.7684

Keywords:

fixed-parameter algorithmics, hierarchical clustering

Abstract

We perform new theoretical as well as first-time experimental studies for the NP-hard problem to find a closest ultrametric for given dissimilarity data on pairs. This is a central problem in the area of hierarchical clustering, where so far only polynomial-time approximation algorithms were known. In contrast, we develop efficient preprocessing algorithms (known as kernelization in parameterized algorithmics) with provable performance guarantees and a simple search tree algorithm. These are used to find optimal solutions. Our experiments with synthetic and biological data show the effectiveness of our algorithms and demonstrate that an approximation algorithm due to Ailon and Charikar [FOCS 2005] often gives (almost) optimal solutions.

Downloads

Published

2010-07-03

How to Cite

Hartung, S., Guo, J., Komusiewicz, C., Niedermeier, R., & Uhlmann, J. (2010). Exact Algorithms and Experiments for Hierarchical Tree Clustering. Proceedings of the AAAI Conference on Artificial Intelligence, 24(1), 457-462. https://doi.org/10.1609/aaai.v24i1.7684