Institute for Program Structures and Data Organization (IPD), Chair Prof. Böhm

Discovering Multiple Clustering Solutions: Grouping Objects in Different Views of the Data

  • Author: Emmanuel Müller, Stephan Günnemann, Ines Färber and Thomas Seidl
  • Source:

    Proceedings of the 11th SIAM International Conference on Data Mining (SDM 2011), Mesa, USA

  • Traditional clustering algorithms identify just a single clustering of the data. Today's complex data, however, allow multiple interpretations leading to several valid groupings hidden in different views of the database. Each of these multiple clustering solutions is valuable and interesting as different perspectives on the same data and several meaningful groupings for each object are given. Especially for high dimensional data, where each object is described by multiple attributes, alternative clusters in different attribute subsets are of major interest.

    In this tutorial, we describe several real world application scenarios for multiple clustering solutions. We abstract from these scenarios and provide the general challenges in this emerging research area. We describe state-of-the-art paradigms, we highlight specific techniques, and we give an overview of this topic by providing a taxonomy of the existing clustering methods. By focusing on open challenges, we try to attract young researchers for participating in this emerging research field.