Visualizing High-density Clusters in Multidimensional Data

  • The analysis of multidimensional multivariate data has been studied in various research areas for many years. The goal of the analysis is to gain insight into the specific properties of the data by scrutinizing the distribution of the records at large and finding clusters of records that exhibit correlations among the dimensions or variables. As large data sets become ubiquitous but the screen space for displaying is limited, the size of the data sets exceeds the number of pixels on the screen. Hence, we cannot display all data values simultaneously. Another problem occurs when the number of dimensions exceeds three dimensions. Displaying such data sets in two or three dimensions, which is the usual limitation of the displaying tools, becomes a challenge. The main approach consists of two major steps: clustering and visualizing. In the clustering step, we propose two clustering algorithms to construct hierarchical density clusters. In the visualizing step, we propose two methods to visually analyze the hierarchical density clusters. An optimized star coordinates approach is used to project high-dimensional data into the (two- or three-dimensional) visual space, in which the leaf clusters of hierarchical density clusters (well-separated in the original data space) are projected into visual space with minimizing the overlapping. The second method, we developed to visualize the hierarchical density cluster tree, combines several information visualization techniques in linked and embedded displays: radial layout for hierarchical structures, linked parallel coordinates, and embedded circular parallel coordinates. By combining cluster analysis with star coordinates or parallel coordinates, we extend these visualization techniques to cluster visualizations. We display clusters instead of data points. The advantage of this combination is scalability with both the size and dimensions of data set.

Download full text

Cite this publication

  • Export Bibtex
  • Export RIS

Citable URL (?):

Search for this publication

Search Google Scholar Search Catalog of German National Library Search OCLC WorldCat Search Catalog of GBV Common Library Network Search Catalog of Jacobs University Library Search Bielefeld Academic Search Engine
Meta data
Publishing Institution:IRC-Library, Information Resource Center der Jacobs University Bremen
Granting Institution:Jacobs Univ.
Author:Van Long Tran
Referee:Lars Linsen, Adalbert F. X. Wilhelm, Daniel Keim
Advisor:Lars Linsen
Persistent Identifier (URN):urn:nbn:de:101:1-2013051611109
Document Type:PhD Thesis
Date of Successful Oral Defense:2009/12/11
Year of Completion:2009
Date of First Publication:2010/01/27
PhD Degree:Computer Science
School:SES School of Engineering and Science
Library of Congress Classification:Q Science / QA Mathematics (incl. computer science) / QA71-90 Instruments and machines / QA75.5-76.95 Electronic computers. Computer science / QA76.9.I52 Information visualization
Call No:Thesis 2009/39

$Rev: 13581 $