ISilDR
Isometric Seriation-based Dimensionality Reduction for Visual Cluster Analysis
Abstract: Visual cluster analysis is a central task in exploring multidimensional (MD) data, typically supported by Dimensionality Reduction (DR) techniques that spatialize MD data similarities as point patterns in scatterplots. However, these layouts are often limited by unavoidable distortions—specifically false neighbors (where separated clusters appear to overlap) and missing neighbors (where true clusters are split into falsely separated groups). In this work, the authors propose Isometric Seriation-based Dimensionality Reduction (ISilDR), a family of techniques that provably generate at most missing neighbors. This contrasts with orthogonal linear projections (OLP) like Principal Component Analysis (PCA), which never produce missing neighbors but do produce false neighbors. ISilDR creates a seriation of MD data points (an ordering along a 1D projection axis) and spaces consecutive points by their MD distance. An mD ISilDR can be obtained by combining m 1D ISilDRs. The authors provide a systematic, formal analysis using epsilon-neighborhood graphs to study the theoretical and empirical characteristics of ISilDR and OLP variants. They derive rules for discovering true MD cluster patterns by interactively linking ISilDR and OLP coordinated layouts, illustrating through case studies how this combination enables more trustworthy visual cluster analysis.
