AVID: GPU-enabled Visual Analytics with GPU-FAST-PROCLUS

By Jakob Rødsgaard Jørgense, Ira Assent, and Hans-Jörg Schulz

Abstract

GPU-FAST-PROCLUS is a GPU-parallelized algorithm for projected clustering based on the \(k\)-medoids approach. It speeds up clustering to allow for real-time interaction - even for datasets of millions of items. Interactivity allows users to quickly determine sensible clustering parameters such as the number of clusters \(k\), provided a suitable visualization is available. Yet, as clustering and visualization are usually decoupled, cluster results are funneled from the GPU back to the CPU, only to be mapped onto appropriate graphics, which are then rendered on the GPU again. This introduces a bottleneck that hinders fluid interaction with clustering.

As a solution to this, we propose AVID (Analysis and Visualization In Device). Following the principle "What happens on the GPU, stays on the GPU", AVID removes the round trip to the CPU and keeps clustering results on the GPU to render them on the GPU directly. By doing so, users can interactively tune projected clustering parameters and observe the effects without noticeable delay. In our demo system, we showcase the efficiency of our data management strategies for projected clustering as well as the efficacy of data visualization.

The layout of AVID.

Resources

Below, you find links to primary resources for our paper and repository.

github paper video