More about HKUST
Combining Automated Analysis with Interactive Visualizations for Spatio-temporal Data Analysis
PhD Thesis Proposal Defence
Title: "Combining Automated Analysis with Interactive Visualizations for
Spatio-temporal Data Analysis"
by
Mr. Dongyu LIU
Abstract:
The rapid advances in sensing technologies and large-scale computing
infrastructures lead to explosive growth in data. Spatio-temporal data, as
one of the ubiquitous data types, is increasingly collected and
extensively studied in various scientific domains such as geology,
climatology, sociology, epidemiology, and transportation science. This
kind of data is distinct from other data types due to the simultaneous
presence of spatial and temporal dimensions, which increase analysis
complexity substantially. Purely automatic data analysis techniques,
therefore, are insufficient to handle such complexity immaculately. Humans
not only have inherently good senses for perceiving space and time, but
also have creativity, flexibility, and domain expertise. Hence, an
appropriate manner to involve these humans' traits into the automatic data
analysis process would be tremendously helpful.
In this thesis, we introduce the basic idea of combining automatic data
analysis techniques with interactive visualizations in the context of
spatio-temporal data analysis and discuss the main challenges. We present
two advanced interactive visual analysis techniques that are developed on
the basis of intelligent mining models to demonstrate the strength of such
a combination. In particular, we first study the problem of billboard
location selection using massive taxi trajectory data. The problem is
tough because it not only has vast solution search space but also involves
multiple factors to judge the optimal. The former requires a mass of
computing that would be too large for a human to effectively tackle alone,
while the latter highly demands the involvement of humans as every person
may have different criteria. To tackle the challenges, we present
SmartAdP, a system combining a visualization-driven mining model with
several well-designed visualization and interaction techniques, to
facilitate creating and comparing multiple solutions in an efficient and
human-steerable manner. Secondly, an interactive visualization system,
TPFlow, is presented for exploring large-scale multidimensional
spatio-temporal data. Spatio-temporal patterns at different granularity
levels are usually hidden within different subsets of data. In TPFlow, we
model the spatio-temporal dataset as a tensor and propose a novel
tensor-based algorithm to support automated tensor (dataset) partitioning
and multidimensional pattern extraction simultaneously. Built upon the
algorithm, the TPFlow system features a novel combination of visualization
and interaction designs to facilitate pattern discovery, comparison, and
verification.
The effectiveness and usefulness of the above techniques are all validated
through case studies on real-world datasets from various application
domains and interviews of domain experts. Notice that the proposed
techniques are not limited to the presented example application scenarios
but can be easily adapted to other applications with similar problems as
well.
Date: Friday, 3 May 2019
Time: 10:00am - 12:00noon
Venue: Room CYTG003 (CYT Building)
(lifts 35/36)
Committee Members: Prof. Huamin Qu (Supervisor)
Prof. Tin-Yau Kwok (Chairperson)
Prof. Cunsheng Ding
Dr. Qifeng Chen
**** ALL are Welcome ****