More about HKUST
Exploring Deep Learning Architectures for Spatiotemporal Sequence Forecasting
PhD Thesis Proposal Defence
Title: "Exploring Deep Learning Architectures for Spatiotemporal Sequence
Forecasting"
by
Mr. Xingjian SHI
Abstract:
Spatiotemporal systems are common in the real world. Forecasting the
multi-step future of these spatiotemporal systems based on the past
observations, or, Spatiotemporal Sequence Forecasting (STSF), is a
significant and challenging problem. Due to the complex spatial and
temporal relationships within the data and the potential long forecasting
horizon, it is challenging to design appropriate Deep Learning (DL)
architectures for STSF. In this thesis proposal, we explore DL
architectures for STSF. We first define the STSF problem and classify it
into three subcategories: Trajectory Forecasting of Moving Point Cloud
(TF-MPC), STSF on Regular Grid (STSF-RG), and STSF on Irregular Grid
(STSF-IG). We then propose architectures for STSF-RG and STSF-IG problems.
For STSF-RG problems, we propose the Convolutional Long-Short Term Memory
(ConvLSTM) and the Trajectory Gated Recurrent Unit (TrajGRU). ConvLSTM
uses convolution in both the input-state and state-state transitions of
LSTM and is better at capturing the spatiotemporal correlations that the
Fully-connected LSTM (FC-LSTM). TrajGRU improves upon ConvLSTM by actively
learning the recurrent connection structure, which achieves better
prediction performance with less number of parameters. To better
investigate the effectiveness of our proposed architectures and other DL
models for STSF-RG, we choose to tackle the precipitation nowcasting
problem, which is a representative STSF-RG problem with huge real-world
impact. By incorporating ConvLSTM into an Encoder-Forecaster (EF)
structure, we propose the first machine learning based solution to
precipitation nowcasting that outperforms the operational algorithm. To
facilitate future researches for this problem and gauge the state of the
art, we propose the first large-scale benchmark for precipitation
nowcasting: HKO-7. HKO-7 has new evaluation metrics and has both the
offline setting and the online setting in the evaluation protocol.
For STSF-IG problems, we convert the sparsely distributed observations
into a spatiotemporal graph and utilize graph convolution operators, or
graph aggregators, to build the model. We propose a new graph aggregator
called Gated Attention Network (GaAN). GaAN not only uses multiple
attention heads to aggregate information from the neighborhoods but also
uses another set of gates to control each attention head's importance.
With experiments on two large-scale inductive node classification
datasets, we show that GaAN outperforms the baseline graph aggregators.
Also, we propose a unified framework called Graph GRU (GGRU), which
transforms any valid graph convolution operators, or graph aggregators, to
RNNs that are designed for STSF-IG.
Date: Thursday, 30 August 2018
Time: 2:00pm - 4:00pm
Venue: Room 5501
(lifts 25/26)
Committee Members: Prof. Dit-Yan Yeung (Supervisor)
Dr. Yangqiu Song (Chairperson)
Dr. Raymond Wong
Prof. Nevin Zhang
**** ALL are Welcome ****