More about HKUST
Learning Approach for Querying Neural Graph Databases
PhD Thesis Proposal Defence
Title: "Learning Approach for Querying Neural Graph Databases"
by
Mr. Zihao WANG
Abstract:
Neural Graph Databases (NGDB) augment classic graph databases with neural
representations that embed both features and relations, but this integration
also introduces new and complex challenges in handling logical queries. The
logical calculus required to answer queries is not readily found in continuous
spaces, making querying such NGDB a highly intricate and non-trivial task. An
important example of neural graph databases is knowledge graph with relation
and entity representations, which are then our key research object.
The first part of this thesis discusses learning approaches for tree-formed
queries. For such queries, the logical queries are solved by set operators over
geometric embedding spaces of a fixed dimension, where only constant data
complexity is required. We introduce the large-scale query-answering framework,
demonstrating the challenge of generalizing such approaches in the
combinatorial spaces of logical queries due to the conflict between rigorous
logical reasoning and geometric embeddings. To overcome this, we first
understand the truth value matrix between queries and all entities from a
matrix-decomposition perspective. We then show that the fuzzy logical t-norm
operations can be integrated into the embeddings under specific conditions.
This leads to developing a new metric space inspired by unbalanced optimal
transport for more combinatorically generalizable query answering and its
computation with convolution operations.
In the second part of the thesis, we take one step back from the Tree Form (TF)
queries and turn to Existential First Order (EFO) queries. We present
syntactical and complexity gaps between the TF and EFO queries, which motivate
further research in learning methods beyond tree-formed queries. With an
abstract interface of link predictors, the query answering process of EFO query
can be regarded as an optimization process but requires at least linear data
complexity. We then propose a learning-to-optimization framework to answer such
queries. Compared to direct global optimization, we introduce four-types the
one-hop inference of single predicate whose results are then combined by a
graph neural network for global solution. Notably, we derive the closed-form
formulations of the one-hop inferences for widely used six KG embeddings, which
further facilitate the inference process.
Date: Friday, 2 August 2024
Time: 10:00am - 12:00noon
Venue: Room 5506
Lifts 25/26
Committee Members: Dr. Yangqiu Song (Supervisor)
Prof. Dit-Yan Yeung (Chairperson)
Prof. Raymond Wong
Prof. Xiaofang Zhou