More about HKUST
Inductive Graph Inference: Challenges and Solutions
PhD Thesis Proposal Defence Title: "Inductive Graph Inference: Challenges and Solutions" by Mr. Meng QIN Abstract: For various complex systems, graphs serve as a generic abstract of the available entities and associated relations using a set of nodes and edges. Most graph inference tasks have both transductive and inductive settings. Compared with transductive inference, which only focuses on currently observed graph information, inductive graph inference is a more advanced setting that can transfer knowledge learned from known topology to new unseen nodes or graphs. However, the original designs of several graph inference techniques (e.g., link prediction and community detection) only focus on transductive settings. Some of them are also inapplicable to their inductive settings. This thesis studies several challenging inductive graph inference tasks seldom considered in related research, which covers both inference settings for new unseen nodes and across graphs. First, we consider inductive temporal link prediction (TLP) for weighted dynamic graphs with non- fixed node sets, which should (i) determine the existence of future links, (ii) predict associated link weights, and (iii) deal with the variation of node sets. To support this challenging task, we develop an inductive dynamic embedding aggregation (IDEA) method based on novel designs of (i) a stacked GNN-RNN cell, (ii) an adaptive embedding aggregation module, and (iii) a hybrid training objective regarding adversarial learning and scale difference minimization. Experiments on public datasets of various scenarios demonstrate that IDEA can handle the variation of node sets and derive high-quality prediction results for weighted dynamic graphs. Second, we consider community detection (CD) and explore the potential of an inductive scheme across graphs to achieve a better trade-off between the inference quality and efficiency of CD. Following this scheme, we propose an inductive community detection (ICD) method. It first (i) conducts the offline training of an advanced graph neural network (GNN) model on existing known graphs and then (ii) generalizes the model to new graphs for fast online inference without retraining. Novel designs of (i) an adversarial dual GNN and (ii) a clustering regularization objective also enable ICD to capture the permutation-invariant label information of CD during the offline training. Experiments on various synthetic and real benchmarks demonstrate that ICD can achieve a significant trade-off between the inference quality and efficiency of CD. Date: Monday, 27 May 2024 Time: 3:00pm - 5:00pm Venue: Room 4475 Lifts 25/26 Committee Members: Prof. Dit-Yan Yeung (Supervisor) Prof. Raymond Wong (Chairperson) Dr. Long Chen Dr. Dan Xu