More about HKUST
Theory and Practice: Graph Representation Learning For Subgraph Isomorphisms And Substructure-Augmented Applications
The Hong Kong University of Science and Technology Department of Computer Science and Engineering PhD Thesis Defence Title: "Theory and Practice: Graph Representation Learning For Subgraph Isomorphisms And Substructure-Augmented Applications" By Mr. Xin LIU Abstract: Graphs are one of the most widely used data structures to model various domains, with subgraph matching being a fundamental and critical demand. In graph theory, subgraph matching involves identifying subgraph isomorphisms between two graphs, whose decision problem of existence and counting is known to be NP-complete. A more challenging task is to locate all isomorphic subgraphs and their corresponding mappings. Numerous enumeration and approximation algorithms have been developed for subgraph isomorphisms, but their generality and scalability are limited by computational complexity or memory usage. To overcome these issues, we explore different graph representation techniques from the perspective of learning. We highlight the constraints of current graph kernels for subgraph isomorphism counting and suggest modifications to enhance performance. To improve the generalization capabilities of graph representations, we introduce a deep learning framework that integrates various neural representation learning architectures to simulate searching and pruning for subgraph isomorphism counting and matching. We demonstrate that graph neural networks consistently outperform sequence models, with the sum pooling operation playing a significant role. Moreover, we develop an efficient dynamic memory network that improves the perception of the global graph structure and strengthens the transfer capability through pretraining on synthetic data. Additionally, we successfully apply the ideas of neural subgraph matching to anonymous recommendation systems: we extract frequent attribute patterns from transition graphs and utilize them to enhance the user intent capture, attribute preference estimation, and long-period recommendation. We also conduct a theoretical analysis of the shortcomings of existing message passing graph neural networks when dealing with heterogeneous multigraphs. By examining the connection between the edge-to-vertex transform and the duality of isomorphisms in heterogeneous multigraphs, we propose an advanced dual message passing mechanism for learning edge representations, leading to comprehensive graph representations for subgraph matching and unsupervised node classification. We also show that incorporating a simple dummy node that connects all existing vertices allows for the preservation of graph information without loss during transformation and learning. These structures directly boost the performance of subgraph matching and graph classification, and the dual graph can further improve the measurement of substructure similarity. Extensive experimental results highlight the potential of our proposed neural framework and algorithms in addressing subgraph isomorphism problems, particularly in large-scale graphs and patterns. And applying them to downstream tasks for enhancing the awareness of substructures can yield direct performance benefits. Date: Thursday, 29 June 2023 Time: 4:00pm - 6:00pm Venue: Room 4475 lifts 25/26 Chairperson: Prof. Qing CHEN (MAE) Committee Members: Prof. Yangqiu SONG (Supervisor) Prof. Raymond WONG Prof. Dan XU Prof. Can YANG (MATH) Prof. Sinno PAN (CUHK)