More about HKUST
Enhancing Recommender Systems with Rich Side Information
PhD Thesis Proposal Defence
Title: "Enhancing Recommender Systems with Rich Side Information"
by
Mr. Huan ZHAO
Abstract:
Collaborative filtering (CF) based methods have become the most popular
technique for recommender systems (RSs). In recent years, various types of
side information such as social connections among users and metadata of
items have been introduced into CF and shown to be effective for improving
recommendation performance. Moreover, side information helps to alleviate
data sparsity and cold start problems of conventional CF. However,
previous works process different types of information separately, thus
losing information that might exist across different types of side
information.
In this proposal, we explore methods to enhance RS with various side
information. We start with the incorporation of one important type of side
information, i.e., social connections among users, into a state-of-the-art
matrix factorization (MF) method, Local LOw Rank Matrix Approximation
(LLORMA), and propose our Social LOcal Matrix Approximation (SLOMA).
Experimental results on two real-world datasets demonstrate the
superiority of SLOMA to LLORMA in the rating prediction task.
Next, we study the application of Heterogeneous Information Network (HIN),
which offers a flexible representation for different types of information,
to enhance CF based recommendation methods. Since HIN could be a complex
graph representing multiple types of relations existing between two entity
types, we need to tackle two challenging issues facing HIN-based RSs: How
to capture the complex semantics driving the similarities between users
and items in a HIN, and how to fuse the heterogeneous information to
support recommendation. We propose to apply metagraph to HIN-based RSs to
overcome the former problem and the ``matrix factorization (MF) +
factorization machine (FM)'' framework for the latter. For the MF part, we
obtain the user-item similarity matrix from each metagraph and then apply
low-rank matrix approximation to obtain latent features for both users and
items. For the FM part, we propose to apply FM with Group lasso (FMG) on
the features obtained from the MF part to train the recommendation model
and at the same time identify the useful metagraphs. Experimental results
on two large real-world datasets, i.e., Amazon and Yelp, show that our
proposed approach is better than FM and other state-of-the-art HIN-based
recommendation methods.
Finally, besides metagraph, we further propose Motif Enhanced MetaPath
(MEMP) based similarities between users and items in HIN-based RSs. Motif
is a local structure that can capture higher-order relations among nodes
in homogeneous graphs. We argue that such higher-order relations also
exist among nodes of same types in HIN. Thus, existing metapath based
similarities can also be enhanced by integrating these motif-based
higher-order relations. After computing the MEMP based similarities
between users and items, our proposed ``MF+FM'' framework is adopted to
fuse the similarities and for rating prediction. Experiments have been
conducted on two real-word datasets, Epinions and CiaoDVD, the results
demonstrate the effectiveness of MEMP in HIN-based RSs.
Date: Monday, 22 October 2018
Time: 3:30pm - 5:30pm
Venue: Room 2408
(lifts 17/18)
Committee Members: Prof. Dik-Lun Lee (Supervisor)
Dr. Yangqiu Song (Chairperson)
Dr. Wilfred Ng
Prof. Nevin Zhang
**** ALL are Welcome ****