More about HKUST
An Associative Characterization of Click Models in Web Search
The Hong Kong University of Science and Technology Department of Computer Science and Engineering PhD Thesis Defence Title: "An Associative Characterization of Click Models in Web Search" By Mr. Weizhu Chen Abstract Web search has become a fundamental means for massive users to find information. As a result, huge amounts of user interaction data are generated and acting as a valuable source for many web tasks. An important task is to understand user preference of each query-document pair based on their click behavior, so as to allow search engines to deliver better search results to effectively serve their users. In this thesis, we study the problem of modeling user click behavior in Web search, which is often formalized as a click model problem. Click models can automatically infer user-perceived relevance for each search result. This in turn enforces the search engines to deliver better search results. In the context of a click, there are multiple objects: user, query, session, task, search result, page region, etc. Previous models generally treat each object in isolation, disregarding their associations by only considering individual queries and search results. This may bring an over-simplification to a model but sacrifice valuable associative information. The main contribution of this thesis is a family of models and algorithms to address these limitations via modeling the associations between these objects. The proposed model and algorithm family characterizes the associations from six facets. We first put forward a whole-page model to describe the interplay between organic search and sponsored search. We then propose a session-based model and an intent-bias model to study multiple queries with their corresponding clicks collectively. We then introduce a user-based model to complement query and document with user and characterize this triple relationship. We continue with a novel noise-aware model to capture the noise of a click by leveraging above objects as its context. Finally, we provide a new solution to combine multiple proposed click models together and solve a relevance prediction challenge. Furthermore, we verify all the proposed models through extensive experiments using large-scale data collected from a commercial search engine. Experimental results demonstrate the significant improvements over the state-of-the-art. Date: Thursday, 2 August 2012 Time: 10:00am – 12:00noon Venue: Room 3501 Lifts 25/26 Chairman: Prof. Chi-Ming Chan (ENVR) Committee Members: Prof. Qiang Yang (Supervisor) Prof. Dik-Lun Lee Prof. Dit-Yan Yeung Prof. Rong Zheng (ISOM) Prof. Michael Lyu (Comp. Sci. & Engg., CUHK) **** ALL are Welcome ****