More about HKUST
A Family of Algorithms to Characterize Association in Click Model for Web Search
PhD Thesis Proposal Defence Title: "A Family of Algorithms to Characterize Association in Click Model for Web Search" by Mr. Weizhu Chen ABSTRACT: One of the major changes in the past decade is the heavy use of search engines which generates large-scale user activity data in Web search. These data have been in turn contributing to many critical Web tasks, such as optimizing search or sponsored results. As the most informative and reliable source of user action, click is believed to be the most important user activity in the data. Understanding the motivation behind a click or the decision making process to trigger it becomes the key to uncover the myth encoded in the data. Yet, user click behaviour is complex, varying with users and implicit under various contexts. This poses challenges to characterize a click comprehensively. Many recent research works have attempted to model user click behaviour in a structural manner and positioned it as a click model problem, with the intention to better exploit user click behaviour so as to predict user click or estimate a user-perceived relevance for each query-document pair. Despite of their success, most existing click models treat the modelled objects, such as queries, users, sessions, in isolation, disregarding their relationships. This may bring a simplification to the model but simultaneously sacrifices much valuable information, and hence interpret user click data in an inaccurate or biased way. This thesis proposal puts forward a family of algorithms to address these limitations. Our object is to characterize multiple associations among objects as well as design novel collective click models, where multiple modelled objects and their relationships are involved and associated together. This algorithm family will depict the associations from three facets: region-based, query-based and user-based associations. Region-based models focus on the interplaying between organic search and sponsored search, so that it can depict user behaviour in the whole page thoroughly. Query-based models first collectively investigate the multiple queries with their corresponding clicks in a same session by designing a session-based click model. Then, it scrutinizes and uses the rich information in the high-frequent queries to alleviate the sparseness of long-tailed queries. User-based models characterize the user-centric click behaviour to design a personalized click model to entertain each individual user. These user-based models can also be tailored to better solve the sparseness challenges in the long-tailed queries. Finally, an ongoing feasibility study with extensive investigations showcases the practicality of the associative click models and future works are proposed to solve the problem to a next level. Date: Friday, 25 November 2011 Time: 1:00pm - 3:00pm Venue: Room 3304 lifts 17/18 Committee Members: Prof. Qiang Yang (Supervisor) Prof. Dit-Yan Yeung (Chairperson) Prof. Dik-Lun Lee Dr. Raymond Wong **** ALL are Welcome ****