More about HKUST
Evaluation of an Entropy-based Online Sequence Learning Algorithm in Nonstationary Zero-Sum Games
The Hong Kong University of Science and Technology Department of Computer Science and Engineering FYT Presentation and Demonstration Title: "Evaluation of an Entropy-based Online Sequence Learning Algorithm in Nonstationary Zero-Sum Games" by Miss YAN, Jialei abstract: Constructing agents that can learn and adapt to changing environments is a key challenge in intelligent systems, especially in multiagent domains, with the presence of other intelligent agents which are also learning and adapting. Recently, an entropy-based online sequence learning algorithm, called Entropy Learning Pruned Hypothesis (ELPH) space, has been proposed for fast learning in nonstationary environments. This paper presents our evaluation of ELPH in zero-sum games that simulate nonstationary multiagent environments. With a properly selected observation history length of seven and a pruning threshold of 0.5, the ELPH player learns and adapts to stochastic and nonstationary deterministic agents quickly. ELPH plays at Nash equilibrium in self-plays of zero-sum games. However, it loses by a noticeable amount in plays against simulated aggressive human players. We will analyze the reasons behind the wins and losses. Date : 28 July 2008, Monday Time : 2:30pm to 3:30pm Venue : Room 3501 Advisor : Prof. D.Y. Yeung 2nd Reader : Dr. S.C. Cheung