Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection

Published in International Conference on Learning Representations 2023 (ICLR 2023, Oral — Notable Top 5%), 2023

Recommended citation: Jiajun Fan, Yuzheng Zhuang, Yuecheng Liu, Jianye Hao, Bin Wang, Jiangcheng Zhu, Hao Wang, Shu-Tao Xia. "Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection." ICLR 2023, oral (ranked 5/4176). https://openreview.net/forum?id=FeWvD0L_a4

We propose LBC (Learnable Behavior Control), a unified framework enabling significantly enlarged behavior selection space via a hybrid behavior mapping. Our agents achieved 10077.52% mean human normalized score and surpassed 24 human world records within 1B training frames, demonstrating SOTA performance with exceptional sample efficiency.