実世界マルチエージェントの手本を用いた強化学習における適応的な行動の活用

藤井 慶輔; 筒井 和詩; スコット アトム; 中原 啓; 武石 直也; 河原 吉伸

doi:10.11517/pjsai.JSAI2024.0_1E5GS504

38th (2024)

Session ID : 1E5-GS-5-04

DOI https://doi.org/10.11517/pjsai.JSAI2024.0_1E5GS504

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 38th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 38

Location : [in Japanese]

Date : May 28, 2024 - May 31, 2024

Adaptive action utilization in reinforcement learning from real-world multi-agent demonstrations

*Keisuke FUJII, Kazushi TSUTUSI, Atom SCOTT, Hiroshi NAKAHARA, Naoya TAKEISHI, Yoshinobu KAWAHARA

Author information

Keywords: Reinforcement learning, Machine Learning, Sports, Deep Learning

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

When modeling real-world biological multi-agents with reinforcement learning, there is a domain gap between the source real-world data and the target reinforcement learning environment. Therefore, the target dynamics are adapted to the unknown source dynamics. In this study, we propose a reinforcement learning method that uses information obtained by adapting source action to target action in a supervised manner as a method for domain adaptation in multi-agent reinforcement learning from real-world demonstrations. In limited situations such as 2vs1 chase-escape, 2vs2 and 4vs8 in soccer, we show that the agent learned to imitate the demonstrations and obtain rewards compared to the baseline.

Corresponding author

Conference information

Register with J-STAGE for free!