figment 发表于 2025-3-30 09:48:31

http://reply.papertrans.cn/17/1622/162162/162162_51.png

项目 发表于 2025-3-30 14:45:50

http://reply.papertrans.cn/17/1622/162162/162162_52.png

烧瓶 发表于 2025-3-30 16:40:31

https://doi.org/10.1007/978-3-030-59908-9re in the action space can improve sample efficiency during exploration. To show this we focus on concurrent action spaces where the RL agent selects multiple actions per timestep. Concurrent action spaces are challenging to learn in especially if the number of actions is large as this can lead to a

门闩 发表于 2025-3-30 22:35:25

http://reply.papertrans.cn/17/1622/162162/162162_54.png
页: 1 2 3 4 5 [6]
查看完整版本: Titlebook: Artificial Intelligence XXXVI; 39th SGAI Internatio Max Bramer,Miltos Petridis Conference proceedings 2019 Springer Nature Switzerland AG 2