duplicate 发表于 2025-3-23 11:01:31

http://reply.papertrans.cn/83/8260/825929/825929_11.png

Fulminate 发表于 2025-3-23 17:09:45

http://reply.papertrans.cn/83/8260/825929/825929_12.png

打折 发表于 2025-3-23 21:03:46

http://reply.papertrans.cn/83/8260/825929/825929_13.png

Hectic 发表于 2025-3-23 23:10:55

http://reply.papertrans.cn/83/8260/825929/825929_14.png

别名 发表于 2025-3-24 02:27:45

http://reply.papertrans.cn/83/8260/825929/825929_15.png

戏法 发表于 2025-3-24 07:01:11

Book 2024ions. Starting from a uniform mathematical framework, this book derives the theory of modern reinforcement learning systematically and introduces all mainstream reinforcement learning algorithms such as PPO, SAC, and MuZero. It also covers key technologies of GPT training such as RLHF, IRL, and PbRL

护身符 发表于 2025-3-24 11:06:48

http://reply.papertrans.cn/83/8260/825929/825929_17.png

fibroblast 发表于 2025-3-24 15:51:11

http://reply.papertrans.cn/83/8260/825929/825929_18.png

Absenteeism 发表于 2025-3-24 22:52:10

http://reply.papertrans.cn/83/8260/825929/825929_19.png

AER 发表于 2025-3-25 02:47:59

http://reply.papertrans.cn/83/8260/825929/825929_20.png
页: 1 [2] 3 4 5 6
查看完整版本: Titlebook: Reinforcement Learning; Theory and Python Im Zhiqing Xiao Book 2024 Beijing Huazhang Graphics & Information Co., Ltd, China Machine Press 2