CLOUT 发表于 2025-3-30 11:30:09

Differential Safety Testing of Deep RL Agents Enabled by Automata Learninguracy guarantees on learned models are not strictly necessary. Through a combination of automata learning, testing, and statistics, we perform testing-based verification with statistical guarantees in the absence of guarantees on the learned automata. We showcase our approach by testing deep reinfor

MOT 发表于 2025-3-30 15:58:57

http://reply.papertrans.cn/20/1908/190775/190775_52.png

记忆 发表于 2025-3-30 19:26:32

http://reply.papertrans.cn/20/1908/190775/190775_53.png

远地点 发表于 2025-3-30 21:48:44

http://reply.papertrans.cn/20/1908/190775/190775_54.png

生存环境 发表于 2025-3-31 03:11:29

Deep Neural Networks, Explanations, and Rationalityal “explanation” for a decision is a chronicle of the steps used to arrive at the decision. Herb Simon’s “bounded rationality” is the observation that the ability of a human brain to handle algorithmic complexity and data is limited. As a consequence, human decision-making in complex cases mixes som

opinionated 发表于 2025-3-31 08:25:00

Shielded Reinforcement Learning for Hybrid Systemss state, is known to be intricately hard. Reinforcement learning has been leveraged to construct near-optimal controllers, but their behavior is not guaranteed to be safe, even when it is encouraged by reward engineering. One way of imposing safety to a learned controller is to use a ., which is cor

GNAT 发表于 2025-3-31 10:43:16

What, Indeed, is an Achievable Provable Guarantee for Learning-Enabled Safety-Critical Systemsnges. Among the challenges, it is known that a rigorous, yet practical, way of achieving safety guarantees is one of the most prominent. In this paper, we first discuss the engineering and research challenges associated with the design and verification of such systems. Then, based on the observation

VICT 发表于 2025-3-31 13:27:37

DeepAbstraction++: Enhancing Test Prioritization Performance via Combined Parameterized Boxess. Subsequently, the DeepAbstraction algorithm has recently become one of the leading techniques in this area. It employs a box-abstraction concept, the efficiency of which depends on the tau parameter, the clustering parameter, that influences the size of these boxes. The conclusion of the previous

邪恶的你 发表于 2025-3-31 20:42:29

http://reply.papertrans.cn/20/1908/190775/190775_59.png
页: 1 2 3 4 5 [6]
查看完整版本: Titlebook: Bridging the Gap Between AI and Reality; First International Bernhard Steffen Conference proceedings 2024 The Editor(s) (if applicable) an