阐释 发表于 2025-3-28 15:36:58
Mixed-Policy Asynchronous Deep Q-Learning such as deep neural networks, have been successfully used in both single- and multi-agent environments with high dimensional state-spaces. The multi-agent learning paradigm faces even more problems, due to the effect of several agents learning simultaneously in the environment. One of its main conc袋鼠 发表于 2025-3-28 21:41:48
Reward-Weighted GMM and Its Application to Action-Selection in Robotized Shoe Dressingask and must select the action that maximizes success probability among a repertoire of pre-trained actions. We investigate the case in which sensory data is only available before making the decision, but not while the action is being performed. In this paper we propose to use a Gaussian Mixture Mod吼叫 发表于 2025-3-29 01:14:26
http://reply.papertrans.cn/83/8203/820228/820228_43.png朴素 发表于 2025-3-29 04:51:56
Tactile Sensing and Machine Learning for Human and Object Recognition in Disaster Scenariosrios where haptic feedback provides a valuable information for the search of potential victims. To extract haptic information from the environment, a tactile sensor attached to a lightweight robotic arm is used. Then, methods based on the SURF descriptor, support vector machines (SVM), Deep Convolut不来 发表于 2025-3-29 11:01:43
http://reply.papertrans.cn/83/8203/820228/820228_45.png全等 发表于 2025-3-29 11:42:05
http://reply.papertrans.cn/83/8203/820228/820228_46.png祝贺 发表于 2025-3-29 16:59:17
http://reply.papertrans.cn/83/8203/820228/820228_47.pngcondone 发表于 2025-3-29 21:28:56
http://reply.papertrans.cn/83/8203/820228/820228_48.pnghedonic 发表于 2025-3-30 01:19:04
http://reply.papertrans.cn/83/8203/820228/820228_49.pngMercantile 发表于 2025-3-30 07:28:18
Reward-Weighted GMM and Its Application to Action-Selection in Robotized Shoe Dressingorithm to use the result of each execution to update the model, thus adapting the robot behavior to the user and evaluating the effectiveness of each pre-trained action. The proposed algorithm is applied to a robotic shoe-dressing task. Simulated and real experiments show the validity of our approach.