finale 发表于 2025-3-26 21:22:03
http://reply.papertrans.cn/71/7033/703220/703220_31.png清澈 发表于 2025-3-27 04:49:02
A Learning-Based Iterated Local Search Algorithm for Solving the Traveling Salesman Probleme well-known NP-Hard Traveling Salesman Problem. This metaheuristic basically employs single local search and perturbation operators for finding the (near-) optimal solution. In this paper, by incorporating multiple local search and perturbation operators, we explore the use of reinforcement learnin障碍 发表于 2025-3-27 08:50:53
http://reply.papertrans.cn/71/7033/703220/703220_33.pngImmunotherapy 发表于 2025-3-27 13:18:42
A Comparison of Learnheuristics Using Different Reward Functions to Solve the Set Covering Problem machine learning. The concept behind the hybridization of both worlds is called Learnheuristics which allows to improve optimization methods through machine learning techniques where the input data for learning is the data produced by the optimization methods during the search process. Among the mophytochemicals 发表于 2025-3-27 13:49:39
A Bayesian Optimisation Approach for Multidimensional Knapsack Problemultidimensional knapsack problem with a large number of items and knapsack constraints, a two-level formulation is presented to take advantage of the global optimisation capability of the Bayesian optimisation approach, and the efficiency of integer programming solvers on small problems. The first lSLING 发表于 2025-3-27 20:28:02
http://reply.papertrans.cn/71/7033/703220/703220_36.png令人悲伤 发表于 2025-3-28 00:41:53
Guiding Representation Learning in Deep Generative Models with Policy Gradientsion. Using such a representation as input to Reinforcement Learning (RL) approaches may reduce learning time, enable domain transfer or improve interpretability of the model. However, current state-of-the-art approaches that combine VAE with RL fail at learning good performing policies on certain RL耐寒 发表于 2025-3-28 02:22:02
Deep Reinforcement Learning for Dynamic Pricing of Perishable Productsic pricing of perishable products using DQN value function approximator. A model-free reinforcement learning approach is used to maximize revenue for a perishable item with fixed initial inventory and selling horizon. The demand is influenced by the price and freshness of the product. The conventionSLAG 发表于 2025-3-28 09:54:26
An Exploratory Analysis on a Disinformation Datasete the effects of this type of content have their impacts in the most diverse areas and generate more and more impacts within society. Automated fact-checking systems have been proposed by applying supervised machine learning techniques to assist in filtering fake news. However, two challenges are st共同时代 发表于 2025-3-28 11:38:34
http://reply.papertrans.cn/71/7033/703220/703220_40.png