finale
发表于 2025-3-26 21:22:03
http://reply.papertrans.cn/71/7033/703220/703220_31.png
清澈
发表于 2025-3-27 04:49:02
A Learning-Based Iterated Local Search Algorithm for Solving the Traveling Salesman Probleme well-known NP-Hard Traveling Salesman Problem. This metaheuristic basically employs single local search and perturbation operators for finding the (near-) optimal solution. In this paper, by incorporating multiple local search and perturbation operators, we explore the use of reinforcement learnin
障碍
发表于 2025-3-27 08:50:53
http://reply.papertrans.cn/71/7033/703220/703220_33.png
Immunotherapy
发表于 2025-3-27 13:18:42
A Comparison of Learnheuristics Using Different Reward Functions to Solve the Set Covering Problem machine learning. The concept behind the hybridization of both worlds is called Learnheuristics which allows to improve optimization methods through machine learning techniques where the input data for learning is the data produced by the optimization methods during the search process. Among the mo
phytochemicals
发表于 2025-3-27 13:49:39
A Bayesian Optimisation Approach for Multidimensional Knapsack Problemultidimensional knapsack problem with a large number of items and knapsack constraints, a two-level formulation is presented to take advantage of the global optimisation capability of the Bayesian optimisation approach, and the efficiency of integer programming solvers on small problems. The first l
SLING
发表于 2025-3-27 20:28:02
http://reply.papertrans.cn/71/7033/703220/703220_36.png
令人悲伤
发表于 2025-3-28 00:41:53
Guiding Representation Learning in Deep Generative Models with Policy Gradientsion. Using such a representation as input to Reinforcement Learning (RL) approaches may reduce learning time, enable domain transfer or improve interpretability of the model. However, current state-of-the-art approaches that combine VAE with RL fail at learning good performing policies on certain RL
耐寒
发表于 2025-3-28 02:22:02
Deep Reinforcement Learning for Dynamic Pricing of Perishable Productsic pricing of perishable products using DQN value function approximator. A model-free reinforcement learning approach is used to maximize revenue for a perishable item with fixed initial inventory and selling horizon. The demand is influenced by the price and freshness of the product. The convention
SLAG
发表于 2025-3-28 09:54:26
An Exploratory Analysis on a Disinformation Datasete the effects of this type of content have their impacts in the most diverse areas and generate more and more impacts within society. Automated fact-checking systems have been proposed by applying supervised machine learning techniques to assist in filtering fake news. However, two challenges are st
共同时代
发表于 2025-3-28 11:38:34
http://reply.papertrans.cn/71/7033/703220/703220_40.png