dragon 发表于 2025-3-28 14:42:54
https://doi.org/10.1007/978-3-662-43079-8to a partnership. We assume that the agent has a model of the likelihood of different outcomes and corresponding utilities for each such partnership. Given a fixed, finite number of interactions, the problem is to choose a particular partner to interact with where the goal is to maximize the sum of