找回密码
 To register

QQ登录

只需一步,快速开始

扫一扫,访问微社区

Titlebook: Markov Decision Processes with Their Applications; Qiying Hu,Wuyi Yue Book 2008 Springer-Verlag US 2008 Markov decision process.Observable

[复制链接]
查看: 23766|回复: 45
发表于 2025-3-21 17:00:57 | 显示全部楼层 |阅读模式
书目名称Markov Decision Processes with Their Applications
编辑Qiying Hu,Wuyi Yue
视频video
概述Presents new branches for Markov Decision Processes (MDP).Applies new methodology for MDPs with discounted total reward criterion.Offers new applications of MDPs in areas such as the control of discre
丛书名称Advances in Mechanics and Mathematics
图书封面Titlebook: Markov Decision Processes with Their Applications;  Qiying Hu,Wuyi Yue Book 2008 Springer-Verlag US 2008 Markov decision process.Observable
描述.Markov decision processes (MDPs), also called stochastic dynamic programming, were first studied in the 1960s. MDPs can be used to model and solve dynamic decision-making problems that are multi-period and occur in stochastic circumstances. There are three basic branches in MDPs: discrete-time MDPs, continuous-time MDPs and semi-Markov decision processes. Starting from these three branches, many generalized MDPs models have been applied to various practical problems. These models include partially observable MDPs, adaptive MDPs, MDPs in stochastic environments, and MDPs with multiple objectives, constraints or imprecise parameters...Markov Decision Processes With Their Applications examines MDPs and their applications in the optimal control of discrete event systems (DESs), optimal replacement, and optimal allocations in sequential online auctions. The book presents four main topics that are used to study optimal control problems: a new methodology for MDPs with discounted total reward criterion; transformation of continuous-time MDPs and semi-Markov decision processes into a discrete-time MDPs model, thereby simplifying the application of MDPs; MDPs in stochastic environments, wh
出版日期Book 2008
关键词Markov decision process; Observable; Optimal control; decision making problems; decision processes; discr
版次1
doihttps://doi.org/10.1007/978-0-387-36951-8
isbn_softcover978-1-4419-4238-8
isbn_ebook978-0-387-36951-8Series ISSN 1571-8689 Series E-ISSN 1876-9896
issn_series 1571-8689
copyrightSpringer-Verlag US 2008
The information of publication is updating

书目名称Markov Decision Processes with Their Applications影响因子(影响力)




书目名称Markov Decision Processes with Their Applications影响因子(影响力)学科排名




书目名称Markov Decision Processes with Their Applications网络公开度




书目名称Markov Decision Processes with Their Applications网络公开度学科排名




书目名称Markov Decision Processes with Their Applications被引频次




书目名称Markov Decision Processes with Their Applications被引频次学科排名




书目名称Markov Decision Processes with Their Applications年度引用




书目名称Markov Decision Processes with Their Applications年度引用学科排名




书目名称Markov Decision Processes with Their Applications读者反馈




书目名称Markov Decision Processes with Their Applications读者反馈学科排名




单选投票, 共有 1 人参与投票
 

1票 100.00%

Perfect with Aesthetics

 

0票 0.00%

Better Implies Difficulty

 

0票 0.00%

Good and Satisfactory

 

0票 0.00%

Adverse Performance

 

0票 0.00%

Disdainful Garbage

您所在的用户组没有投票权限
发表于 2025-3-21 22:05:44 | 显示全部楼层
发表于 2025-3-22 01:40:21 | 显示全部楼层
发表于 2025-3-22 05:09:50 | 显示全部楼层
Semi-Markov Decision Processes,t decision epochs are not considered. Those in CTMDPs are continuous time Markov chains, where the decision is chosen every time. In this chapter, we study a stationary semi-Markov decision processes (SMDPs) model, where the underlying stochastic processes are semi-Markov processes. Here, the decisi
发表于 2025-3-22 09:01:23 | 显示全部楼层
Markovdecisionprocessesinsemi-Markov Environments,m that itself can be modeled by a Markov decision process, but the system is influenced by its environment which is modeled by a semi-Markov process. The influence of the environment on the system occurs when the environment state changes, and consists of the following three things: (1) an instantan
发表于 2025-3-22 14:19:16 | 显示全部楼层
Optimal control of discrete event systems: I, new optimal control problem in DESs. The performance measure is to maximize the maximal discounted total reward among all possible strings (i.e., paths) of the controlled system. The condition we need for this is only that the performance measure is well defined. By using the method and ideas prese
发表于 2025-3-22 19:27:18 | 显示全部楼层
Optimal control of discrete event systems: II,three ways. First, the discrete event system is defined as a collection of event sets that depend on strings. Whenthe system generates a string, the next event that occurs should be in the corresponding event set. Second, the rewards are for choosing control inputs at strings. Finally, the control p
发表于 2025-3-22 23:44:07 | 显示全部楼层
Optimal replacement under stochastic Environments,nd thus should be replaced by a new one when it is too bad. There are two types of deterioration considered in reliability literature. The first one is due to the operation of the system itself, and the second one is caused by the influence of the environment, for example, shocks to the system. We c
发表于 2025-3-23 04:40:45 | 显示全部楼层
Optimalal location in sequential online Auctions,quential auctions on the Web and has a reserve price set on the items. We present two such Internet auction cases: one is where the reserve price is private (known only by the seller). The other one is where the reserve is public (known to all). The buyers arrive according to a Poisson process. The
发表于 2025-3-23 09:14:41 | 显示全部楼层
 关于派博传思  派博传思旗下网站  友情链接
派博传思介绍 公司地理位置 论文服务流程 影响因子官网 SITEMAP 大讲堂 北京大学 Oxford Uni. Harvard Uni.
发展历史沿革 期刊点评 投稿经验总结 SCIENCEGARD IMPACTFACTOR 派博系数 清华大学 Yale Uni. Stanford Uni.
|Archiver|手机版|小黑屋| 派博传思国际 ( 京公网安备110108008328) GMT+8, 2025-5-24 11:16
Copyright © 2001-2015 派博传思   京公网安备110108008328 版权所有 All rights reserved
快速回复 返回顶部 返回列表