决策不确定性下的规划与强化学习：理论与应用

5星 · 超过95%的资源需积分: 14 65 浏览量更新于2024-07-20 6 收藏 5.93MB PDF 举报

《决策在不确定性下的挑战》是一本深入介绍决策支持系统设计者如何处理不确定性和多目标问题的著作。作者们聚焦于规划与强化学习这两种设计决策代理的方法，涵盖了概率模型、贝叶斯网络、效用理论、马尔可夫决策过程等多个核心概念。首先，贝叶斯网络作为一种图形模型，被用来捕捉变量之间的概率关系，这对于理解决策中的不确定性至关重要。效用理论则提供了一种框架，指导在面临未知结果时做出最优决策。马尔可夫决策过程被用于处理序列决策问题，通过动态地更新状态和预测可能的结果来指导行动。书中还探讨了模型不确定性，即决策者对系统模型的不完全理解，以及状态不确定性，即决策过程中实际状态的不确定性。此外，它涵盖了合作决策，即涉及多个相互作用的智能体共同决策的情况，这种复杂性在现实世界的许多应用场景中是常见的，如人搜索系统、语音应用、航空碰撞避免和无人驾驶飞机持久监视。作者们威廉·德莱尼、艾伦·芬恩、彼得·赫斯特、迈克尔·科亨德弗、查尔斯-宾·张和凯尔-平·邓恩等，都是该领域的知名专家，他们的贡献确保了本书的深度和实用性。书中的实例应用广泛，包括基于属性的人脸搜索、语音识别技术、飞行器避障，以及无人侦察机的持续监控，这些都是将理论概念转化为实际解决方案的典型例子。这本书以一致的符号系统统一了来自不同研究社区的研究成果，适合工程学科（如计算机科学、航空航天和管理科学）有概率论和微积分基础的学生和研究人员作为高级教材。同时，对于跨学科的研究人员而言，这也将是一部宝贵的参考文献，反映了麻省理工学院林肯实验室在国家安全领域应用先进技术所作出的卓越贡献。整个系列书籍延续了麻省理工辐射实验室系列的传统，致力于知识共享和技术传播。

5BCMF PG $POUFOUT YW

 'FBUVSF &YUSBDUJPO 

 )JEEFO .BSLPW .PEFMT 

 (BVTTJBO .JYUVSF .PEFMT 

 &YQFDUBUJPO.BYJNJ[BUJPO "MHPSJUIN 

 4QFFDI 3FDPHOJUJPO 

 5PQJD *EFOUJýDBUJPO 

 -BOHVBHF 3FDPHOJUJPO 

 4QFBLFS *EFOUJýDBUJPO 

 'PSFOTJD 4QFBLFS 3FDPHOJUJPO 

 .BDIJOF 5SBOTMBUJPO 

 4VNNBSZ 

3FGFSFODFT 

 0QUJNJ[FE "JSCPSOF $PMMJTJPO "WPJEBODF 

 "JSCPSOF $PMMJTJPO "WPJEBODF 4ZTUFNT 

 5SBGýD "MFSU BOE $PMMJTJPO "WPJEBODF 4ZTUFN 

 -JNJUBUJPOT PG &YJTUJOH 4ZTUFN 

 6ONBOOFE "JSDSBGU 4FOTF BOE "WPJE 

 "JSCPSOF $PMMJTJPO "WPJEBODF 4ZTUFN 9 

 $PMMJTJPO "WPJEBODF 1SPCMFN 'PSNVMBUJPO 

 3FTPMVUJPO "EWJTPSJFT 

 %ZOBNJD .PEFM 

 3FXBSE 'VODUJPO 

 %ZOBNJD 1SPHSBNNJOH 

 4UBUF &TUJNBUJPO 

 4FOTPS &SSPS 

1SFGBDF

This book provides an introduction to decision making under uncertainty from a

computational perspective. The aim of the ﬁrst part of the book is to familiarize the

reader with the foundations of probabilistic models and decision theory. The second part

of the book discusses the application of the theory to problems relevant to a variety of

mission areas. The subject of decision making under uncertainty is quite broad and has

its origins in several dierent ﬁelds. The text aims to be as concise as possible, providing

references to additional material that may be relevant to a wide set of applications.

The target audience for this book includes advanced engineering undergraduate and

graduate students as well as professionals. Disciplines for which the book would be

especially useful include computer science, aerospace, electrical engineering, and opera-

tions research. The text is intended to be introductory in nature. Although algorithms

are outlined in the text, proofs are omitted. The book requires some mathematical

maturity and assumes some prior exposure to probability theory and calculus. The ﬁrst

ﬁve chapters can be used as the basis of an undergraduate or graduate course. The topics

in Chapters 6 and 7 are more appropriate for the graduate level.

The book was written over the course of two years while I was at Lincoln Laboratory,

a federally funded research and development center of the Massachusetts Institute of

Technology. While teaching a course on decision making under uncertainty, I was

invited by a member of the Lincoln Laboratory book series to prepare a volume. Much

of the material in this book originated from the course. The later part of the course

consisted of guest lectures from researchers from Lincoln Laboratory and campus with

the aim to show how the principles and techniques discussed in the ﬁrst part of the

course can be applied to problems of national interest. Some of these guest lectures have

become chapters in this book.

M J. K

Stanford, Calif.

February 6, 2015

Ancillary material is available on the book’s webpage:

http://mitpress.mit.edu/decision-making-under-uncertainty

YJY

剩余349页未读，继续阅读

SorelCheung

粉丝: 61
资源: 120

决策不确定性下的规划与强化学习：理论与应用

Uncertain Computation-based Decision Theory 无水印原版pdf

Algorithms for Decision Making.pdf

Foundations for Logical Reasoning and Decision Making (Cutting-edge Exploration)

The Power Tool for Quantifying Uncertainty: Monte Carlo Simulation in MATLAB

Sequential Decision Making

简单的基于 Kotlin 和 JavaFX 实现的推箱子小游戏示例代码

基于simulink建立的PEMFC燃料电池机理模型（国外团队开发的，密歇根大学)，包含空压机模型，空气路，氢气路，电堆等模型 可以正常进行仿真

基于springboot的高校教学档案管理系统设计与实现源码（java毕业设计完整源码+LW）.zip

物流工厂往复式升降机2018可编辑全套技术资料100%好用.zip

基于USuperStar酒店管理系统（java web课程设计）、全部资料+详细文档+高分项目.zip

最新资源

基于simulink建立的PEMFC燃料电池机理模型（国外团队开发的，密歇根大学)，包含空压机模型，空气路，氢气路，电堆等模型可以正常进行仿真