当前位置：和泉文库 > 计算机 > 浏览文档

中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 06 Game Playing

Games Perfect play（最优策略） minimax decisions α − β Pruning Resource limits and approximate evaluation Games of chance (包含几率因素的游戏) Games of imperfect information

文件格式：PDF，文件大小：5.73MB，售价：16.59元

共78页，可试读20页，点击往前阅读 ↑↑

文档详细内容（约78页）

Normal-Form Game:Prisoners'Dilemma Example:Prisoner's Dilemma Two prisoners questioned in isolated cells Each prisoner can Cooperate or Defect Utilities (row agent 1,column agent 2): C D C-1,1 -5,0 D0,-5 -3,-3 4口◆4⊙t1三1=，￥9QC

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Normal-Form Game: Prisoners’ Dilemma

Normal-Form Game:Rock-Paper-Scissors Example:Rock-Paper-Scissors Two players,three actions Rock beats Scissors beats Paper beats Rock 。Utilities:: RP S R 0,0 -1,1 1,-1 1,-1 0,0 -1,1 S-1,1 1,-1 0,0 口卡回t·三4色，是分Q0

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Normal-Form Game: Rock-Paper-Scissors

Optimality Concepts Optimality Concepts in Normal-Form Games: Best-Response Function:set of optimal strategies given the other agents current strategies. π∈BR(π-) iff Vπ；∈PD(A) R(π，π-)≥R((π，T-) Nash Equilibria:all agents are using best-response strategies. i=1..nπi∈BR(r-i) All Normal-Form Games have at least one Nash Equilibrium 4口◆4⊙t1三1=，￥9QC

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Optimality Concepts Optimality Concepts in Normal-Form Games: ▶ Best-Response Function: set of optimal strategies given the other agents current strategies. π ∗ i ∈ BRi(π−i) iff ∀πi ∈ PD(Ai) Ri(⟨π ∗ i , π−i⟩) ≥ Ri(⟨πi , π−i⟩) ▶ Nash Equilibria: all agents are using best-response strategies. ∀i = 1 . . . n πi ∈ BRi(π−i) ▶ All Normal-Form Games have at least one Nash Equilibrium

Game Classification:Zero-sum .2 players with opposing objectives. There is only one Nash equilibrium 。Minimax to find it. (a)Reward function for player 1 (b)Reward function for player 2 口卡4·三4色，是分QC

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Game Classification: Zero-sum

Two-Player Zero-Sum Games Characteristics: Two opponents play against each other. symmetrical rewards (always sum zero). Usually only one equilibrium and if more exist they are interchangeable ·Interchangeable::(π1，T2〉和(，2）)是两个Nash equilibria, 则（π1，μ2），(1，r2〉也是Nash equilibria;并且它们效用都相 Minimax to find an equilibrium (2,A,O,R,-R): max min∑x(a)R(a,o) TEPD(A)oEO aEA Formulated as a Linear Program. Solution in the strategy space:simultaneous playing invalidates deterministic strategies. 4口◆4⊙t4三1=，￥9QC

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Two-Player Zero-Sum Games ▶ Characteristics: ▶ Two opponents play against each other. ▶ symmetrical rewards (always sum zero). ▶ Usually only one equilibrium and if more exist they are interchangeable ▶ Interchangeable: ⟨π1, π2⟩ 和 ⟨µ1, µ2⟩ 是两个 Nash equilibria，则 ⟨π1, µ2⟩, ⟨µ1, π2⟩ 也是 Nash equilibria；并且它们效用都相等 ▶ Minimax to find an equilibrium (2, A, O, R, −R): max π∈PD(A) min o∈O ∑ a∈A π(a)R(a, o) ▶ Formulated as a Linear Program. ▶ Solution in the strategy space: simultaneous playing invalidates deterministic strategies

点击进入文档下载页（PDF格式）

共78页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 05 Constraint Satisfaction Problems
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 04 Informed Search
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 03 Solving Problems by Searching
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 02 Intelligent Agents
《Artificial Intelligence：A Modern Approach》教学资源（PPT课件，英文版）Chapter 9-Inference in first-order logic
《Artificial Intelligence：A Modern Approach》教学资源（PPT课件，英文版）Chapter 8-First-Order Logic
《Artificial Intelligence：A Modern Approach》教学资源（PPT课件，英文版）Chapter 7-Logical Agents
《Artificial Intelligence：A Modern Approach》教学资源（PPT课件，英文版）Chapter 6-Adversarial Search
《Artificial Intelligence：A Modern Approach》教学资源（PPT课件，英文版）Chapter 5-Constraint Satisfaction Problems
《Artificial Intelligence：A Modern Approach》教学资源（PPT课件，英文版）Chapter 4-Informed search algorithms
《Artificial Intelligence：A Modern Approach》教学资源（PPT课件，英文版）Chapter 3-Solving problems by searching
《Artificial Intelligence：A Modern Approach》教学资源（PPT课件，英文版）Chapter 2-Intelligent Agents
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 07 Logical Agents
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 10 Uncertainty and Bayesian Networks
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 11 马尔可夫决策过程
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 08 First-Order Logic and Inference in FOL
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 09 AI Planning
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 13 神经网络与深度学习
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 14 Reinforcement Learning
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 15 智能机器人系统介绍
中国科学技术大学：《人工智能基础》课程教学资源（课件讲稿）Lecture 01 Introdution（主讲：吉建民）
北京大学：《信息检索》课程教学资源（PPT课件讲稿）Course Overview（主讲：闫宏飞）
北京大学：《信息检索》课程教学资源（PPT课件讲稿）Web Search
北京大学：《信息检索》课程教学资源（PPT课件讲稿）Crawling the Web

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录