当前位置：和泉文库 > 计算机 > 浏览文档

《机器学习》演示文稿（15）

文件格式：PPT，文件大小：473KB，售价：3元

文档详细内容（约10页）

A(high)=search, wait A(low)=search, wait, recharge 1,R wait 1-,-3 search β,R wait 1.0 recharge high search ●wait 1. Rait R search 1-α, R search

Agent State Reward Action Environment S Goal: Learn to choose actions that maximize o+yr,+y-n2+…, where 0≤y<1

Markov Decision processes Assume ● finite set of states s ● set of actions4 at each discrete time agent observes state st E S and chooses action at EA o then receives immediate reward rt and state changes to St+1 Markov assumption: St+1= d(St, at)and t =r(St,at i.e., rt and st+1 depend only on current state and action functions d and r may be nondeterministic - functions o and r not necessarily known to agent

Value function To begin, consider deterministic worlds For each possible policy the agent might adopt we can define an evaluation function over states V(s)=r+7r+1+y2r+2+ r Tt+i where rt, rt+1,.. are generated by following policy T starting at state s Restated, the task is to learn the optimal policy T 丌*≡ argmax v(s),(Vs)

What to learn We might try to have agent learn the evaluation function Vm(which we write as V*) It could then do a lookahead search to choose best action from any state s because T"(s)=argmax[r( s, a)+?V*(8(s, a problem · This works well if agent knows6:S×A→S, andr:S×A4→犹 . But when it doesnt it cant choose actions this way

点击进入文档下载页（PPT格式）

共10页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

《机器学习》教材
《机器学习》推导
《机器学习》学习规则
《机器学习》case-based
中国水利水电出版社：《动画专业导论》教材电子教案（PPT教学课件）第六章制作变形动画效果
中国水利水电出版社：《动画专业导论》教材电子教案（PPT教学课件）第五章制作文字动画效果
中国水利水电出版社：《动画专业导论》教材电子教案（PPT教学课件）第四章制作简单的逐帧动画
中国水利水电出版社：《动画专业导论》教材电子教案（PPT教学课件）第八章制作三维动画效果
中国水利水电出版社：《动画专业导论》教材电子教案（PPT教学课件）第七章制作三维人物动画效果
中国水利水电出版社：《动画专业导论》教材电子教案（PPT教学课件）第三章制作简单的位移型动画
中国水利水电出版社：《动画专业导论》教材电子教案（PPT教学课件）第二章动画的制作流程
中国水利水电出版社：《动画专业导论》教材电子教案（PPT教学课件）第一章动画的基础知识
《机器学习》第二章示例学习（1/2）
《机器学习》第二章示例学习（2/2）
《机器学习》第三章学习的计算理论
《机器学习》（英文版）Table 1. The explanation-based generalization problem Given
《机器学习》（英文版）Table 1. The explanation-based generalization problem Given
《机器学习》（英文版）Given：E-a set of data events k-the number of clusters
《机器学习》第一章关于机器学习的一般论题
《机器学习》（英文版）Choose initial “seed” events from
《机器学习》第三章规则学习算法
《机器学习》Star生成：Induce方法
《机器学习》扩张矩阵算法
《机器学习》第三章概念学习和一般到特殊序

点击购买下载（PPT）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录

《机器学习》演示文稿（15）