麻省理工学院:《自制决策制造原则》英文版 Planning to Maximize Reward: Markov Decision processes

How Might a mouse search a Maze for Cheese? heese · State Space Search? As a Constraint Satisfaction Problem? Goal-directed Planning As a rule or production System? What is missing? Ideas in this lecture Objective is to accumulate rewards rather than goal states Task is to generate policies for how to act in all situations rather than a plan for a single starting situation
文件格式:PDF,文件大小:187.97KB,售价:7.3元
文档详细内容(约25页)
点击进入文档下载页(PDF格式)

您可能感兴趣的文档

点击购买下载(PDF)

下载及服务说明

  • 购买前请先查看本文档预览页,确认内容后再进行支付;
  • 如遇文件无法下载、无法访问或其它任何问题,可发送电子邮件反馈,核实后将进行文件补发或退款等其它相关操作;
  • 邮箱: