当前位置：和泉文库 > 计算机 > 浏览文档

香港科技大学：Latent Tree Models Part III：Learning Algorithms

文件格式：PPTX，文件大小：3.59MB，售价：17.26元

文档详细内容（约96页）

Model selection criteria Bayesian score: posterior probability P(mD P(mD)=P(mP(D m)/P(D) =P(m)P(DIm, e) P(e m)de/P(DI BIC Score: Large sample approximation of bayesian score BIC(m D)=log P(D/m, 8-d/2 logN d: number of free parameters; n is the sample size 8*: MLE of 0, estimated using the Em algorithm Likelihood term of bic Measure how well the model fits data Second term Penalty for model complexity. The use of the bic score indicates that we are looking for a model that fits the data well, and at the same time, not overly complex AAAl2014 Tutorial Nevin L Zhang HKUST

AAAI 2014 Tutorial Nevin L. Zhang HKUST 6  Bayesian score: posterior probability P(m|D) P(m|D) = P(m)P(D|m) / P(D) = P(m)∫ P(D|m, θ) P(θ |m) dθ / P(D)  BIC Score: Large sample approximation of Bayesian score BIC(m|D) = log P(D|m, θ*) – d/2 logN  d : number of free parameters; N is the sample size.  θ*: MLE of θ, estimated using the EM algorithm.  Likelihood term of BIC: Measure how well the model fits data.  Second term: Penalty for model complexity.  The use of the BIC score indicates that we are looking for a model that fits the data well, and at the same time, not overly complex. Model Selection Criteria

Model selection criteria AlC(Akaike, 1974) AlC(m D)=log P(DIm, 8*)-d/2 holdout likelihood Data=> Training set, validation set Model parameters estimated based on the training set s Quality of model is measured using likelihood on the validation set Cross validation too expensive AAAl2014 Tutorial Nevin L Zhang HKUST

AAAI 2014 Tutorial Nevin L. Zhang HKUST 7  AIC (Akaike, 1974): AIC(m|D) = log P(D|m, θ*) – d/2  Holdout likelihood  Data => Training set, validation set.  Model parameters estimated based on the training set.  Quality of model is measured using likelihood on the validation set.  Cross validation: too expensive Model Selection Criteria

Search Algorithms Double hill climbing (DHC),(zhang 2002, 2004) )7 manifest variables Single hill climbing( SHC),(Zhang and Kocka 2004 12 manifest variables HeuristIc SHC (HSHC),(Zhang and Kocka 2004) 50 manifest variables EAST, ( Chen et al 2011) 100+ manifest variables AAAl2014 Tutorial Nevin L Zhang HKUST

AAAI 2014 Tutorial Nevin L. Zhang HKUST 8 Search Algorithms  Double hill climbing (DHC), (Zhang 2002, 2004)  7 manifest variables.  Single hill climbing (SHC), (Zhang and Kocka 2004)  12 manifest variables  Heuristic SHC (HSHC), (Zhang and Kocka 2004)  50 manifest variables  EAST, (Chen et al 2011)  100+ manifest variables

Double Hill climbing ( DHC) Two search procedures s One for model structure One for cardinalities of latent variables Very inefficient. Tested only on data sets with 7 or fewer variables (Zhang 2004) DHC tested on synthetic and real-world data sets, together with BIC AIC, and Holdout likelihood respectively Best models found when bic was used So subsequent work based on bIC AAAl2014 Tutorial Nevin L Zhang HKUST

AAAI 2014 Tutorial Nevin L. Zhang HKUST 9  Two search procedures  One for model structure  One for cardinalities of latent variables.  Very inefficient. Tested only on data sets with 7 or fewer variables. (Zhang 2004)  DHC tested on synthetic and real-world data sets, together with BIC, AIC, and Holdout likelihood respectively.  Best models found when BIC was used.  So subsequent work based on BIC. Double Hill Climbing (DHC)

Single Hill Climbing (HSC) Determines both model structure and cardinalities of latent variables using a single search procedure Uses five search operators Node Introduction (ND) Node Deletion(ND) Node Relation (NR) State Introduction (SI) State Deletion (SI) AAAl2014 Tutorial Nevin L Zhang HKUST 10

AAAI 2014 Tutorial Nevin L. Zhang HKUST 10  Determines both model structure and cardinalities of latent variables using a single search procedure.  Uses five search operators  Node Introduction (NI)  Node Deletion (ND)  Node Relation (NR)  State Introduction (SI)  State Deletion (SI) Single Hill Climbing (HSC)

点击进入文档下载页（PPTX格式）

共96页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

四川大学：Object-Oriented Design and Programming（Java，PPT课件）Advanced Class Design
《计算机组成原理》课程教学资源（PPT课件讲稿）第6章总线结构
南京航空航天大学：《C++程序设计》课程教学资源（PPT课件）第1章 C++程序设计基础（主讲：陈哲）
《Excel实用技术基础》课程教学资源（PPT课件讲稿）Excel 技术基础、数据管理
《计算机系统》课程教学资源（PPT课件讲稿）第六章设备管理 Devices Management
Introduction to XML IR（PPT讲稿）
中国传媒大学（北京广播学院）：《计算机网络》课程教学资源（PPT课件讲稿）第五章网络层 The Network Layer
山东大学：《微机原理及单片机接口技术》课程教学资源（PPT课件讲稿）第六章中断（主讲：刘忠国）
《工程计算软件》课程教学资源（PPT课件讲稿）第四章 Maple简介
中国科学技术大学：QuickPass系统的排队问题（PPT讲座，谢瑶）
中国科学技术大学：《网络信息安全 NETWORK SECURITY》课程教学资源（PPT课件讲稿）第十章入侵检测系统（主讲：肖明军）
东南大学：《数据结构》课程教学资源（PPT课件讲稿）第五章树（主讲：方效林）
《多媒体教学软件设计》课程教学资源（PPT课件讲稿）第3章多媒体教学软件开发平台（Authorware）
河南中医药大学（河南中医学院）：《网络技术实训》课程教学资源（PPT课件讲稿）第9讲通过VPN访问企业网内部服务器设计讨论
四川大学：《操作系统 Operating System》课程教学资源（PPT课件讲稿）Chapter 2 Operating System Overview
《数据结构 Data Structure》课程教学资源（PPT课件讲稿）第三章栈和队列
IS6000 – Seminar 8 Research Methods – Case Study – Action Research
《编译原理》课程教学资源（PPT课件讲稿）上下文无关文法——自顶向下分析
《计算机应用基础》课程教学资源（PPT讲稿）统考考前辅导
Cassandra and Sigmod contest
上海交通大学：《数字图像处理 Digital Image Processing》课程教学资源（PPT课件讲稿，第三版）Chapter 9 Morphological Image Processing
南京航空航天大学：《模式识别》课程教学资源（PPT讲稿）Model Selection for SVM & Our intent works
中国科学技术大学：《微机原理》课程教学资源（PPT课件讲稿）第八章中断系统
《单片机原理及应用》课程教学资源（PPT课件讲稿）第3章 MCS-51单片机的指令系统

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录