当前位置：和泉文库 > 计算机 > 浏览文档

同济大学：《大数据分析与数据挖掘 Big Data Analysis and Mining》课程教学资源（PPT课件讲稿）Evaluation & other classifiers

文件格式：PPTX，文件大小：1.75MB，售价：9.91元

文档详细内容（约41页）

Bootstrap Bootstrap Works well with small data sets Samples the given training tuples uniformly with replacement n i.e., each time a tuple is selected, it is equally likely to be selected again and re-added to the training set a Several bootstrap methods and a common one is. 632 bootstrap o Adata set with d tuples is sampled d times, with replacement resulting in a training set of d samples. The data tuples that did not make it into the training set end up forming the test set. about 63.2%of the original data end up in the bootstrap and the remaining 36.8%form the test set(since(1-1/d)d=e1=0.368) o Repeat the sampling procedure k times, overall accuracy of the model: ACC( M) Xi=10.632*Acc(Mi)testset +0.368* Acc Trainset

Bootstrap ◼ Bootstrap ◆ Works well with small data sets ◆ Samples the given training tuples uniformly with replacement  i.e., each time a tuple is selected, it is equally likely to be selected again and re-added to the training set ◼ Several bootstrap methods and a common one is .632 bootstrap ◆ A data set with d tuples is sampled d times, with replacement, resulting in a training set of d samples. The data tuples that did not make it into the training set end up forming the test set. About 63.2% of the original data end up in the bootstrap, and the remaining 36.8% form the test set (since (1-1/d) d =e-1=0.368) ◆ Repeat the sampling procedure k times, overall accuracy of the model: 𝐴𝑐𝑐(𝑀) = 1 𝑘 σ𝑖=1 𝑘 (0.632 ∗ 𝐴𝑐𝑐(𝑀𝑖 )𝑡𝑒𝑠𝑡𝑠𝑒𝑡 + 0.368 ∗ 𝐴𝑐𝑐 𝑀𝑖 𝑡𝑟𝑎𝑖𝑛𝑠𝑒𝑡 )

Confidence Intervals a Suppose we have 2 classifiers, M, and M2 which one is better? a Use 10-fold cross-validation to obtain error(M,) and error(M2) a These mean error rates are just estimates of error on the true population of future data cases a What if the difference between the 2 error rates is just attributed to chance? o Use a test of statistical significance e Obtain confidence limits for our error estimates

Confidence Intervals ◼ Suppose we have 2 classifiers, M1 and M2 , which one is better? ◼ Use 10-fold cross-validation to obtain error(M1 ) and error(M2 ) ◼ These mean error rates are just estimates of error on the true population of future data cases ◼ What if the difference between the 2 error rates is just attributed to chance? ◆ Use a test of statistical significance ◆ Obtain confidence limits for our error estimates

Null Hypothesis Perform 10-fold cross-validation a Assume samples follow a t distribution with k-1 degrees of freedom(k=10) a Use t-test (or Students t-test) a Null hypothesis: M, and M2 are the same a If we can reject null hypothesis, then e We conclude that the difference between m, and ma is statistically significant Choose model with lower error rate

Null Hypothesis ◼ Perform 10-fold cross-validation ◼ Assume samples follow a t distribution with k-1 degrees of freedom (k=10) ◼ Use t-test (or Student’s t-test) ◼ Null hypothesis: M1 and M2 are the same ◼ If we can reject null hypothesis, then ◆ We conclude that the difference between M1 and M2 is statistically significant ◆ Choose model with lower error rate

Estimating confidence intervals: t-test a If only 1 test set available: pairwise comparison For ith round of 1o-fold cross-validation the same cross partitioning is used to obtain error(M,) and error(M2) o Average over 10 rounds to get error(M,) and error(M2 T-test computes t-statistic with k-1 degrees of freedom error(M,)-error(M2) varM,-M2/k var(M,-M2)=∑=1ror(M)2-eror(M2)-erorM,-eror(M2)2 a If two test sets available: use non- paired t-test r (M, var(M2) var (1-M2 where k,& k, are of cross-validation samples used for M, and M2 resp

Estimating confidence intervals: t-test ◼ If only 1 test set available: pairwise comparison ◆ For ith round of 10-fold cross-validation, the same cross partitioning is used to obtain error(M1 ) and error(M2 ) ◆ Average over 10 rounds to get error(M1 ) and error(M2 ) ◆ T-test computes t-statistic with k-1 degrees of freedom ◼ If two test sets available: use non-paired t-test where k1 & k1 are # of cross-validation samples used for M1 and M2 resp. 𝑡 = error(M1 ) − error(M2 ) 𝑣𝑎𝑟(M1−M2 )/𝑘 𝑣𝑎𝑟(M1−M2 ) = 1 𝑘 σ𝑖=1 𝑘 [𝑒𝑟𝑟𝑜𝑟 M1 𝑖 − 𝑒𝑟𝑟𝑜𝑟 M2 𝑖 − (error(M1 ) − error(M2 ))] 2 𝑣𝑎𝑟(M1−M2 ) = 𝑣𝑎𝑟(M1 ) k1 + 𝑣𝑎𝑟(M2 ) k2

点击进入文档下载页（PPTX格式）

共41页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

面积对象编程（PPT讲稿）Object-Oriented Programming and Classes
《计算机网络概述》教学资源（PPT课件讲稿）
《计算机组成原理》课程PPT教学课件（讲稿）第三章计算机核心部件及其工作原理
《大型机系统管理技术》课程教学资源（PPT课件讲稿）第2章大型服务器外存管理
《ARM嵌入式软件开发》课程教学资源（PPT课件讲稿）第三章 ARM体系结构及编程模型
北京大学：基于信息利用的烟花算法研究（PPT讲稿）Research on Fireworks Algorithms from the Perspective of Information Utilization
系统编程工具REXX和CLIST
《软件测试 Software Testing》教学资源（PPT讲稿）Part 1 The Big Picture
西南民族大学：软件需求分析与总体设计（PPT讲稿，主讲：殷锋）
中国地质大学（武汉）：R语言入门教程（PPT讲稿）
对外经济贸易大学：《大学计算机基础》课程电子教案（PPT课件）第5章 PowerPoint幻灯片制作（PowerPoint 2010）
西安培华学院：《计算机网络工程》课程教学资源（PPT课件讲稿）第1章网络工程知识（主讲：张伟）
香港中文大学：Arm board tutorial Part 1 Using the ARM board And start working with C Tutorial 5 and 6
清华大学出版社：《JAVA程序设计实例教程》课程教材电子教案（PPT课件讲稿，共七章，主编：关忠）
香港浸会大学：Community Search over Big Graphs：Models, Algorithms, and Opportunities
《数字图像处理》课程教学资源（PPT课件讲稿）第5章图像编码与压缩
厦门理工学院：《网页设计》培训课件教学资源（PPT课件）
西安电子科技大学：《计算机操作系统》课程PPT教学课件（讲稿）第六章文件管理
机器翻译研讨会（PPT讲稿）神经机器翻译前沿进展（PPT讲稿）
山东大学：《微机原理及单片机接口技术》课程教学资源（PPT课件讲稿）第三章计算机系统的组成与工作原理 3.1 理解模型机的结构及工作过程 3.2 掌握单片机的结构
清华大学出版社：《计算机导论 Introduction to Computer Science》课程配套教材教学资源（PPT课件讲稿，第3版）第4章操作系统与网络知识
《数据库系统原理》课程PPT教学课件（SQLServer）第7章 Transact-SQL程序设计
《Chemdraw 软件教程》教学资源（PPT讲稿）第一部分 ChemDraw简介
北京大学：计算智能实验室（PPT讲稿）烟花算法算子分析

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录