当前位置：和泉文库 > 统计 > 浏览文档

电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿）第九讲数据表示——不含参模型

1 概率密度估计 2 直方图方法 3 Parzen 窗 4 K 近邻密度估计 k 近邻分类器

文件格式：PDF，文件大小：439.5KB，售价：8.14元

共28页，可试读10页，点击往前阅读 ↑↑

文档详细内容（约28页）

P 100 50 20 K/p 0 P=.7 当n很大时，kn在均值P处呈尖峰分布：的=P,an哈=Pl-P/n 因此： P≈ 5/27

当 n 很大时，k/n 在均值 P 处呈尖峰分布： E[ k n ] = P, var[ k n ] = P(1 − P)/n 因此： P ≈ k n 5 / 27

·如果假定p(x)连续，且区域R足够小，使得p(x)在R这个区域几乎没有变化，那么我们可以得到如下的一个近似： P=p)≈p) 其中，x是R中的一个点，V是R这个区域的体积（二维情况下V为面积)。由P≈k/n,R区域的概率密度函数可以近似估计为： P。k/n 6/27

▶ 如果假定 p(x) 连续，且区域 R 足够小，使得 p(x) 在 R 这个区域几乎没有变化，那么我们可以得到如下的一个近似： P = ∫ R p(x ′ )dx′ ≈ p(x)V 其中，x 是 R 中的一个点，V 是 R 这个区域的体积（二维情况下 V 为面积）。 ▶ 由 P ≈ k/n，R 区域的概率密度函数可以近似估计为: p(x) ≈ P V ≈ k/n V 6 / 27

p(x)≈ kn Its validation depends on two contradictory assumptions: o Region R be sufficiently small that the density is approximately constant over the region o Region R be sufficiently large (in relation to the value of that density)that the number k of samples falling inside the region is sufficient for the binomial distribution to be sharply peaked. Condition of converging to the true probability density in the limit n→o, o Ishrinks suitably with n ●k grows with n 7/27

p(x) ≈ k/n V ▶Its validation depends on two contradictory assumptions: Region R be sufficiently small that the density is approximately constant over the region Region R be sufficiently large (in relation to the value of that density) that the number k of samples falling inside the region is sufficient for the binomial distribution to be sharply peaked. ▶ Condition of converging to the true probability density in the limit n → ∞, V shrinks suitably with n k grows with n 7 / 27

k/n p(x)≈ In practice,we will have to find a compromise for V: o Large enough to include enough examples within R o Small enough to support the assumption that is constant within R Two ways to calculate p(x): o fix Iand determine k from the data,giving rise to the kernel approach,such as histogram,Parzen window ofix k and determine /from the data,which gives rise to the k-nearest-neighbor 8/27

p(x) ≃ k/n V ▶ In practice, we will have to find a compromise for V: Large enough to include enough examples within R Small enough to support the assumption that is constant within R ▶ Two ways to calculate p(x): fix V and determine k from the data, giving rise to the kernel approach, such as histogram, Parzen window fix k and determine V from the data, which gives rise to the k-nearest-neighbor 8 / 27

9.2.Histogram Method直方图方法 A very simple method is to partition the space into a number of equally-sized cells(bins) and compute a histogram. Figure 1:Histogram in one dimension. Estimate of the density at a point x becomes k p(x)= WN亚 where N is the total number of samples,k is the number of samples in the cell that includes x,and I is the volume of that cell. 9/27

9.2. Histogram Method 直方图方法 ▶ A very simple method is to partition the space into a number of equally-sized cells (bins) and compute a histogram. ▶ Estimate of the density at a point x becomes p(x) = k NV where N is the total number of samples, k is the number of samples in the cell that includes x, and V is the volume of that cell. 9 / 27

点击进入文档下载页（PDF格式）

共28页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿）第八讲数据表示——含参模型
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿）第七讲非线性分类模型——集成方法
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿）第六讲非线性分类模型——多层感知机
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿）第五讲支持向量机
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿）第四讲感知机
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿）第三讲回归模型
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿）第二讲概率与线性代数回顾
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿）第一讲概述（文泉、陈娟）
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿，英文版）Lecture 10 Unsupervised Learning
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿，英文版）Lecture 09 Data Representation — Non-Parametric Model
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿，英文版）Lecture 08 Data Representation - Parametric Model
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿，英文版）Lecture 07 Non-Linear Classification Model - Ensemble Methods
电子科技大学：《统计学习理论及应用 Statistical Learning Theory and Applications》课程教学资源（课件讲稿）第十讲非监督学习
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第10章随机过程在保险精算中的应用
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第11章 Markov链Monte Carlo方法
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第1章预备知识（张波、商豪、邓军）
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第2章随机过程的基本概念和类型
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第3章 Poisson过程
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第4章更新过程
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第5章 Markov链
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第6章鞅
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第7章 Brown运动
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第8章随机积分
中国人民大学：《应用随机过程 Applied Stochastic Processes》课程教学资源（课件讲稿）第9章随机过程在金融中的应用

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录