当前位置：和泉文库 > 信息系统 > 浏览文档

中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第10章文本分类（支持向量机及机器学习方法）

• 支持向量机 • 二元线性SVM • SVM用于非线性分类 • 机器学习方法 • 人工神经网络（Artificial Neural Network, ANN） • 深度学习（Deep Learning）现状 • 经典的深度学习模型/算法 • 卷积神经网络 Convolutional Neural Networks (CNN) • 多层反馈网络 Recurrent neural Network(RNN) • 自动编码器 AutoEncoder • 受限玻尔兹曼机 Restricted Boltzmann Machine, RBM • 深度置信网络 (Deep Belief Nets，DBN）

文件格式：PDF，文件大小：5.63MB，售价：22.95元

共100页，可试读20页，点击往前阅读 ↑↑

文档详细内容（约100页）

信息检索与数据挖掘 2019/4/22 12 二元线性分类器的形式化定义函数间隔 .w:decision hyperplane normal vector w:决策超平面的法向量，下图中为a .x,:data point i b:决策超平面的截距 y:class of data point i(+1 or-1) arx≥-b ax≤-b .Classifier is:f(x)=sign(wx;+b) ·函数间隔 Functional margin of x,is: yi(wx;+b) Functional margin of dataset is twice the minimum functional margin for any point The factor of 2 comes from measuring the whole width of the margin dist(C,D)=inf{u-vll2u∈C,v∈D}

信息检索与数据挖掘 2019/4/22 12 二元线性分类器的形式化定义函数间隔 • w: decision hyperplane normal vector • xi : data point i • yi : class of data point i (+1 or -1) • Classifier is: f(xi ) = sign(wTxi + b) • 函数间隔 • Functional margin of xi is: yi (wTxi + b) • Functional margin of dataset is twice the minimum functional margin for any point • The factor of 2 comes from measuring the whole width of the margin w: 决策超平面的法向量，下图中为a b: 决策超平面的截距 -b -b

信息检索与数据挖掘 2019/4/22 13 Geometric Margin 分类器的几何间隔：中间空白带的最大宽度，几何间隔该空白带可以用于将两类支持向量分开 wx+b Distance from example to the separator is r=y 点x到决策平面的距离r: w x向量在法向量w上的投影 Examples closest to the hyperplane are support vectors. Margin p of the separator is the width of separation between support vectors of classes. Derivation of finding r: Dotted line x'-x is perpendicular to decision boundary so parallel to w. Unit vector is w/w,so line is rw/|w. x'=x-yrw/wl. x'satisfies wx'+b 0. SowT(x-ywl小w)+b=0 Recall that |w sqrt(wTw) So,solving for r gives: r y(wTx b)/wl yj:class of data point i(+1 or-1)

信息检索与数据挖掘 2019/4/22 13 Geometric Margin 几何间隔 • Distance from example to the separator is • Examples closest to the hyperplane are support vectors. • Margin ρ of the separator is the width of separation between support vectors of classes. w w x b r y T   r ρ x x′ w Derivation of finding r: Dotted line x’−x is perpendicular to decision boundary so parallel to w. Unit vector is w/|w|, so line is rw/|w|. x’ = x – yrw/|w|. x’ satisfies wTx’+b = 0. So wT(x –yrw/|w|) + b = 0 Recall that |w| = sqrt(wTw). So, solving for r gives: r = y(wTx + b)/|w| 分类器的几何间隔: 中间空白带的最大宽度，该空白带可以用于将两类支持向量分开点x到决策平面的距离r： x向量在法向量w上的投影 yi : class of data point i (+1 or -1)

信息检索与数据挖掘 2019/4/22 14 由于可以将函数间隔缩放到任意区间 Linear SVM Mathematically ,为了方便解决大规模SVM问题，我 The linearly separable case 们要求所有点的函数间隔至少为1，而且至少有一个点的函数间隔为1。 Assume that all data is at least distance 1 from the hyperplane,then the following two constraints follow for a training set {(xi)) wTx+b≥1ify,=1 wx+b≤-1ify,=-1 For support vectors,the inequality w×b三1 becomes an equality.Then,since each w×-b=0 example's distance from the hyperplane is wx-= w7x+b r=y w ·The margin is: 2 w X1

信息检索与数据挖掘 2019/4/22 14 Linear SVM Mathematically The linearly separable case • Assume that all data is at least distance 1 from the hyperplane, then the following two constraints follow for a training set {(xi ,yi )} • For support vectors, the inequality becomes an equality. Then, since each example’s distance from the hyperplane is • The margin is: wTxi + b ≥ 1 if yi = 1 wTxi + b ≤ -1 if yi = -1 w 2   w w x b r y T   由于可以将函数间隔缩放到任意区间，为了方便解决大规模SVM 问题，我们要求所有点的函数间隔至少为1，而且至少有一个点的函数间隔为1

信息检索与数据挖掘 2013e0225.15 Linear Support Vector Machine(SVM) wTxa b=1 。Hyperplane wTx+b=0 wTx+b三-1 Extra scale constraint: min=l,…nlwX+bl=1 This implies: w(Xa-Xp)=2 p llxa-xpll2 2/1lwll2 WTx+b=0 我们的目标是最大化分类间隔p

信息检索与数据挖掘 2019/4/22 15 Linear Support Vector Machine (SVM) • Hyperplane wT x + b = 0 • Extra scale constraint: mini=1,…,n |wTxi + b| = 1 • This implies: wT(xa–xb ) = 2 ρ = ||xa–xb ||2 = 2/||w||2 wT x + b = 0 wTxa + b = 1 wTxb + b = -1 ρ Sec. 15.1 我们的目标是最大化分类间隔ρ

信息检索与数据挖掘 2019/4/22 16 Linear SVMs Mathematically (cont.) Then we can formulate the quadratic optimization problem: Find w and b such that P= 网 is maximized;and for all {, wx+b≥1ify=1;wx+b≤-1ify,=-1 A better formulation (min lwll max 1/lwl ) 市=√市 Find w and b such that (w)=%wTw is minimized; and for all {(xi):y (wxi+b)>1 上述问题实际上是在线性约束条件下的二次函数优化问题。该问题是数学中的基本问题之一，存在很多解决算法（比如，有一些二次规划库）

信息检索与数据挖掘 2019/4/22 16 Linear SVMs Mathematically (cont.) • Then we can formulate the quadratic optimization problem: • A better formulation (min ||w|| = max 1/ ||w|| ): Find w and b such that is maximized; and for all {(xi , yi )} wTxi + b ≥ 1 if yi=1; wTxi + b ≤ -1 if yi = -1 w 2   Find w and b such that Φ(w) =½ wTw is minimized; and for all {(xi ,yi )}: yi (wTxi + b) ≥ 1 上述问题实际上是在线性约束条件下的二次函数优化问题。该问题是数学中的基本问题之一，存在很多解决算法 (比如，有一些二次规划库)

点击进入文档下载页（PDF格式）

共100页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第10章文本分类（基于向量空间的文本分类）
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第10章文本分类（文本分类及朴素贝叶斯方法）
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）矩阵分解在信息检索中的应用
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）课程要求（论文阅读&研讨）
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第9章基于语言建模的检索模型
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第8章概率模型
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第7章相关反馈和查询扩展
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第6章检索的评价
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第5章向量模型及检索系统 5.2 检索系统
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第5章向量模型及检索系统 5.1 向量模型
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第4章索引构建与索引压缩 4.2 索引压缩
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第4章索引构建与索引压缩 4.1 索引构建
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）概率图及主题模型 Probabilistic Graphical Models Topic Model
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第11章文本聚类
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）图像分类的算法思想
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）数据挖掘经典算法概述
中国科学技术大学：《信息检索与数据挖掘》课程教学资源（课件讲稿）第12章 Web搜索
长沙医学院：信息工程学院课程简介
南京大学：《信息与计算科学导论》课程教学资源（课件讲稿）集合与关系 Sets-and-Relations
南京大学：《信息与计算科学导论》课程教学资源（课件讲稿）递归算法与递归方程 Recursive Algorithm and Recurrence Relations
《管理信息系统》课程教学资源（书籍教材）第2章管理信息系统的技术基础
国家中医药管理局：中医医院信息系统基本功能规范（修订，征求意见稿，2019年3月）
北京中医药大学：《数据科学导论》课程教学资源（PPT课件）第1章绪论 Introduction to Data Science（主讲：韩爱庆）
北京中医药大学：《数据科学导论》课程教学资源（PPT课件）第2章计算机基础

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录