当前位置：和泉文库 > 计算机 > 浏览文档

南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 05 Online Convex Optimization - OGD, convex functions, strongly convex functions, online Newton step, exp-concave functions

• Online Learning • Online Convex Optimization • Convex Functions • Strongly Convex Functions • Exp-concave Functions

文件格式：PDF，文件大小：20.81MB，售价：18.77元

文档详细内容（约84页）

Offline Towards Online Learning Traditional statistical machine learning The training data are available offline Learning model is trained based on the offline data in a batch setting Online learning scenario In real applications,data are in the form of stream New data are being collected all the time:after observing a new data point,the model should be online updated at a constant cost Advanced Optimization(Fall 2023) Lecture 5.Online Convex Optimization 6

Advanced Optimization (Fall 2023) Lecture 5. Online Convex Optimization 6 Offline Towards Online Learning • Traditional statistical machine learning • The training data are available offline • Learning model is trained based on the offline data in a batch setting • Online learning scenario • In real applications, data are in the form of stream • New data are being collected all the time: after observing a new data point, the model should be online updated at a constant cost

A Formulation of Online Learning We introduce a game-theoretic view to model online learning. Online learning is formulated as a repeated game between Player:essentially the learner,or you can think as the"learning model" Environments:an abstraction of all factors evaluating the model. At each round t =1,2,... (1)the player first picks a model wEW; (2)and simultaneously environments pick an online function f:w->R; (3)the player suffers loss f(wt),observes some information about fr and updates the model. Advanced Optimization(Fall 2023) Lecture 5.Online Convex Optimization 7

Advanced Optimization (Fall 2023) Lecture 5. Online Convex Optimization 7 A Formulation of Online Learning • We introduce a game-theoretic view to model online learning. • Online learning is formulated as a repeated game between • Player: essentially the learner, or you can think as the “learning model" • Environments: an abstraction of all factors evaluating the model

Online Learning:Formulation At each round t=1,2,... (1)the player first picks a model wW; (2)and simultaneously environments pick an online function fr:W->R; (3)the player suffers loss ft(wt),observes some information about ft and updates the model. ·An example of online function f:W→R. Considering the task of online classification,we have (i)the loss e:Jy×Jy→R,and fi(w)=l(h(w;x:),) (i)the hypothesis function h:W×X→). =e(w xt,Ut)for simplicity Advanced Optimization(Fall 2023) Lecture 5.Online Convex Optimization 8

Advanced Optimization (Fall 2023) Lecture 5. Online Convex Optimization 8 Online Learning: Formulation for simplicity • Considering the task of online classification, we have

Online Learning:Formulation At each round t=l,2,· (1)the player first picks a model wtW; (2)and simultaneously environments pick an online function fr:W->R; (3)the player suffers loss fi(wt),observes some information about fi and updates the model. Spam filtering (1)Player submits a spam classifier w ↓ (2)A mail is revealed whether it is a spam ☒ (3)Player suffers loss fi(w)and updates model Advanced Optimization(Fall 2023) Lecture 5.Online Convex Optimization 9

Advanced Optimization (Fall 2023) Lecture 5. Online Convex Optimization 9 Online Learning: Formulation Spam filtering

Applications spam detection(online classification/regression):At each timet=1,2,... ·receive an email xt∈R, ·predict whether it is a spam∈{-l,+li SPAM 。see its true label y∈{-1，+1l} aggregating weather prediction(the expert problem):At each day t=1,2,... obtain temperature predictions from N models; make the final prediction by randomly following a model according to the probability p∈△w; on the next day observe the loss of each model f0,1]N. Advanced Optimization(Fall 2023) Lecture 5.Online Convex Optimization 10

Advanced Optimization (Fall 2023) Lecture 5. Online Convex Optimization 10 Applications

点击进入文档下载页（PDF格式）

共84页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 04 GD Methods II - GD method, smooth optimization, Nesterov’s AGD, composite optimization
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 03 GD Methods I - GD method, Lipschitz optimization
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 02 Convex Optimization Basics; Function Properties
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 01 Introduction; Mathematical Background
南京大学：《数字图像处理》课程教学资源（课件讲义）11 图像特征分析
南京大学：《数字图像处理》课程教学资源（课件讲义）10 图像分割
南京大学：《数字图像处理》课程教学资源（课件讲义）09 形态学及其应用
南京大学：《数字图像处理》课程教学资源（课件讲义）08 压缩编码
南京大学：《数字图像处理》课程教学资源（课件讲义）07 频域滤波器
南京大学：《数字图像处理》课程教学资源（课件讲义）06 图像频域变换
南京大学：《数字图像处理》课程教学资源（课件讲义）05 代数运算与几何变换
南京大学：《数字图像处理》课程教学资源（课件讲义）04 图像复原及锐化
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 06 Prediction with Expert Advice - Hedge, minimax bound, lower bound; mirror descent（motivation and preliminary）
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 07 Online Mirror Descent - OMD framework, regret analysis, primal-dual view, mirror map, FTRL, dual averaging
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 08 Adaptive Online Convex Optimization - problem-dependent guarantee, small-loss bound, self-confident tuning, small-loss OCO, self-bounding property bound
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 09 Optimistic Online Mirror Descent - optimistic online learning, predictable sequence, small-loss bound, gradient-variance bound, gradient-variation bound
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 10 Online Learning in Games - two-player zero-sum games, repeated play, minimax theorem, fast convergence
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 11 Adversarial Bandits - MAB, IW estimator, Exp3, lower bound, BCO, gradient estimator, self-concordant barrier
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 12 Stochastic Bandits - MAB, UCB, linear bandits, self-normalized concentration, generalized linear bandits
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 13 Advanced Topics - non-stationary online learning, universal online learning, online ensemble, base algorithm, meta algorithm
南京大学：《组合数学》课程教学资源（课堂讲义）课程简介 Combinatorics Introduction（主讲：尹一通）
南京大学：《组合数学》课程教学资源（课堂讲义）基本计数 Basic enumeration
南京大学：《组合数学》课程教学资源（课堂讲义）生成函数 Generating functions
南京大学：《组合数学》课程教学资源（课堂讲义）筛法 Sieve methods

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录