当前位置：和泉文库 > 计算机 > 浏览文档

南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 07 Online Mirror Descent - OMD framework, regret analysis, primal-dual view, mirror map, FTRL, dual averaging

• Algorithmic Framework • Regret Analysis • Interpretation from Primal-Dual View • Follow-the-Regularized Leader

文件格式：PDF，文件大小：13.11MB，售价：13.81元

文档详细内容（约59页）

Stability Lemma Lemma 2(Stability Lemma).Consider the following updates: x=arg minxex(g:x)+D (x,c) x'=arg minxex (g',x)+D (x,c) When the regularizer:XR is a X-strongly convex function with respect to norm,we have 入x-x'I≤g-gl*: Proof. x'-x,g-g〉≥(7(x)-7(x),x-x')(1) Vψ(x)-Vb(x),x-x)≥Ax-x12 (2) > λx-x≤(7b(x)-7b(x),x-x)≤x-x,g-g) <lx-x'll lg-g'll,(Holder's inequality) > λx-x'l≤g-gl 口 Advanced Optimization(Fall 2023) Lecture 7.Online Mirror Descent 16

Advanced Optimization (Fall 2023) Lecture 7. Online Mirror Descent 16 Stability Lemma (2) (1) (Hölder’s inequality) Proof

Proof of Mirror Descent Lemma Proof. fi(x:)-fi(u)<(Vfi(x:),xi-x+1)+Vfi(x:),x+1-u) term (a) term (b) We further introduce following lemma to analyze term(b). Lemma 3(Bregman Proximal Inequality).Let Y be a convex set in a Banach space B.Let f:X→R be a closed proper convex function onX.Given a convex regularizerψ：X→R, we denote its induced Bregman divergence by D(,).Then,any update of the form x+1 arg min {(gt,x)+D(x,x:)} x∈X satisfies the following inequality for any uX: (gt;x+1-u)<D(u,xt)-Do(u,xt+1)-D(xi+1;xi). Crucial for analysis of first-order optimization methods based on Bregman divergence. Advanced Optimization(Fall 2023) Lecture 7.Online Mirror Descent 17

Advanced Optimization (Fall 2023) Lecture 7. Online Mirror Descent 17 Proof of Mirror Descent Lemma Proof. We further introduce following lemma to analyze term (b). term (a) term (b) Crucial for analysis of first-order optimization methods based on Bregman divergence

Bregman Proximal Inequality Lemma 3(Bregman Proximal Inequality).The Bregman proximal update in the form of Xt+1=arg minxex {(gt,x)+Du(x,xt)}satisfies (gt,xt+1-u）≤Db(u,xt)-Db(u,xt+1）-Db(xt+1,xt). Proof.Recall that for any convex function f,we have the following first-order optimality condition: f(x)≤f(y)y∈X→Vf(x)T(y-x)≥0y∈X Therefore,for x+1=arg minxex {(gt,x)+D(x,x)},we have (gt+7(x+1)-7(xt),u-x+i〉≥0 holds for any u∈X. Advanced Optimization(Fall 2023) Lecture 7.Online Mirror Descent 18

Advanced Optimization (Fall 2023) Lecture 7. Online Mirror Descent 18 Bregman Proximal Inequality Proof

Bregman Proximal Inequality Lemma 3(Bregman Proximal Inequality).The Bregman proximal update in the form of x+1=arg minxex {(gt,x)+Du(x,x)}satisfies (gt,X:+1-u)<Du(u,xi)-Do(u,xt+1)-Du(xt+1;xi). Proof. (gt +Vu(x+1)-Vu(x),u-x+1)>0holds for any uE On the other hand,the right side of Lemma 3 is: Dv(u,xt)-Dv(u,xi+1)-Dv(Xt+i;xi) =-xt)-(V(xt),u)-)+(41)+(7(xt+1),u-xt+1〉》 -(x+1)+)+((xt),xt+1一x) =(V(xt+1)-7(xt),u-xt+1）. Rearranging the terms can finish the proof. 口 Advanced Optimization(Fall 2023) Lecture 7.Online Mirror Descent 19

Advanced Optimization (Fall 2023) Lecture 7. Online Mirror Descent 19 Bregman Proximal Inequality Proof

Proof of Mirror Descent Lemma Proof. f(x)-f(u)≤(Tf(xt),xt-xt+1）+(f(x),xt+1-u term (a) term (b) Lemma 2(Stability Lemma). 入x1-X2‖≤g1-g2l* tema)=W(x),x-x+)≤IVfx,服 (think of two updates:one for x with Vfi(x)and another one for x,with 0) Lemma 3(Bregman Proximal Inequality). (gi,Xi+1-u)<Du(u,xi)-Du(u,Xi+1)-Du(xi+1;xi) temb)≤L（Dw(u,x-Do(u,x+i）-Du(x+1,x (negative term,usually dropped; nt but sometimes highly useful) →ix)-i@≤D,(ux)-D,(ux+》+张IVx训2-D(x+,x刘 t Advanced Optimization(Fall 2023) Lecture 7.Online Mirror Descent 20

Advanced Optimization (Fall 2023) Lecture 7. Online Mirror Descent 20 Proof of Mirror Descent Lemma Proof. term (a) term (b) (negative term, usually dropped; but sometimes highly useful)

点击进入文档下载页（PDF格式）

共59页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 06 Prediction with Expert Advice - Hedge, minimax bound, lower bound; mirror descent（motivation and preliminary）
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 05 Online Convex Optimization - OGD, convex functions, strongly convex functions, online Newton step, exp-concave functions
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 04 GD Methods II - GD method, smooth optimization, Nesterov’s AGD, composite optimization
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 03 GD Methods I - GD method, Lipschitz optimization
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 02 Convex Optimization Basics; Function Properties
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 01 Introduction; Mathematical Background
南京大学：《数字图像处理》课程教学资源（课件讲义）11 图像特征分析
南京大学：《数字图像处理》课程教学资源（课件讲义）10 图像分割
南京大学：《数字图像处理》课程教学资源（课件讲义）09 形态学及其应用
南京大学：《数字图像处理》课程教学资源（课件讲义）08 压缩编码
南京大学：《数字图像处理》课程教学资源（课件讲义）07 频域滤波器
南京大学：《数字图像处理》课程教学资源（课件讲义）06 图像频域变换
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 08 Adaptive Online Convex Optimization - problem-dependent guarantee, small-loss bound, self-confident tuning, small-loss OCO, self-bounding property bound
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 09 Optimistic Online Mirror Descent - optimistic online learning, predictable sequence, small-loss bound, gradient-variance bound, gradient-variation bound
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 10 Online Learning in Games - two-player zero-sum games, repeated play, minimax theorem, fast convergence
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 11 Adversarial Bandits - MAB, IW estimator, Exp3, lower bound, BCO, gradient estimator, self-concordant barrier
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 12 Stochastic Bandits - MAB, UCB, linear bandits, self-normalized concentration, generalized linear bandits
南京大学：《高级优化 Advanced Optimization》课程教学资源（讲稿）Lecture 13 Advanced Topics - non-stationary online learning, universal online learning, online ensemble, base algorithm, meta algorithm
南京大学：《组合数学》课程教学资源（课堂讲义）课程简介 Combinatorics Introduction（主讲：尹一通）
南京大学：《组合数学》课程教学资源（课堂讲义）基本计数 Basic enumeration
南京大学：《组合数学》课程教学资源（课堂讲义）生成函数 Generating functions
南京大学：《组合数学》课程教学资源（课堂讲义）筛法 Sieve methods
南京大学：《组合数学》课程教学资源（课堂讲义）Cayley公式 Cayley's formula
南京大学：《组合数学》课程教学资源（课堂讲义）Pólya计数法 Pólya's theory of counting

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录