当前位置：和泉文库 > 计算机 > 浏览文档

《大数据 Big Data》课程教学资源（参考文献）Parallel and Distributed Stochastic Learning - Towards Scalable Learning for Big Data Intelligence（南京大学：李武军）

1 Introduction 2 AsySVRG 3 SCOPE 4 Conclusion

文件格式：PDF，文件大小：1.7MB，售价：9.04元

文档详细内容（约35页）

Introduction Parallel and Distributed Stochastic Learning To further improve the learning scalability (speed): o Parallel stochastic learning: One machine with multiple cores and a shared memory o Distributed stochastic learning: A cluster with multiple machines Key issues:cooperation Parallel stochastic learning: lock vs.lock-free:waiting cost and lock cost Distributed stochastic learning: synchronous vs.asynchronous:waiting cost and communication cost 4口，4@下￥2生，2分Q0 Wu-Jun Li (http://cs.nju.edu.cn/lwj) PDSL CS,NJU 6/36

Introduction Parallel and Distributed Stochastic Learning To further improve the learning scalability (speed): Parallel stochastic learning: One machine with multiple cores and a shared memory Distributed stochastic learning: A cluster with multiple machines Key issues: cooperation Parallel stochastic learning: lock vs. lock-free: waiting cost and lock cost Distributed stochastic learning: synchronous vs. asynchronous: waiting cost and communication cost Wu-Jun Li (http://cs.nju.edu.cn/lwj) PDSL CS, NJU 6 / 36

Introduction Our Contributions Parallel stochastic learning:AsySVRG Fast Asynchronous Parallel Stochastic Gradient Descent:A Lock-Free Approach with Convergence Guarantee. Distributed stochastic learning:SCOPE Scalable Composite Optimization for Learning 4口，4@，4242，定分Q0 Wu-Jun Li (http://cs.nju.edu.cn/lwj) PDSL CS,NJU 7/36

Introduction Our Contributions Parallel stochastic learning: AsySVRG Fast Asynchronous Parallel Stochastic Gradient Descent: A Lock-Free Approach with Convergence Guarantee. Distributed stochastic learning: SCOPE Scalable Composite Optimization for Learning Wu-Jun Li (http://cs.nju.edu.cn/lwj) PDSL CS, NJU 7 / 36

ASySVRG Outline Introduction ② AsySVRG SCOPE Conclusion 4口，4@下4242，定分QC Wu-Jun Li (http://cs.nju.edu.cn/lvj) PDSL CS,NJU 8/36

AsySVRG Outline 1 Introduction 2 AsySVRG 3 SCOPE 4 Conclusion Wu-Jun Li (http://cs.nju.edu.cn/lwj) PDSL CS, NJU 8 / 36

AsySVRG Motivation and Contribution Motivation: Existing asynchronous parallel SGD:Hogwild![Recht et al.2011], and PASSCoDe [Hsieh,Yu,and Dhillon 2015] No parallel methods for SVRG. Lock-free:empirically effective,but no theoretical proof. Contribution: A fast asynchronous method to parallelize SVRG,called AsySVRG. A lock-free parallel strategy for both read and write Linear convergence rate with theoretical proof o Outperforms Hogwild!in experiments AsySVRG is the first lock-free parallel SGD method with theoretical proof of convergence. 4口，4@下￥24=， Wu-Jun Li (http://cs.nju.edu.cn/lvj) PDSL CS,NJU 9/36

AsySVRG Motivation and Contribution Motivation: Existing asynchronous parallel SGD: Hogwild! [Recht et al. 2011], and PASSCoDe [Hsieh, Yu, and Dhillon 2015] No parallel methods for SVRG. Lock-free: empirically effective, but no theoretical proof. Contribution: A fast asynchronous method to parallelize SVRG, called AsySVRG. A lock-free parallel strategy for both read and write Linear convergence rate with theoretical proof Outperforms Hogwild! in experiments AsySVRG is the first lock-free parallel SGD method with theoretical proof of convergence. Wu-Jun Li (http://cs.nju.edu.cn/lwj) PDSL CS, NJU 9 / 36

AsySVRG AsySVRG:a multi-thread version of SVRG Initialization:p threads,initialize wo,n; fort=0,1,2,…do uo Wt; All threads parallelly compute the full gradient Vf(uo)=员∑2-1Vf(o: u=Wt: For each thread,do: for m =1 to M do Read current value of u,denoted as u,from the shared memory. And randomly pick up an i from {1,...,n}; Compute the update vector:=Vfi(u)-Vfi(uo)+Vf(uo); u←-u-7V: end for Take wt+1 to be the current value of u in the shared memory; end for 4口，49卡，重，4=，2QC Wu-Jun Li (http://cs.nju.edu.cn/lvj) PDSL CS.NJU 10/36

AsySVRG AsySVRG: a multi-thread version of SVRG Initialization: p threads, initialize w0, η; for t = 0, 1, 2, ... do u0 = wt ; All threads parallelly compute the full gradient ∇f(u0) = 1 n Pn i=1 ∇fi(u0); u = wt ; For each thread, do: for m = 1 to M do Read current value of u, denoted as uˆ, from the shared memory. And randomly pick up an i from {1, . . . , n}; Compute the update vector: vˆ = ∇fi(uˆ) − ∇fi(u0) + ∇f(u0); u ← u − ηvˆ; end for Take wt+1 to be the current value of u in the shared memory; end for Wu-Jun Li (http://cs.nju.edu.cn/lwj) PDSL CS, NJU 10 / 36

点击进入文档下载页（PDF格式）

共35页，可试读12页，点击继续阅读 ↓↓

您可能感兴趣的文档

《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data - A Tutorial
《大数据 Big Data》课程教学资源（参考文献）大数据机器学习 Big Data Machine Learning
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data
《大数据 Big Data》课程教学资源（参考文献）大数据机器学习 Big Data Machine Learning
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data Retrieval and Mining（南京大学：李武军）
《大数据 Big Data》课程教学资源（参考文献）Learning to Hash for Big Data Retrieval and Mining（南京大学：李武军）
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Decidability, Complexity（P, NP, NPC and related）
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Timed Automata
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Petri Net
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Transition System
南京大学：《形式语言与自动机 Formal Languages and Automata》课程教学资源（PPT课件讲稿）Turing Machine
《人工智能、机器学习与大数据》课程教学资源（参考文献）Coherence functions for multicategory margin-based classification methods
《人工智能、机器学习与大数据》课程教学资源（参考文献）Latent Wishart processes for relational kernel learning
《人工智能、机器学习与大数据》课程教学资源（参考文献）Latent Wishart processes for relational kernel learning（讲稿）
《人工智能、机器学习与大数据》课程教学资源（参考文献）agiCoFi - Tag informed collaborative filtering
《人工智能、机器学习与大数据》课程教学资源（参考文献）Localized content-based image retrieval through evidence region identification
《人工智能、机器学习与大数据》课程教学资源（参考文献）Relation regularized matrix factorization
《人工智能、机器学习与大数据》课程教学资源（参考文献）Relation regularized matrix factorization（讲稿）
《人工智能、机器学习与大数据》课程教学资源（参考文献）Probabilistic relational PCA
《人工智能、机器学习与大数据》课程教学资源（参考文献）Gaussian process latent random field
《人工智能、机器学习与大数据》课程教学资源（参考文献）Multiple-instance learning via disambiguation
《人工智能、机器学习与大数据》课程教学资源（参考文献）Generalized latent factor models for social network analysis
《人工智能、机器学习与大数据》课程教学资源（参考文献）Social relations model for collaborative filtering

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录