• Multi-Armed Bandits • Explore-Then-Exploit • Upper Confidence Bound • Linear Bandits • LinUCB Algorithm • Generalized Linear Bandits • Advanced Topics
文件格式: PDF大小: 13.8MB页数: 50
• Two-player Zero-sum Games • Minimax Theorem • Repeated Play • Faster Convergence via Adaptivity
文件格式: PDF大小: 9.25MB页数: 33
• Optimistic Online Mirror Descent • A Unified Framework • Small-Loss bound • Gradient-Variance bound • Gradient-Variation bound
文件格式: PDF大小: 16.17MB页数: 66
• Algorithmic Framework • Regret Analysis • Interpretation from Primal-Dual View • Follow-the-Regularized Leader
文件格式: PDF大小: 13.11MB页数: 59
• Online Learning • Online Convex Optimization • Convex Functions • Strongly Convex Functions • Exp-concave Functions
文件格式: PDF大小: 20.81MB页数: 84
• GD for Smooth Optimization • Smooth and Convex Functions • Smooth and Strongly Convex Functions • Nesterov’s Accelerated GD • Extension to Composite Optimization
文件格式: PDF大小: 18.3MB页数: 74
©2025 mall.hezhiquan.com 和泉文库
帮助反馈侵权