当前位置：和泉文库 > 计算机 > 浏览文档

香港科技大学：深度学习导论（PPT讲稿）Introduction to Deep Learning

• Introduction • Supervised Learning – Convolutional Neural Network – Sequence Modelling: RNN and its extensions • Unsupervised Learning – Autoencoder – Stacked DenoisingAutoencoder • Reinforcement Learning – Deep Reinforcement Learning – Two applications: Playing Atari & AlphaGo

文件格式：PPTX，文件大小：5.71MB，售价：13.39元

共61页，可试读20页，点击往前阅读 ↑↑

文档详细内容（约61页）

Deep cnn in AlphaGo Policy network Value network Policy network nput:19×19,48 p,(als) vo(s) input channels ayer 1: 5X5 kernel 192 filters ayer 2 to 12: 3X3 kernel. 192 filters ayer 13: 1X1 kernel 1 filter Value network has similar architecture to policy network (Silver et al, 2016)

Deep CNN in AlphaGO Policy network: • Input: 19x19, 48 input channels • Layer 1: 5x5 kernel, 192 filters • Layer 2 to 12: 3x3 kernel, 192 filters • Layer 13: 1x1 kernel, 1 filter Value network has similar architecture to policy network (Silver et al, 2016)

Sequence Modelling Why do we need RNN? What are rnns? RNN EXtensions What can rnns can do?

Sequence Modelling • Why do we need RNN? • What are RNNs? • RNN Extensions • What can RNNs can do?

Why do we need RNNS? The limitations of the Neural network(CNNS) Rely on the assumption of independence among the (training and test) examples After each data point is processed, the entire state of the network is lost Rely on examples being vectors of fixed length We need to model the data with temporal or sequential structures and varying length of inputs and outputs Frames from video Snippets of audio Words pulled from sentences

Why do we need RNNs? The limitations of the Neural network (CNNs) • Rely on the assumption of independence among the (training and test) examples. – After each data point is processed, the entire state of the network is lost • Rely on examples being vectors of fixed length We need to model the data with temporal or sequential structures and varying length of inputs and outputs – Frames from video – Snippets of audio – Words pulled from sentences

What are rnns? Recurrent neural networks(RNNs )are connectionist models with the ability to selectively pass information across sequence steps, while processing sequential data one element at a time Outputs Allow a'memory' of previous inputs to persist in h(t) the network's internal state, and thereby h(t) influence the network output Hidden units Delay h(t)=fH WIHx(t)+ Whhh(t-1)) x y(t)=fo(WHoh(t)) Inputs fH and fo are the activation function for hidden and output unit; WIH, WHH, and The simplest form of fully recurrent WHo are connection weight matrices which neural network is an mlp with the are learnt by training previous set of hidden unit activations feeding back into the network along with the inputs

Recurrent neural networks (RNNs) are connectionist models with the ability to selectively pass information across sequence steps, while processing sequential data one element at a time. The simplest form of fully recurrent neural network is an MLP with the previous set of hidden unit activations feeding back into the network along with the inputs ℎ 𝑡 = 𝑓𝐻 𝑊𝐼𝐻𝑥 𝑡 + 𝑊𝐻𝐻ℎ(𝑡 − 1) 𝑦 𝑡 = 𝑓𝑂(𝑊𝐻𝑂ℎ(𝑡)) 𝑓𝐻 and 𝑓𝑂 are the activation function for hidden and output unit; 𝑊𝐼𝐻, 𝑊𝐻𝐻, and 𝑊𝐻𝑂 are connection weight matrices which are learnt by training Allow a ‘memory’ of previous inputs to persist in the network’s internal state, and thereby influence the network output What are RNNs?

What are rnns? The recurrent network can be converted into a feed-forward network by unfolding over time W ○-○-○ An unfolded recurrent network. each node represents a layer of network units at a single time step. The weighted connections from the input layer to hidden layer are labelled'wl, those from the hidden layer to itself (i.e. the recurrent weights) are labelled w2 and the hidden to output weights are labelled" w3. Note that the same weights are reused at every time step. Bias weights are omitted for clarity

An unfolded recurrent network. Each node represents a layer of network units at a single time step. The weighted connections from the input layer to hidden layer are labelled ‘w1’, those from the hidden layer to itself (i.e. the recurrent weights) are labelled ‘w2’ and the hidden to output weights are labelled‘w3’. Note that the same weights are reused at every time step. Bias weights are omitted for clarity. What are RNNs? • The recurrent network can be converted into a feed-forward network by unfolding over time

点击进入文档下载页（PPTX格式）

共61页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

北京大学软件研究所：高级软件工程（PPT讲稿）云计算与平台即服务
合肥学院：《数据库原理与应用》课程教学资源（PPT课件）第1章数据库系统概述（主讲：叶潮流）
《数据库原理与应用》课程PPT教学课件（SQL Server）第9章存储过程和触发器
《The C++ Programming Language》课程教学资源（PPT课件讲稿）Lecture 02 Procedure-Based Programming
东南大学：《数据结构》课程教学资源（PPT课件讲稿）第七章图
北京大学：《高级软件工程》课程教学资源（PPT课件讲稿）第一讲软件与软件开发
西安电子科技大学：《现代密码学》课程教学资源（PPT课件讲稿）第二章流密码（主讲：董庆宽）
《Photoshop基础教程与上机指导》教学资源（PPT讲稿）第18章扫描和修饰图像
中国水利水电出版社：《单片机原理及应用》课程PPT教学课件（C语言版）第8章单片机系统扩展（主编：周国运）
西安电子科技大学：《操作系统 Operating Systems》课程教学资源（PPT课件讲稿）Chapter 04 Memory Management
《网页设计》课程教学资源：课程教学大纲
《人工智能技术导论》课程教学资源（PPT课件讲稿）第3章图搜索与问题求解
香港中文大学：《Topics in Theoretical Computer Science》课程教学资源（PPT课件讲稿）量子计算 Quantum computing
《数字图像处理》课程PPT教学课件（讲稿）第二章图像获取、显示和表示
《Web编程实用技术教程》课程教学资源（PPT课件讲稿）第5章 MFC WinSock类的编程
电子工业出版社：《计算机网络》课程教学资源（第五版，PPT课件讲稿）第五章运输层
《神经网络 Neural Networks》课程教学资源（PPT课件讲稿）Ch 8 Artificial Neural networks
PROGRAMMING METHDOLODGY AND SOFTWARE ENGINEERING（PPT讲稿）C Programming Review
计算机网络技术基础（PPT课件讲稿）
《网络搜索和挖掘关键技术 Web Search and Mining》课程教学资源（PPT讲稿）Lecture 13 Matrix Factorization and Latent Semantic Indexing
多媒体技术及应用（PPT讲稿）多媒体音频技术
山东大学：《微机原理及单片机接口技术》课程教学资源（PPT课件讲稿）第四章指令系统及汇编语言程序设计（4.1-4.4）
东南大学：《C++语言程序设计》课程教学资源（PPT课件讲稿）Chapter 13 Object-Oriented Programming - Polymorphism
《C++语言程序设计》课程教学资源（PPT课件）第14讲运算符重载

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录