当前位置：和泉文库 > 计算机 > 浏览文档

《高级人工智能 Advanced Artificial Intelligence》教学资源（PPT讲稿）Lecture 7 Recurrent Neural Network

▪ Recurrent Neural Network ▪ Vanilla RNNs ▪ Some RNN Variants ▪ Backpropagation through time ▪ Gradient Vanishing / Exploding ▪ Long Short-term Memory ▪ LSTM Neuron ▪ Multiple-layer LSTM ▪ Backpropagation through time in LSTM ▪ Time-Series Prediction

文件格式：PPTX，文件大小：3.21MB，售价：17.63元

文档详细内容（约73页）

Advanced artificial Intelligence Lecture: Recurrent neural Network

Advanced Artificial Intelligence Lecture 7: Recurrent Neural Network

Outline Recurrent neural Network Vanilla rnns Some rnn variants Backpropagation through time Gradient Vanishing/Exploding Long short-term Memory LSTM Neuron Multiple-layer LSTM Backpropagation through time in LStm Time-Series Prediction

Outline ▪ Recurrent Neural Network ▪ Vanilla RNNs ▪ Some RNN Variants ▪ Backpropagation through time ▪ Gradient Vanishing / Exploding ▪ Long Short-term Memory ▪ LSTM Neuron ▪ Multiple-layer LSTM ▪ Backpropagation through time in LSTM ▪ Time-Series Prediction

Vanilla rnns Sequential data So far, we assume that data points(x, y)'s in a dataset are i.i. d (independent and identically distributed) Does not hold in many applications Sequential data: data points come in order and successive points may be dependent, e.g Letters in a word Words in a sentence/document Phonemes in a spoken word utterance Page clicks in a Web session Frames in a video, etc

Vanilla RNNs ▪ Sequential data So far, we assume that data points (x, y)’s in a dataset are i.i.d (independent and identically distributed) Does not hold in many applications Sequential data: data points come in order and successive points may be dependent, e.g., Letters in a word Words in a sentence/document Phonemes in a spoken word utterance Page clicks in a Web session Frames in a video, etc

Vanilla rnns Sequence Modeling How to model sequential data? Recurrent neural networks(vanilla rNNs) c(t depends on x(1),.x(t) Output all t)depends on hidden activations a+.) LtD) (Bias term omitted (k1) act(( )(k -1)+w(k)a(k-1 x a( summarizes x(…,x1) Earlier points are less important Sourceofslidehttps://ww.youtubecom/watch?v2btuy-fw3c&list=plipcwhqlgjdkvoozhmqswxlja9xw7osok

Vanilla RNNs ▪ Sequence Modeling How to model sequential data? Recurrent neural networks (vanilla RNNs): C(t) depends on x(1) ,··· ,x(t) Output a (L,t) depends on hidden activations: （Bias term omitted） a (·,t) summarizes x(t) ,··· ,x(1) Earlier points are less important Source of slide: https://www.youtube.com/watch?v=2btuy_-Fw3c&list=PLlPcwHqLqJDkVO0zHMqswX1jA9Xw7OSOK

Vanilla rnns Sequence Modeling a(k, =act(zk, t) =act(U(a4-1)+Wa(k-1) Weights are shared across time instances(W(k) Assumes that the“ transition functions”are time invariant(U(k) Our goal is to learn U(k)and W(k) for k=1,. L Sourceofslidehttps://ww.youtubecom/watch?v2btuy-fw3c&list=plipcwhqlgjdkvoozhmqswxlja9xw7osok

Vanilla RNNs ▪ Sequence Modeling Weights are shared across time instances (W(k) ) Assumes that the “transition functions” are time invariant (U(k) ) Our goal is to learn U(k) and W(k) for k = 1,···,L Source of slide: https://www.youtube.com/watch?v=2btuy_-Fw3c&list=PLlPcwHqLqJDkVO0zHMqswX1jA9Xw7OSOK

点击进入文档下载页（PPTX格式）

共73页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

西安交通大学：《网络与信息安全》课程PPT教学课件（网络入侵与防范）第六章网络入侵与防范——拒绝服务攻击与防御技术
西安电子科技大学：《计算机通信网》课程教学资源（PPT课件讲稿）第1章概述（宋锐）
中国科学技术大学：《嵌入式操作系统 Embedded Operating Systems》课程教学资源（PPT课件讲稿）第四讲 CPU调度（part II）
大数据集成（PPT讲稿）Big Data Integration
《计算机文化基础》课程教学资源（PPT课件讲稿）第七章计算机网络基础
《计算机应用基础》课程教学资源（PPT课件讲稿）第四章电子表格软件（Excel 2003）
四川大学：《操作系统 Operating System》课程教学资源（PPT课件讲稿）Chapter 3 Process Description and Control 3.1 What is a Process 3.2 Process States 3.3 Process Description
哈尔滨工业大学：《语言信息处理》课程教学资源（PPT课件讲稿）机器翻译 II Machine Translation II
Gas Systems Modeling andSimulation with MSC.EASY5：GD Advanced Class Notes（EAS105 Course Notes）
《计算机网络 Computer Networking》课程教学资源（PPT课件讲稿，英文版）Chapter 6 Wireless and Mobile Networks
《图像处理与计算机视觉 Image Processing and Computer Vision》课程教学资源（PPT课件讲稿）Chapter 08 Stereo vision
《计算机文化基础》课程教学大纲 Computer Culture Foundation
南京大学：《编译原理》课程教学资源（PPT课件讲稿）第七章运行时刻环境
中国科学技术大学：《计算机体系结构》课程教学资源（PPT课件讲稿）第6章 Data-Level Parallelism in Vector, SIMD, and GPU Architectures
河南中医药大学（河南中医学院）：《计算机网络》课程教学资源（PPT课件讲稿）第六章应用层
媒体服务（PPT课件讲稿）Media Services
东北大学：《可信计算基础》课程教学资源（PPT课件讲稿）第6章 TPM核心功能（主讲：周福才）
山东大学：《人机交互技术》课程教学资源（PPT课件讲稿）第3章交互设备 3.5 显示设备 3.6 语音交互设备 3.7虚拟现实系统中的交互设备
《网络搜索和挖掘关键技术 Web Search and Mining》课程教学资源（PPT讲稿）Lecture 11 Probabilistic Information Retrieval
广西医科大学：《计算机网络 Computer Networking》课程教学资源（PPT课件讲稿）Chapter 01 Introduction overview
东南大学：《C++语言程序设计》课程教学资源（PPT课件讲稿）Chapter 10 Classes A Deeper Look（Part 2）
《网上开店实务》课程教学资源（PPT讲稿）学习情境1 网上开店创业策划
安徽理工大学：《Linux开发基础 Development Foundation on Linux OS》课程资源（PPT课件讲稿）Section 4 Perl programming（赵宝）
香港理工大学：Artificial Neural Networks for Data Mining

点击购买下载（PPTX）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录