当前位置：和泉文库 > 计算机 > 浏览文档

《深度自然语言处理》课程教学课件（Natural language processing with deep learning）08 Language Model & Distributed Representation（5/6）

文件格式：PDF，文件大小：2.72MB，售价：12.12元

文档详细内容（约94页）

Self-attentionSelf-AttentionKeylKey2Key3Kev4KeylKey2Key3Key4AttentionQueryValueStep1QueryF(Q,K)F(QK)FIQKF(Q,K)ValuelValue2Value3Value4s2s3s4SSourceSoftMax()Step2Calculationprocess:Step 1:calculatingthesimilarityAttentionbetweenqueryandkeytoget theValueweightsStep3Step2:normalizingtheweightsValuelValue2Value3Value4

Self-attention l Self-Attention Step 1 Step 2 Step 3 Calculation process: lStep 1: calculating the similarity between query and key to get the weights lStep 2: normalizing the weights

Self-attentionSelf-AttentionKeylKey2Key3KeyKey2Key3Key4KeylAttentionQueryValueStep1QueryF(Q,K)F(QK)F(QK)F(Q,K)ValuelValue2Value3Value4s2s3$4s1SourceSoftMax(Step2Calculationprocess:Step1:calculatingthesimilarityAttentionbetweenqueryandkeytogettheValueweightsStep3Step2:normalizingtheweightsValuelValue2Value3Value4Step3:Summingtheweightedvaluetogetthehiddenstate

Self-attention l Self-Attention Step 1 Step 2 Step 3 Calculation process: lStep 1: calculating the similarity between query and key to get the weights lStep 2: normalizing the weights lStep 3: Summing the weighted value to get the hidden state

Outlines1.Self-attention2.Transformer3. Pre-training LM

Outlines 1. Self-attention 2. Transformer 3. Pre-training LM

TransformerAs of last week:recurrent modelsfor (most)NLP!Circa2016,thedefactostrategyinNLPistoencodesentenceswithabidirectionalLSTM:(forexample,thesourcesentenceinatranslation)Defineyour output (parse, sentence,summary)asasequence,anduseanLsTMtogenerate(decode)it.交通大学Useattentiontoallowflexibleaccess11111tomemory

Transformer l As of last week: recurrent models for (most) NLP! • Circa 2016, the de facto strategy in NLP is to encode sentences with a bidirectional LSTM: (for example, the source sentence in a translation) • Define your output (parse, sentence, summary) as a sequence, and use an LSTM to generate(decode) it. • Use attention to allow flexible access to memory

TransformerToday:Samegoals,differentbuilding blocksLast week, welearnedabout sequence-to-sequence problemsandencoder-decodermodels交道大学

Transformer l Today: Same goals, different building blocks • Last week, we learned about sequence-to-sequence problems and encoder-decoder models

点击进入文档下载页（PDF格式）

共94页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

《深度自然语言处理》课程教学课件（Natural language processing with deep learning）07 Language Model & Distributed Representation（4/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）05 Language Model & Distributed Representation（2/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）06 Language Model & Distributed Representation（3/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）04 Language Model & Distributed Representation（1/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）03 Fundamental Tasks of NLP
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）01 About the course
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）02 What is NLP, why NLP and How NLP
佛山大学（佛山科学技术学院）：2022年版计算机科学与技术专业理论课程教学大纲汇编
佛山大学（佛山科学技术学院）：2022年版物联网实践课程教学大纲汇编
佛山大学（佛山科学技术学院）：2022年版智能科学与技术专业理论课程教学大纲汇编
佛山大学（佛山科学技术学院）：2022年版物联网实验课程教学大纲汇编
《物联网导论》课程教学资源（PPT课件）第16章 SLAM空间智能计算
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）09 Language Model & Distributed Representation（6/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）12 sentiment analysis
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）11 coreference resolution
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）10 information extraction
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）15 Machine translation
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）14 Question Answering
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）16 Natural Language Generation
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）17 Deep leanring Programing framework
全国信息安全标准化技术委员会：大数据安全标准化白皮书（2018 版）
沈阳师范大学：《大学计算机基础》课程教学大纲 Fundamentals of University Computer A
沈阳师范大学：《大学计算机基础》课程授课教案（讲义，共五章，任课教师：刘冰）
《大学计算机基础》课程教学资源（教案讲义，共五章，沈阳师范大学：刘冰）

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录