当前位置：和泉文库 > 计算机 > 浏览文档

《深度自然语言处理》课程教学课件（Natural language processing with deep learning）08 Language Model & Distributed Representation（5/6）

文件格式：PDF，文件大小：2.72MB，售价：12.12元

文档详细内容（约94页）

TransformerToday:Samegoals,differentbuilding blocks5Lastweek,welearnedaboutsequence-to-sequenceproblemsandencoder-decodermodels.Today, we're not trying to motivate entirely new ways of looking atproblemsInstead, we're trying to find the best building blocks to plug intoourmodels and enablebroad progress.交通大学Lots of trialEanderror20212014-2017??????Recurrence

Transformer l Today: Same goals, different building blocks • Last week, we learned about sequence-to-sequence problems and encoder-decoder models. • Today, we’re not trying to motivate entirely new ways of looking at problems • Instead, we’re trying to find the best building blocks to plug into our models and enable broad progress. 2014-2017 Recurrence Lots of trial and error 2021 ??????

TransformerIssueswithrecurrent models:Linearinteraction distanceRNNs are unrolled "left-to-right"This encodes linear locality: a useful heuristic!? Nearby words often affect each other's meaningstastypizza支通大学

TransformerIssues with recurrent models:Linear interaction distanceRNNs are unrolled"left-to-right"This encodes linear locality:a useful heuristic!?Nearby words often affect each other's meaningstastypizzaProblem:RNNstake O(sequencelength)stepsfordistant word pairs to interact.O(sequencelength)The chef whowas

Transformer l Issues with recurrent models: Linear interaction distance • RNNs are unrolled “left-to-right”. • This encodes linear locality: a useful heuristic! • Nearby words often affect each other’s meanings tasty pizza The chef who . was • Problem: RNNs take O(sequence length) steps for distant word pairs to interact. O(sequence length)

TransformerIssues with recurrent models:Linear interaction distance.O(sequence length) steps for distant word pairs tointeract means:·Hardto learnlong-distance dependencies (becausegradientproblems!)·Linear order of words is“baked in";we already know linear orderisn't the right wayto think about sentences...The chef who..wasInfo of chef has gonethroughO(sequencelength)manylayers!

Transformer l Issues with recurrent models: Linear interaction distance • O(sequence length) steps for distant word pairs to interact means: • Hard to learn long-distance dependencies (because gradient problems!) • Linear order of words is “baked in”; we already know linear order isn’t the right way to think about sentences. The chef who . was Info of chef has gone through O(sequence length) many layers!

点击进入文档下载页（PDF格式）

共94页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

《深度自然语言处理》课程教学课件（Natural language processing with deep learning）07 Language Model & Distributed Representation（4/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）05 Language Model & Distributed Representation（2/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）06 Language Model & Distributed Representation（3/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）04 Language Model & Distributed Representation（1/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）03 Fundamental Tasks of NLP
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）01 About the course
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）02 What is NLP, why NLP and How NLP
佛山大学（佛山科学技术学院）：2022年版计算机科学与技术专业理论课程教学大纲汇编
佛山大学（佛山科学技术学院）：2022年版物联网实践课程教学大纲汇编
佛山大学（佛山科学技术学院）：2022年版智能科学与技术专业理论课程教学大纲汇编
佛山大学（佛山科学技术学院）：2022年版物联网实验课程教学大纲汇编
《物联网导论》课程教学资源（PPT课件）第16章 SLAM空间智能计算
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）09 Language Model & Distributed Representation（6/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）12 sentiment analysis
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）11 coreference resolution
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）10 information extraction
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）15 Machine translation
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）14 Question Answering
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）16 Natural Language Generation
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）17 Deep leanring Programing framework
全国信息安全标准化技术委员会：大数据安全标准化白皮书（2018 版）
沈阳师范大学：《大学计算机基础》课程教学大纲 Fundamentals of University Computer A
沈阳师范大学：《大学计算机基础》课程授课教案（讲义，共五章，任课教师：刘冰）
《大学计算机基础》课程教学资源（教案讲义，共五章，沈阳师范大学：刘冰）

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录