当前位置：和泉文库 > 计算机 > 浏览文档

《深度自然语言处理》课程教学课件（Natural language processing with deep learning）09 Language Model & Distributed Representation（6/6）

文件格式：PDF，文件大小：1.37MB，售价：7.12元

文档详细内容（约51页）

Pre-training LMPretraining decodersIt's natural to pretrain decoders as language models andthen usethem as generators, finetuning their (1:-1)!W2W3W4WsW6A,bh,...hW1W2W3W4Ws福[Notehowthe linear layer has beenpretrained.j

Pre-training LM It’s natural to pretrain decoders as language models and then use them as generators, finetuning their � (� |�1: −1 ) ! [Note how the linear layer has been pretrained.] l Pretraining decoders

Pre-training LMPretraining decodersIt's natural to pretrain decoders as language models andthen usethem as generators, finetuning their (11:-1)!W2W3W4WsW6This is helpful intasks wherethe output isa sequenceA,bwithavocabularylikethat atpretrainingtime!th,.,hrDialogue(context=dialoguehistory)Summarization(context=document)h..., h. = Decoder(wi..., w.)W1W2W3W4Wsw, ~ Aw-- +b[Notehowthe linearlayerhasbeenpretrained.j

Pre-training LM It’s natural to pretrain decoders as language models and then use them as generators, finetuning their � (� |�1: −1 ) ! This is helpful in tasks where the output is a sequence with a vocabulary like that at pretraining time! 1 1 ,., Decoder( ,., ) T T h h  w w wt ~ Awt 1 b   [Note how the linear layer has been pretrained.] • Dialogue (context=dialogue history) • Summarization (context=document) l Pretraining decoders

Pre-trainingLMPretraining decodersIt's natural to pretrain decoders as language models andthen usethem as generators, finetuning their (11:-1)!W2W3W4WsW6This is helpful intasks wherethe output isa sequenceA,bwitha vocabulary likethatatpretrainingtime!thu,..,hrDialogue(context=dialoguehistory）Summarization(context=document)h,.., h. = Decoder(wi..., w.)W1W2W3W4Wsw, ~ Aw-- +b[Note howthe linear layerhasbeenpretrained.jWhere,were pretrained in the language model!

Pre-training LM It’s natural to pretrain decoders as language models and then use them as generators, finetuning their � (� |�1: −1 ) ! This is helpful in tasks where the output is a sequence with a vocabulary like that at pretraining time! 1 1 ,., Decoder( ,., ) T T h h  w w wt ~ Awt 1 b   Where �, � were pretrained in the language model ! [Note how the linear layer has been pretrained.] • Dialogue (context=dialogue history) • Summarization (context=document) l Pretraining decoders

Outlines1. Pre-training LM2. GPT3.Bert4. T5

Outlines 1. Pre-training LM 2. GPT 3. Bert 4. T5

Pre-trainingLMGenerativePretrainedTransformer(GPT)2018's GPT was a big success in pretraining a decoder!交通大学RadfordA,NarasimhanK,SalimansT,etal.Improving language understanding by generative pre-training[J].2018

Pre-training LM l Generative Pretrained Transformer (GPT) 2018’s GPT was a big success in pretraining a decoder! Radford A, Narasimhan K, Salimans T, et al. Improving language understanding by generative pre-training[J]. 2018

点击进入文档下载页（PDF格式）

共51页，可试读17页，点击继续阅读 ↓↓

您可能感兴趣的文档

《深度自然语言处理》课程教学课件（Natural language processing with deep learning）08 Language Model & Distributed Representation（5/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）07 Language Model & Distributed Representation（4/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）05 Language Model & Distributed Representation（2/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）06 Language Model & Distributed Representation（3/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）04 Language Model & Distributed Representation（1/6）
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）03 Fundamental Tasks of NLP
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）01 About the course
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）02 What is NLP, why NLP and How NLP
佛山大学（佛山科学技术学院）：2022年版计算机科学与技术专业理论课程教学大纲汇编
佛山大学（佛山科学技术学院）：2022年版物联网实践课程教学大纲汇编
佛山大学（佛山科学技术学院）：2022年版智能科学与技术专业理论课程教学大纲汇编
佛山大学（佛山科学技术学院）：2022年版物联网实验课程教学大纲汇编
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）12 sentiment analysis
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）11 coreference resolution
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）10 information extraction
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）15 Machine translation
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）14 Question Answering
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）16 Natural Language Generation
《深度自然语言处理》课程教学课件（Natural language processing with deep learning）17 Deep leanring Programing framework
全国信息安全标准化技术委员会：大数据安全标准化白皮书（2018 版）
沈阳师范大学：《大学计算机基础》课程教学大纲 Fundamentals of University Computer A
沈阳师范大学：《大学计算机基础》课程授课教案（讲义，共五章，任课教师：刘冰）
《大学计算机基础》课程教学资源（教案讲义，共五章，沈阳师范大学：刘冰）
《大学计算机基础》课程教学大纲 Fundamentals of University Computer A

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录