Word2vecWord2vecTomas Mikolov,et al.2013Awordvectorlearningframework麦通大学1.MikolovT,ChenK,CorradoG,etal.Efficientestimation of wordrepresentationsinvectorspace[J].arXivpreprintarXiv:1301.3781,2013
Word2vec • Tomáš Mikolov, et al. 2013 • A word vector learning framework Word2vec 1. Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space[J]. arXiv preprint arXiv:1301.3781, 2013
Word2vecWord2vecTomas Mikolov, et al. 2013AwordvectorlearningframeworkBasicideaAlarge text corpus (corpus)交通大学1.MikolovT,ChenK,CorradoG,etal.Efficientestimation of wordrepresentationsinvectorspace[J].arXivpreprintarXiv:1301.3781,2013
Word2vec • Tomáš Mikolov, et al. 2013 • A word vector learning framework Word2vec • A large text corpus (corpus). Basic idea 1. Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space[J]. arXiv preprint arXiv:1301.3781, 2013
Word2vecWord2vecTomas Mikolov, et al. 2013AwordvectorlearningframeworkBasicideaAlarge text corpus (corpus)Represent each word in a fixed-size vocabulary as a vector.交道大学1.MikolovT,ChenK,CorradoG,etal.Efficientestimationofwordrepresentationsinvectorspace[J].arXivpreprintarXiv:1301.3781,2013
Word2vec • Tomáš Mikolov, et al. 2013 • A word vector learning framework Word2vec • A large text corpus (corpus). • Represent each word in a fixed-size vocabulary as a vector. Basic idea 1. Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space[J]. arXiv preprint arXiv:1301.3781, 2013
Word2vecWord2vecTomas Mikolov, et al. 2013AwordvectorlearningframeworkBasicideaAlarge text corpus (corpus),Representeachwordinafixed-sizevocabularyasavectorAt each position t in the text, there is a head word c and a context word o交道大学1.MikolovT,ChenK,CorradoG,etal.Efficientestimationofwordrepresentationsinvectorspace[J].arXivpreprintarXiv:1301.3781,2013
Word2vec • Tomáš Mikolov, et al. 2013 • A word vector learning framework Word2vec • A large text corpus (corpus). • Represent each word in a fixed-size vocabulary as a vector. • At each position t in the text, there is a head word c and a context word o. Basic idea 1. Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space[J]. arXiv preprint arXiv:1301.3781, 2013
Word2vecWord2vecTomasMikolov,etal.2013AwordvectorlearningframeworkBasicideaA large text corpus (corpus)Representeachword inafixed-sizevocabularyasavectorAt each position t in the text, there is a head word c and a context word oUsing the word vector similarity of c and o, calculate the probability of o given通大c, and vice versa.1.MikolovT,ChenK,CorradoG,etal.Efficientestimationofwordrepresentationsinvectorspace[J].arXivpreprintarXiv:1301.3781,2013
Word2vec • Tomáš Mikolov, et al. 2013 • A word vector learning framework Word2vec • A large text corpus (corpus). • Represent each word in a fixed-size vocabulary as a vector. • At each position t in the text, there is a head word c and a context word o. • Using the word vector similarity of c and o, calculate the probability of o given c, and vice versa. Basic idea 1. Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space[J]. arXiv preprint arXiv:1301.3781, 2013