当前位置：和泉文库 > 计算机 > 《网络搜索和挖掘关键技术 Web Search and Mining》课程教学资源（PPT讲稿）Lecture 08 Scoring and results assembly

《网络搜索和挖掘关键技术 Web Search and Mining》课程教学资源（PPT讲稿）Lecture 08 Scoring and results assembly

文件格式：PPT，文件大小：446KB，售价：10.78元

文档详细内容（约48页）

Computing Scores in a Complete Search System Efficient Ranking Computing cosine scores CosineSCORE(q float Scores[N=0 2 float Length[N] 3 for each query term t 4 do calculate Wt g and fetch postings list for t or each pair(d, tft. d )in postings list lo Scores[d+=Wd×Wtq 7 Read the array Length 8 for each d 9 do Scores d= Scores[d/Length[d 10 return Top K components of Scores

Computing Scores in a Complete Search System Computing cosine scores Efficient Ranking 6

Computing Scores in a Complete Search System Efficient Ranking Efficient cosine rankin Find the k docs in the collection "nearest to the query =K largest query-doc cosines ■ Efficient ran king: Computing a single cosine efficiently Choosing the k largest cosine values efficiently Can we do this without computing all n cosines?

Computing Scores in a Complete Search System Efficient cosine ranking ▪ Find the K docs in the collection “nearest” to the query  K largest query-doc cosines. ▪ Efficient ranking: ▪ Computing a single cosine efficiently. ▪ Choosing the K largest cosine values efficiently. ▪ Can we do this without computing all N cosines? Efficient Ranking 7

Computing Scores in a Complete Search System Efficient Ranking Efficient cosine rankin What we re doing in effect solving the k-nearest neighbor problem for a query vector In general, we do not know how to do this efficiently for high-dimensional spaces But it is solvable for short queries and standard indexes support this well

Computing Scores in a Complete Search System Efficient cosine ranking ▪ What we’re doing in effect: solving the K-nearest neighbor problem for a query vector ▪ In general, we do not know how to do this efficiently for high-dimensional spaces ▪ But it is solvable for short queries, and standard indexes support this well Efficient Ranking 8

Computing Scores in a Complete Search System Efficient Ranking Special case -unweighted queries No weighting on query terms Assume each query term occurs only once Then for ranking, dont need to normalize query vector Slight simplification of agorithm from Lecture 7

Computing Scores in a Complete Search System Special case – unweighted queries ▪ No weighting on query terms ▪ Assume each query term occurs only once ▪ Then for ranking, don’t need to normalize query vector ▪ Slight simplification of algorithm from Lecture 7 Efficient Ranking 9

Computing Scores in a Complete Search System Efficient Ranking Faster cosine: unweighted query FASTCOSINESCORE(Q 1 float Scores[N=0 2 for each d 3 do Initialize Length[d] to the length of doc d 4 for each query term t 5 do calculate wtg and fetch postings list for t 6 for each pair(d, tft, d )in postings list 7 do add wf, d to Scores[d] 8 Read the array Length[dI g for each d 10 do Divide Scoresld] by Length[dI 11 return Top K components of Scoresll Figure 7.1 A faster algorithm for vector space scores

Computing Scores in a Complete Search System Faster cosine: unweighted query Efficient Ranking 10

点击进入文档下载页（PPT格式）

共48页，可试读17页，点击继续阅读 ↓↓

您可能感兴趣的文档

上海海事大学：《数字图像处理》课程教学资源（PPT课件讲稿）Unit 7 Introduction to Digital Image Processing
Performance Evaluation of Long Range Dependent Queues（PPT讲稿）
《C语言程序设计》课程电子教案（PPT课件讲稿）第二章基本数据类型及运算
南京大学：《面向对象技术 OOT》课程教学资源（PPT课件讲稿）模式&框架 Pattern & Framework
《数据库系统概论 An Introduction to Database System》课程教学资源（PPT课件讲稿）第二讲关系数据库
《计算机辅助设计》课程介绍
沈阳工程学院：《面向对象程序设计》课程教学大纲（适用专业：计算机科学与技术专业）
《编译原理》课程教学资源（PPT课件讲稿）从正则表达式到有限自动机
Introduction to Computing Using Java（PPT讲稿）Java Language Basics
《物联网导论》课程教学资源（PPT课件讲稿）第2章自动识别技术与RFID
《计算机维修》课程教学资源（PPT课件讲稿）第3章磁盘工具
《数据结构》课程PPT教学课件（讲稿）第一章数据结构基础
《数据库基础》课程教学资源（PPT课件讲稿）第四章数据查询
北京大学：C++模板与STL库介绍（PPT讲稿）
Computer Graphics（PPT讲稿）INFORMATION VISUALIZATION
档案数字化基本程序与要求（PPT讲稿）
中国科学技术大学：《计算机体系结构》课程教学资源（PPT课件讲稿）第5章指令级并行
上海交通大学：《程序设计》课程教学资源（PPT课件讲稿）第14章输入输出与文件
中国科学技术大学：《计算机体系结构》课程教学资源（PPT课件讲稿）第7章多处理器及线程级并行
南京大学：《编译原理》课程教学资源（PPT课件讲稿）第五章语法制导的翻译
河南中医药大学：《网络技术实训》课程教学资源（PPT课件讲稿）第一阶段组网（主讲：路景鑫）
《SQL基础教程》课程教学资源（PPT课件讲稿）第6章数据操作与SQL语句
《计算机基础及C语言程序设计》课程PPT教学课件（讲稿）第1章概论
西安交通大学：《网络与信息安全》课程PPT教学课件（网络入侵与防范）身份认证

点击购买下载（PPT）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录