当前位置：和泉文库 > 计算机 > 浏览文档

《网络搜索和挖掘关键技术 Web Search and Mining》课程教学资源（PPT讲稿）Lecture 09 Evaluation

文件格式：PPT，文件大小：1.34MB，售价：11.2元

文档详细内容（约50页）

Evaluation Measures easuring user happiness Issue: who is the user we are trying to make happy? Depends on the setting Web engine User finds what they want and return to the engine Can measure rate of return users a User completes their task -search as a means, not end SeeRussellhttp://dmrussellgooglepages.com/jcdl-talk- June-2007-short. pdf e Commerce site: user finds what they want and buy Is it the end-user, or the e Commerce site, whose happiness we measure? Measure time to purchase, or fraction of searchers who become buyers?

Evaluation 6 Measuring user happiness ▪ Issue: who is the user we are trying to make happy? ▪ Depends on the setting ▪ Web engine: ▪ User finds what they want and return to the engine ▪ Can measure rate of return users ▪ User completes their task – search as a means, not end ▪ See Russell http://dmrussell.googlepages.com/JCDL-talkJune-2007-short.pdf ▪ eCommerce site: user finds what they want and buy ▪ Is it the end-user, or the eCommerce site, whose happiness we measure? ▪ Measure time to purchase, or fraction of searchers who become buyers? Measures

Evaluation Measures easuring user happiness Enterprise( company/govt/academic): Care about user productivity How much time do my users save when looking for information lany other criteria having to do with breadth of access secure access, etc

Evaluation 7 Measuring user happiness ▪ Enterprise (company/govt/academic): Care about “user productivity” ▪ How much time do my users save when looking for information? ▪ Many other criteria having to do with breadth of access, secure access, etc. Measures

Evaluation Measures Happiness: elusive to measure Most common proxy: relevance of search results But how do you measure relevance? We will detail a methodology here then examine Its IsSues Relevance measurement requires 3 elements: 1. a benchmark document collection 2. a benchmark suite of queries 3. a usually binary assessment of either relevant or Nonrelevant for each query and each document Some work on more-than- binary, but not the standard 8

Evaluation 8 Happiness: elusive to measure ▪ Most common proxy: relevance of search results ▪ But how do you measure relevance? ▪ We will detail a methodology here, then examine its issues ▪ Relevance measurement requires 3 elements: 1. A benchmark document collection 2. A benchmark suite of queries 3. A usually binary assessment of either Relevant or Nonrelevant for each query and each document ▪ Some work on more-than-binary, but not the standard Measures

Evaluation Measures Evaluating an iR system Note: the information need is translated into a quer Relevance is assessed relative to the information need not the query E. g. Information need /'m looking for information on whether drinking red wine is more effective at reducing your risk of heart attacks than white wine Query: wine red white heart attack effective You evaluate whether the doc addresses the information need not whether it has these words

Evaluation 9 Evaluating an IR system ▪ Note: the information need is translated into a query ▪ Relevance is assessed relative to the information need not the query ▪ E.g., Information need: I'm looking for information on whether drinking red wine is more effective at reducing your risk of heart attacks than white wine. ▪ Query: wine red white heart attack effective ▪ You evaluate whether the doc addresses the information need, not whether it has these words Measures

Evaluation Benchmarks Standard relevance benchmarks TREC -National Institute of standards and Technology nist) has run a large ir test bed for many years Reuters and other benchmark doc collections used Retrieval tasks" specified sometimes as queries Human experts mark, for each query and for each doc, Relevant or nonrelevant or at least for subset of docs that some system returned for that query

Evaluation 10 Standard relevance benchmarks ▪ TREC - National Institute of Standards and Technology (NIST) has run a large IR test bed for many years ▪ Reuters and other benchmark doc collections used ▪ “Retrieval tasks” specified ▪ sometimes as queries ▪ Human experts mark, for each query and for each doc, Relevant or Nonrelevant ▪ or at least for subset of docs that some system returned for that query Benchmarks

点击进入文档下载页（PPT格式）

共50页，可试读17页，点击继续阅读 ↓↓

您可能感兴趣的文档

长春工业大学：《网页设计与制作》课程教学资源（PPT课件）第5章 Div+CSS布局技术
合肥工业大学：《计算机网络技术》课程教学资源（PPT课件讲稿）第4章交换网的运行
山东大学软件学院：非线性规划（PPT讲稿）一维搜索方法
《并发控制技术》课程教学资源（PPT课件讲稿）第7章事务管理 transaction management
北京师范大学现代远程教育：《计算机应用基础》课程教学资源（PPT课件讲稿）第1章计算机常识（主讲：马秀麟）
南京大学：《面向对象技术 OOT》课程教学资源（PPT课件讲稿）面向对象的分析与设计简介 OOA & OOD：An introduction
中国科学技术大学：《计算机体系结构》课程教学资源（PPT课件讲稿）向量体系结构
中国科学技术大学：《现代密码学理论与实践》课程教学资源（PPT课件讲稿）第二部分公钥密码和散列函数第8章数论入门（苗付友）
《计算机网络技术》课程教学资源（PPT课件讲稿）第5章广域网
香港城市大学：Rank Aggregation in MetaSearch
Vitebi 译码
图形处理及多媒体应用（PPT课件讲稿）
上海交通大学：《计算机图形学 Computer Graphics》课程教学资源（PPT讲稿）CHAPTER 4 THE VISUALIZATION PIPELINE
香港中文大学：XML for Interoperable Digital Video Library
中国医科大学计算机中心：《虚拟现实与增强现实技术概论》课程教学资源（PPT课件讲稿）第3章虚拟现实系统的输出设备
同济大学：《大数据分析与数据挖掘 Big Data Analysis and Mining》课程教学资源（PPT课件讲稿）K-means & EM
北京大学：文本挖掘技术（PPT讲稿）文本分类 Text Categorization
《网页设计与制作》课程教学资源（PPT课件讲稿）第一章 HTML基础
清华大学：《计算机导论》课程电子教案（PPT教学课件）第1章计算机发展简史
《网络搜索和挖掘关键技术 Web Search and Mining》课程教学资源（PPT讲稿）Lecture 06 Index Compression
嵌入式交叉开发环境的建立（PPT实验讲稿）
西安交通大学：《微型计算机接口技术》课程教学资源（PPT课件讲稿）第五章输入/输出控制接口
《TCP/IP协议及其应用》课程教学资源（PPT课件讲稿）第3章 IP寻址与地址解析
中国医科大学：《计算机网络实用教程》课程教学资源（PPT讲稿）高速局域网技术、交换式局域网技术、虚拟局域网技术、主要的城域网技术

点击购买下载（PPT）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录