当前位置：和泉文库 > 计算机 > 《网络搜索和挖掘关键技术 Web Search and Mining》课程教学资源（PPT讲稿）Lecture 09 Evaluation

《网络搜索和挖掘关键技术 Web Search and Mining》课程教学资源（PPT讲稿）Lecture 09 Evaluation

文件格式：PPT，文件大小：1.34MB，售价：11.2元

文档详细内容（约50页）

Evaluation Unranked Retrieval Evaluation Unranked retrieval evaluation Precision and recall Precision fraction of retrieved docs that are relevant P(relevant retrieved Recall fraction of relevant docs that are retrieved P(retrieved relevant Relevant Nonrelevant Retrieved Not Retrieved fn Precision P= tp/tp fp) Recall r=tp/tp+ fn)

Evaluation 11 Unranked retrieval evaluation: Precision and Recall ▪ Precision: fraction of retrieved docs that are relevant = P(relevant|retrieved) ▪ Recall: fraction of relevant docs that are retrieved = P(retrieved|relevant) ▪ Precision P = tp/(tp + fp) ▪ Recall R = tp/(tp + fn) Relevant Nonrelevant Retrieved tp fp Not Retrieved fn tn Unranked Retrieval Evaluation

Evaluation Unranked Retrieval Evaluation Should we instead use the accuracy measure for evaluation Given a query, an engine classifies each doc as Relevant"or nonrelevant The accuracy of an engine: the fraction of these classifications that are correct (tp+tn)/(tp+ fp+ fn +tn) a Accuracy is a commonly used evaluation measure in machine learning classification work Why is this not a very useful evaluation measure in IR?

Evaluation 12 Should we instead use the accuracy measure for evaluation? ▪ Given a query, an engine classifies each doc as “Relevant” or “Nonrelevant” ▪ The accuracy of an engine: the fraction of these classifications that are correct ▪ (tp + tn) / ( tp + fp + fn + tn) ▪ Accuracy is a commonly used evaluation measure in machine learning classification work ▪ Why is this not a very useful evaluation measure in IR? Unranked Retrieval Evaluation

Evaluation Unranked Retrieval Evaluation Why not just use accuracy? How to build a 99. 9999% accurate search engine on a low budget Noodle com Search for 0 matching results found People doing information retrieval want to find something and have a certain tolerance for junk

Evaluation 13 Why not just use accuracy? ▪ How to build a 99.9999% accurate search engine on a low budget…. ▪ People doing information retrieval want to find something and have a certain tolerance for junk. Search for: 0 matching results found. Unranked Retrieval Evaluation

Evaluation Unranked Retrieval Evaluation Precision/Recal You can get high recall (but low precision) by retrieving all docs for all queries Recall is a non-decreasing function of the number of docs retrieved In a good system precision decreases as either the number of docs retrieved or recall increases This is not a theorem, but a result with strong empirical confirmatⅰon

Evaluation 14 Precision/Recall ▪ You can get high recall (but low precision) by retrieving all docs for all queries! ▪ Recall is a non-decreasing function of the number of docs retrieved ▪ In a good system, precision decreases as either the number of docs retrieved or recall increases ▪ This is not a theorem, but a result with strong empirical confirmation Unranked Retrieval Evaluation

Evaluation Unranked Retrieval Evaluation Difficulties in using precision /recall Should average over large document collection/query ensembles Need human relevance assessments People aren ' t reliable assessors Assessments have to be binary nuanced assessments? Heavily skewed by collection /authorship Results may not translate from one domain to another

Evaluation 15 Difficulties in using precision/recall ▪ Should average over large document collection/query ensembles ▪ Need human relevance assessments ▪ People aren’t reliable assessors ▪ Assessments have to be binary ▪ Nuanced assessments? ▪ Heavily skewed by collection/authorship ▪ Results may not translate from one domain to another Unranked Retrieval Evaluation

点击进入文档下载页（PPT格式）

共50页，可试读17页，点击继续阅读 ↓↓

您可能感兴趣的文档

长春工业大学：《网页设计与制作》课程教学资源（PPT课件）第5章 Div+CSS布局技术
合肥工业大学：《计算机网络技术》课程教学资源（PPT课件讲稿）第4章交换网的运行
山东大学软件学院：非线性规划（PPT讲稿）一维搜索方法
《并发控制技术》课程教学资源（PPT课件讲稿）第7章事务管理 transaction management
北京师范大学现代远程教育：《计算机应用基础》课程教学资源（PPT课件讲稿）第1章计算机常识（主讲：马秀麟）
南京大学：《面向对象技术 OOT》课程教学资源（PPT课件讲稿）面向对象的分析与设计简介 OOA & OOD：An introduction
中国科学技术大学：《计算机体系结构》课程教学资源（PPT课件讲稿）向量体系结构
中国科学技术大学：《现代密码学理论与实践》课程教学资源（PPT课件讲稿）第二部分公钥密码和散列函数第8章数论入门（苗付友）
《计算机网络技术》课程教学资源（PPT课件讲稿）第5章广域网
香港城市大学：Rank Aggregation in MetaSearch
Vitebi 译码
图形处理及多媒体应用（PPT课件讲稿）
上海交通大学：《计算机图形学 Computer Graphics》课程教学资源（PPT讲稿）CHAPTER 4 THE VISUALIZATION PIPELINE
香港中文大学：XML for Interoperable Digital Video Library
中国医科大学计算机中心：《虚拟现实与增强现实技术概论》课程教学资源（PPT课件讲稿）第3章虚拟现实系统的输出设备
同济大学：《大数据分析与数据挖掘 Big Data Analysis and Mining》课程教学资源（PPT课件讲稿）K-means & EM
北京大学：文本挖掘技术（PPT讲稿）文本分类 Text Categorization
《网页设计与制作》课程教学资源（PPT课件讲稿）第一章 HTML基础
清华大学：《计算机导论》课程电子教案（PPT教学课件）第1章计算机发展简史
《网络搜索和挖掘关键技术 Web Search and Mining》课程教学资源（PPT讲稿）Lecture 06 Index Compression
嵌入式交叉开发环境的建立（PPT实验讲稿）
西安交通大学：《微型计算机接口技术》课程教学资源（PPT课件讲稿）第五章输入/输出控制接口
《TCP/IP协议及其应用》课程教学资源（PPT课件讲稿）第3章 IP寻址与地址解析
中国医科大学：《计算机网络实用教程》课程教学资源（PPT讲稿）高速局域网技术、交换式局域网技术、虚拟局域网技术、主要的城域网技术

点击购买下载（PPT）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录