当前位置：和泉文库 > 计算机 > 浏览文档

《电子商务 E-business》阅读文献：Meeting user information needs in recommender systems

文件格式：PDF，文件大小：10.89MB，售价：50.1元

文档详细内容（约254页）

Content-Based Recommenders Content-based recommenders use information from the items themselves to generate recommendations. For example, in a research paper recommender, text extracted from the papers could be used to generate recommendations. Such recommenders use information retrieval and filtering algorithms to generate recommendations. A complete review of information retrieval(IR)and information filtering(IF)is beyond the scope of this dissertation, but a high-level overview helps place our research in context. We will first discuss the similarities and differences between IR and IF, review common models used in these systems, and finally discuss the relationship between IR, IF, and content based recommenders In their influential 1992 paper, Belkin and Croft provided a clear argument that information filtering(IF)and information retrieval (Ir) were much closer related than had been previously discussed in their respective communities [7]. In it, they argue that both IR and IF share the same five characteristics: A predefined representation and organization of documents, a representation of a user's current information state, a comparison step in which relevant documents are selected, an evaluation step where the user reviews the selected documents, and a possible iteration on the user's information state. The two key points are the user's information state, and the possibility of iterating on this state. In combined IR/IF systems, this information state must to be translated into a query that the system can parse. Such queries are comprised of keywords describing the user's need [5]. This implied translation could seriously affect the user's ability to find documents that meet her information need. Once given this translation of state, the iteration step becomes essential for helping users meet their information need. These problems also appear in recommender systems and we discuss them in this dissertation While there are many similarities between IR and IF, there are differences as well The differences between IR and IF can be expressed in a few salient points 15 Reproduced with permission of the copyright owner. Further reproduction prohibited without permission

space, where each dimension represents a word in the corpus. Similarities between documents can be computed using cosine similarity measures. One extremely popular version of this model weights the different word dimensions based on the frequency of the word in the document and in the corpus and is called TD/IDF similarity [127]. We will discuss using TF/IDF as an IR algorithm nside a content-based recommender in Chapter 3 3. The Probabilistic Model The probabilistic model uses probability calculations to determine the relevance of a query to different documents. For example, a Bayesian inference network can be used to model the relationships between documents in the corpus. When a query is presented to the system, the network ' propagates a signal depending on he probabilities between nodes, and returns the nodes representing the documents with highest probability of being related to the query [7]. We will discuss a different probabilistic model, the Naive Bayes Classifier, for use in recommender systems in Chapter 3 The relationship between IR/IF systems and content-based recommender systems is one of abstraction and purpose. Recommenders require elements of both kinds of systems, including the ability to search an existing corpus and streams of information make use of an existing user model, and return the most relevant information. In essence we argue a content-based recommender is equal to the abstraction above an ir or an IF system-an information processing system with the explicit goal of generating meaningful recommendations to users. This goal, we believe is a different goal from IR or IF systems. In an IR/IF system, the goal is to provide the most relevant documents to meet a user's information need, either from the corpus or from the incoming streams. In a recommender, the goal is the similar, but instead returning the most relevant documents for a user's information need, they return the most salient. That is, they should return not only relevant documents, but those which have the greatest impact on the user's Reproduced with permission of the copyright owner. Further reproduction prohibited without permission

perception of completing her information seeking task. This is a point we will discuss in detail when we present Human-Recommender Interaction theory in Chapter 6 Collaborative Filterin Collaborative filtering-based recommenders(CF)work by gathering the opinions of users about items in a domain( e.g. movie ratings) placing this information into a user-item ratings matrix. Algorithms then compare either rows(users)or columns (items )to predict values for empty entries in the matrix. The idea of using the opinions of others to generate item recommendations combined with the phrase"collaborative filtering"was first discussed by Goldberg et al [41]. Their vision was that the opinions of others could help a person manage email and electronic documents by providing opinions and annotations to each item, giving the current user extra information about each of the Items In 1994, Resnick et al. published"GroupLens: an open architecture for collaborative filtering of netnews "in which they proposed a k-nearest neighbor algorithm for generating recommendations based on user opinions of netnews news articles [120] This algorithm, now commonly referred to as User-based Collaborative Filtering, was the rst algorithm widely used in recommender systems. Herlocker et al. performed a detailed analysis and proposed several important modifications [55]. a more technical discussion will appear in Chapter 3. The first experiments in CF were in netnews news articles [120, 143], but this soon expanded into several other domains, such as: jokes [42], movies [55, 58, and music [135] The k-nearest neighbor algorithm is considered an instance-based machine learning algorithm used for instance classification [97] before becoming known as'the CF algorithm. Soon, other machine learning algorithms were also explored recommenders. The first to be tried were statistical methods such as Bayesian networks [13] and clustering methods [15, 147]. Just as important, these papers brought with them machine learning evaluation methodologies for evaluating recommendation quality, such as k-folding and leave-n-out, the implications of which will be discussed in detail later Reproduced with permission of the copyright owner. Further reproduction prohibited without permission

点击进入文档下载页（PDF格式）

共254页，可试读40页，点击继续阅读 ↓↓

您可能感兴趣的文档

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录