当前位置：和泉文库 > 计算机 > 浏览文档

《电子商务 E-business》阅读文献：A Two-Level Learning Hierarchy of Concept Based Keyword Extraction for Tag Recommendations

文件格式：PDF，文件大小：4.92MB，售价：57.75元

文档详细内容（约305页）

In this stage, we collected the social tags that are potentially relevant for describing the input bookmarked document based on a set of related bookmarks. We assigned a weight to each tag capturing the strength of its contribution to the bookmark description. However, we realised that this measure is not enough for tag recommendation purposes, and global metrics regarding the folksonomy graph, such ies and tag correlations, have to be taken into considerat 4.4 Building the global social tag co-occurrence sub-graph In the fourth stage(label 4 in Figure 1), we interconnect the social tags obtained in the previous stage through the co-occurrence values of each pair of tags of resources(bookmarks)that have been tagged with both ti and t. In this work er The co-occurrence of two tags ti and ty is usually defined in terms of the numb make use of the asymmetric co-occurrence metric (t,t) #in: ti E tags(bn)at E tags(n)) #{n:t∈tags(bn) which assigns different values for co(ti and co(ti, ti dividing the number of resources tagged with the two tags by the number of resources tagged with one of Computing the co-occurrence values for each pair of tags existing in a training dataset, we build a global graph where the vertices correspond to the available tags, and the edges link tags that co-occur within at least one resource. This graph is directed and weighted: each pair of co-occurring tags is linked by two edges whose weights are the asymmetric co-occurrence values of the tags he tags obtained in the We propose to exploit this global graph to previous stage, and extract the ones that are more related with the input bookmark Specifically, we create a sub-graph where the vertices are the above tags, and the edges are the same as these tags have in the global co-occurrence graph. From this sub-graph,we remove those edges whose co-occurrence values co(ti, t)are lower than the average co-occurrence value of the sub-graph vertices Σujco(t,t) avg_Co(n)-#(.j): co(t, t9)>0) where ti and t are the pairs of social tags related to the input bookmark bn Removing these edges, we aim to isolate(and later discard)"noise" tags that less frequently appear in bookmark annotations We hypothesise that vertices of the generated sub-graph that are most"strongly connected with the rest of the vertices correspond to tags that should be recommended, assuming that high graph vertex centralities are associated to the most informative or representative vertices. In this context, it is important to note that related tags with high weights vn do not necessarily have to be the ones with highest vertex centralities in the co-occurrence sub-graph. We hypothesise that a combination

In this stage, we collected the social tags that are potentially relevant for describing the input bookmarked document based on a set of related bookmarks. We assigned a weight to each tag capturing the strength of its contribution to the bookmark description. However, we realised that this measure is not enough for tag recommendation purposes, and global metrics regarding the folksonomy graph, such as tag popularities and tag correlations, have to be taken into consideration. 4.4 Building the global social tag co-occurrence sub-graph In the fourth stage (label 4 in Figure 1), we interconnect the social tags obtained in the previous stage through the co-occurrence values of each pair of tags. The co-occurrence of two tags ' and '0 is usually defined in terms of the number of resources (bookmarks) that have been tagged with both ' and '0 . In this work, we make use of the asymmetric co-occurrence metric: 123' ,'04 = #{6:' 7 tags ^ '0 7 tags } #{6:' 7 tags } , which assigns different values for 123' , '04 and 123'0 , '4 dividing the number of resources tagged with the two tags by the number of resources tagged with one of them. Computing the co-occurrence values for each pair of tags existing in a training dataset, we build a global graph where the vertices correspond to the available tags, and the edges link tags that co-occur within at least one resource. This graph is directed and weighted: each pair of co-occurring tags is linked by two edges whose weights are the asymmetric co-occurrence values of the tags. We propose to exploit this global graph to interconnect the tags obtained in the previous stage, and extract the ones that are more related with the input bookmark. Specifically, we create a sub-graph where the vertices are the above tags, and the edges are the same as these tags have in the global co-occurrence graph. From this sub-graph, we remove those edges whose co-occurrence values 123' , '04 are lower than the average co-occurrence value of the sub-graph vertices: <&=_12 = ∑ 12' ,' ,0 0 #{,?: 123' ,'04 > 0} , where ' and '0 are the pairs of social tags related to the input bookmark . Removing these edges, we aim to isolate (and later discard) “noise” tags that less frequently appear in bookmark annotations. We hypothesise that vertices of the generated sub-graph that are most “strongly” connected with the rest of the vertices correspond to tags that should be recommended, assuming that high graph vertex centralities are associated to the most informative or representative vertices. In this context, it is important to note that related tags with high weights & do not necessarily have to be the ones with highest vertex centralities in the co-occurrence sub-graph. We hypothesise that a combination 26

点击进入文档下载页（PDF格式）

共305页，可试读40页，点击继续阅读 ↓↓

您可能感兴趣的文档

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录