中国科学技术大学学报 ›› 2014, Vol. 44 ›› Issue (7): 576-581.DOI: 10.3969/j.issn.0253-2778.2014.07.006

• 论著 • 上一篇    下一篇

一种融合PageRank的协同过滤帖子推荐方法

曹阳,刘松,郭剑毅,余正涛,周枫,毛存礼   

  1. 1.昆明理工大学信息工程与自动化学院,云南昆明 650504; 2.昆明理工大学智能信息处理重点实验室;云南昆明 650504
  • 收稿日期:2014-03-21 修回日期:2014-06-15 接受日期:2014-06-15 出版日期:2023-05-11 发布日期:2014-06-15
  • 通讯作者: 郭剑毅
  • 作者简介:曹阳,男,1987年生,硕士生. 研究方向:数据挖掘,机器学习. E-mail:724728777@qq.com
  • 基金资助:
    国家自然科学基金(61175068),云南省教育厅基金重大专项项目(KKJI201203001)资助.

A posts recommendation method based on the collaborative filtering and PageRank

CAO Yang, LIU Song, GUO Jianyi, YU Zhengtao, ZHOU Feng, MAO Cunli   

  1. 1.School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China; 2.Key Laboratory of Intelligent Information Processing Kunming University of Science and Technology, Kunming 650504, China
  • Received:2014-03-21 Revised:2014-06-15 Accepted:2014-06-15 Online:2023-05-11 Published:2014-06-15

摘要: 针对贴吧用户面临严重的信息过载问题,提出一种基于用户信息的协同过滤帖子推荐方法.分析帖子推荐的属性特点后,首先利用一个融合了用户评论行为的PageRank算法去判断参与一个帖子讨论中各用户的重要性,主要考虑各用户之间的回复关系以及各用户之间回复的时间关系;然后把PageRank得分高的用户作为聚类中心进行k-means聚类;最后把帖子中聚类得到的用户与推荐系统使用者通过协同过滤算法计算相似度,并结合用户的PageRank得分,选择与用户相关度较高的帖子作为推荐结果.实验结果表明,该模型比现在使用的热门帖子推荐有着更好的表现.

关键词: 帖子推荐, PageRank, 协同过滤, 百度贴吧

Abstract: In order to solve the problem of information overload in the post bar, a method of information filtration was proposed based on the users commenting behavior. After analyzing the properties of the recommended posts, the importance of an individual user was evaluated by the PageRank algorithm, in which the weight of replies to the posts among users and the weight of reply intervals were taken into consideration. The users with a high PageRank score were then taken as a cluster center in k-means clustering. The similarity between two groups of users (one from the clustering analysis and the other from the recommending system) was calculated by a collaborative filtering algorithm. The posts with high correlations to the users were presented as the recommended results. Experimental results show that the proposed method performs better than the recommending methods in use.

Key words: topics recommendations, PageRank, collaborative filtering, Baidu Post Bar

中图分类号: