社交网络协同过滤算法提升推荐精度与解决冷启动问题

74 浏览量更新于2024-08-29 收藏 498KB PDF 举报

"基于社交网络信息的协同过滤算法"是一篇深入探讨传统推荐系统局限性的研究论文。在当前的推荐系统中，传统的协同过滤方法常常受到矩阵稀疏性和新用户（即"冷启动"）问题的挑战，这会导致推荐精度的下降。为了改善这一状况，该论文提出了一种创新的方法，将社交网络中的邻域关系和用户的关键词信息融入协同过滤过程。作者Rui Wang、Bailing Wang和Junheng Huang来自哈尔滨工业大学威海分校计算机科学技术系，他们开发了一种融合社交网络元素的推荐策略。核心创新在于扩展了计算TOP N（即最相关邻居）的方法，从两个方面提升了推荐效果。首先，通过引入社交网络中的用户关系，算法能够更好地理解用户之间的相似性，增加了推荐的多样性，从而提高了推荐系统的精度。其次，利用用户在社交网络中的关键词，有助于挖掘用户的兴趣偏好，解决了新用户推荐中的“冷启动”问题，即对于没有历史行为数据的新用户，也能提供更个性化的建议。实验部分基于KDD2012数据集进行，这表明作者们的方法在实际应用中展现出显著的优势，能够有效地提高推荐系统的效率和准确性。这种结合社交网络信息的协同过滤算法不仅突破了传统推荐系统的瓶颈，也为个性化推荐领域开辟了新的研究方向。这篇论文对提升推荐系统在复杂网络环境下的性能具有重要的理论价值和实践意义。

A Collaborative Filtering Algorithm Based on

Social Network Information

Rui Wang

Department of Computer Science and

Technology

Harbin Institute of Technology at

Weihai

Weihai, China

rena_wang521@163.com

Bailing Wang

Department of Computer Science and

Technology

Harbin Institute of Technology at

Weihai

Weihai, China

wbl@hit.edu.cn

Junheng Huang

Department of Computer Science and

Technology

Harbin Institute of Technology at

Weihai

Weihai, China

hithjh@163.com

Abstract—In traditional collaborative filtering recommenda-

tion, the matrix sparsity and cold start restricted the accuracy

of system. In this paper, we develop a way to enhance the

recommendation effectiveness by merging neighborhood

relationship and user`s keyword of social network information

into collaborative filtering. We extend the calculation method

of the TOP N neighbors which is the most important from two

aspects. Our method expands the information capacity which

can be used by collaborative filtering, improves the accuracy of

recommendation and eases the cold start problem in

recommendation system. We conducts experiment based on

KDD 2012 real data set. The result indicates that our

algorithm performs more superior than traditional

collaborative filtering algorithm.

Keywords-social network; recommendation system; data

mining; collaborative filtering

I. INTRODUCTION

With the exponential growth of the Internet data, human

society has stepped into the Age of Big Data. It is

increasingly difficult for users to find out the information

they need in the huge data set. Different users are provided

with the same ranking results through the traditional search

engine technology, but users hope to get the personalized

recommendation according to their own preferences.

Researchers have come up with a variety of recommendation

algorithms and developed the corresponding personalized

recommendation systems some of which have been

successfully applied in the industry. In these

recommendation systems, collaborative filtering (CF)

becomes the most popular one for its easy implementation

and good expandability [1]. When predicting the user u’s

preference to the item i, this algorithm firstly will find the

users set N

which shares the similar rating behaviors with

the user u according to the previous rating records, and then

estimate the user u’s preference to the item i according to all

the users’ preference to the item i in the users set N

[2].

has been successfully applied on Amazon and other sites,

however, there are some problems existing in it: (1) Data

sparsity. The rating matrix composed of user’s rating to the

item in most cases is very sparse, and it cannot calculate the

neighborhood of the user correctly and effectively. (2) Cold

start. For lacking the rating record of the new user, the

neighborhood of this user cannot be calculated and this user

cannot get an effective recommendation, either. (3) CF

calculates the neighborhood on the basis of the similar

interests, therefore it cannot distinguish neighborhood of

friend and stranger with similar interests.

The previous recommendation systems are all based on

the hypothesis: users are independent identically distributed.

But actually, on some problems, people usually ask for their

friends’ advice which plays an important role in the final

decision. The rapid development of SNS represented by

Facebook, Twitter, and Tencent provides a great social

platform for people’s communication. Friend relationship

and user-related information in SNS can provide more

available information for recommendation system. Recently,

it has attracted high attention of scholars that information on

social networks can be used to improve the performance of

the recommendation system [3]-[6].

This paper raises a collaborative filtering algorithm

combined with the neighborhood and user’s tag in social

network, and extends the key problem “user’s

neighborhood” in the algorithm from two aspects, The

algorithm expands the information capacity which can be

used by CF, improves the accuracy of recommendation and

eased the cold start problem in recommendation system. We

conducted experiments based on KDD 2012 real data set.

The result indicates that our algorithm performs more

superior than the traditional CF algorithm.

II. RELATED WORK

A. Propaedeutics

The most fundamental elements in the recommendation

system are users set and items set. We set users set

},...,,{

21 mu

uuuS 

, and m is the number of the users. We set

items set to

},...,,{

21 ni

iiiS 

, and n is the number of the items.

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38545117

粉丝: 9
资源: 917

社交网络协同过滤算法提升推荐精度与解决冷启动问题

社交网络的分布式协同过滤算法在煤炭产业的应用

基于物品的协同过滤算法 （mapreduce）

基于MovieLens-1M数据集实现的协同过滤算法demo.zip

协同过滤算法的发展趋势

协同过滤算法的国内外研究现状

协同过滤算法发展比较

详细描述基于用户的协同过滤算法近年来在国内的运用

基于用户的协同过滤算法在当前被发展的理由

协同过滤算法的研究现状

基于协同过滤算法的个性化新闻推荐系统的国内外研究现状

最新资源

基于物品的协同过滤算法（mapreduce）