经济策略：PPRank——预算下最大化社交网络影响力选取

132 浏览量更新于2024-08-26 收藏 1.09MB PDF 举报

PPRank是一项针对社交网络中影响力最大化的经济性种子用户选择策略的研究论文。传统上，影响最大化问题主要关注如何在有限的预算内选取k个（通常固定数量）个体，即所谓的“种子”，来触发大规模的行为传播。然而，大部分现有工作并未考虑个体间的差异，即每个人对新行为的易受影响程度不同，这可能导致说服某些种子用户接受新行为的成本不一。论文提出了一种创新的启发式算法——价格性能比(Popularity Performance Ratio, PPRank)，旨在解决这一问题。PPRank的核心思想是经济地分配预算，同时最大化传播过程。该算法的主要贡献包括： 1. 理论框架：首先，它引入了一个新的视角，考虑了每个个体不同的影响力易感度。这与以往仅依赖于数量选择的假设形成对比，强调了个体特性在选择决策中的重要性。 2. 经济性设计：PPRank通过计算每个个体的“价格”（即影响其采纳的成本）和“性能”（即其潜在的扩散效果），构建了一个经济性评价体系。这使得算法能够根据每个用户的性价比进行选择，优先挑选那些成本效益最高的种子用户。 3. 优化算法：算法采用了迭代方法，通过动态调整预算分配和种子用户的选择，以逐步提高整体的影响力。这种方法不仅考虑了预算约束，还试图最大化传播的范围和深度。 4. 实证分析：论文提供了详尽的实验分析，通过在各种社交网络模型和真实数据集上的测试，验证了PPRank在实际应用中的有效性和效率。结果表明，相比于传统的种子选择方法，PPRank能够在有限预算下获得更好的影响力扩散效果。 5. 未来方向：最后，作者指出PPRank还有进一步扩展的可能性，如结合更复杂的网络结构、动态变化的环境等因素，以实现更精准和灵活的影响最大化策略。综上，PPRank为社交网络中的影响力最大化问题提供了一个经济、高效且个性化的解决方案，有望在实际商业推广和公共信息传播中发挥重要作用。

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

WANG et al.: SELECTING INITIAL USERS FOR INFLUENCE MAXIMIZATION IN SOCIAL NETWORKS 3

the strength with which node i governs node j (i.e., node j is

inﬂuenced by a neighbor node i according to a weighted value

i,j

); w

i,j

and w

j,i

are generally different from each other.

Basically, there are two popular operational diffusion models

in the literature that capture underlying dynamics of diffusion

process: general threshold model and cascade model.

In the general threshold model, consider a potential node

i, represent its incoming neighbors by set N

, and assume



j∈N

j,i

≤ 1; then, the decision of node i to become active

depends on a threshold function (f

) of the set of active neigh-

bors of i and a threshold (called θ

) chosen uniformly at random

by node i from the interval [0,1]. The threshold θ

represents the

weighted fraction of node i’s neighbors that must become active

in order for the node i to become active. The process runs until

no more active users occur.

The cascade model is a type of probabilistic models in which

a user “catches” a speciﬁc behavior from her friends. It starts

with an initial set of active nodes A

, and the process unfolds

in discrete steps according to the following randomized rule.

At time step t, when there exists a directed (or undirected)

edge (i, j), such that i is active and j is not, node i is given

a single chance to activate node j (this activation succeeds with

some probability that might depend on properties of nodes i

and j and/or on the set of nodes that have already tried and

failed to activate j.); If i succeeds, then j will become active in

step (t +1), but whether i succeeds, it cannot make any further

attempts to activate j in subsequent rounds. Again, the process

runs until no more activations are possible. It should be noted

that, while the cascade model seems syntactically different from

the general threshold model, these two models are, in fact,

semantically equivalent [14].

In our paper, we utilize WC diffusion model for the problem

of inﬂuence maximization. Speciﬁcally, it can be described as

follows: assuming d

to be the indegree of node j in social

graph G and (i, j) an edge in G,ifi is activated in round t,

then with probability 1/d

, j is activated by i in round (t +1).

As described previously, due to the huge computational

overhead in greedy-based schemes, we seek to propose a new

heuristic method that could be cost-effective and achieve in-

ﬂuential range as large as possible. There exist many heuristic

schemes to solve the problem of seed selection. A degree dis-

count heuristic algorithm called DegreeDiscount was proposed

in [10], in which the IC diffusion model is used. The basic

idea in DegreeDiscount is the following. Assuming node j be

a neighbor of vertex i,ifi has been selected as a seed, then

when considering whether to select j as a new seed based

on its degree, the edge (i, j) toward node j’s degree is not

counted, i.e., discounting j’s degree by one due to the presence

of i in the seed set, and the same discount will be done on

j’s degree, if every other neighbor of j is already in the seed

set. Simulations show that the performance of this heuristic

algorithm is comparable to that of the greedy algorithm for the

independent cascade model, while its running time is much less

than that of the greedy algorithm. Furthermore, [11] extended

the DegreeDiscount scheme with WC diffusion model.

Unlike the aforementioned works, our paper investigates

how to select the initial seeds from cost-effective viewpoint

and designs a new heuristic scheme, PPRank, that considers

various factors: persuasion cost, user’s inﬂuence power, and

overlapping effect.

Information diffusion models and the top-k node problem

are also appropriately considered from the view of blogspace,

where a blogger may have a certain level of interest in a topic

and is thus susceptible to talking about it. By discussing the

topic, the blogger may inﬂuence other bloggers [15]. A mech-

anism for detecting contagious outbreaks in social networks

was proposed in [16], which demonstrated that, by monitoring

only the friends of these randomly selected students, an early

detection of ﬂu by up to 13.9 days at Harvard College can

be obtained. Based on the observation that people with larger

numbers of friends may have a high probability of being ob-

served among one’s friend circle (i.e., the friends of randomly

selected individuals may have higher centrality in friendship

graphs than average), a lightweighted, distributed, and random

walk-based protocol, iWander, was proposed for identifying

inﬂuential users in mobile social networks [9]. Reference [17]

investigated the connection between PageRank algorithm (orig-

inally designed for web graphs) and the problem of inﬂuence

maximization, by reversing all of the links of the original

networks (so-called reverse PageRank), because, in web graph,

receiving links increases page’s ranking, which is opposite to

the content of the inﬂuence. Furthermore, PRDiscount [26]

was proposed to alleviate the “overlapping effect” existing

in reverse PageRank-like schemes. Interestingly, greedy-based

algorithm and PageRank-inspired heuristic are integrated by

[18], which conducted the greedy algorithm on a small set

of nodes, consisting of the top nodes ranked by PageRank

algorithm on social networks.

As emphasized previously, all aforementioned existing

works ignore one key aspect of inﬂuence propagation that

we usually experience in the real social life: The cost used

to persuade individuals might vary highly (due to their dif-

ferent susceptibility of being inﬂuenced). Note that, given a

ﬁxed budget and an arbitrary cost for selecting each node, the

problem of budgeted inﬂuence maximization (BIM) has been

investigated recently in [13]. Our paper is signiﬁcantly different

from BIM in following aspects. First, BIM belongs to the cate-

gory of greedy-based schemes, while instead of focusing effort

on further improving the running time of greedy algorithms,

we argue that ﬁne-tuned heuristics may provide truly scalable

solutions to the inﬂuence maximization problem with satisfying

inﬂuence spread and blazingly fast running time. Therefore,

our paper aims to design a budget-bounded heuristic scheme;

second, complying with empirical experience, our scheme,

PPRank, explicitly distinguishes and formally characterizes

each individual’s with two factors: IP and SI, and incorporates

two metrics as a selection criterion: Price-Performance-Ratio

and IP, which could maximize inﬂuence diffusion under the

constraint of a given marketing budget.

III. E

CONOMICAL SELECTION OF INITIAL SEEDS BASED

PRICE-PERFORMANCE RATIO

A. Problem Statement

The main motivation of our paper stems from the following

consideration: Individuals may be heterogeneous in terms of

剩余11页未读，继续阅读

weixin_38608873

粉丝: 6
资源: 980

经济策略：PPRank——预算下最大化社交网络影响力选取

TIFIM：社交网络中影响力最大化的两阶段迭代框架

git新手入门：全面教程包括初始化、克隆与代码管理

C/C++线性表操作详解：顺序表的初始化、插入

HP 3PAR 存储详尽配置教程：从导入到初始化与概念解析

std::string初始化陷阱：memset与内存异常

STM32CubeMX中文指南：配置与初始化C代码生成

C++实现顺序表：初始化、长度计算与节点插入详解

C语言实现线性表操作：初始化、插入、删除、查找等

Python Numpy：掌握数组初始化与基本操作

NullReferenceException异常总结：对象未初始化的常见场景

最新资源