分布式图扰动算法：保持社交网络可达性

68 浏览量更新于2024-07-15 收藏 1.24MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

"具有可达性保留的社交网络分布式图摄动算法是针对日益增长的社交网络数据规模和有向图中的节点可达性查询需求提出的一种解决方案。该算法利用分布式图形处理系统GraphX，旨在保护节点隐私的同时，保持网络的可达性特性，提高处理效率。" 在社交网络中，数据的爆炸式增长对现有的匿名社交网络方法提出了挑战，因为这些方法可能无法有效处理大规模的图数据。节点可达性查询是理解网络中节点间关系和信息传播方向的关键。因此，设计一个能处理大规模有向社交网络并保持节点可达性的算法显得尤为重要。本文提出的"可达性保留分布扰动算法"（RPDP）基于Apache Spark的GraphX框架，这是一个强大的分布式图处理系统。GraphX提供了高效的消息传递机制和"探针"功能，这些对于处理大规模图数据十分有用。RPDP算法首先为每个节点生成一个随机邻域表（RNT），这个表由四个元组组成，用于模拟和保护节点的真实邻接关系。这样做的目的是在不泄露实际连接信息的前提下，尽可能地保留图的结构特性。算法的核心是通过RNT和GraphX的消息传递机制来扰动原始的邻接关系。消息传递允许节点间进行通信，更新各自的属性，而"探针"则是一种检查和调整节点状态的方法，以确保在保持隐私的同时，节点的可达性不会被破坏。通过这种方式，RPDP算法能够在大规模的社交网络中实现高效的隐私保护和可达性保持。实验部分使用了真实社交网络数据进行验证，结果显示，该算法不仅能够有效地处理大量数据，而且在保持图结构特征的同时，成功地保持了节点间的可达性。这表明RPDP算法在实际应用中具有广阔前景，可以用于保护用户隐私，同时保证社交网络服务的正常运行。 "具有可达性保留的社交网络分布式图摄动算法"结合了分布式计算的优势和图处理的策略，为解决大型有向社交网络的隐私和可达性问题提供了一个创新的解决方案。通过生成随机邻域表和利用GraphX的特性，该算法在保护节点隐私的同时，维持了网络的可达性，对于未来社交网络的发展和隐私保护技术有着重要的理论与实践意义。

资源详情

资源推荐

In addition, with the rapid development of Internet technology, the scale of online

social network data has shown a trend of quantitative changes. Faced with such a large

scale of social network data, traditional anonymity technology cannot meet the actual

needs, the use of parallel algorithms is an effective way to improve the efﬁciency of

execution. At present, the social network anonymous parallel processing technology

is mainly divided into two categories, one is based on the Secure Multi Party

(SMC) model [9], and the other takes use of the MapReduce which is a data processing

framework [10, 11]. However, these two types of parallel processing technologies are

aimed at relational data and do not consider the graph properties of individuals such as

the degree of nodes, neighbor subgraphs in social networks, which cannot protect

private information in social networks. The graph modeling and parallel processing of

social networks are effective solutions to the problem.

The research work of this paper is aimed at large-scale social network graph G. Its

content is to generate the anonymous graph G

quickly and efﬁciently while main-

taining the reachability of nodes. The main work and contributions are as follows.

(1) We propose a distributed Random Neighborhood Search (DRNS) algorithm. This

algorithm generates a Random Neighbor Table (RNT) for nodes and implem ents a

fast lookup of random neighbor sets based on the message passing mechanism of

GraphX.

(2) We propose two different distributed graph perturbation algorithms, Distribution

Neighborhood Randomization (DNR) and Reachability Preserving Distribution

Perturbation (PRDP). Based on the DRNS and the graph construction operations

in GraphX, DNR implements fast edge perturbation of large-scale social net-

works. RPDP proposes a “probe” mechanism. It is possible to maintain reacha-

bility node in the rapid edge perturbance.

2 Related Work

The reachability query is to query whether a node can reach another node in the

directed graph [5]. In order to protect the link relationship in the social network, the

researchers propose to protect the sensitive link through the random perturbation

technology [3]. The technology randomly modiﬁes the social network graph by edge

probability, so the attacker cannot accurately guess the real data in the original social

network.

An edge perturbation technique [12] based on subgraph structure is proposed which

divides the original graph into several subgraphs, and then adds/deletes m edges ran-

domly in the subgraph. However, this increases the degree of some nodes in the

anonymous graph and the probability that such nodes will be identiﬁed in the subgra ph.

In order to solve this problem, a random neighbor edge perturbation technique [13]is

proposed. The edge <u, v> in the graph is reserved with a certain probability p

(0  p  1). If <u, v> needs to be deleted, the destination node v is replaced with the

r-hop (r  2) neighbor node w of the node u. Paper [14] proposed to use secure

grouping to protect the link relationship of interactive social networks. The idea is to

abstract the network into bipartite graphs, then group the network nodes, and the nodes

196 X. Zhang et al.

剩余14页未读，继续阅读

weixin_38530115

粉丝: 9
资源: 960

分布式图扰动算法：保持社交网络可达性

一种新的分布式无线传感网络估计算法.pdf

分布式算法作业2

可达性问题、算法与复杂性

详细介绍引用计数算法和可达性分析算法

基于网络地图api开放地图访问和高斯两步移动搜索法的武汉市大型公园可达性评价

多智能体如何做到完全分布式

floyd算法求传递闭包可以求可达性问题吗

java可达性算法中GCroot对象

dv路由算法是osps吗

电动汽车充电基础设施空间可达性分析

使用js生成一个可达性算法

如何使用Arcgis可达性分析

用python写一个可以计算可达性矩阵级别划分的算法

arcgis交通可达性分析

说一下GIS中可达性分析的所有方法

gis基于路网可达性分析图

华为云虚拟机在分布式交换机迁移

GIS中可达性分析的方法有哪些，分别详细说一下

java可达性_可达性分析详解

gephi中的统计算法学习

最新资源