NEIWalk：动态内容驱动的网络社区检测

35 浏览量更新于2024-07-15 收藏 1.54MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"NEIWalk是基于动态内容的网络中的社区发现方法，通过集成链接结构、节点内容和边缘内容来发现有意义的动态社区。该方法利用差异活动逐步维护NEI网络，并设计了转移概率矩阵来捕捉不同边缘类型的语义效果。通过异构随机游走进行动态社区检测，具有有效性和效率。" 在当前的网络环境中，社区发现已经成为一个关键任务，尤其是对于动态变化的社区。传统的社区发现算法主要依赖于网络的链接结构，然而，社交网络中的丰富内容，如节点内容和边缘内容，对发现具有特定主题的社区至关重要。因此，NEIWalk方法应运而生，它创新性地将内容信息纳入社区发现过程。 NEIWalk的核心是构建一个Node-Edge Interaction (NEI)网络，这是一个将链接结构、节点内容和边缘内容无缝融合的模型。这种转换使得网络的多元信息得以充分利用。随着内容的动态变化，NEIWalk采用差异活动策略来逐步更新和维护NEI网络，确保网络的时效性。为了处理不同类型的边在社区结构中的语义差异，NEIWalk设计了一个转移概率矩阵。这个矩阵能够量化不同边对社区结构影响的程度，有助于更准确地捕捉网络中的拓扑特征和内容信息。在此基础上，引入了异构随机游走的概念，随机游走在NEI网络中进行，有助于发现动态社区。这种方法允许算法随着网络的变化自适应地调整，提高了社区发现的准确性和效率。理论分析表明，尽管NEIWalk可能会因为随机游走采样的局限性导致一定的精度损失，但总体上，它的性能表现优秀。实验结果验证了NEIWalk在动态社区检测上的优越性和高效性，证明了将内容信息与结构信息结合在社区发现中的有效性。总结来说，NEIWalk是一种创新的动态社区发现方法，它综合运用链接结构、节点内容和边缘内容，通过差异活动维护的NEI网络和基于转移概率矩阵的异构随机游走，实现对网络中结构和内容层面有意义的社区的精确识别。这种方法对于理解和分析复杂网络中的群体行为和模式具有重要意义，对于社交网络分析、信息推荐等领域有广泛应用价值。

资源详情

资源推荐

1736 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 7, JULY 2014

the effectiveness and efﬁciency of the proposed method in

Section 5. The paper is concluded in Section 6.

2RELATED WORK

As revealed in [2], [3], [32], although there are a large

number of algorithms for discovering static communities,

the study of detecting dynamic communities is still in

its infancy. One of the earliest studies was conducted by

Hopcroft et al. [9], which detects evolving communities via

agglomerative hierarchical clustering. Backstrom et al. con-

ducted a case study on two large sources of data [10], which

addresses the questions concerning community member-

ship, growth and evolution. In [4], the dynamic relationship

between vertices and communities was explored, where

communities are found by the MCL method [33]. The tem-

poral smoothness framework trading off the history quality

with the snapshot quality has been investigated in [5],

[6], [8]. Based on the latent space model, Fu et al. [34]

proposed a dynamic mixed membership stochastic block

model (dMMSB) to track the evolving roles of the actors

across timestamps. Similarly, a dynamic stochastic block

model was developed for modeling communities and their

evolution in a uniﬁed probabilistic framework, where a

Bayesian treatment is presented for parameter estima-

tion [32]. Focusing on identifying communities in a multi-

mode network, where both actor membership and inter-

actions can evolve, an evolutionary multi-mode clustering

was proposed by Tang et al. using the temporal informa-

tion [7], [35]. Apart from the aforementioned approaches,

evolving communities can also be discovered by means of

information compression, e.g., [11]. Readers can refer to [2],

[3] for an overview of community detection in evolving

networks.

In the case of static graphs, some recent works have

shown signiﬁcant improvements achieved by integrating

node/edge content and linkage structure in community

detection [17]–[21]. A discriminative model was proposed

in [18] to combine linkage and content analysis for commu-

nity detection, where a conditional model and a discrimi-

native model were respectively used for linkage analysis

and node content analysis. In [20], an edge-induced matrix

factorization (EIMF) approach was used to integrate link-

age structure and edge content for community detection.

Liu et al. [17] developed a Topic-Link LDA model, which

combines the topic similarity (edge content similarity) and

linkage structure to jointly model topics and author com-

munity. Zhou et al. [19] proposed to integrate the structural

and attribute similarities into a uniﬁed framework through

graph augmentation, so as to consider both linkage struc-

ture and node attribute. Sachan et al. [21] addressed the

problem of discovering topically meaningful communities

from social networks by combining three types of informa-

tion, namely, discussed topics, graph topology and nature

of user interactions, whereby generative Bayesian models

were introduced for extracting latent communities.

Apart from that, some works on supervised node classi-

ﬁcation have also validated the effectiveness of combining

linkage structure and content information in the supervised

learning scenario [22], [23]. However, previous works are

either limited to static graphs [17]–[21] or proposed for

supervised learning [22], [23]. Additionally, it is difﬁcult

to extend these methods for discovering dynamic com-

munities in content-based networks in an unsupervised

manner. For instance, to extend [20], a full recomputa-

tion from scratch is required at each timestamp, which is

time-consuming.

Random walk has been widely used in various research

ﬁelds, e.g. [26], [27], [36]–[38]. In [26], an information the-

oretic approach was proposed, which uses the probability

ﬂow of random walks in a network as a proxy for informa-

tion ﬂows. Community structure can then be discovered by

compressing the description of the probability ﬂow. In [27],

a new distance metric was proposed between vertices and

between sets of vertices to quantify the structural similari-

ties using random walks. Then a hierarchical agglomerative

clustering algorithm is performed using the distance metric

between vertices and between sets of vertices to ﬁnd com-

munity structures at different scales. Finally the community

structure containing the required number of communities

is output. These two approaches are batch processing in

nature and hence do not meet the efﬁciency requirement

of the on-line dynamic detection of communities. Similarly,

based on the Markov-chain model of random walk, Fouss

et al. computed quantities such as average commute time

as the similarity between vertices, which are used in collab-

orative recommendation [37]. In [38], a random walk based

clustering algorithm was developed, which is based on the

asymmetric pairwise similarity measure of random walk

hitting time and K-destinations.

3 NEI NETWORK TRANSFORMATION

To combine linkage structure, node content and edge con-

tent for dynamic community detection, one needs to design

some structure for simultaneously representing link-based

similarity and content-based similarity. To this end, we pro-

pose a Node-Edge Interaction (NEI) network, into which

the content-based network is transformed. In this section,

we will ﬁrst describe the construction of the NEI network

from a content-based network. Then we introduce how

to update the NEI network as the content-based network

evolves.

3.1 NEI Network Construction

For notational simplicity, we ignore the time-subscript in

this subsection. Assume that we are given a content-based

network containing a node set

N ={v

, v

,...,v

} and an

undirected edge set

E ={e

, e

,...,e

}. There are contents

associated with each node v

and each edge e

. To inte-

grate linkage structure, node content and edge content, this

content-based network is transformed into a Node-Edge

Interaction (NEI) network, which is a multi-mode network

comprising two types of nodes and three types of edges, as

shown in Fig. 2. The two types of nodes correspond to

and E, which are called n-nodes and e-nodes respectively.

The three types of edges are deﬁned as follows:

1) Each n-node v

and each e-node e

are connected

if the edge e

is incident upon the node v

in the

original network, i.e., ∃v

∈ N s.t. e

= (v

, v

).Inthe

NEI network shown in Fig. 2, such edge is indicated

剩余14页未读，继续阅读

weixin_38550722

粉丝: 8
资源: 928

NEIWalk：动态内容驱动的网络社区检测

家政服务管理平台 源码+数据库+论文（JAVA+SpringBoot+Vue.JS+MySQL）.zip

基于SpringBoot和Bootstrap的Kettle 8.3任务调度系统设计源码

实例程序原理图加文档MSP430单片机51单片机室内环境检测仪

鲁东大学在江西2020-2024各专业最低录取分数及位次表.pdf

中央财经大学在江西2020-2024各专业最低录取分数及位次表.pdf

基于Java语言的JDBC操作封装与简化数据查询的commons-dbutils设计源码

基于Java与HTML的Servlet之间Cookie数据共享设计源码

【乳腺癌检测】GUI医学乳腺癌检测处理系统（自适应中值滤波 GMM分类）【含Matlab源码 8770期】.mp4

基于Java与Python的经管质控心理项目APP原生设计源码

基于Python核心的Rbot机器人设计源码整合项目

CPA 经济法 基础班 王碧波 第10章 滥用市场支配地位行为与经营者集中的反垄断控制.pdf

CPA 会计 郑庆华 基础班 第25章 调整分录 12页.pdf

基于Java语言的TelnetCenter远程批量采集与命令执行设计源码

基于Python开发的打字小游戏设计源码

基于ROS框架的初学者入门教程设计源码

Scratch计算FPS源代码

基于Python和HTML的湘能物业项目设计源码

华为杯数学建模F题_modeling.rar

掌上客网页小程序前端+后端 开源版本.zip

青岛工学院在江西2020-2024各专业最低录取分数及位次表.pdf

最新资源

家政服务管理平台源码+数据库+论文（JAVA+SpringBoot+Vue.JS+MySQL）.zip

CPA 经济法基础班王碧波第10章滥用市场支配地位行为与经营者集中的反垄断控制.pdf

CPA 会计郑庆华基础班第25章调整分录 12页.pdf

掌上客网页小程序前端+后端开源版本.zip