改进的迭代sIB算法：一种解决局部最优问题的方法

160 浏览量更新于2024-08-30 收藏 1.04MB PDF 举报

"本文介绍了一种改进的迭代sIB（Sequential Information Bottleneck）算法，旨在解决sIB算法在处理聚类问题时容易陷入局部最优的挑战。该算法结合了变异策略和遗传算法，以提高解决方案的全局优化能力。" sIB算法是信息瓶颈理论在实际应用中的一个实例，该理论旨在通过压缩数据表示来提取最重要的信息，同时保留与目标变量的关联。sIB算法通过迭代过程不断优化信息传递，寻找在保留关键信息的同时减少冗余的最佳数据表示。然而，传统的sIB算法在执行过程中可能会遇到一个问题，即在优化过程中可能会停留在局部最优解，而不是全局最优解，这限制了其在聚类等任务中的性能。为了克服这一问题，研究者提出了一种新的isIB（iterative sIB）算法，该算法引入了变异操作。在isIB算法中，首先利用sIB算法生成聚类标签的初始解向量，然后随机选择一部分元素，对这些元素的聚类标签进行变异。变异过程是根据一个最优的突变率来进行的，这个突变率可以确保既保持一定的多样性，又能够有效地探索解决方案空间。这种变异策略有助于跳出局部最优，增加搜索全局最优解的概率。接下来，isIB算法结合了遗传算法进一步优化结果。遗传算法是一种模拟自然选择和遗传的全局优化方法，它通过选择、交叉和变异等步骤来迭代地改进解决方案。在这个阶段，经过变异的聚类标签作为新的种群，通过遗传算法的迭代过程，筛选出更优的聚类解。实验部分，isIB算法在多个基准数据集上进行了测试，结果表明，该算法在保持高准确性的同时，提高了寻找全局最优解的能力，并且在计算效率上也有所提升。这些实验证明，isIB算法在处理聚类问题时，有效地解决了sIB算法可能遇到的局部最优陷阱，从而增强了信息瓶颈方法在实际应用中的性能。关键词：信息瓶颈，sIB算法，变异，互信息 isIB算法通过创新性地结合信息瓶颈理论、变异策略和遗传算法，为聚类问题提供了一种更有效且全局优化的解决方案，对于理解和改进信息处理系统中的优化技术具有重要的理论和实践意义。

Iterative sIB algorithm

Huaqiang Yuan

⇑

, Yangdong Ye

Engineering and Technology Institute, Dongguan University of Technology, Dongguan 523808, China

Information Engineering School, Zhengzhou University, Zhengzhou 450001, China

article info

Article history:

Received 3 August 2007

Available online 26 November 2010

Communicated by W. Pedrycz

Keywords:

Information Bottleneck

sIB algorithm

Mutation

Mutual information

abstract

Recent years have witnessed a growing interest in the information bottleneck theory. Among the relevant

algorithms in the extant literature, the sequential Information Bottleneck (sIB) algorithm is recognized

for its balance between accuracy and complexity. However, like many other optimization techniques,

it still suffers from the problem of getting easily trapped in local optima. To that end, our study proposed

an iterative sIB algorithm (isIB) based on mutation for the clustering problem. From initial solution vec-

tors of cluster labels generated by a seeding the sIB algorithm, our algorithm randomly selects a subset of

elements and mutates the cluster labels according to the optimal mutation rate. The results are iteratively

optimized further using genetic algorithms. Finally, the experimental results on the benchmark data sets

validate the advantage of our iterative sIB algorithm over the sIB algorithm in terms of both accuracy and

efﬁciency.

1. Introduction

Tishby et al. proposed the Information Bottleneck (Tishby et al.,

1999) (IB) method in 1999 for data analysis based on the informa-

tion theory. Their method showed that the hidden patterns in data

are what we expect to ﬁnd during data analysis. When we analyze

a data object, the result should unveil its relationship with other

objects. Along this line, while the IB method compresses a data ob-

ject into a deﬁned bottleneck variable, it tries to maintain the data

object’s relationship with other relevant data objects. In this way,

IB is able to effectively mine the correlation patterns among data

objects. So far, IB has been widely applied in many ﬁelds, including

text clustering (Slonim et al., 2002; Slonim and Tishby, 2000,

2001), image clustering (Goldberger et al., 2006; Winston et al.,

2005), galaxy spectra analysis (Slonim et al., 2001), gene expres-

sion analysis (Tishby and Slonim, 2001), neural system coding

analysis (Schneidman et al., 2002), natural language processing

(Gorodetsky, 2002), and voice recognition (Hecht and Tishby,

2005). In 2002, Slonim summarized the IB principles, algorithms,

and applications in his doctoral thesis and proposed the framework

of Information Bottleneck (IB) theory (Slonim, 2002). Recently,

the IB theory was further expanded with many attractive variants,

such as IBSI (Chechik and Tishby, 2002), CIB (Gondek and

Hofmann, 2003), CCIB (Gondek and Hofmann, 2004), MIB

(Friedman et al., 2001; Elidan and Friedman, 2005), GIB (Chechik

et al., 2005), and SDR (Globerson and Tishby, 2003).

IB theory originated from the well-known Rate-Distortion the-

ory. To tackle insufﬁcient objectiveness in the selection of distor-

tion functions in the Rate-Distortion theory, Tishby extracted a

reasonable distortion function by deﬁning the relevant variable

with respect to the source data. With the mathematical framework

and the formal solution, Tishby et al. thereby established the IB the-

ory (Tishby et al., 1999). Later, many variants of the IB algorithm

were proposed, such as iterative IB (iIB) (Tishby et al., 1999),

agglomerative IB (aIB) (Slonim and Tishby, 1999), deterministic IB

(dIB) (Slonim, 2002), and sequential IB (sIB) (Slonim et al., 2002).

Iterative IB is, as its name implies, an iterative algorithm and similar

to the EM algorithm; aIB is a bottom-up agglomerative algorithm;

and dIB is a top-down divisive algorithm. The last two are costly

in time and space (Slonim, 2002; Slonim and Tishby, 1999).

Compared with other IB algorithms, the sIB algorithm yields an

optimal solution for the IB problem with lower time and space

complexity (Slonim et al., 2002; Slonim, 2002). It has found wide

applications, especially in practical clustering problems. However,

this algorithm may only offer a local optimal solution, because it

starts from a random partition and will offer different solutions

from different runs. To address this problem, Slonim ran the sIB

several times and chose the best solution (Slonim et al., 2002).

Unfortunately, several drawbacks still exist with Slonim’s method

including randomness in results and insufﬁcient optimization.

In 2004, Peltonen et al. leveraged Bayes factor to propose an fsIB

algorithm, which yielded better clustering performance on small

and sparse data sets (Peltonen et al., 2004). Still, fsIB could not

completely solve the problems in the sIB algorithm.

doi:10.1016/j.patrec.2010.11.020

Supported by the National Natural Science Foundation of China (Grant Nos.

60573029, 60773050, and 60773048).

⇑

Corresponding author. Fax: +86 76922862911.

E-mail address: yuanhq@dgut.edu.cn (H. Yuan).

Pattern Recognition Letters 32 (2011) 606–614

Contents lists available at ScienceDirect

Pattern Recognition Letters

journal homepage: www.elsevier.com/locate/patrec

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38746918

粉丝: 7
资源: 900

改进的迭代sIB算法：一种解决局部最优问题的方法

WCDMA_SIB1-SIB18详细消息[参考].pdf

四种积分双谱算法

WCDMA SIB消息

思乡.sib

SIB6812.pdf

SIB2_bkp

Sib Font Editor 2.0

2SIB-ProjetoEstacionamento2021

2SIB-ProjetosEstacionamento2021

sib2ly-开源

最新资源