图聚类与排序驱动的多样化图像搜索策略

47 浏览量更新于2024-08-26 收藏 1008KB PDF 举报

本文是一篇研究论文，发表在《多媒体系统》(Multimedia Systems)杂志上，其标题为“基于图的聚类和排序以实现多样化的图像搜索”(Graph-based clustering and ranking for diversified image search)，作者包括 Yan Yan、Gao Wen Liu、Sen Wang 和 Jian Zhang，以及 Kai Zheng。该研究专注于利用图形理论在图像检索领域进行深入探索，目的是提升搜索结果的多样性，以便用户能够找到更加符合他们需求的、丰富的图像内容。在论文中，作者探讨了图论方法在图像聚类和排序中的应用。图是一种数学结构，其中节点代表图像特征或对象，边则表示这些特征之间的相似性或关联。通过构建图像间的连接网络，可以有效地组织和分析大量的图像数据。聚类技术在此过程中扮演关键角色，通过对图像进行分组，识别出具有相似特性的图像簇，这有助于减少冗余信息，突出各簇的独特性和差异性。同时，排序算法在这个框架下被用来优化搜索结果的展示顺序，确保最相关的图像排在前面，从而增强用户体验。通过结合聚类后的结构和排序策略，文章提出了一种新颖的方法来个性化地满足不同用户的图像搜索需求，使得搜索结果不仅准确，而且具有较高的多样性。值得注意的是，作者强调了版权问题，指出该论文是受版权保护的，只能用于个人阅读，并且在正式发表后12个月后或更晚才可公开存档。如果要自我存档，必须使用接受稿版本，并在链接中引用原始出版源以及包含“最终出版物可在link.springer.com找到”的文字。这篇论文的研究成果对于图像搜索引擎的设计者和开发者来说具有很高的参考价值，它展示了如何利用先进的图论技术提升搜索算法的效率和用户满意度，是多媒体信息检索领域的一个重要贡献。通过深入理解并应用这些方法，研究人员和工程师们可以设计出更加智能、适应性强的图像搜索工具，为用户提供更加个性化的服务。

1 3

DOI 10.1007/s00530-014-0419-4

Multimedia Systems

SPECIAL ISSUE PAPER

Graph‑based clustering and ranking for diversiﬁed image search

Yan Yan · Gaowen Liu · Sen Wang ·

Jian Zhang · Kai Zheng

used to cluster images into topics in this paper. In order to

perform CCCMRW, a two-layer image graph is constructed

with image cluster nodes as upper layer added to a base

image graph. Conditioned on the image cluster information

from upper layer, Markov random walk is constrained to

incline to walk across different image clusters, so as to give

high rank scores to images of different topics and therefore

gain the diversity. Encouraging clustering and re-ranking

outputs on Google image search results are reported in this

paper.

Keywords Web image clustering · Ranking · Diversity ·

Visibility · Graph model

1 Introduction

Using keywords to search images are currently the most

popular approach [11], such as the Google and Yahoo!

image search engines. However, most keywords used by

persons for queries are visually polysemous words, which

means a word has several dictionary senses that are visu-

ally distinct [30]. Taken visually polysemous words as

queries the image search results always comprise images

of multiple topics and images of different topics are mixed

together. Furthermore, high ranking items always come

from a few topics, i.e. one or two topics. For instance, the

query word jaguar represents an example of visually poly-

semous word. In response to this query, images returned by

Google image search mainly fall into following four topics:

jaguar cat, jaguar car, jaguar logo and jaguar plane. How-

ever, the top 20 search results ranked by Google only cover

two limited topics: jaguar cat and jaguar car, which is in

poor diversity. The importance of result diversiﬁcation has

been recognized since early work on information retrieval

Abstract In this paper, we consider the problem of clus-

tering and re-ranking web image search results so as to

improve diversity at high ranks. We propose a novel ranking

framework, namely cluster-constrained conditional Markov

random walk (CCCMRW), which has two key steps: ﬁrst,

cluster images into topics, and then perform Markov ran-

dom walk in an image graph conditioned on constraints of

image cluster information. In order to cluster the retrieval

results of web images, a novel graph clustering model is

proposed in this paper. We explore the surrounding text to

mine the correlations between words and images and there-

fore the correlations are used to improve clustering results.

Two kinds of correlations, namely word to image and word

to word correlations, are mainly considered. As a standard

text process technique, tf-idf method cannot measure the

correlation of word to image directly. Therefore, we pro-

pose to combine tf-idf method with a novel feature of word,

namely visibility, to infer the word-to-image correlation.

By latent Dirichlet allocation model, we deﬁne a topic rel-

evance function to compute the weights of word-to-word

correlations. Taking word to image correlations as hetero-

geneous links and word-to-word correlations as homoge-

neous links, graph clustering algorithms, such as complex

graph clustering and spectral co-clustering, are respectively

Y. Yan · S. Wang · K. Zheng

School of Information Technology and Electrical Engineering,

The University of Queensland, Brisbane, QLD, Australia

G. Liu

Department of Information Engineering and Computer Science,

University of Trento, Trento, Italy

J. Zhang (*)

School of Science and Technology, Zhejiang International

Studies University, Zhejiang, China

e-mail: jeyzhang@outlook.com

Author's personal copy

剩余13页未读，继续阅读

weixin_38725260

粉丝: 2
资源: 909

图聚类与排序驱动的多样化图像搜索策略

diverse-image-search:MediaEval 多样化图像搜索任务中使用的多样化图像搜索方法的实现

optics聚类算法

高光谱图像聚类分析与基本处理技术研究

hclust包深度解析：如何在R语言中实现高效聚类分析

【聚类分析技术入门】：一步到位掌握聚类算法的精髓

TRS:毕设：基于图像理解和文本分析的移动应用众包测试报告选择系统

Text2Palette： 基于互联网图像的颜色主题自动生成方法.pdf

基于领域变换的图像滤波技术探索

【大规模数据聚类策略】：Python算法实战指南

【聚类算法评估与选择】：Python方法论全解析

最新资源

Text2Palette：基于互联网图像的颜色主题自动生成方法.pdf