MPEG-7视觉标准：内容描述概述

MPEG7

需积分: 9 185 浏览量更新于2024-09-14 收藏 94KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"这篇资源是关于MPEG-7视觉标准的综述文章，旨在提供对MPEG-7标准的深入理解。MPEG-7是一个正在开发的国际标准，它定义了基于内容的描述符，使用户或代理（或搜索引擎）能够根据视觉特征度量图像或视频的相似性，从而高效地识别、过滤或浏览基于视觉内容的图像或视频。" MPEG-7（Moving Picture Experts Group-7）是多媒体内容描述标准，主要关注如何有效地描述和检索视觉信息。标准的核心是创建一套标准化的描述符，这些描述符可以从颜色、纹理、对象形状、全局运动和对象运动等多个方面提取，用于识别和比较图像或视频的内容。 1. **颜色描述符**：MPEG-7提供了多种方法来表示和比较图像的颜色分布，如色彩直方图、色彩布局和色彩关键点等。这些工具允许系统识别和区分不同颜色模式，是图像检索的重要依据。 2. **纹理描述符**：纹理是图像的另一个关键特征，MPEG-7通过使用Gabor滤波器、局部二值模式(LBP)和其他纹理分析技术来捕获和比较纹理信息，帮助区分具有相似颜色但不同纹理的区域。 3. **对象形状描述符**：形状分析是MPEG-7中的一个重要部分，它涉及到边缘检测、轮廓提取和形状特征量化。形状描述符如边缘方向直方图、形状轮廓和形状指数用于识别和描述图像中的对象形状。 4. **运动描述符**：MPEG-7考虑了全局运动和对象运动两个方面，通过光流估计和运动矢量分析来描述图像序列中的动态信息，这对于视频分析和检索至关重要。 5. **索引术语与标准化**：MPEG-7的制定不仅涉及技术层面，还包括标准化过程，确保不同系统间描述符的互操作性和兼容性。相似性基检索（similarity-based retrieval）是MPEG-7的一个重要应用，通过定义的标准描述符，可以实现高效的多媒体内容搜索。 6. **应用领域**：MPEG-7标准广泛应用于数字图书馆、视频数据库、图像检索系统、视频监控、智能推荐系统等领域，极大地提升了用户在海量视觉数据中的查找和管理效率。总结来说，MPEG-7视觉标准为处理和检索视觉信息提供了一种结构化和标准化的方法，使得在大量图像和视频数据中寻找特定内容成为可能。通过理解并应用MPEG-7的描述符和技术，开发者可以构建更强大的多媒体处理和分析工具。

资源详情

资源推荐

696 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 11, NO. 6, JUNE 2001

The MPEG-7 Visual Standard for Content

Description—An Overview

Thomas Sikora, Senior Member, IEEE

Abstract—The MPEG-7 Visual Standard under development

specifies content-based descriptors that allow users or agents (or

search engines) to measure similarity in images or video based

on visual criteria, and can be used to efficiently identify, filter, or

browse images or video based on visual content. More specifically,

MPEG-7 specifies color, texture, object shape, global motion, or

object motion features for this purpose. This paper outlines the

aim, methodologies, and broad details of the MPEG-7 Standard

development for visual content description.

Index Terms—Coding, descriptors, MPEG-7, similarity-based

retrieval, standardization, visual information.

I. INTRODUCTION

ECENT years have seen a rapid increase in volume of

image and video collections. A huge amount of informa-

tion is available, and every day, gigabytes of new visual infor-

mation is being generated, stored, and transmitted. However, it

is difficult to access this visual information unless it is orga-

nized in a suitable way—to allow efficient browsing, searching,

and retrieval. Image retrieval has been a very active research

and developmentdomain since the early 1970s. During the early

1990s—with the advent of digital video—research on video re-

trieval became of equal importance. A very popular means for

image or video retrieval is to annotate images or video with text,

and to use text-based database management systems to perform

image retrieval. However, text-based annotation has significant

drawbacks when confronted with large volumes of images. An-

notation can in these circumstance become significantly labor

intensive. Furthermore, since images are rich in content, text

may in many applications not be rich enough to describe images.

To overcome these difficulties in the early 1990s, content-based

image retrieval emerged as a promising means for describing

and retrieving images. Content-based image retrieval systems

describe images by their own visual content rather than text,

such as color, texture, and objects’ shape information.

In the late 1990s—with the large scale introduction of

digital images and video to the market [1]—the necessity

for interworking between image/video retrieval systems of

different vendors arose. For this purpose, in 1997 the ISO

MPEG Group initiated the “MPEG-7 Multimedia Description

Language” work item. The target of this activity was to issue

an international MPEG-7 Standard, defining standardized

descriptions and description systems that allow users or agents

Manuscript received January 2, 2001; revised March 12, 2001.

The author is with the Heinrich-Hertz-Institute for Communication Tech-

nology (HHI), Interactive Media—Human Factors, D-10587 Berlin, Germany

(e-mail: sikora@hhi.de).

Publisher Item Identifier S 1051-8215(01)04986-2.

to search, identify, filter, and browse audiovisual content

[2], [3]. MPEG-7 is currently still under definition and will

become international standard in July 2001. Besides support

for meta-data and text descriptions of the audiovisual content,

much focus in the development on MPEG-7 has been in the

definition of efficient content-based description and retrieval

specifications.

The purpose of this paper is to provide a broad overview of

the MPEG-7 content-based visual descriptors. For an overall

overview of the MPEG-7 Standard, more detailed descriptions

of MPEG-7 content-based visual, audio and speech descriptors,

the reader is referred to the literature in [2], [3] and to the re-

maining papers of this Special Issue on MPEG-7 in [5]–[12].

II. S

COPE OF MPEG-7 VISUAL STANDARD

The ultimate goal and objective of MPEG-7 Visual Standard

is to provide standardized descriptions of streamed or stored im-

ages or video—standardized header bits (visual low-level De-

scriptors) that help users or applications to identify, categorize

or filter images or video. These low-level descriptors can be

used to compare, filter, or browse image or video purely based

on nontext visual descriptions of the content, or if required, in

combination with common text-based queries. Because of their

descriptive features, the challenge for developing such MPEG-7

Visual nontext descriptors is that they must be meaningful in the

context of various applications. They will be used differently for

different user domains and different application environments.

Selected application examples include digital libraries (image

and video catalogue), broadcast media selection (TV channels),

and multimedia editing (personalised electronic news service,

media authoring). Among this diversityof possible applications,

the MPEG-7 Visual feature descriptors allow users or agents to

perform the following tasks taken as examples.

1) Graphics: Draw a few lines on a screen and get, in return,

a set of images containing similar graphics or logos.

2) Images: Define objects, including color patches or tex-

tures, and get, in return, examples among which you se-

lect the ones of interest.

3) Video: On a given set of video objects, describe object

movements, camera motion, or relations between objects

and get, in return, a list of videos with similar or dissimilar

temporal and spatial relations.

4) Video Activity: On a given video content, describe actions

and get a list of videos where similar actions happen.

The MPEG-7 Visual Descriptors describe basic audiovisual

content of media based on visual information. For images and

video, the content may be described, for example by the shape

下载后可阅读完整内容，剩余6页未读，立即下载

guohuiqing1987

粉丝: 0
资源: 1

MPEG-7视觉标准：内容描述概述

MPEG-7介绍资料

Overview of the MPEG-7 Standard

Transformer-Based Visual Segmentation: A Survey

sim2real挑战赛学习资料

el-tooltip manual

hexo-admin-ui

opengl weiler-atherton code

请提供给我Visual Studio 2012的官方文档或者其他相关教程。

针对车道线检测，写一个PPT

mfc cohen-sutherland code

搜索quantile g-computation模型分析混合物暴露的R语言教程

基于十篇多传感器融合循迹智能车的英文文献，帮我写一篇2000字的文献综述。

Help me write a literature review of the research on path planning algorithms for satellite maps

write a m file to simulate OAM-INDEX MODULATION in Matlab

bsub -Is -q dv indago -db waves.shm &这个命令是干嘛的

For guided examples, go to 'https://jenfb.github.io/bkmr/overview.html'

最新资源