JBIG2标准详解：二值图像压缩技术

4星 · 超过85%的资源需积分: 30 10 浏览量更新于2024-07-26 收藏 1.12MB PDF 举报

"JBIG2是JBIG（联合二值图像专家组）的升级版，专注于二值图像编码标准的制定。JBIG属于ISO/IEC JTC1 SC29工作组1，该组织还负责JPEG标准的开发。JBIG1是最初的标准，而JBIG2是其改进后的版本，提供了更先进的压缩算法。JBIG2的编码技术包括了对二值图像的无损和有损压缩方法，如符号编码、通用编码和半色调处理等。" JBIG2是一种高效的数据压缩标准，特别适用于二值图像，如黑白文档、扫描件和传真。这个标准由JBIG委员会制定，并以ISO/IEC 14492的形式发布。在1999年7月16日的最终委员会草案（FCD）14492FCD中，详细描述了JBIG2的编码规范和解码过程。 JBIG2的编码过程分为几个关键部分： 1. **段（Segments）**：JBIG2的编码结构基于段，每个段代表图像数据的一个特定方面或处理。这些段可以独立编码，允许灵活的解码顺序和可选的数据压缩。 2. **文档和段的关系**：在JBIG2中，多个段可以组合成一个文档，每个段可能包含不同的编码策略，用于优化特定区域的压缩效果。 3. **内部表示**：JBIG2允许内部使用多种数据表示形式，以适应不同类型的图像特征和压缩需求。 4. **解码结果**：解码过程的目标是重构原始图像，JBIG2确保解码后的图像质量尽可能接近原始图像。 5. **解码过程**：JBIG2的解码算法包括一系列步骤，从解析段到应用相应的编码方法，最后重建图像。 6. **有损编码**：除了传统的无损编码，JBIG2引入了有损编码，允许在牺牲一定图像质量的前提下换取更高的压缩比。这包括符号编码，通过统计模式识别来预测和编码像素；以及通用编码，利用上下文依赖性进行更高效的编码。 7. **符号编码**：这种方法侧重于识别和编码重复出现的图像模式，比如文字或图形元素，以实现压缩。 8. **通用编码**：这是一种灵活的编码方式，可以根据图像内容的不同部分选择不同的编码策略，提高压缩效率。 9. **半色调处理**：JBIG2还考虑了半色调图像的处理，这是黑白图像中常见的现象，通过模拟灰度层次来表现图像细节。 JBIG2标准的广泛应用在于文档扫描和存储，特别是在低带宽通信和存储空间有限的场景中。由于其高效的压缩性能，它已经成为PDF和电子文档格式的标准部分，确保了二值图像的高质量显示和传输。

Page and

auxiliary

buffers

Text

region

decoding

procedure

Generic

reﬁnement

region

decoding

procedure

Generic

region

decoding

procedure

Halftone

region

decoding

procedure

Pattern

dictionary

decoding

procedure

Symbol

dictionary

decoding

procedure

Pattern

memory

Symbol

memory

Context

memory













-





 -



 

 

Figure 1 — Block diagram of major decoder components.

Table 1 — Entities in the decoding process

JBIG2 JBIG2 Physical

Concept bitstream entity decoding entity representation

Document JBIG2 ﬁle JBIG2 decoder Output medium

or device

Page Collection of segments Implicit in control Page buffer

decoding procedure

Region Region segment Region decoding Page buffer or

procedure auxiliary buffer

Dictionary Dictionary segment Dictionary decoding List of symbols

procedure

Character Field within a symbol Symbol dictionary Symbol bitmap

dictionary segment decoding procedure

Gray-scale Field within a halftone Pattern dictionary Pattern

value dictionary segment decoding procedure

arithmetic coding case, the prediction context contains only pixels determined by data already decoded within the

current segment.

The generic reﬁnement region decoding procedure modiﬁes a buffer pixel-by-pixel using arithmetic coding.

The prediction context uses pixels determined by data already decoded within the current segment as well as pixels

already present either in the page buffer or in an auxiliary buffer.

The text region decoding procedure takes symbols from one or more symbol dictionaries and places them in a

buffer. This procedure is invoked during the decoding of a text region segment. The text region segment contains

the position and index information for each symbol to the placed in the buffer; the bitmaps of the symbols are

taken from the symbol dictionaries.

The symbol dictionary decoding procedure creates a symbol dictionary, that is, an indexed set of symbol

bitmaps. A bitmap in the dictionary may be coded directly; it may be coded as a reﬁnement of a symbol already in

a dictionary; or it may be coded as an aggregation of two or more symbols already in dictionaries. This decoding

procedure is invoked during the decoding of a symbol dictionary segment.

The halftone region decoding procedure takes patterns from a pattern dictionary and places them in a buffer.

This procedure is invoked during the decoding of a halftone region segment. The halftone region segment contains

the position information for all the patterns to be placed in the buffer, as well as index information for the patterns

themselves. The patterns, the ﬁxed-size bitmaps of the halftone, are taken from the halftone dictionaries.

The pattern dictionary decoding procedure creates a dictionary, that is, an indexed set of ﬁxed-size bitmaps

(patterns). The bitmaps in the dictionary are coded directly and jointly. This decoding procedure is invokedduring

the decoding of a pattern dictionary segment.

The control decoding procedure decodes segment headers, which include segment type information. The seg-

ment type determines which decoding procedure must be invoked to decode the segment. The segment type also

determines where the decoded output from the segment will be placed. The segment reference information, also

present in the segment header and decoded by the control decoding procedure, determines which other segments

must be used to decode the current segment. The control decoding procedure affects everything shown in Figure 1,

and so is not shown there as a separate block.

Table 1 summarises the types of data being decoded, which decoding procedure is responsible for decoding

them, and what the ﬁnal representations of the decoded data are.

0.2 Lossy coding

This speciﬁcation does not deﬁne how to control lossy coding of bi-level images. Rather it deﬁnes how to perform

perfect reconstruction of a bitmap that the encoder has chosen to encode. If the encoder chooses to encode a

bitmap that is different than the original, the entire process becomes one of lossy coding. The different coding

methods allow for different methods of introducing loss in a proﬁtable way.

0.2.1 Symbol coding

Lossy symbol coding provides a natural way of doing lossy coding of text regions. The idea is to allow small

differences between the original symbol bitmap and the one indexed in the symbol dictionary. Compression gain

is effected by not having to code a large dictionary and, afterwards, by having a cheap symbol index coding as a

consequence of the smaller dictionary. It is up to the encoder to decide when two bitmaps are essentially the same

or essentially different. This technique was ﬁrst described in [1].

The hazard of lossy symbol coding is to have substitution errors, that is, to have the encoder replace a bitmap

corresponding to one character by a bitmap depicting a different character, so that a human reader misreads the

character. The risk of substitution errors can be reduced by using intricate measures of difference between bitmaps

and/or by making sure that the critical pixels of the indexed bitmap are correct. One way to control this, described

in [5], is to index the possibly wrong symbol and then to apply reﬁnement coding to that symbol bitmap. The idea

is to encode the basic character shape at little cost, then correct pixels that the encoder believes alter the meaning

of the character.

The process of beneﬁcially introducing loss in textual regions may also take simpler forms such as removing

ﬂyspecks from documents or regularizing edges of letters. Most likely such changes will lower the code length of

the region without affecting the general appearance of the region — possibly even improving the appearance.

A number of examples of performing this sort of lossy symbol coding with JBIG2 can be found in [7].

NOTE — Although the term “text region” is used for regions of the page coded using symbol coding, other

possible uses of symbol coding include coding line-art and other non-textual data.

0.2.2 Generic coding

To effect near-lossless coding using generic coding, the encoder applies a preprocess to an original image and

encodes the changed image losslessly. The difﬁculties are to ensure that the changes result in a lower code length

and that the quality of the changed image does not suffer badly from the changes. Two possible preprocesses are

given in [11]. These preprocesses ﬂip pixels that, when ﬂipped, signiﬁcantly lower the total code length of the

region, but can be ﬂipped without seriously impairing the visual quality. The preprocesses provide for effective

near-lossless coding of periodic halftones and for a moderate gain in compression for other data types. The

preprocesses are not well-suited for error diffused images and images dithered with blue noise as perceptually

lossless compression will not be achieved at a signiﬁcantly lower rate than the lossless rate.

0.2.3 Halftone coding

Halftone coding is the natural way to obtain very high compression for periodic halftones, such as clustered-dot

ordered dithered images. In contrast to lossy generic coding as described above, halftone coding does not intend to

preserve the original bitmap, although this is possible in special cases. Loss can also be introduced for additional

compression by not putting all the patterns of the original image into the dictionary, thereby reducing both the

number of halftone patterns and the number of bits required to specify which pattern is used in which location.

For lossy coding of error diffused images and images dithered with blue noise it is advisable to use halftone

coding with a small grid size. A reconstructed image will lack ﬁne details and may display blockiness but will be

clearly recognizable. The blockiness may be reduced on the decoder side in a postprocess; for instance, by using

other reconstruction patterns than those that appear in the dictionary. Error diffused images may also be coded

losslessly, or with controlled loss as described above, using generic coding.

More details on performing this halftone coding can be found in [12].

0.2.4 Consequences of inadequate segmentation

In order to obtain optimum coding, both in terms of quality and ﬁle size, the correct form of encoding should

be used for the appropriate regions of the document pages. This subclause brieﬂy describes the consequences of

errors in this segmentation.

Using lossy symbol coding for a document containing both text and halftone data will result in poor compres-

sion. Depending on the encoder, the quality of the halftone data may be good or bad. Using the form of lossy

symbol coding described in [5] the visual quality will probably not suffer.

Using lossy generic coding (using the preprocesses given in [11]) for a document containing both symbol and

halftone data usually results in good quality and moderate compression.

剩余188页未读，继续阅读

ZuoDiDi

粉丝: 1
资源: 7

JBIG2标准详解：二值图像压缩技术

JBIG 图像压缩源码

jbig和bmp互转

jbig银联用的图片压缩算法

jbig-android

jbig图像压缩算法源码

JBIG银联电子签名解析

AC_Encoding.rar_JBIG_binary arithmetic

多媒体与数字图像压缩技术

图像编码压缩学习教案.pptx

视频帧 图像压缩编码详解 MTK

最新资源

视频帧图像压缩编码详解 MTK