增强预测失真音频编码的扩展线性预测工具

需积分: 7 71 浏览量更新于2024-09-12 收藏 221KB PDF 举报

本文档《EXTENDED LINEAR PREDICTION TOOLS FOR LOSSLESS AUDIO CODING》探讨了在无损音频编码中，扩展线性预测技术的应用和优化策略。作者Takehiro Moriya、Dai Tracy Yang和Tilman Liebchen分别来自日本NTTCyberSpace Labs、美国南加州大学和德国柏林工业大学，他们针对预测式无损音频编码提出了两种关键的改进工具。首先，文章介绍了一种渐进预测顺序编码方法。在随机访问点（即开始位置，此时无法利用先前样本信息），编码方式采用逐步提升的预测阶次：第一样本直接编码，第二样本通过一阶预测，第三样本用二阶预测，依此类推。这种逐级预测使得编码效率得以提高，因为可以利用PARCOR（部分自相关系数）来高效地处理预测过程。这种方法特别适用于那些需要频繁随机访问的音频数据，显著减少了比特率，尤其是在样本间的交叉通道相关性较高的情况下。其次，文中提出的另一种工具是跨通道联合编码。通过在编码过程中利用三tap自适应预测，对预测系数和预测误差信号进行交互通道差分编码。这种方法能够有效地压缩信号，并进一步利用了不同通道之间的信息共享，从而降低比特率，增强编码性能。这两种扩展线性预测工具在无损音频编码中的应用，显著提升了编码效率，特别是在处理随机访问和多通道音频时，能够实现更小的码率和更好的压缩效果。这对于音频存储、传输和在线播放等领域具有重要意义，有助于减少数据存储需求，提高用户体验。通过引入这些创新技术，该研究为无损音频编码的发展开辟了新的可能性，对于音频编码标准的更新和完善具有积极的推动作用。

EXTENDED LINEAR PREDICTION TOOLS FOR LOSSLESS AUDIO CODING

Takehiro Moriya *, Dai Tracy Yang ** and Tilman Liebchen ***

* NTT Cyber Space Labs., Tokyo, Japan

** University of Southern California, Los Angeles, USA

*** Technical University of Berlin, Berlin, Germany

ABSTRACT

Two extension tools for enhancing the compression performance

of prediction-based lossless audio coding are proposed. One is

progressive-order prediction of the starting samples at the random

access points, where the information of previous samples is not

available. The ﬁrst sample is coded as is, the second is predicted

by ﬁrst-order prediction, the third is predicted by second-order pre-

diction, and so on. This can be efﬁciently carried out with PAR-

COR (PARtial autoCORrelation) coefﬁcients. The second tool is

inter-channel joint coding. Both predictive coefﬁcients and predic-

tion error signals are efﬁciently coded by inter-channel differential

or three-tap adaptive prediction. These new prediction tools lead

to a steady reduction in bit rate when random access is activated

and the inter-channel correlation is strong.

1. INTRODUCTION

For the archiving and broadband transmission of music signals,

lossless reconstruction is becoming a more important feature than

high efﬁciency in compression by means of perceptual coding as

deﬁned in MPEG standards such as MP3 or AAC. Although DVD-

audio and Super CD Audio [1, 2] include proprietary lossless com-

pression schemes, there is a demand for an open and general com-

pression scheme among content-holders and broadcasters. In re-

sponse to this demand, a new lossless coding scheme has been

considered as an extension to the MPEG-4 Audio standard [3, 4].

In the course of the standardization process, a time-domain

compression scheme based on linear predictive coding (LPC) has

been deﬁned as a reference model. This model is proposed by the

Technical University of Berlin and the decoding process is shown

in Fig. 1 [5]. For every frame, the optimum LPC coefﬁcients

are calculated and the associated PARCOR coefﬁcients [6, 7] are

quantized in an arcsine-transformed domain. The prediction error

signal is derived by the quantized predictive coefﬁcients and coded

with a Rice code. For stereo signals, simple inter-channel coding

is applied, where either the L-channel or R-channel together with

the difference between the R- and L-channels are coded.

This paper proposes two extension tools for prediction-based

lossless coding. One is progressive-order prediction to improve

performance in the compression of starting samples at random-

access points. The other is inter-channel joint coding for both pre-

dictive coefﬁcients and prediction error signals. Both tools are

described in detail, and the results of performance evaluation are

given.

2. PROGRESSIVE ORDER PREDICTION

2.1. Random access

Samples of an audio signal usually have strong correlation in the

time domain. Auto-regressive linear prediction is well-known as

one of the most powerful and simple tools for reducing the ampli-

tudes of error signals, enabling reductions of bit rate [2, 8]. How-

ever, in the editing and playback of compressed signals, the ability

to start from a random access point is desirable. We thus have to

reconstruct perfect signals without using any of the previous sig-

nal information. Ensuring this property for auto-regressive linear

prediction leads to a signiﬁcant loss of compression performance,

since prediction must be shut off at the accessible points. Until

now, the ﬁrst p samples, where p is the prediction order, are kept

unchanged and required separate coding due to a large amplitude.

2.2. Progressive prediction

For starting samples in the random access frames, progressive-

order prediction is useful as a way of making full use of the avail-

able samples and thus reducing prediction error as much as possi-

ble. While it is of course impossible to predict the ﬁrst sample, the

second sample is predictable by ﬁrst-order prediction only from

the previous sample. The prediction error at the (q +1)-th sample

is derivable by q-th order prediction in general.

For this progressive-order prediction, PARCOR coefﬁcients

are convenient, since each coefﬁcient is independent from the pre-

diction order p, while normal auto-regressive LPC coefﬁcients

need to be calculated for every prediction order q upto p. The asso-

ciated lattice ﬁlter is shown in Fig. 2, where k

represents the q-th

PARCOR coefﬁcient. An example procedure of PARCOR-based

progressive-order prediction is shown in Fig. 3. It is understood

output

(L-ch)

entropy

decoder

output

(R-ch)

LPC

entropy

decoder

LPC

prediction

error

side

information

prediction

error

side

information

Fig. 1. Decoding process of reference predictive coding system

with simple inter-channel prediction.

➠

➡

下载后可阅读完整内容，剩余3页未读，立即下载

CCLALABEAR

粉丝: 0
资源: 1

增强预测失真音频编码的扩展线性预测工具

藏经阁-Prediction as a service with E.pdf

MATLAB_Software_for_the_Code_Excited_Linear_Prediction_Algorithm.pdf

New Linear Predictive Methods for Digital Speech Processing .pdf

Maintainability Prediction MIL-HDBK-472.pdf

High.Efficiency.Video.Coding.Coding.Tools.and.Specification

ieee-standard-for-secondgeneration-ieee-1857-video-coding.pdf

Linear-Prediction-Based-Semi.rar_MIMO prediction_MIMO信道估计_predic

audio_coding.zip_celp

Multiplex Graph Neural Network for Tabular Data Prediction.pdf

论文研究-A novel aging-resilient configurable aging sensor for circuit failure prediction.pdf

最新资源