MPEG-4 Overview © MPEG 1999-2002 – unlimited reproduction permitted if not modified 16
2. Speech signals, to extend the capabilities currently provided by MPEG-4 speech coders.
A single technology that addresses both of these signals is preferred. This technology shall be
both forward and backward compatible with existing MPEG-4 technology. In other words, an
MPEG-4 decoder can decode an enhanced stream and a new technology decoder can decode an
MPEG-4 stream. There are two possible configurations for the enhanced stream: MPEG-4
AAC streams can carry the enhancement information in the DataStreamElement, while all
MPEG-4 systems know the concept of elementary streams, which allow second Elementary
Stream for a given audio object, containing the enhancement information.
b) Parametric coding
The MPEG-4 standard already provides a parametric coding scheme for coding of general
audio signals for low bit-rates (HILN, "Harmonic Individual Lines and Noise"). The extension
investigates parametric coding of general audio signals for the higher quality range, to extend
the capabilities currently provided by HILN. Whenever possible this technology will build
upon the existing MPEG-4 HILN technology.
5 Profiles in MPEG-4
MPEG-4 provides a large and rich set of tools for the coding of audio-visual objects. In order
to allow effective implementations of the standard, subsets of the MPEG-4 Systems, Visual,
and Audio tool sets have been identified, that can be used for specific applications. These
subsets, called ‘Profiles’, limit the tool set a decoder has to implement. For each of these
Profiles, one or more Levels have been set, restricting the computational complexity. The
approach is similar to MPEG-2, where the most well known Profile/Level combination is
‘Main Profile @ Main Level’. A Profile@Level combination allows:
a codec builder to implement only the subset of the standard he needs, while maintaining
interworking with other MPEG-4 devices built to the same combination, and
checking whether MPEG-4 devices comply with the standard (‘conformance testing’).
Profiles exist for various types of media content (audio, visual, and graphics) and for scene
descriptions. MPEG does not prescribe or advise combinations of these Profiles, but care has
been taken that good matches exist between the different areas.
5.1 Visual Profiles
The visual part of the standard provides profiles for the coding of natural, synthetic, and
synthetic/natural hybrid visual content. There are five profiles for natural video content:
1. The Simple Visual Profile provides efficient, error resilient coding of rectangular video
objects, suitable for applications on mobile networks, such as PCS and IMT2000.
2. The Simple Scalable Visual Profile adds support for coding of temporal and spatial
scalable objects to the Simple Visual Profile, It is useful for applications which provide
services at more than one level of quality due to bit-rate or decoder resource limitations,
such as Internet use and software decoding.
3. The Core Visual Profile adds support for coding of arbitrary-shaped and temporally
scalable objects to the Simple Visual Profile. It is useful for applications such as those
providing relatively simple content-interactivity (Internet multimedia applications).
4. The Main Visual Profile adds support for coding of interlaced, semi-transparent, and
sprite objects to the Core Visual Profile. It is useful for interactive and entertainment-
quality broadcast and DVD applications.