Developments in TV audio
Jan 1, 2010 12:00 PM, By James Castelton
Broadcast audio has evolved from mono to multichannel.
Television audio has progressed in a matter of 20 years from monophonic analog transmissions using FM modulation to digital surround sound carried in the MPEG transport stream. In addition, new features such as audio description and control metadata are all improving the service for listeners.
The evolution
Worldwide broadcast technology is evolving in both the state and private sectors as broadcasters migrate from analog to digital transmissions. Across Europe, companies are launching HD terrestrial services as part of the switchover. This year, the FIFA World Cup is being used as a starting point from which broadcasters will roll out new HDTV 5.1 services.
DVB SD services have generally carried stereo audio using MPEG-1 Layer II (ISO/IEC 11172-3) and MPEG-2 Layer II (ISO/IEC 13818-3) encoding. Some broadcasters have used surround sound as part of their premium services for sporting events and movies.
As broadcasters move from SD to HD services, part of the more immersive viewing experience is to provide surround sound. The existing DVD market, plus growing penetration of HDTV, Blu-ray and 5.1 home theater systems, adds to viewers' expectations for enhanced audio services with broadcast TV.
New platforms can also deliver multichannel audio, including online services to complement digital TV channels. Also, the online aggregators are engaging with game console makers as well as set-top box (STB) and TV receiver manufacturers to deliver services into the home. In Europe, surround services are available via more than 300 channels, and that number is growing.
Efficient bandwidth use is always an issue, whatever the platform, because it enables service providers to offer higher quality or more channels. There are always limits to compression efficiency, and the limits are not entirely the development of new algorithms, but the cost of implementing the algorithms in silicon. As semiconductor technology advances, what can be achieved in a chip increases yearly.
Compression schemes like MPEG-2, Layers II and III, were all that was possible in the 1990s. Ten years later, much more is possible at the low prices demanded for the components used in STBs and integrated receivers.
New requirements
This evolution from mono to multichannel must be carefully man-aged. The public expects a reasonable life from their receiver and will resist rapid obsolescence through constant standards changes to trans-missions; thus, cable operators have a large investment in their STBs.
Broadcasters also need to amortize audio infrastructure, so, again, expect a standard to run for some years before change. They also have to be mindful of changes to production techniques and workflow in the migration from stereo to multichannel audio.
This means that change to delivery standards must be carefully considered and well researched. In the past, early adopters have been left with less-than-optimal systems and out of step with international standards. It is in the interest of both viewers and CE manufacturers to agree to standards across a wider territory to increase market size and lower costs. Organizations such as the DVB Forum, the European Information and Communication Technology Association (EICTA) and Consumer Electronics Association (CEA) have created a framework for the delivery of multichannel audio that constrains delivery standards. Many national governments have derived standards to support the local markets from these frameworks that take account of any multilingual requirements within their territory. (See Table 1 on page 8.)
Dolby Digital Plus (E-AC-3) and HE-AAC (an amendment to MPEG-4 Part 3, ISO/IEC 14496-3) are the two audio codecs that have been chosen as next-generation broadcast audio codecs to provide facilities with high-quality audio, control, consistency, compression efficiency and surround sound (5.1). These codecs both use spectral extensions to improve compression efficiency and to make improvements to coding algorithms now possible with increased processing power.
Dolby Digital Plus
Dolby Digital Plus or Enhanced AC-3 (E-AC-3) adds extensions to the AC-3 compression algorithm used by the ATSC HDTV system in the United States, among other applications. New coding tools include spectral extension, enhanced channel coupling, transient prenoise processing and improved filter bank and quantization. The result of the enhancements is that some applications can use lower data rates (than AC-3), although the improvements also deliver support for higher data rates up to 6.144Mb/s. A typical 5.1 surround-sound bit rate can be about 256kb/s.
Spectral extension replaces higher-frequency coefficients with lower-frequency spectral segments translated up in frequency. Much high-frequency content is the harmonics of lower-frequency sound.
New features include simultaneous support for more channels beyond 5.1, 7.1 (as used in Blu-ray) and 9.1 (for height perception) — allowing for at least 13.1 for future applications. The standard also supports multiple program streams through substreams, all multiplexed into a single bit stream. This could be used for audio description, commentaries and the like.
HE-AAC
The high-efficiency advanced audio codec (HE-AAC) is a development of the MPEG-4 AAC audio codec. Specifically, it adds spectral band replication (SBR) and, in the Version 2 profile, parametric stereo (PS).
If a codec like MP3 is to be used at low data rates, to avoid audible quantizing noise, the bandwidth is limited to about 7kHz. Much like the spectral extension of E-AC-3, HE-AAC uses SBR to reconstruct higher frequencies from the lower frequency range. The reconstruction is aided by the SBR signal, multiplexed with the lower-frequency data; thus, the aim, to limit the audio bandwidth of the codec while maintaining the full output audio bandwidth, can be realized.
Parametric stereo relies on the strong correlation between left and right signals. A composite mono signal is derived, and separate parameters to describe the stereo signals are derived (at a fraction of the rate of composite mono data), and used by the decoder to reconstruct the original stereo signal. The stereo signal is described as a panorama parameter (L/R differences) and an ambience parameter. The Dolby implementation of HE-AAC v2, which adds Dolby metadata and full 5.1 surround sound, is called Dolby Pulse.
Metadata
One key difference among current audio codecs is the inclusion of metadata in the bit stream. This allows broadcasters more control over delivery to a wide range of receivers — from a mono portable receiver to a home theater.
| Want to use this article? Click here for options! |






















