Controlling loudness

Feb 1, 2010 12:00 PM, By Guy Marquis

New audio processing technology allows broadcasters to overcome traditional loudness control issues.

    
Figure 1. Wideband algorithms continuously control a full audio bandwidth gain stage in 
real time.

Figure 1. Wideband algorithms continuously control a full audio bandwidth gain stage in real time.
Select figure to enlarge.

Loudness control is probably the source of more complaints for broadcasters than any other technical issue. Viewers have grown weary of continuously adjusting the television volume, especially between programs and commercials.

This lack of audio consistency between segments has arisen due to the technical difficulties inherent with stabilizing loudness across multiple sources of programming, along with a desire to make commercials more noticeable to viewers. For a long time, the audio for commercials has been compressed to generate a more even but louder soundtrack.

However, this landscape is changing quickly, and there is new impetus behind measures to control loudness. This is being driven by multiple factors, including upcoming legislation around the world, like the CALM Act in the United States, and new audio recommended practices, such as ATSC A/85. There's also a desire to offer viewers a better quality of experience and, perhaps most importantly, new loudness control technology that's creating more options for broadcasters.

Up until now, broadcasters have typically relied on PPM meters to measure audio levels. However, the reality is that two different pieces of audio, with the same level characteristics, can sound very different when it comes to their loudness impression. Perceptions of loudness are affected by the speed of gain changes and by sudden jumps in audio levels. Therefore, an audio operator cannot simply look at the volume units, or peak meters, to determine the true loudness level of the program as perceived by the viewer at home.

Much more effective loudness measurement is offered by the ITU's BS.1770 algorithm, which is based on human hearing characteristics and provides a numerical value indicating the perceived loudness of the content being measured. A number of equipment vendors have already incorporated ITU-R BS.1770 in their audio loudness measurement and processing solutions.

Traditional loudness control during production

Figure 2. Multiband algorithms apply their gain compensation on different frequency bands independently, reflecting the fact that the perception of loudness by the human brain varies with the different frequencies of the audio spectrum.

Figure 2. Multiband algorithms apply their gain compensation on different frequency bands independently, reflecting the fact that the perception of loudness by the human brain varies with the different frequencies of the audio spectrum.
Select figure to enlarge.

There are now multiple different approaches to loudness control, including new recommendations for content origination, as well as new devices providing a loudness control safety net during playout. Let's consider the content origination option first.

Dolby has long been active in loudness measurement and control through the use of dialog normalization (dialnorm). During content production, audio engineers can set the dialnorm value and profile metadata using their Dolby E encoders. This dialnorm value can be preserved in the VANC all the way through the playout process and ultimately to the viewer's set-top box. Whenever the set-top box detects a change in the dialnorm value, a different attenuation is applied to bring the loudness back to the desired range, an average dialog level at -31dBFS.

This is an elegant solution that preserves audio characteristics, while minimizing both channel-to-channel and intrachannel loudness differences. However, many broadcasters have found it difficult to preserve the dialnorm metadata throughout their playout chains, which typically comprise a large number of devices from multiple vendors. Another issue is that broadcasters do not typically have full production control over all the content they broadcast. A typical facility will use feeds from other broadcasters, live news feeds and even Web content. Live news feeds are especially problematic because the dialnorm value is usually set after the program is produced, as it represents the average dialog loudness of the entire program. However, this process is just not conducive to fast-turnaround news environments. Naturally, all these different factors can undermine this loudness control process.

To help address these problems associated with broadcasting content from multiple third-party sources, the ATSC A/85 Recommended Practice proposes using a standardized, dialog average loudness level of -24dBFS for content creation. In essence, the aim here is to offer more control to broadcasters by limiting the variation in the loudness level from different content producers. However, this proposal has not been widely adopted at this point.

Figure 3. Shown here is a playout application for automatic loudness control.

Figure 3. Shown here is a playout application for automatic loudness control.
Select figure to enlarge.

New loudness control safety net in playout

In view of the difficulties associated with using third-party content, it makes sense to reinforce any loudness control processes used during content production with a safety net that can be activated during playout. This can be provided by a new generation of loudness control processors. Typically, these comprise an advanced automatic gain control processor, with a range of processing options to best suit individual broadcasters. Choices include set-and-forget type loudness processing, and active management of the processing according to the content type. Other options include wideband or multiband processing, which each offer different strengths to different types of facilities.

When a loudness control processor is used in a set-and-forget mode, a target average loudness is set per channel program, and this will be maintained by the processor during commercial breaks, as well as when sources from different origins are played out.

A more sophisticated loudness control method involves adapting the loudness processing according the type of content. Different profiles can be used, which offer processing for music, speech and other forms of content. With this method, a broadcaster's traffic team will tag the content according to its type so the playout automation can select different profiles for the loudness processor, using GPI or serial commands. Another variant of this method involves using the automation to trigger a bypass of the processing for certain types of content. For instance, some engineers may like to use only the loudness processing for content that has been generated by third parties in order to preserve original dynamics as much as possible for their own originated and reliable material.

A good loudness controller must be able to react swiftly to fast loudness changes, such as going to a commercial, as well as over the longer term to manage an entire program that might be too low or too high in relation to the dialnorm value in use. So, a good solution will always be composed of a long-term automatic gain control and a short-term compressor/expander/limiter. For equipment vendors, the challenge has been to come up with good controlling algorithms and, more importantly, the right parameter combination. Unless these key criteria are addressed, the dynamic range of programs can be adversely affected by loudness control, and artifacts like pumping can become apparent to viewers.




Want to use this article?
Click here for options!
Get Copyright Clearance

Share this article

blog comments powered by Disqus

 

Current Issue

Online captioning compliance

May 2012

The FCC has issued captioning requirements for all online video. Learn how to meet the requirements of the new rules and how to automate the technical process.

Read More articles...

Related Newsletter

Audio Technology Update
A twice-monthly newsletter about audio technology.

Related Posts


Confused about the terminology in an article? Find definitions of common terms and abbreviations in Broadcast Engineering's Glossary.

 


Video Compression, Editing and Displays

Video Compression, Editing and Displays

Video compression, editing and displays is an in-depth tutorial on MPEG compression technology, editing MPEG content and evaluating color video monitors written by long-time video expert, trainer and writer Steve Mullen, Ph. D.

File Based Technology and Workflow

File Based Technology and Workflow

File-based technologies have replaced video tape methods for a majority of production and broadcast operations. The worlds of AV and IT are coalescing to create new methods and workflows for media

Sound Off Podcasts

 

Broadcast Engineering Digital Reference Guide

Browse Back Issues

Back to Top