Format scan and conversion

Dec 1, 2007 12:00 PM, BY DAVID TASKER

    

Processing challenges in motion estimation

In a motion-compensated standards converter, the interfield interpolation axis is not aligned at the time axis in the presence of motion. (See Figure 1 on page 22.) In practice, the interpolation axis is skewed by using the motion vectors to shift parts of the source fields. The displacement is measured in pixels, and the value is divided into the integer part (the nearest whole number of pixels) and the fractional part (the subpixel shift). Pixels from input fields are stored in RAM, which the interpolator addresses to obtain input for filtering.

The integer part of the impulse response shift is simply added to the RAM address so that the pixels from the input field appear to have been shifted. The vertical shift changes the row address, and the horizontal shift changes the column address. Address mapping, commonly used in DVEs, moves the image with pixel accuracy. This stage is followed by using the subpixel shift to control the phase of the interpolator. Combining address mapping and interpolation in this way allows image areas to be shifted by large distances with exceptional accuracy.

What makes the quality of motion estimation so critical is the behavior of outputs when the converter makes a mistake in creating an intermediate field. Say a camera pan causes a person to move from left to right; the motion compensation system must locate that person in a point in time never actually captured by the camera. If the technology moves the head and body differently, for example, the human eye and brain combination will know at some level that something's not right. Thus, a minute error in motion estimation, even for one output picture, is enough to create a significant disturbance for the viewer.

It is widely understood that fast and complex motion can be a challenge for conversion, but in practice, the speed of that movement can help mask the effects of poorly performed motion compensation. (See Figure 2 on page 26.)

The motion estimator's ability to manage still pictures is also important. While it seems like an easier task to convert this type of material, some technologies can turn stills into a picture with bits moving around. Even seemingly benign images can go to pieces, such as a still shot of a building exterior, with the windows in motion.

Moving roller credits are a particular challenge. It takes sophisticated motion estimation to account for the movement of small objects within a picture. Phase-correlated motion compensation can enable high-quality de-interlacing as well as precise, clean frame-rate conversion even for complex graphics, fast-motion sports, film and variable speed camera outputs.

The mark of exceptional motion compensation is not just higher accuracy in creating pictures, but also the way in which it does make mistakes. A mathematical byproduct of phase correlation is a reliability indicator that can tell the system when it is working effectively and when it must tread more carefully. This information provides a graceful fallback mechanism for concealing any errors the system inevitably will make.

Added image and audio processing

When motion-compensation standards conversion became available, phase correlation was so effective that often the only residual artifact for the viewer or clue for the downstream broadcaster was that cuts were no longer clean. Since then, different technologies have been developed, helping ensure clean video transitions between scenes and programs. Some solutions allow operators to choose the field dominance of the converter output. Prior to this development, converters scrambled field dominance, whether or not it was correct on the source.

The problem with fluctuating field dominance is that it becomes difficult to edit programs. It is also an issue with international program exchange. Post-conversion master editing becomes tricky when the field dominance isn't consistent.

Because most content is compressed somewhere within the delivery chain — whether on DVD or over a broadcast media — fluctuating field dominance is also a problem here. The issue makes it difficult for a downstream compression system to insert a single clean I-frame. The efficiency of both workflows — and the quality of the end product — can be compromised when standards conversion doesn't provide clean transitions.

Consider pixel accuracy

Although the number of pixels involved in SD and HD conversion differs, the problems remain the same. If content is shot using an interlaced camera and the content contains interlaced content on a field-by-field basis, the only transparent way to convert between formats is to use motion estimation to measure the movement between fields and compensate for the effects of the movement between them.

The first step is to use motion estimation to nullify the effects of any movement and to make sure that pixels within the input frame are aligned in time. Within this deinterlacing process, it is the motion estimator's job to deliver motion information that can be used to near-perfectly compensate for movement.

Another issue accompanying modern standards conversion is aspect ratio. While most of Europe watches television in a widescreen SD format, the transition to a 16:9 aspect ratio is still relatively new to most U.S. broadcasters. This means that production companies and broadcasters must effectively deal with at least two aspect ratios. Fortunately, every standards converter made today includes built-in aspect ratio conversion with various preset and user-definable modes.

The audio side

Dealing effectively with audio also has become the standards converter's responsibility. It is imperative that standards converters accommodate 16 channels of audio, or eight AES pairs, and be able to resample audio and perform sample rate conversion from the input rate to the output rate. The Dolby audio standard is used extensively throughout the broadcast industry for multichannel surround. This means handling up to 16 channels filled with Dolby E, discrete 5.1 audio, an additional stereo mix and perhaps a second language or soundtrack information.

Dolby E brings its own special requirements to conversion because it is locked to the incoming frame rate of video and must be decoded, recoded and relocked in order to ensure that the audio can be re-edited downstream without corrupting the Dolby E signal. Any standards converter you select should be able to compensate for video delay introduced during conversion and even peripheral delays and, on a per-channel basis, account for lip-sync errors on the incoming program.

Continued product development

Investing in standards conversion technology is an important decision in the life of a broadcast, post or duplication facility. Advanced solutions today include comprehensive film tools for 23p, 24p, 25p, 30p and sF formats, along with 3Gb/s capabilities to handle 1080p. Standards conversion platforms are capable of operating within both the hardware or software domain.

Realize that issues of standards conversion do not disappear in the file-based workflow. Content must remain at the correct frame rate through the entire workflow right to the end of the consumption chain. With the shift to software-based platforms and increasing work with HD, fast and effective motion compensation will depend on manufacturers' ability to maintain accurate motion estimation. This development likely will leverage hardware acceleration to turbo charge software-based solutions to enable efficient, file-based interchange of content.


David Tasker is head of technical sales for Snell & Wilcox.




Want to use this article?
Click here for options!
Get Copyright Clearance

Share this article

blog comments powered by Disqus

 

Current Issue

Online captioning compliance

May 2012

The FCC has issued captioning requirements for all online video. Learn how to meet the requirements of the new rules and how to automate the technical process.

Read More articles...

Related Newsletter

Transition to Digital
A twice per month tutorial on digital technology.

Related Posts


Confused about the terminology in an article? Find definitions of common terms and abbreviations in Broadcast Engineering's Glossary.

 


Video Compression, Editing and Displays

Video Compression, Editing and Displays

Video compression, editing and displays is an in-depth tutorial on MPEG compression technology, editing MPEG content and evaluating color video monitors written by long-time video expert, trainer and writer Steve Mullen, Ph. D.

File Based Technology and Workflow

File Based Technology and Workflow

File-based technologies have replaced video tape methods for a majority of production and broadcast operations. The worlds of AV and IT are coalescing to create new methods and workflows for media

Sound Off Podcasts

 

Broadcast Engineering Digital Reference Guide

Browse Back Issues

Back to Top