Broadcasting In Discrete 5.1 Surround: What's The Cost?


by Steve Church & Michael Dosch
Axia Audio
Cleveland, Ohio, USA

There is growing interest among broadcasters to deliver a surround listening experience to their audiences. Surround is clearly the hot topic at audio, consumer electronics, and computer shops. Visit any of these and you will see plenty of surround audio set-ups. Indeed, it would appear to a casual visitor that stereo has become nearly obsolete. The systems on display are home systems, mostly to be used for “home theater” listening to accompany surround DVD-Video disks, which are near-universally produced with a digital surround audio track. But many people have discovered surround as an impressive way to enhance general music listening as well. Two audio disk formats offer an audiophile grade carrier to provide surround audio to consumers: DVD-Audio and Super Audio CD (SACD). In the shops, you will find DVD players costing around $140 that can play all three formats. In all cases, surround is being delivered to consumers digitally in the so-called 5.1 format, providing six discrete digital channels: Left Front, Right Front, Left Surround, Right Surround, Center, and Subwoofer.

Our current modern systems offer a tremendous jump in quality over the early "quadraphonic" attempts to woo consumers with multi-channel sound. These were “matrix” systems that combined four tracks into two using phase-shifting techniques. They had the advantage that existing stereo vinyl records and FM transmission could be used to convey the audio to consumers. They were even compatible! One record, one broadcast could serve both quad and stereo listeners, so they said. This was all certainly convenient to broadcasters: just playing a surround-encoded record in the same way and with the same equipment that you would play a stereo one made you a surround broadcaster.

Alas, compatibility was achieved only in the imaginings of the record company PR departments. The reality was quite something else, as the systems were heard and their faults became apparent: the vast majority of people still listening in stereo received music that sounded quite different from what they were used to hearing, with distinctively strange placement, reverb, and cancellation effects. And the separation of the surround channels proved disappointing, dipping to as low as 3dB between some of the channels. And so the great quad broadcasting experiment came to an end.

The modern "5.1 channels" idea was the first surround method conceived in the digital era. Work on it was begun in 1987 by the Society of Motion Picture and Television Engineers when it looked to be possible to digitally encode audio for film releases. SMPTE decided that 5.1 channels were satisfactory to create the aural sensations film producers desired. The name was proposed by film sound innovator Tomlinson Holman to initial confusion but eventual acceptance. The “point one” channel is the subwoofer, with the decimal value suggesting the limited frequency response of the channel.

Prior to the establishment of 5.1 as a standard, a few surround films had been released on 70mm that had the capability for six audio channels in the longstanding analog optical format. Star Wars was the first, followed by Close Encounters, Superman, and Apocalypse Now. These were all big hits and all used essentially the 5.1 L-C-R-LS-RS-Sub arrangement, so there had been successful real-world experience with the format. There is limited physical space on film for the marks needed to encode digital values and at first it seemed there would not be enough space to hold six channels. But a new development came along just in time: audio “coding” or compression. With the possibility to reduce the bitrate by a factor of up to 10 over simple PCM, multichannel digital for film became a practical reality. Dolby Digital, DTS, and Sony SDDS were invented to exploit the opportunity and remain in widespread use today.

What about radio?
By now you have probably noticed what’s missing from this picture: Radio. While it always been possible to transmit matrix surround over traditional FM, the introduction of iBiquity HD Radio in the USA offers the opportunity to give listeners digital surround with quality commensurate with their current and soon-to-come experiences with movie theaters, DVD film, TV broadcast, surround music disks, computer audio, and portable players. European DAB offers a similar opportunity for surround upgrade, and first steps are underway to enhance this radio service.

The iBiquity HD Radio service has a bitrate of 100kbps, of which 96kbps is used for audio. Only a couple of years ago, this would have been thought to be too little for high-fidelity stereo. Surround at this rate was but a dream. As with the introduction of digital surround to film, enabled by the just-in-time availability of audio compression technology, another bit of magic seems to have appeared just when needed for surround radio broadcasting: parametric surround coding technology. This amazing development allows a stereo signal to be expanded to surround with an additional 5-32kbps added to the basic stereo rate. Recent tests of the latest surround codec version have shown that 6kbps offers performance as good as our first demonstrations that used 16kbps.

A station using all 96kbps for a single high quality program would probably take a 90/6kbps split for the core stereo and surround streams. Stations wanting to "multicast" will divide the bits according to the needs of the various programs.

iBiquity has a proposal before the FCC to increase the HD bitrate to 150kbps, which would offer more options for additional programs and/or quality enhancement.

 


MPEG Surround System via HD Radio end-to-end

Remember that the quad broadcasting experiments came to an end primarily because it was not possible to achieve acceptable compatibility in stereo. Fortunately, this is guaranteed not to be a problem with the modern MPEG approach because the stereo signal is taken directly from the original source and sent on to the listener without modification. Unlike with the matrix systems, there is no requirement for the original 5.1 source to be downmixed to create the stereo broadcast. Instead the system takes both the stereo and the 5.1 signals from the source, such as a DVD-A or SACD disk, and uses the 5.1 to create the 6kbps surround-coded stream. This stream contains the information necessary to expand the stereo to surround at the decoder; a format that its developers, Fraunhofer Labs, call "2 + 5.1".

So we have a fortunate match of the coder’s characteristics to the application. If for some reason we have only the 5.1 source, it would be possible to downmix it automatically to create the stereo signal, but if we have a handcrafted stereo mix available, as we usually do, we are able to use it. Advances in digital transmission and codec technology let us get that so often sought and so rarely achieved “something for nothing.” Station owners bought an old-fashioned analog stereo license and now find themselves with the potential to offer a state-of-the-art digital surround service just when they need it to compete. Not bad!

Building a modern surround radio studio facility
To broadcast in MPEG surround, a station has to upgrade it's studio facilities to surround. Specifically, we need to store, network, and mix in the 2 + 5.1 format. This is the main objection that proponents of matrix systems proffer – that their systems can be used with existing stereo facilities.

So, since we need to examine the cost associated with a surround upgrade, let’s walk through how one would build a modern 2 + 5.1 plant, with a careful eye to expenses. We'll start with the routing and distribution infrastructure, then the PC-based delivery system, move to the mixing console, then on to the surround encoding, dynamics processing, STL, and transmitter. Then we’ll discuss monitoring and the production studio’s needs. All will be in the context of using computers and computer networking to provide the functions we used to get from the old proprietary radio station machinery.


Surround Radio Station: Functional Perspective

The Routing and Distribution Infrastructure
Here we are talking about the glue that binds the studios together. In smaller stations, this may only be a couple of distribution amps that provide a couple of rooms with network feeds. But larger facilities, including the common consolidated ones in the USA, usually need to have flexible audio routing so that sources originating at any point within the plant may be consumed anywhere else. There is also the need to distribute a multitude of network feeds to a number of studios, switch various studios to transmitters and other outgoing lines, etc. Thus in the past years, we have seen the increasing application of facility-wide audio routers such as have been common in TV facilities for some time. These are proprietary boxes filled with cards that communicate via a backplane and offer various kinds of input/output. They look very much like the telephone PBXs that have been in use over the past decades and share many characteristics. These are manufactured in low volume for our very small industry, and are consequently expensive. Each input or output requires a port on a card which needs physical space, conversion chips, etc. An 8-channel input such as we need for surround would require 8 individual XLRs for analog or 4 for AES3 connections. Same for a surround output.

We propose a system that uses an Ethernet switch as an audio router. When analog or AES3 inputs and outputs are needed, these are converted in “nodes” to Ethernet. But this system requires many fewer of these because most devices communicate directly via a single Ethernet RJ-45.

An Ethernet 100BaseT link has 100Mbps capacity, enough to transport 25 uncompressed stereo signals or 3 8-channel surround signals. And these are bi-directional. One RJ-45 thus substitutes for as many as 100 XLRs!

Delivery System
Most stations are using PC-based delivery systems to play music, promos, commercials, etc. With today’s low-cost, high-capacity hard drives, there is no significant barrier to storing the required 8 channels. A 300 Gigabyte drive costs less than $200 and can store 1200 surround songs with no compression.

With an Ethernet infrastructure, there is no need for soundcards and their associated connectors. There is also no need for router or console inputs and/or outputs at the other end. Driver software passes the audio to and from the audio playback application and the Ethernet. Physical connection is via a single RJ-45. Modern Ethernet switches support “Quality of Service” prioritization, so that general data may share the same link as audio. That means that you can use the same network for both audio playback and for other applications like file downloads from a server.

We propose to store audio in eight 24-bit integer packed PCM (uncompressed) channels in standard Windows interleaved wav format, organized as shown in the table at left.

This layout is standardized within the ITU and SMPTE for interchange of program content accompanying a picture and is widely used with TV digital tape recorders. The Music Producer’s Guild of America has also endorsed it. For Windows PCs, this will be stored in the RIFF/WAVE audio file format, which is a variation of the longstanding .wav format. The “fmt” (format description) chunk is a WAVEFORMATEXTENSIBLE structure that allows description of multichannel formats as well as any other PCM and non-PCM audio formats.

We choose 24-bit because DVD-Audio has this resolution and SACD has dynamic range that could take advantage of this bit depth. The Axia Livewire network also has 24-bit resolution, so we have a match between the source, the storage, and the network. Compact disks have 16 bits and this has been the norm in broadcasting, but 24-bits are the future. 16-bit systems have theoretically 94dB dynamic range, and 24-bit systems 141dB. Both are plenty enough for radio broadcasting, but having more bits means that distortion at low audio levels is reduced, which may be audible – even (or particularly?) after aggressive processing. Were big cheap hard drives not available, we’d probably want to stay with 16 bits – but with drives so cheap, why not splurge?

Audio that is stored on other formats: compressed, fewer bits, only mono, stereo, or surround should be uncompressed and/or up or down-mixed as needed to convert to the network’s standard format. The file header tells the application about the format so that it knows what to do. We could consider compressing the surround channels to extend capacity. Since they are only used as inputs to the surround position encoder and are not actually transmitted, there would be no degradation of the on-air quality at all. On the other hand, swapping to a bigger drive or adding another one is so cheap, so perhaps there is no compelling reason to bother with it. A software “driver” installed in the PC makes the network look like a standard Windows Driver Model (WDM) soundcard, so any audio application that works with usual soundcards should work without modification to send and receive audio from the network.

Mixing Console
A modern mixing console can be built with two ingredients: A control surface and a mixing and processing Engine with PC motherboard, CPU, and Ethernet connection. While the control surface has to be manufactured in the small volumes our industry dictates, the Engine can take advantage of powerful, high-volume, low-cost components from the computer world. A commodity 2.4 Gigahertz Pentium 4 CPU has plenty of horse-power to support mixing, equalization, panning, dynamics control, etc. for a 24-fader surround broadcast console.



Next-generation consoles will have a surround metering option that show both stereo and 5.1 channel levels (as demonstrated by Axia's Element modular control surface, above). Because this is only a software change for a PC display, there is no additional cost compared to stereo.

The Engine has only two connectors: power and Gigabit Ethernet. All audio and control pass via the single RJ-45 Ethernet jack. With no hard drive (software is stored in a Compact Flash card), embedded Linux as the operating system, and all parts mounted on one PCB, reliability is probably higher than a traditional digital mixing engine with its many plug-in DSP, CPU, input/output cards, etc.

Cost to provide surround mixing in this PC Engine-based console is the same as for stereo. There is no incremental increase in cost going to surround from stereo because the P4 platform has so much headroom that surround mixing software can be added without changing any hardware. The Gigabit Ethernet connection has enough capacity as well to support the additional surround signals. Contrast this with a surround upgrade to a traditional console. You would have four times the dozens of audio in/out connectors already needed for stereo and many more plug-in cards, leading to probably having to increase the size of the frame. Your cost increment would be tens of thousands of dollars.

Via the Ethernet switch, the console has access to any audio source in the system. Its various outputs may consumed anywhere within the facility.

There will surely be a lot of experimentation with microphone ideas for surround. In most cases, mono mics will be panned to a position within the surround stage. Perhaps reverb with multiple outputs, time-delay, pitch-shifting, or comb-filtering processing will be used to create a sense of immersive spaciousness. Surround panning will be part of the console and so microphones without additional processing will cost no more to support in surround than in stereo. Other local inputs and outputs would require corresponding ports in the audio-to-Ethernet nodes. Stereo CD players would need only the usual two input ports and would be panned to surround within the console. Only surround SACD and DVD players would require surround inputs. They would normally connect 5.1 channels with the downmixing to stereo happening within the console.

Surround Encoding
The surround encoder can another Ethernet-connected box. One RJ-45 serves all required inputs and outputs. The 2 + 5.1 channels from the console program output are the inputs and the output is a 6kbps coded surround stream that gets sent to the transmitter. Alternatively, the surround encoding could be done within the dynamics processor or HD encoder.

Dynamics Processing
Initial testing indicates that existing stereo processing is satisfactory for the MPEG surround system. Any dynamics processing that is applied to the stereo channel affects the received surround channels as well. Because today’s processing is not enabled for direct Ethernet connection, a node is used adapt the network audio to the processor’s input and to apply the output back to the network.

Future processors may incorporate the surround encoder and offer more sophisticated individual processing control over the stereo and surround channels. For example, it may be interesting to have a way to “deprocess” the surround channels somewhat, while maintaining a more aggressive sound on the stereo program. More than a few listeners exposed to surround have said that the envelopment effect causes a perception of "high energy" similar to what programmers and engineers try to achieve with dynamics compression.

FM and HD require different processing styles. FM needs special attention to pre-emphasis, usually quite a lot of left/right clipping, and perhaps even some composite clipping. The HD encoder has to work harder on a clipped signal and will not have as good a result as from a non-clipped signal since the additional harmonics look like audio that needs to be encoded and therefore attempt to receive bit allocation. So a processor optimized for the HD channel will generally use a look-ahead limiter rather than a clipper. Stations probably will decide to process the HD program less than the FM in order to offer a more “purist” signal to listeners. (For now, anyway. When HD Radio gets popular, all bets are off.) The most popular processors use a common front-end AGC section and follow that with independent limiter sections for FM and HD. Thus, both outputs need to be connected ultimately to their respective transmitters.

HD Radio Encoding
In iBiquity’s second-generation HD Radio system, the encoder is located at the studio. This has the advantage that the processing may be co-located at the studio and the STL only has to convey the encoded HD signal, tremendously reducing its bandwidth requirements. It also gives the benefit that any additional data that needs to be muxed-in can be applied at the studio. This data could be Program Associated Data (PAD) like song titles, or indeed our 6kbps coded surround stream. The input for this data is via Ethernet, so connecting it to the network easily enables a path from the surround encoder.

Studio to Transmitter Link
In most cases, we need to get our FM program audio to the transmitter in either composite stereo or PCM form. And we need to send the 96kbps HD radio signal. We could decide to do this with two independent links, or we could use one STL radio to handle both.

A digital STL such as the Mosley Starlink can be used in this set-up. The FM audio goes via the usual input and the HD radio signal via the ancillary data channel. These radios don’t have much capacity because they operate in the traditional 950 MHz band, where not much bandwidth is available. Because their operating frequencies are protected by license and because the frequencies they use are (relatively) low, they are quite reliable.

Another way would be to use the new Ethernet radios like the BE Big Pipe. These operate with bitrates up to 45Mbps, so there is a lot of capacity for multiple audio channels as well as data, VoIP phone, etc. Since we already have all our facility’s audio on the Ethernet, no format conversion is required – just connect the radio’s Ethernet jack to a port on the Ethernet switch. These operate in the unlicensed ISM band at 5.2 and 5.7 GHz, so there is some risk. However, the few current users report good performance and overall satisfaction.

At the Transmitter
With the HD encoding at the studio, there is not much to be done at the transmitter site. The HD exciter simply accepts the already encoded and multiplexed bitstream from the STL and modulates it for transmission. The FM audio is applied to the FM exciter and transmitted as usual.

Monitoring
It’s going to be necessary to listen to your internal audio and your station’s on-air program in surround. This means 5 small speakers and one subwoofer in each serious monitoring position.

The old quad arrangement, with the speakers in each corner of the room, is not the right way to position your monitoring set-up. Human ears are not front-to-back symmetrical and that set-up not only sounds unnatural, but may indeed provoke stress as your deep genetic wiring causes your brain to tell you that “there is danger behind.”

The right way is defined in ITU standard 775 (illustrated at left). This specifies the left and right front speakers to be placed at 30° from the listening position. The surrounds go at 110° ±10° - just a bit back of straight out to the sides. The center goes in the center and the sub goes wherever it sounds the best or is out of the way. You should not be able to detect the position of the subwoofer. According to well-researched psycho-acoustics, humans are not able to localize frequencies below 80Hz. Our heads are too small and our ears too close together at these long wavelengths to detect any left-right difference. If you are able to locate the sub, it probably means that it is radiating audio at a frequency high enough to be localized. One cause of this is distortion-caused harmonics outside of the sub’s proper operating range.

Another psychoacoustic phenomenon to be aware of is the human ear’s change in frequency response from different positions due to the Head Related Transfer Response (HRTF). Sound entering the ear from the side speakers will be perceived as bright compared to sound panned to the front speakers. The effect is significant – a broad curve starting at 1.6kHz, reaching an 8dB peak at 4kHz, and extending to 7kHz. Music producers have probably already compensated for this in their mixes, so it’s not an issue for normal listening. But if you are checking your set-up with white noise, you will likely notice this.

While the usual set-up calls for a one-to-one correspondence between channels and speakers, when you have small main speakers, you will probably need “bass management” to filter the low frequencies from the small speakers and re-direct them to the subwoofer. This means that the subwoofer will be responsible for the sum of the “.1” bass channel and the filtered lows from each of the other channels. This is how the “theater in a box” systems so popular with consumers do it.

As another compromise, you could leave off the center speaker and add the center signal to both the left and right front speakers. (I actually recommend this for your home music listening system. While the center speaker is helpful to stabilize the dialog that accompanies video, it has been my experience that music is better without it. You have the practical consideration that the center speaker is probably not nearly as good as your two front mains – and it can’t be if you have a TV in front of you since the screen and the speaker can’t share the same space. Movie theaters solve this by putting the speakers behind a screen with holes in it. Punching holes in your TV’s CRT is not very likely to be a satisfying operation…)

Production Studio
Most PC-based audio editors such as Adobe Audition, ProTools, etc. support mixing for surround, a procedure not much more complicated than stereo mixing. For a production studio now equipped with a PC editor (are there many that aren’t these days?), a soundcard upgrade and a surround monitoring loudspeaker set-up may be all that are required to start producing in surround.


Adobe Audition's surround mix-down function.

For dubbing surround music from disks to the delivery system, the production studio will need a DVDAudio and SACD player, which may be one universal device. These players will not output stereo and 5.1 simultaneously, so the tracks need to be recorded separately and synchronized in an audio editor.

Data
If the audio network is engineered with sufficient capacity and it correctly supports modern priority mechanisms, it could also be used for the station’s data needs. Email, web browsing, client-server downloads, etc. may traverse the common network. The Ethernet switch isolates the data traffic from the audio streams. When audio and data need to share the same switch port and link, the audio is assured to have first call on the bandwidth because it has higher priority than the data and the switch knows to hold any data packets until the audio is sent. The TCP (Transmission Control Protocol) part of TCP/IP in the network interfaces of computers automatically regulates the data transmission rate to fill the link capacity not occupied by audio.

A more conservative approach is to have two networks. Computers that need to have access to both could have two Ethernet cards with a connection from one to the audio and the other to the data network. Or an IP router could be used to safely pass data from one network to the other.

Cost Comparison - Conventional vs. Networked Studios
Since our focus has been on the real-world practicality of upgrading a facility to surround, let’s explore the cost to build discrete 5.1 surround-capable studios with different approaches. For this discussion, we will have a basic studio set-up with a 12 fader console, 3 mic inputs, 4 automation source inputs, 2 inputs for codecs and 2 more for phone hybrids, and 1 input for an SACD player. We will compare the cost of building a stereo studio with that of a surround studio.

Axia Stereo Studio   Axia Surround Studio
(1) Element 16-position frame -- $1,595.00
      p/n 2001-00175
  (1) Element 16-position frame -- $1,595.00
      p/n 2001-00175
(1) Element Power Supply + GPIO -- $2,595.00
     p/n 2001-00170
  (1) Element Power Supply + GPIO -- $2,595.00
     p/n 2001-00170
(1) Element Monitor/Navigation Module -- $1,895.00
     p/n 2001-00181
  (1) Element Monitor/Navigation Module -- $1,895.00
     p/n 2001-00181
(3) Element 4-Fader Module -- $1,945.00 each
     p/n 2001-00182
  (3) Element 4-Fader Module -- $1,945.00 each
     p/n 2001-00182
(1) Axia Studio Mix Engine -- $2,995.00
     p/n 2001-00139
  (1) Axia Studio Mix Engine -- $2,995.00
     p/n 2001-00139
(1) Axia Microphone Node -- $2,595.00
     p/n 2001-00136
  (1) Axia Microphone Node -- $2,595.00
     p/n 2001-00136
(2) Axia Analog Line Node -- $2,595.00 each
     p/n 2001-00133
  (2) Axia Analog Line Node -- $2,595.00 each
     p/n 2001-00133
(1) Axia Multi-Channel IP-Audio Driver Software
     -- $795.00
  (1) Axia Multi-Channel IP-Audio Driver Software
     -- $795.00

Total Cost: $23,495

 

Total Cost: $23,495

Compare the cost of networked studios using Axia to the cost of a conventional studio build:

Conventional

Networked

Stereo $40k - $60k

Stereo $24k

Surround $80k

Surround $24k

The Axia networked console approach is much less expensive than the conventional (digital router-based console) systems for stereo studios. The small increment in cost for the networked system when expanding from stereo to surround-capable is due to the inherent characteristics of the networked approach. The networked radio studio has only a single cable for each source, whether stereo or surround. The DSP mixing engine is the same as for stereo. Soundcards are replaced by a software driver and the router is replaced by a low-cost Ethernet switch, which handles stereo or surround equally well. With conventional systems you must increase the quantity of PC soundcard ports, console input cards, output cards, cables, DSP cards, frame size, etc. by a factor of four. These costs can vary significantly between vendors but as a rule of thumb, a networked stereo console should cost about half that of its conventional counterpart. And the upgrade to surround will be only a small incremental cost for speakers, extra hard drive space and some additional I/O.

Axia's Element modular control surface (left) and
Studio Engine mixing engine (above) can be deployed
to mix stereo audio and software-upgraded for discrete surround audio mixing duty.

Cost Comparison - Matrixed vs. Discrete Encoding
Proponents of matrix have been hammering on the point that a studio upgrade to support surround is expensive and therefore impractical. But consider... to go the matrix route and stay with your stereo facility, for a typical station you would need:

  • An encoder in the production studio
  • A decoder in the production studio
  • A decoder for monitoring in the air studio
  • A decoder anywhere else you want to monitor the audio, probably at least one in a TOC position
  • Assuming you want to surround-pan mics in the air studio, you would need another encoder and some kind of outboard mixer to provide surround panning. (All the mics would appear on one main console fader.)
  • Perhaps another encoder for remotes .

This is 2-3 encoders, 3 decoders, and a mixer of some kind. Matrix audio encoders cost north of $5k, so you'd have $25-30k in these boxes, and another, say, $4k for some kind of surround mixer/panner. (Does anyone know of a product that does this without going to a full-blown console?)

On the other hand, a complete Axia surround-ready studio eliminates the need for this expensive extra gear. You have integrated surround panning, mixing, metering, routing, etc., so you don't need an awkward sub-mixer. You can pop a surround SACD or DVD directly on the air without pre-encoding it to matrix. (You could do this with the matrix studio approach, but you'd need yet another encoder.) You can monitor surround or stereo anywhere you can plug-in - no decoder needed. Sure, you will need a bigger hard drive in your delivery system to store your audio in discrete form, but 300 Gigabytes would suffice for most commercial stations, providing something like a 1200-music-piece capacity for less than $200. And you would need one MPEG surround encoder as part of the transmission chain. A stand-alone encoder might cost, say, $5k. But one included in the HD generator or dynamics processor would likely cost less.

And if you need to upgrade your studio anyway, an Axia networked solution gives you surround mixing and routing at no increase in cost over stereo. This becomes a lot *cheaper* than the matrix approach because you don't need all the encoders and decoders everywhere.

To summarize cost (not including the surround loudspeakers, SACD/DVD players, and maybe an upgrade to your production audio editor that you'd need either way):

  • Matrix Surround Studio: $29 - 34k (and your old stereo studio gear)
  • Axia discrete surround studio + MPEG on-air surround encoder: $29k
  • New stereo studio + matrix: $50k+

The big cost benefit claimed by the matrix guys is just not there. Indeed, there is a significant *cost penalty* for those who are starting fresh.

By the way, the two surround approaches are not mutually exclusive. You could (and probably should) go with a discrete studio even if you plan to broadcast in matrix surround. You'd have only one matrix encoder, just after the studio output and before the on-air dynamics processor. You'd get convenient panning, metering, and monitoring with no outboard lash-ups - and save a lot on encoders and decoders.

By the same token, you could (but probably shouldn't) use a matrix system in-house to feed an MPEG on-air system. This could offer those who are adamant they want to stick with their stereo storage and consoles a way to do so without compromising broadcast quality for the rest of us.

Time for a Radio Revolution?
While the world around swirls with change and opportunity, not much has happened to the technology side of radio since the addition of stereo to FM in the early 60s. Other established media (film, TV, music disks) have exploded with innovation, and completely new media (Internet, iPod, satellite broadcasting) have burst onto the scene. All of the preceding have gained capability and appeal from having transitioned to digital. We finally have HD to take our industry into the digital era, but in stereo it offers a very small improvement to the FM listener’s experience. When we were growing up, FM was cool because it was at the pinnacle of audio delivery technology. Just the letters FM connoted a general sense of quality. With competitive media having surpassed radio, this connotation has faded. Without action, we will surely be sidelined.

Digital surround may well be an answer. It’s a way to please older music fans by making the classics fresh and to excite younger listeners with aural fireworks – all the better in cars with huge subwoofers. As film-sound innovator Tomlinson Holman says: “Perceptually, we know that everyone equipped Any PC on the network can listen to surround audio streams. with normal hearing can hear the difference between mono and stereo, and it is a large difference. And…virtually everyone can hear the difference between 2-channel stereo and 5.1 channel sound as a significant improvement.”

There is an argument that the HD channel capacity should be split and used to "multicast" two or more programs. If we divide the current 96kbps channel into two 48kbps channels, we could certainly offer two good-fidelity talk services. (One of them would not benefit from HD Radio’s “revert to analog” feature in the case of digital failure, though.) There is no need for surround in this scenario. But 48kbps is probably not good enough for music. So we could imagine that a station operator could decide to use his available bandwidth for either two channels of talk or one channel of music. Should iBiquity’s proposal to increase HD’s bandwidth succeed, a division of 90/6 + 48 could nicely serve one high-quality surround music service and one talk service. Other splits would be possible, such as 64/6 + 64/6 for two surround services.

Either way, compared to a mere shift to stereo digital, there will be a clear motivation for a consumer to buy an HD-enabled receiver. And a state-of-the-art networked studio facility supporting the creation of on-air product for these services presents an opportunity for both cost savings and operational flexibility.



Return To