Study Hall

Sound System, Loudspeaker & Room Interactions

November 9, 2021

Sam Berkow

Using multiple time windows in a single measurement as a way to measure and optimize the response of a sound system in a room.

If one could listen to only the direct sound of a loudspeaker, the world would be a very different place!

Unfortunately, free field listening, where you have no reflections, room modes or ambient noise, is hard to achieve in everyday life, so we listen to loudspeakers in real rooms. The interaction of a loudspeaker system and a room can be very complex to understand, model or measure.

One way to measure this interaction is to measure the impulse response of the loudspeaker/room system. The impulse response of a typical sound system in a room contains lots of interesting information, including:

1) The delay between the loudspeaker and measurement microphone

2) The direct sound-to-reverberent level ratio

3) The time arrival, frequency content and level of reflections of sound

4) The early and late decay rates of the sound

5) The frequency response of the direct sound.

This last point is particularly interesting. The question is “What do we want to measure and why?”

One question that goes to the heart of “system” measurement and optimization issues is “If the impulse response contains the frequency response of the direct sound, can we separate the loudspeaker response from the room response?” Also “If we can, do we want to?”

Figure 1 shows an impulse response of a 1,250-seat multipurpose hall. displayed in the time domain. The x-axis is time (~0.75 sec) and the y-axis is magnitude in dB. Note the direct sound, reflections, the reverberant decay and the noise floor.

The “spike” that represents the direct sound actually contains the frequency and phase information about the loudspeaker. To see this information we must transform this portion of the impulse response into the frequency domain.

To achieve this isolation of the direct sound from the room response, we must select a time window that includes the direct sound but excludes the reflections and decay of the room.

Figure 2 displays such a time window. This measurement, in the same 1,250-seat hall, was made using a full-range loudspeaker system with the microphone approximately 60 feet from the loudspeaker. Pink noise was used as a reference signal and the impulse response was calculated using a 512K FFT (although only the first ~0.75 seconds are shown).

The vertical lines suggest a time window that ignores most of the effects of the room at frequencies whose periods are longer than the time window (i.e. low frequencies).

We can take the “time windowed” data and transform it into the frequency domain using FFT mathematics. This transformation yields a result that shows how much energy is present at each frequency, demonstrated in Figure 3, showing the frequency response of the direct sound portion of an impulse response in the 1,250-seat hall.

The response was calculated using a 512 point FFT (which equals a 512/48000 or ~11 msec). As you can see the frequency response shows a pronounced LF roll-off. You can also notice the lack of LF resolution in this figure. The lack of resolution at LF is offset by a excess of HF resolution.

This uneven resolution between LF and HF energy is the result of the FFT mathematics used to transform the data from the time domain to the frequency domain. Standard FFTs yield data that is distributed linearly in frequency (one data point every X Hertz). Unfortunately, humans perceive frequency logarithmically.

This lack of LF resolution in Figure 3 is a direct result of the use of a short time window in our transformation from the time domain to the frequency domain. It is interesting to note that this plot does not correlate with what we hear. Simply listening to the full range loudspeaker system we were measuring made it clear that the system was reproducing LF energy down to at least 100 Hz!

I would suggest that a primary goal of an effective measurement system should be to provide results that correlate well with what we hear. So the lack of correlation between what we have heard and what we measured suggests a modification to our approach.

As an alternate approach to trying to find a measurement that correlates with what we hear, we can try using a longer time window to “see” the LF response with better resolution.

A longer time window of approximately 250 msec is depicted in Figure 4, showing the impulse response in the 1,250-seat multipurpose hall. The vertical lines suggest a time window that includes most of the effects of the room. The time window shown is approximately 0.25 seconds.

To transform this longer “slice” of the impulse response into the frequency domain, we will use an 8k FFT which represents 8k/48000 seconds, or 0.171 seconds.

Notice again that this time window includes both the direct sound and the response of the room.

Figure 5 shows the frequency response of the direct sound portion of an impulse response of the 1,250-seat hall. The response was calculated using a 8192 point FFT (which equals a 8192/48000 or ~107 msec). As you can see, the frequency response shows low-frequency energy that is much more pronounced than seen with the shorter time window.

While the low-frequency information is seen in adequate resolution, the high frequency results look confusing. The plot shows data that has 5 Hz resolution (i.e. one data point every 5 Hz). While this resolution provides excellent LF resolution (between 31 Hz and 62.5 Hz there are 15 data points.

However, at HF we have excessive resolution—between 4 kHz and 8 kHz there are approximately 800 data points. Simply stated, the longer time window provides good LF resolution, but excessive HF resolution.

The result of studying these plots might lead you to conclude that in order to make measurements that correlate well with our listening experience, we must use very short time windows that isolate the direct sound at high frequencies, and increasingly longer time windows as we look at lower frequencies. At first glance this idea might seem to violate the often quoted phrase, “One can only affect the direct sound with processing.”

However this is not the case. At mid-low and low frequencies, the interaction of a sound system and a room can be affected and optimized by signal processing. In other words, at low frequencies (long wavelengths) the direct sound and reflections from nearby surfaces combine to form a composite response. It is this composite response that a listener hears.

The ability to measure several time windows simultaneously provides a measurement that both correlates well with human hearing and provides insight into how the signal being sent to the loudspeaker can be tailored (via equalizers, or other processing) to optimize the loudspeaker/room interaction.

The last figure shows a measurement of a loudspeaker system that includes multiple time windows and displays both the magnitude and phase response of the “system.” The use of multiple time windows allows one to isolate the direct sound of a loudspeaker in a real-world situation at high frequencies.

However, at lower frequencies, longer time windows that include the loudspeaker/room interaction have been found to correlate well with our listening experience. Multiple time windows in a single measurement is an extremely interesting way to measure and optimize the response of a sound system in a room.

Sam Berkow

Sam Berkow is the principal of SIA Acoustics and has completed a wide variety of acoustical design projects, including concert halls, recording studios, broadcast facilities, production facilities, house of worship facilities, large multi-purpose venues, amphitheaters and stadiums. His educational background includes a masters degree in Engineering from the Stevens Institute of Technology, where he specialized in acoustic measurement and design. He is also the original developer of Smaart acoustic measurement and system optimization software.

All Posts

Study Hall Top Stories

Ghost In The Machine: Phantom Power

Posted on April 23, 2024

Clearing up the mysteries about phantom power – what exactly is it, and how do we apply it effectively?

The Buck Stops Here: Who’s Responsible For The Overall Success Of Your Integration Project?

Posted on April 19, 2024

Project ownership is a philosophy, an attitude, an ability and willingness to oversee all aspect of a project, and accept the “buck” when it ...read more →

FOH First Aid Kit: Being Prepared For Anything As The Summer Concert Season Rolls In

Posted on April 19, 2024

That beautiful, pristine sound check that happened several hours ago can become a distant memory when the elements take over...

Church Sound: The Power (And Value) Of The Unseen In Worship Tech

Posted on April 19, 2024

Excelling at the invisible side of what we do is one of the biggest ways we can build quality on the visible side.

Lost In Translation? Pro Audio Has A Language All Its Own

Posted on April 17, 2024

Beyond terms and jargon, there is a form of communication based on our passion for the craft.

Helping Build The Future: The Wide-Ranging Educational & Development Efforts Of Tech 25

Posted on April 17, 2024

Inside the work of a Pittsburgh-based non-profit collective network of industry professionals providing production education, workforce programs, hands-on experience and more to the next ...read more →

Three Days In March: A Very Long 72 Hours Battling “Gremlins” At The 1984 Juno Awards

Posted on April 16, 2024

The “normal” amount of time to mount and broadcast the show was about a week to 10 days, but the “actual” amount of time ...read more →

Worthwhile Endeavor? The Case For Deploying Plugins At Corporate Events

Posted on April 15, 2024

Viewpoints from numerous mix engineers who are doing high-profile corporate shows and utilizing external plugins, including some who have been using them for many ...read more →

Tech Focus: Introducing The Lily P4D Microphone Ducking System

Posted on April 15, 2024

Inside a new tool for eliminating microphone bleed in live performances via a simple analog design that's controlled by the artists on stage.

Tandem Touring: Mixing Monitors As A Dual-Engineer Team

Posted on April 15, 2024

In a realm where we engineers are normally the maestro of our own domain, we will be sharing leadership of the department with another ...read more →

In The Studio: Organizing Your Session Files

Posted on April 12, 2024

Some tips to help keep things running smoothly when the track count starts to climb and the chaos starts to ensue...

Sonic Illusions: Creating “Mood” In Recording Mixes

Posted on April 9, 2024

Working to build the "sonic illusions" that help take a song to the next level...

Study Hall

Sound System, Loudspeaker & Room Interactions

Sam Berkow

Ghost In The Machine: Phantom Power

The Buck Stops Here: Who’s Responsible For The Overall Success Of Your Integration Project?

FOH First Aid Kit: Being Prepared For Anything As The Summer Concert Season Rolls In

Church Sound: The Power (And Value) Of The Unseen In Worship Tech

Lost In Translation? Pro Audio Has A Language All Its Own

Helping Build The Future: The Wide-Ranging Educational & Development Efforts Of Tech 25

Three Days In March: A Very Long 72 Hours Battling “Gremlins” At The 1984 Juno Awards

Worthwhile Endeavor? The Case For Deploying Plugins At Corporate Events

Tech Focus: Introducing The Lily P4D Microphone Ducking System

Tandem Touring: Mixing Monitors As A Dual-Engineer Team

In The Studio: Organizing Your Session Files

Sonic Illusions: Creating “Mood” In Recording Mixes

Latest In News

About PSW

News

Gear

Study Hall

Podcasts

Subscribe

Forums

More Content