- •Preface
- •Acknowledgments
- •Contents
- •1 Introduction
- •1.1 Auditory Temporal and Spatial Factors
- •1.2 Auditory System Model for Temporal and Spatial Information Processing
- •2.1 Analysis of Source Signals
- •2.1.1 Power Spectrum
- •2.1.2 Autocorrelation Function (ACF)
- •2.1.3 Running Autocorrelation
- •2.2 Physical Factors of Sound Fields
- •2.2.1 Sound Transmission from a Point Source through a Room to the Listener
- •2.2.2 Temporal-Monaural Factors
- •2.2.3 Spatial-Binaural Factors
- •2.3 Simulation of a Sound Field in an Anechoic Enclosure
- •3 Subjective Preferences for Sound Fields
- •3.2.1 Optimal Listening Level (LL)
- •3.2.4 Optimal Magnitude of Interaural Crosscorrelation (IACC)
- •3.3 Theory of Subjective Preferences for Sound Fields
- •3.4 Evaluation of Boston Symphony Hall Based on Temporal and Spatial Factors
- •4.1.1 Brainstem Response Correlates of Sound Direction in the Horizontal Plane
- •4.1.2 Brainstem Response Correlates of Listening Level (LL) and Interaural Crosscorrelation Magnitude (IACC)
- •4.1.3 Remarks
- •4.2.2 Hemispheric Lateralization Related to Spatial Aspects of Sound
- •4.2.3 Response Latency Correlates of Subjective Preference
- •4.3 Electroencephalographic (EEG) Correlates of Subjective Preference
- •4.3.3 EEG Correlates of Interaural Correlation Magnitude (IACC) Changes
- •4.4.1 Preferences and the Persistence of Alpha Rhythms
- •4.4.2 Preferences and the Spatial Extent of Alpha Rhythms
- •4.4.3 Alpha Rhythm Correlates of Annoyance
- •5.1 Signal Processing Model of the Human Auditory System
- •5.1.1 Summary of Neural Evidence
- •5.1.1.1 Physical Characteristics of the Ear
- •5.1.1.2 Left and Right Auditory Brainstem Responses (ABRs)
- •5.1.1.3 Left and Right Hemisphere Slow Vertex Responses (SVRs)
- •5.1.1.4 Left and Right Hemisphere EEG Responses
- •5.1.1.5 Left and Right Hemisphere MEG Responses
- •5.1.2 Auditory Signal Processing Model
- •5.2 Temporal Factors Extracted from Autocorrelations of Sound Signals
- •5.3 Auditory Temporal Window for Autocorrelation Processing
- •5.5 Auditory Temporal Window for Binaural Processing
- •5.6 Hemispheric Specialization for Spatial Attributes of Sound Fields
- •6 Temporal Sensations of the Sound Signal
- •6.1 Combinations of Temporal and Spatial Sensations
- •6.2 Pitch of Complex Tones and Multiband Noise
- •6.2.1 Perception of the Low Pitch of Complex Tones
- •6.2.3 Frequency Limits of Missing Fundamentals
- •6.3 Beats Induced by Dual Missing Fundamentals
- •6.4 Loudness
- •6.4.1 Loudness of Sharply Filtered Noise
- •6.4.2 Loudness of Complex Noise
- •6.6 Timbre of an Electric Guitar Sound with Distortion
- •6.6.3 Concluding Remarks
- •7 Spatial Sensations of Binaural Signals
- •7.1 Sound Localization
- •7.1.1 Cues of Localization in the Horizontal Plane
- •7.1.2 Cues of Localization in the Median Plane
- •7.2 Apparent Source Width (ASW)
- •7.2.1 Apparent Width of Bandpass Noise
- •7.2.2 Apparent Width of Multiband Noise
- •7.3 Subjective Diffuseness
- •8.1 Pitches of Piano Notes
- •8.2 Design Studies of Concert Halls as Public Spaces
- •8.2.1 Genetic Algorithms (GAs) for Shape Optimization
- •8.2.2 Two Actual Designs: Kirishima and Tsuyama
- •8.3 Individualized Seat Selection Systems for Enhancing Aural Experience
- •8.3.1 A Seat Selection System
- •8.3.2 Individual Subjective Preference
- •8.3.3 Distributions of Listener Preferences
- •8.5 Concert Hall as Musical Instrument
- •8.5.1 Composing with the Hall in Mind: Matching Music and Reverberation
- •8.5.2 Expanding the Musical Image: Spatial Expression and Apparent Source Width
- •8.5.3 Enveloping Music: Spatial Expression and Musical Dynamics
- •8.6 Performing in a Hall: Blending Musical Performances with Sound Fields
- •8.6.1 Choosing a Performing Position on the Stage
- •8.6.2 Performance Adjustments that Optimize Temporal Factors
- •8.6.3 Towards Future Integration of Composition, Performance and Hall Acoustics
- •9.1 Effects of Temporal Factors on Speech Reception
- •9.2 Effects of Spatial Factors on Speech Reception
- •9.3 Effects of Sound Fields on Perceptual Dissimilarity
- •9.3.1 Perceptual Distance due to Temporal Factors
- •9.3.2 Perceptual Distance due to Spatial Factors
- •10.1 Method of Noise Measurement
- •10.2 Aircraft Noise
- •10.3 Flushing Toilet Noise
- •11.1 Noise Annoyance in Relation to Temporal Factors
- •11.1.1 Annoyance of Band-Pass Noise
- •11.2.1 Experiment 1: Effects of SPL and IACC Fluctuations
- •11.2.2 Experiment 2: Effects of Sound Movement
- •11.3 Effects of Noise and Music on Children
- •12 Introduction to Visual Sensations
- •13 Temporal and Spatial Sensations in Vision
- •13.1 Temporal Sensations of Flickering Light
- •13.1.1 Conclusions
- •13.2 Spatial Sensations
- •14 Subjective Preferences in Vision
- •14.1 Subjective Preferences for Flickering Lights
- •14.2 Subjective Preferences for Oscillatory Movements
- •14.3 Subjective Preferences for Texture
- •14.3.1 Preferred Regularity of Texture
- •15.1 EEG Correlates of Preferences for Flickering Lights
- •15.1.1 Persistence of Alpha Rhythms
- •15.1.2 Spatial Extent of Alpha Rhythms
- •15.2 MEG Correlates of Preferences for Flickering Lights
- •15.2.1 MEG Correlates of Sinusoidal Flicker
- •15.2.2 MEG Correlates of Fluctuating Flicker Rates
- •15.3 EEG Correlates of Preferences for Oscillatory Movements
- •15.4 Hemispheric Specializations in Vision
- •16 Summary of Auditory and Visual Sensations
- •16.1 Auditory Sensations
- •16.1.1 Auditory Temporal Sensations
- •16.1.2 Auditory Spatial Sensations
- •16.1.3 Auditory Subjective Preferences
- •16.1.4 Effects of Noise on Tasks and Annoyance
- •16.2.1 Temporal and Spatial Sensations in Vision
- •16.2.2 Visual Subjective Preferences
- •References
- •Glossary of Symbols
- •Abbreviations
- •Author Index
- •Subject Index
30 |
3 Subjective Preferences for Sound Fields |
Fig. 3.4 Normalized preference score and interaural correlation magnitude IACC as a function of the horizontal angle of a single reflection for two extreme music motifs A and B. A1=1. Data for 6 sound fields and 13 subjects: ◦; Preference scores for music motif A, x: preference scores for music motif B. •: Values of IACC with music motif A, x: values of IACC with music motif B
3.2Preferred Conditions for Sound Fields with Multiple Reflections
We will now discuss the more general case of sound fields with multiple reflections. Subjective preference obtained by analysis of paired comparisons can be described effectively in terms of four orthogonal properties of the sound field: two monaural temporal factors and two binaural spatial factors [Table 3.1, (Ando, 1983, 1985, 1998)]. These factors are binaural listening level (LL), timing of early reflections ( ti), timing of subsequent reflections (Tsub), and the dissimilarity of the sounds presented to the two ears.
3.2.1 Optimal Listening Level (LL)
The binaural listening level (LL) is the average sound pressure level at the listener’s ears. This is the primary factor that influences listening preferences for sound fields in concert halls. The preferred listening level depends upon the music and the particular passage being performed.
Table 2.1 in Section 2.1.2 lists the two music sources that were used in these listening level experiments. Motif A is the slow and sombre Royal Pavane and Motif B is the fast and playful Sinfonietta. For these two musical extremes, the gross preferred levels obtained by 16 subjects were in the peak ranges of 77–80 dBA, 77–79 dBA for motif A and 79–80 dBA for motif B.
3.2 Preferred Conditions for Sound Fields with Multiple Reflections |
31 |
3.2.2 Optimal First Reflection Time ( t1)
The timing of early reflections relative to the direct sound is the second major factor that influences listening preference. An approximation for the most preferred delay time has been expressed in terms of the effective duration of the ACF of the source signal and the total amplitude of reflections (Ando, 1985). This approximation holds when the envelope of the ACF has an exponential decay:
1 |
− c log10 A)τe |
|
[ t1]p = (log10 k |
(3.1b) |
where k and c are constants that depend on the subjective attributes as listed in Table 3.2.
Here the total pressure amplitude of reflection is given by
A= [A12+A22+A32+ . . . ]1/2 |
(3.2) |
The relationship of Equation (3.1a) for a single reflection, where A = A1, k = 0.1, and c = 1, becomes
τp= [ t1]p ≈ (1 − log10A1)τe |
(3.3) |
Later, we found that the value of τ e in Equations (3.1b) and (3.3) can be replaced by (τ e)min of the running ACF (Ando et al., 1989; Mouri et al., 2000). The minimum value of τ e in a music piece is generally observed in its most active part, the part with the least redundancy, the sharpest musical contrasts, and the one that usually containing the most “artistic” expressive timing information such as vibrato or accelerando in the musical flow. Echo disturbances, therefore, are easily perceived in musical segments where (τ e)min occurs. Even for a long music composition, musical flow can be divided into short segments, so that the minimal values of (τ e)min of the ACF of the whole musical piece can be taken into consideration. This value is useful for matching musical programs to concert halls, for choosing those music programs that will sound best in a given concert hall. The same is true for preferred reverberation times, as in Equation (3.4).
Methods for controlling the (τ e)min for vocal music performances have been discussed (Taguti and Ando, 1997; Kato and Ando, 2002; Kato et al., 2004). If vibrato is included during singing, for example, we can effectively decrease the (τ e)min of the music to better match the acoustics of the hall.
3.2.3 Optimal Subsequent Reverberation Times (Tsub)
Later reverberations play a significant role in sound field preferences for concert halls. In our experiments, the total amplitude A of late reflections was in the range of 1.1 and 4.1, which covers the usual conditions of a sound field in a room. For flat frequency characteristics of reverberation, the times of later reflections (Tsub)
Table 3.2 Constants in Equation (3.1) related to the ACF envelope of source signals for calculating various subjective responses to the sound field with a single reflection
|
In Equation (3.1) |
|
|
|
|
||
|
|
|
|
Delay time to be |
|
|
|
|
|
|
|
|
|
|
|
Subjective attributes |
k |
c |
obtained |
Amplitude examined |
Source signals |
Authors investigated |
|
|
|
|
|
|
|
|
|
Preference of listeners |
0.1 |
1 |
Preferred delay time |
7.5 ≥ A1 ≥ −7.5 |
Speech and |
Ando (1977) |
|
|
|
|
|
|
−10.0 ≥ A1 ≥ −50.0 |
music |
|
Threshold of perception |
2 |
1 |
Critical delay time |
Speech |
Seraphim (1961) |
||
of reflection |
|
|
|
|
0 ≥ A1 ≥ −6.0 |
|
|
50%-echo disturbance |
0.01 |
4 |
Disturbed delay time |
Speech |
Haas (1951); Ando |
||
|
10−5/2 |
|
|
−7.0 ≥ A1 ≥ −27.0 |
|
et al. (1973) |
|
Coloration |
2 |
Critical delay time |
Gaussian noise |
Ando and Alrutz |
|||
|
|
|
|
|
−34.0 ≥ A1 ≥ −10.0 |
|
(1982) |
Preference of |
2/3 |
1/4 |
Preferred delay time |
Music |
Nakayama (1984) |
||
alto-recorder |
|
|
|
|
−21.0 ≥ A1 ≥ −15.0 |
|
|
Preference of |
1/2 |
1 |
Preferred delay time |
Music |
Sato et al. |
||
cello |
|
|
|
|
|
|
(2000) |
|
|
|
|
|
|
|
|
32
Fields Sound for Preferences Subjective 3
3.2 Preferred Conditions for Sound Fields with Multiple Reflections |
33 |
constitute one of the most important preferred conditions (Ando et al., 1989). The preferred subsequent reverberation time is expressed approximately by a constant multiple of the effective duration of the program material (Ando et al., 1982, 1983):
[Tsub]p ≈ 23τe |
(3.4) |
Considering the fact that the effective duration value of τ e is obtained at the tenth percentile (or −10 dB) delay of the ACF envelope of a source signal, the −60 dB decay time of the ACF envelope corresponds roughly to the “reverberation time” contained in the source signal itself, given by (6τ e). This means that the most preferred reverberation time of a sound field [Equation (3.4)] is about four times the “reverberation time” contained in the source signal itself.
The optimal design of a building must take into account its acoustical role. A lecture and conference room should be designed for speech; an opera house should be designed primarily for vocal music but also orchestral music. For orchestral music, it is recommended that a concert hall be selected from one of two or three types of concert halls according to the effective duration of the music programs that will be performed there. For example, Symphony No. 41 by Mozart, “Le Sacre du Printemps” by Stravinsky and Arnold’s Sinfonietta have short values of (τ e)min. On the other hand, Symphony No. 4 by Brahms and Symphony No. 7 by Buckner are more generally typical of orchestral music. Much longer values of (τ e)min are common for pipe organ music, for example, by Bach. Thus, the most preferred reverberation time for each sound source [Equation (3.4)] can potentially play an important role for the selection of the music program to be performed.
3.2.4 Optimal Magnitude of Interaural Crosscorrelation (IACC)
Binaural similarity or dissimilarity of the two signals arriving at the two ears influences subjective preference. All available data with listeners of normal hearing ability indicate a negative relationship between interaural crosscorrelation magnitude (IACC) and subjective preference (i.e., it has been reconfirmed that listeners prefer somewhat dissimilar signals arriving at their two ears). This relation holds under the condition that the maximum value of the binaural, interaural crosscorrelation function (IACF) is near zero interaural delay, with the sound image directly in front and an equal balance between the sound fields for the two ears. If not, then an image shift of the source may occur. To obtain a small magnitude of IACC in the most effective manner, the directions from which the early reflections arrive at the listener should be kept within a certain range of angles from the median plane centered on ξ = ± 55◦. It is obvious that the sound arriving from the median plane makes the IACC greater. Sound arriving from ξ = ± 90◦ in the horizontal plane is not always advantageous, because the similar “detour” paths around the head to both ears cannot decrease the IACC effectively, particularly for frequency ranges higher than 500 Hz. For example, the most effective angles for the frequency ranges of 1 and 2 kHz are roughly centered on ξ = ± 55◦ and ξ = ± 36◦, respectively (see
