- •Preface
- •Acknowledgments
- •Contents
- •1 Introduction
- •1.1 Auditory Temporal and Spatial Factors
- •1.2 Auditory System Model for Temporal and Spatial Information Processing
- •2.1 Analysis of Source Signals
- •2.1.1 Power Spectrum
- •2.1.2 Autocorrelation Function (ACF)
- •2.1.3 Running Autocorrelation
- •2.2 Physical Factors of Sound Fields
- •2.2.1 Sound Transmission from a Point Source through a Room to the Listener
- •2.2.2 Temporal-Monaural Factors
- •2.2.3 Spatial-Binaural Factors
- •2.3 Simulation of a Sound Field in an Anechoic Enclosure
- •3 Subjective Preferences for Sound Fields
- •3.2.1 Optimal Listening Level (LL)
- •3.2.4 Optimal Magnitude of Interaural Crosscorrelation (IACC)
- •3.3 Theory of Subjective Preferences for Sound Fields
- •3.4 Evaluation of Boston Symphony Hall Based on Temporal and Spatial Factors
- •4.1.1 Brainstem Response Correlates of Sound Direction in the Horizontal Plane
- •4.1.2 Brainstem Response Correlates of Listening Level (LL) and Interaural Crosscorrelation Magnitude (IACC)
- •4.1.3 Remarks
- •4.2.2 Hemispheric Lateralization Related to Spatial Aspects of Sound
- •4.2.3 Response Latency Correlates of Subjective Preference
- •4.3 Electroencephalographic (EEG) Correlates of Subjective Preference
- •4.3.3 EEG Correlates of Interaural Correlation Magnitude (IACC) Changes
- •4.4.1 Preferences and the Persistence of Alpha Rhythms
- •4.4.2 Preferences and the Spatial Extent of Alpha Rhythms
- •4.4.3 Alpha Rhythm Correlates of Annoyance
- •5.1 Signal Processing Model of the Human Auditory System
- •5.1.1 Summary of Neural Evidence
- •5.1.1.1 Physical Characteristics of the Ear
- •5.1.1.2 Left and Right Auditory Brainstem Responses (ABRs)
- •5.1.1.3 Left and Right Hemisphere Slow Vertex Responses (SVRs)
- •5.1.1.4 Left and Right Hemisphere EEG Responses
- •5.1.1.5 Left and Right Hemisphere MEG Responses
- •5.1.2 Auditory Signal Processing Model
- •5.2 Temporal Factors Extracted from Autocorrelations of Sound Signals
- •5.3 Auditory Temporal Window for Autocorrelation Processing
- •5.5 Auditory Temporal Window for Binaural Processing
- •5.6 Hemispheric Specialization for Spatial Attributes of Sound Fields
- •6 Temporal Sensations of the Sound Signal
- •6.1 Combinations of Temporal and Spatial Sensations
- •6.2 Pitch of Complex Tones and Multiband Noise
- •6.2.1 Perception of the Low Pitch of Complex Tones
- •6.2.3 Frequency Limits of Missing Fundamentals
- •6.3 Beats Induced by Dual Missing Fundamentals
- •6.4 Loudness
- •6.4.1 Loudness of Sharply Filtered Noise
- •6.4.2 Loudness of Complex Noise
- •6.6 Timbre of an Electric Guitar Sound with Distortion
- •6.6.3 Concluding Remarks
- •7 Spatial Sensations of Binaural Signals
- •7.1 Sound Localization
- •7.1.1 Cues of Localization in the Horizontal Plane
- •7.1.2 Cues of Localization in the Median Plane
- •7.2 Apparent Source Width (ASW)
- •7.2.1 Apparent Width of Bandpass Noise
- •7.2.2 Apparent Width of Multiband Noise
- •7.3 Subjective Diffuseness
- •8.1 Pitches of Piano Notes
- •8.2 Design Studies of Concert Halls as Public Spaces
- •8.2.1 Genetic Algorithms (GAs) for Shape Optimization
- •8.2.2 Two Actual Designs: Kirishima and Tsuyama
- •8.3 Individualized Seat Selection Systems for Enhancing Aural Experience
- •8.3.1 A Seat Selection System
- •8.3.2 Individual Subjective Preference
- •8.3.3 Distributions of Listener Preferences
- •8.5 Concert Hall as Musical Instrument
- •8.5.1 Composing with the Hall in Mind: Matching Music and Reverberation
- •8.5.2 Expanding the Musical Image: Spatial Expression and Apparent Source Width
- •8.5.3 Enveloping Music: Spatial Expression and Musical Dynamics
- •8.6 Performing in a Hall: Blending Musical Performances with Sound Fields
- •8.6.1 Choosing a Performing Position on the Stage
- •8.6.2 Performance Adjustments that Optimize Temporal Factors
- •8.6.3 Towards Future Integration of Composition, Performance and Hall Acoustics
- •9.1 Effects of Temporal Factors on Speech Reception
- •9.2 Effects of Spatial Factors on Speech Reception
- •9.3 Effects of Sound Fields on Perceptual Dissimilarity
- •9.3.1 Perceptual Distance due to Temporal Factors
- •9.3.2 Perceptual Distance due to Spatial Factors
- •10.1 Method of Noise Measurement
- •10.2 Aircraft Noise
- •10.3 Flushing Toilet Noise
- •11.1 Noise Annoyance in Relation to Temporal Factors
- •11.1.1 Annoyance of Band-Pass Noise
- •11.2.1 Experiment 1: Effects of SPL and IACC Fluctuations
- •11.2.2 Experiment 2: Effects of Sound Movement
- •11.3 Effects of Noise and Music on Children
- •12 Introduction to Visual Sensations
- •13 Temporal and Spatial Sensations in Vision
- •13.1 Temporal Sensations of Flickering Light
- •13.1.1 Conclusions
- •13.2 Spatial Sensations
- •14 Subjective Preferences in Vision
- •14.1 Subjective Preferences for Flickering Lights
- •14.2 Subjective Preferences for Oscillatory Movements
- •14.3 Subjective Preferences for Texture
- •14.3.1 Preferred Regularity of Texture
- •15.1 EEG Correlates of Preferences for Flickering Lights
- •15.1.1 Persistence of Alpha Rhythms
- •15.1.2 Spatial Extent of Alpha Rhythms
- •15.2 MEG Correlates of Preferences for Flickering Lights
- •15.2.1 MEG Correlates of Sinusoidal Flicker
- •15.2.2 MEG Correlates of Fluctuating Flicker Rates
- •15.3 EEG Correlates of Preferences for Oscillatory Movements
- •15.4 Hemispheric Specializations in Vision
- •16 Summary of Auditory and Visual Sensations
- •16.1 Auditory Sensations
- •16.1.1 Auditory Temporal Sensations
- •16.1.2 Auditory Spatial Sensations
- •16.1.3 Auditory Subjective Preferences
- •16.1.4 Effects of Noise on Tasks and Annoyance
- •16.2.1 Temporal and Spatial Sensations in Vision
- •16.2.2 Visual Subjective Preferences
- •References
- •Glossary of Symbols
- •Abbreviations
- •Author Index
- •Subject Index
7.2 Apparent Source Width (ASW) |
131 |
Results of the analysis of variance for the scale value SR indicate that both factors the IACC and WIACC contribute to SR independently (p < 0.01), so that
S = SR= fR(IACC) + fR(WIACC) ≈ α(IACC)3/2 + β(WIACC)1/2 |
(7.4) |
where coefficients α ≈ −1.64 and β ≈ 2.44 are obtained by regressions of the scale values with 10 subjects as shown in Fig. 7.5. This holds under the condition of τIACC = 0. Obviously, as shown in Fig. 7.6, the calculated scale values by Equation (7.4) and measured scale values are in good agreement (r = 0.97, p < 0.01).
Fig. 7.5 Scale values of apparent source width (ASW) for the 1/3-octave band-pass noises with 95% reliability as a function of (a) the IACC and (b) the WIACC. The regression curves are expressed by Equation (7.4) with α = 1.64 and β = 2.44
For each individual listener, the scale value can be calculated by Equation (7.4) in a similar manner. Coefficients α and β in the equation for each listener are obtained by the multiple regression analysis and are indicated in Table 7.1. Figure 7.7 shows the relationship between the measured scale values and the calculated scale values with the constants for each of 10 subjects. The different symbols indicate the scale values of different subjects. The correlation coefficient between the measured and calculated S(ASW) is 0.91 (p < 0.01).
7.2.2 Apparent Width of Multiband Noise
Keet (1968) showed that the apparent source width ASW of sounds depends on the amplitude of coherence between signals fed to the two loudspeakers and their listening level LL. On the basis of our central auditory signal processing model, however, source width may be described in terms of spatial factors extracted from the interaural correlation function IACF (Sato and Ando, 2002). It is assumed that this depends most directly on WIACC, rather than on the spectrum of the source signal per se, and also as well on the interaural correlation magnitude IACC and
132 |
7 Spatial Sensations of Binaural Signals |
Fig. 7.6 Relationship between the measured scale values of apparent source width (ASW) and the scale values of ASW calculated by Equation (7.4) with α = 1.64 and β = 2.44. Correlation coefficient r = 0.97 (p < 0.01)
Table 7.1 Coefficients α and β in Equation (7.4) for calculating ASW of each individual and the correlation coefficient between the measured and calculated scale values of the ASW
Individual |
α |
β |
Correlation coefficient |
|
|
|
|
S.H. |
−1.21 |
2.58 |
0.88 |
T.S. |
−1.50 |
3.18 |
0.97 |
C.C. |
−1.05 |
2.82 |
0.97 |
S.Y. |
−0.94 |
2.92 |
0.91 |
M.K. |
−2.21 |
2.09 |
0.92 |
S.T. |
−2.57 |
1.94 |
0.94 |
T.H. |
−2.04 |
1.32 |
0.87 |
F.K. |
−0.99 |
3.27 |
0.89 |
N.K. |
−1.79 |
2.14 |
0.80 |
O.S. |
−2.09 |
2.14 |
0.94 |
Average |
−1.64 |
2.44 |
0.97 |
binaural listening level LL. It has been reported that even if the IACC is constant, the ASW increases as low-frequency components increase (Morimoto and Maekawa, 1988; Hidaka et al., 1995). The fact that wider source widths are perceived for sound sources with predominately low-frequency components might have a correlate in the behavior of WIACC.
This study examines the apparent source widths of complex noise signals, which consist of several band-pass noises whose center passband frequencies are the harmonics of a fundamental frequency. The question here is whether pitches associated
7.2 Apparent Source Width (ASW) |
133 |
Fig. 7.7 Relationship between the measured individual scale values of apparent source width (ASW) and the scale values of ASW calculated by Equation (7.4) with each individual value of α and β as listed in Table 7.1, r = 0.90 (p < 0.01). Different symbols indicate data obtained by different individual subjects
with the fundamental frequencies of these noise complexes in some way influence or predict the apparent source widths ASWs of these sounds (Sato and Ando, 2002). Scale values of the ASW for these complex noise signals are compared with those of comparable noise signals with single passbands.
Single band-pass noises with center frequencies of 200, 400, and 800 Hz and multiband complex noises with center fundamental frequencies of 200, 400, and 800 Hz were used as source signals. Each complex noise stimulus consisted of three passbands, and the center frequencies of the lowest passbands were fixed at 1,600 Hz. When the fundamental frequency (F0) was 200 Hz, the three passbands were 1,600, 1,800, and 2,000 Hz. Similarly, for F0 = 400 Hz these were 1,600, 2,000, 2,400 Hz, and for F0 = 800 they were 1,600, 2,400, 3,200 Hz. The amplitudes of all passbands were adjusted to be the same by measuring Φ(0). Bandwidths of the noise passbands in all signals were 80 Hz with a cutoff slope of 2,068 dB/octave.
A single frontal loudspeaker for direct sound (ξ = 0◦) and two symmetrical loudspeakers (ξ = ±54◦) that added reflections were used simulate different source widths in an anechoic chamber (Section 2.3). To produce incoherent sound signals, time delays of reflections t1 and t2 were fixed at 20 and 40 ms, respectively. To reconfirm the effects of listening level LL on apparent source width ASW, the sound pressure level at the listener’s head position was also changed from 70 to 75 dB. The values of the IACC of all sound fields were adjusted to 0.90 ± 0.01 by controlling the identical amplitude of the reflections (A1 = A2). The IACF was measured with two 1/2-in. condenser-type microphones, placed at the ear entrances of a dummy head. The output from the microphones was passed through an A-weighting network and was digitized at a sampling frequency of 44.1 kHz.
134 |
7 Spatial Sensations of Binaural Signals |
Paired-comparisons tests (PCT) using 12 sound fields (6 × 2) were performed on five subjects with normal hearing ability in order to obtain scale values for perceived source width ASW. Subjects were seated in an anechoic chamber and asked to judge which of the two paired stimuli they perceived to be wider. The duration of each sound stimulus was 3 s, the rise and fall times were 50 ms, and the silent interval between stimuli was 1 s. Each stimulus pair was separated by an interval of 4 s and the pairs were presented in random order. Twenty-five responses (five subjects × five repeats) to each stimulus were obtained. Scale values of ASWs were obtained by applying the law of comparative judgment. The relationship between the scale values of source widths ASWs and WIACCs of the source signals is shown in Fig. 7.8. There are significant differences between the scale values of apparent widths of noises with single and multiple passbands (p < 0.01) even when they evoke the same pitch. However, the change in the factor WIACC explains these differences. The results of the analysis of variance for scale values S for apparent source width ASW revealed that the explanatory factors WIACC and LL are significant (p < 0.01) and independent, so that
Fig. 7.8 Average scale values of apparent source width (ASW) as a function of interaural correlation magnitude (IACC) WIACC and as a parameter of listening level (LL). •, band-pass noise, LL = 75 dB; , band-pass noise, LL = 70 dB; , complex noise, LL = 75 dB; , complex noise; LL = 70 dB. The regression curve is expressed by Equation (7.5) with a = 2.40 and b = 0.005
S = SR = fR (WIACC) + fR (LL) ≈ a (WIACC)1/2 + b (LL) 3/2 |
(7.5) |
where a and b are coefficients. The powers, 1/2 and 3/2 for the terms of WIACC and LL in Equation (7.5), respectively, were determined to obtain the best correlation between the scale values measured and its calculated value. The curves in Fig. 7.9 confirm the scale values calculated using Equation (7.5) with the coefficient values a ≈ 2.40 and b ≈ 0.005 that were obtained. The correlation coefficient between measured and calculated scale values was 0.97 (p < 0.01).
7.2 Apparent Source Width (ASW) |
135 |
Fig. 7.9 Relationship between the measured scale values of ASW and the scale values of ASW calculated by Equation (7.5) with a = 2.40 and b = 0.005. Correlation coefficient r = 0.97 (p < 0.01)
It is noteworthy that the above-mentioned scale value of ASW for 1/3-octave band-pass noise is expressed in terms of the 1/2 power of WIACC and that the coefficient for WIACC (β ≈ 2.44) is close to that of this study. The factor WIACC is determined by the frequency component of the source signal, thus the pitch or the fundamental frequency represented by the temporal factor τ 1 is not necessary to describe or predict apparent source width. Results of ANOVA for scale values of ASW indicated that the explanatory factor LL is also significant (p < 0.01). The scale values of ASW increase with an increase in binaural listening level LL, similar to Keet (1968).
Individual differences may be also expressed by the weighting coefficients in Equation (7.5), which are obtained by multiple regression analysis. For each individual, Fig. 7.10 shows the relationship between measured scale values of ASW and those calculated from Equation (7.5) using coefficients a and b (Table 7.2). The different symbols correspond to the different subjects. The correlation coefficient between the measured and the calculated scale values is 0.90 (p < 0.01).
Because the weighting coefficients of (WIACC)1/2 in Equations (7.4) and (7.5) are apparently similar, one can construct a common formula
S = SR = f (IACC) + f (WIACC) + f (LL) ≈ α(IACC)3/2 + β(WIACC)1/2 + γ (LL)3/2 (7.6)
where α ≈ −1.64, β ≈ 2.42, γ ≈ 0.005. It is worth noting that units of scale value of subjective preference even in different subjects and using different source signals appeared to be almost constant (Section 3.3).
