- •Preface
- •Acknowledgments
- •Contents
- •1 Introduction
- •1.1 Auditory Temporal and Spatial Factors
- •1.2 Auditory System Model for Temporal and Spatial Information Processing
- •2.1 Analysis of Source Signals
- •2.1.1 Power Spectrum
- •2.1.2 Autocorrelation Function (ACF)
- •2.1.3 Running Autocorrelation
- •2.2 Physical Factors of Sound Fields
- •2.2.1 Sound Transmission from a Point Source through a Room to the Listener
- •2.2.2 Temporal-Monaural Factors
- •2.2.3 Spatial-Binaural Factors
- •2.3 Simulation of a Sound Field in an Anechoic Enclosure
- •3 Subjective Preferences for Sound Fields
- •3.2.1 Optimal Listening Level (LL)
- •3.2.4 Optimal Magnitude of Interaural Crosscorrelation (IACC)
- •3.3 Theory of Subjective Preferences for Sound Fields
- •3.4 Evaluation of Boston Symphony Hall Based on Temporal and Spatial Factors
- •4.1.1 Brainstem Response Correlates of Sound Direction in the Horizontal Plane
- •4.1.2 Brainstem Response Correlates of Listening Level (LL) and Interaural Crosscorrelation Magnitude (IACC)
- •4.1.3 Remarks
- •4.2.2 Hemispheric Lateralization Related to Spatial Aspects of Sound
- •4.2.3 Response Latency Correlates of Subjective Preference
- •4.3 Electroencephalographic (EEG) Correlates of Subjective Preference
- •4.3.3 EEG Correlates of Interaural Correlation Magnitude (IACC) Changes
- •4.4.1 Preferences and the Persistence of Alpha Rhythms
- •4.4.2 Preferences and the Spatial Extent of Alpha Rhythms
- •4.4.3 Alpha Rhythm Correlates of Annoyance
- •5.1 Signal Processing Model of the Human Auditory System
- •5.1.1 Summary of Neural Evidence
- •5.1.1.1 Physical Characteristics of the Ear
- •5.1.1.2 Left and Right Auditory Brainstem Responses (ABRs)
- •5.1.1.3 Left and Right Hemisphere Slow Vertex Responses (SVRs)
- •5.1.1.4 Left and Right Hemisphere EEG Responses
- •5.1.1.5 Left and Right Hemisphere MEG Responses
- •5.1.2 Auditory Signal Processing Model
- •5.2 Temporal Factors Extracted from Autocorrelations of Sound Signals
- •5.3 Auditory Temporal Window for Autocorrelation Processing
- •5.5 Auditory Temporal Window for Binaural Processing
- •5.6 Hemispheric Specialization for Spatial Attributes of Sound Fields
- •6 Temporal Sensations of the Sound Signal
- •6.1 Combinations of Temporal and Spatial Sensations
- •6.2 Pitch of Complex Tones and Multiband Noise
- •6.2.1 Perception of the Low Pitch of Complex Tones
- •6.2.3 Frequency Limits of Missing Fundamentals
- •6.3 Beats Induced by Dual Missing Fundamentals
- •6.4 Loudness
- •6.4.1 Loudness of Sharply Filtered Noise
- •6.4.2 Loudness of Complex Noise
- •6.6 Timbre of an Electric Guitar Sound with Distortion
- •6.6.3 Concluding Remarks
- •7 Spatial Sensations of Binaural Signals
- •7.1 Sound Localization
- •7.1.1 Cues of Localization in the Horizontal Plane
- •7.1.2 Cues of Localization in the Median Plane
- •7.2 Apparent Source Width (ASW)
- •7.2.1 Apparent Width of Bandpass Noise
- •7.2.2 Apparent Width of Multiband Noise
- •7.3 Subjective Diffuseness
- •8.1 Pitches of Piano Notes
- •8.2 Design Studies of Concert Halls as Public Spaces
- •8.2.1 Genetic Algorithms (GAs) for Shape Optimization
- •8.2.2 Two Actual Designs: Kirishima and Tsuyama
- •8.3 Individualized Seat Selection Systems for Enhancing Aural Experience
- •8.3.1 A Seat Selection System
- •8.3.2 Individual Subjective Preference
- •8.3.3 Distributions of Listener Preferences
- •8.5 Concert Hall as Musical Instrument
- •8.5.1 Composing with the Hall in Mind: Matching Music and Reverberation
- •8.5.2 Expanding the Musical Image: Spatial Expression and Apparent Source Width
- •8.5.3 Enveloping Music: Spatial Expression and Musical Dynamics
- •8.6 Performing in a Hall: Blending Musical Performances with Sound Fields
- •8.6.1 Choosing a Performing Position on the Stage
- •8.6.2 Performance Adjustments that Optimize Temporal Factors
- •8.6.3 Towards Future Integration of Composition, Performance and Hall Acoustics
- •9.1 Effects of Temporal Factors on Speech Reception
- •9.2 Effects of Spatial Factors on Speech Reception
- •9.3 Effects of Sound Fields on Perceptual Dissimilarity
- •9.3.1 Perceptual Distance due to Temporal Factors
- •9.3.2 Perceptual Distance due to Spatial Factors
- •10.1 Method of Noise Measurement
- •10.2 Aircraft Noise
- •10.3 Flushing Toilet Noise
- •11.1 Noise Annoyance in Relation to Temporal Factors
- •11.1.1 Annoyance of Band-Pass Noise
- •11.2.1 Experiment 1: Effects of SPL and IACC Fluctuations
- •11.2.2 Experiment 2: Effects of Sound Movement
- •11.3 Effects of Noise and Music on Children
- •12 Introduction to Visual Sensations
- •13 Temporal and Spatial Sensations in Vision
- •13.1 Temporal Sensations of Flickering Light
- •13.1.1 Conclusions
- •13.2 Spatial Sensations
- •14 Subjective Preferences in Vision
- •14.1 Subjective Preferences for Flickering Lights
- •14.2 Subjective Preferences for Oscillatory Movements
- •14.3 Subjective Preferences for Texture
- •14.3.1 Preferred Regularity of Texture
- •15.1 EEG Correlates of Preferences for Flickering Lights
- •15.1.1 Persistence of Alpha Rhythms
- •15.1.2 Spatial Extent of Alpha Rhythms
- •15.2 MEG Correlates of Preferences for Flickering Lights
- •15.2.1 MEG Correlates of Sinusoidal Flicker
- •15.2.2 MEG Correlates of Fluctuating Flicker Rates
- •15.3 EEG Correlates of Preferences for Oscillatory Movements
- •15.4 Hemispheric Specializations in Vision
- •16 Summary of Auditory and Visual Sensations
- •16.1 Auditory Sensations
- •16.1.1 Auditory Temporal Sensations
- •16.1.2 Auditory Spatial Sensations
- •16.1.3 Auditory Subjective Preferences
- •16.1.4 Effects of Noise on Tasks and Annoyance
- •16.2.1 Temporal and Spatial Sensations in Vision
- •16.2.2 Visual Subjective Preferences
- •References
- •Glossary of Symbols
- •Abbreviations
- •Author Index
- •Subject Index
7.2 Apparent Source Width (ASW) |
127 |
We shall clearly show differences in the three temporal factors, τ e, τ 1, and φ1, extracted from the ACF as a function of the incident angle of elevation. According to the model described in Section 5.1, localization in the median plane Lmedian is grouped with monaural, temporal sensations because it appears to be based on factors extracted from the autocorrelation function:
Lmedian = SL = fL (te, t1, f1) |
(7.2) |
Like other perceptual attributes based on ACF factors, we expect sound localization in the median plane to exhibit “left hemisphere specialization.”
The amplitudes of the transfer functions for sound incident from the median plane to the ear entrances as measured by Mehrgardt and Mellert (1977) were transformed into equivalent autocorrelation functions (ACFs). The following steps obtain the ACFs from these transfer functions:
1.Data sets were obtained from the figures from the Mehrgardt and Mellert paper using an optical image reader (scanner), with 300 data points each.
2.Amplitude as a function of frequency in the logarithmic scale was obtained.
3.Each amplitude in decibel scale was converted to a corresponding real number, and the ACF was calculated by inverse Fourier transform after passage through an A-weighted filter.
Examples of the NACF are shown in Fig. 7.2. There is a certain degree of correlation between both τn and τn+i, φn and φn+i, where τn and φn are the delay time and amplitude, respectively, of the n-th peak of the NACF. Thus, τ1 and φ1 can be representatives for sets of τn and φn. Examples of plotting the amplitude of the ACF on a logarithmic scale as a function of the delay time are shown in Fig. 7.3. A straight line can fit the envelope of the decay of the ACF in the logarithmic scale, and τe was easily obtained from the delay at which the envelope drops below −10 dB. The value (0) is not considered as a cue for sound localization here. Three factors extracted from the NACF are shown in Fig. 7.4. The value of τ1 for incident angles from 0◦ to 45◦ is almost the same, but τe for incident angle of 45◦ (τe = 3.1 ms) is much larger than that at 0◦ (τe = 2.1 ms). The value of τ1 for incident angle of 180◦ is different from those for the above two angles, however, the φ1 is relatively small.
Obviously, the angle in the median plane can be distinguished by the three monaural temporal factors τ1, φ1, and τe. These factors, therefore, may play an important role in the perception of localization in the median plane.
7.2 Apparent Source Width (ASW)
We shall show that apparent source width (ASW) may be described in terms of factors extracted from the IACF.
128 |
7 Spatial Sensations of Binaural Signals |
Fig. 7.2 Examples of the NACF of different incident angles in the median plane, 0◦, 45◦, and 180◦
7.2 Apparent Source Width (ASW) |
129 |
Fig. 7.3 Examples of the effective duration extracted from the NACF envelope (in logarithm) for different incident angles in the median plane, 0◦, 45◦, and 180◦
130 |
7 Spatial Sensations of Binaural Signals |
Fig. 7.4 Three-dimensional illustration plotted for three factors, φ1, τ1, and τe, which may distinguish different incident angles in the median plane. Numbers in the circles indicate incident angle in degrees
7.2.1 Apparent Width of Bandpass Noise
For sound fields passing sounds with predominately low frequencies, the interaural correlation function IACF has no sharp peaks in the interaural delay range of less than 1 ms, and the width of the major IACF peak, WIACC, becomes “wider.” The WIACC of band-pass noise may calculated theoretically using the following equation (Ando, 1998),
WIACC(δ) |
≈ |
4 |
cos−1 |
1 − |
δ |
(7.3) |
ω |
IACC |
where ωc = 2π(f1 + f2), and f1 and f2 are the lower and upper frequencies of an ideal filter. For the sake of simplicity, δ is defined as 10% of the maximal value of the interaural correlation function IACF, i.e. δ = 0.1(IACC).
The scale value of the ASW was obtained by the PCT with 10 subjects (Sato and Ando, 1996). In order to control the value of WIACC, the center frequency of the 1/3-octave band-pass noise was changed as 250, 500, 1, and 2 kHz. The value of IACC was adjusted by controlling the sound pressure ratio between reflections (ξ =
±54◦) and the direct sound (ξ = 0◦). To avoid effects of the listening level on the ASW (Keet, 1968), the total SPL at the ear canal entrances of all sound fields was kept constant at a peak of 75 dBA. Subjects judged which of two sound sources they perceived to be wider.
