- •Preface
- •Acknowledgments
- •Contents
- •1 Introduction
- •1.1 Auditory Temporal and Spatial Factors
- •1.2 Auditory System Model for Temporal and Spatial Information Processing
- •2.1 Analysis of Source Signals
- •2.1.1 Power Spectrum
- •2.1.2 Autocorrelation Function (ACF)
- •2.1.3 Running Autocorrelation
- •2.2 Physical Factors of Sound Fields
- •2.2.1 Sound Transmission from a Point Source through a Room to the Listener
- •2.2.2 Temporal-Monaural Factors
- •2.2.3 Spatial-Binaural Factors
- •2.3 Simulation of a Sound Field in an Anechoic Enclosure
- •3 Subjective Preferences for Sound Fields
- •3.2.1 Optimal Listening Level (LL)
- •3.2.4 Optimal Magnitude of Interaural Crosscorrelation (IACC)
- •3.3 Theory of Subjective Preferences for Sound Fields
- •3.4 Evaluation of Boston Symphony Hall Based on Temporal and Spatial Factors
- •4.1.1 Brainstem Response Correlates of Sound Direction in the Horizontal Plane
- •4.1.2 Brainstem Response Correlates of Listening Level (LL) and Interaural Crosscorrelation Magnitude (IACC)
- •4.1.3 Remarks
- •4.2.2 Hemispheric Lateralization Related to Spatial Aspects of Sound
- •4.2.3 Response Latency Correlates of Subjective Preference
- •4.3 Electroencephalographic (EEG) Correlates of Subjective Preference
- •4.3.3 EEG Correlates of Interaural Correlation Magnitude (IACC) Changes
- •4.4.1 Preferences and the Persistence of Alpha Rhythms
- •4.4.2 Preferences and the Spatial Extent of Alpha Rhythms
- •4.4.3 Alpha Rhythm Correlates of Annoyance
- •5.1 Signal Processing Model of the Human Auditory System
- •5.1.1 Summary of Neural Evidence
- •5.1.1.1 Physical Characteristics of the Ear
- •5.1.1.2 Left and Right Auditory Brainstem Responses (ABRs)
- •5.1.1.3 Left and Right Hemisphere Slow Vertex Responses (SVRs)
- •5.1.1.4 Left and Right Hemisphere EEG Responses
- •5.1.1.5 Left and Right Hemisphere MEG Responses
- •5.1.2 Auditory Signal Processing Model
- •5.2 Temporal Factors Extracted from Autocorrelations of Sound Signals
- •5.3 Auditory Temporal Window for Autocorrelation Processing
- •5.5 Auditory Temporal Window for Binaural Processing
- •5.6 Hemispheric Specialization for Spatial Attributes of Sound Fields
- •6 Temporal Sensations of the Sound Signal
- •6.1 Combinations of Temporal and Spatial Sensations
- •6.2 Pitch of Complex Tones and Multiband Noise
- •6.2.1 Perception of the Low Pitch of Complex Tones
- •6.2.3 Frequency Limits of Missing Fundamentals
- •6.3 Beats Induced by Dual Missing Fundamentals
- •6.4 Loudness
- •6.4.1 Loudness of Sharply Filtered Noise
- •6.4.2 Loudness of Complex Noise
- •6.6 Timbre of an Electric Guitar Sound with Distortion
- •6.6.3 Concluding Remarks
- •7 Spatial Sensations of Binaural Signals
- •7.1 Sound Localization
- •7.1.1 Cues of Localization in the Horizontal Plane
- •7.1.2 Cues of Localization in the Median Plane
- •7.2 Apparent Source Width (ASW)
- •7.2.1 Apparent Width of Bandpass Noise
- •7.2.2 Apparent Width of Multiband Noise
- •7.3 Subjective Diffuseness
- •8.1 Pitches of Piano Notes
- •8.2 Design Studies of Concert Halls as Public Spaces
- •8.2.1 Genetic Algorithms (GAs) for Shape Optimization
- •8.2.2 Two Actual Designs: Kirishima and Tsuyama
- •8.3 Individualized Seat Selection Systems for Enhancing Aural Experience
- •8.3.1 A Seat Selection System
- •8.3.2 Individual Subjective Preference
- •8.3.3 Distributions of Listener Preferences
- •8.5 Concert Hall as Musical Instrument
- •8.5.1 Composing with the Hall in Mind: Matching Music and Reverberation
- •8.5.2 Expanding the Musical Image: Spatial Expression and Apparent Source Width
- •8.5.3 Enveloping Music: Spatial Expression and Musical Dynamics
- •8.6 Performing in a Hall: Blending Musical Performances with Sound Fields
- •8.6.1 Choosing a Performing Position on the Stage
- •8.6.2 Performance Adjustments that Optimize Temporal Factors
- •8.6.3 Towards Future Integration of Composition, Performance and Hall Acoustics
- •9.1 Effects of Temporal Factors on Speech Reception
- •9.2 Effects of Spatial Factors on Speech Reception
- •9.3 Effects of Sound Fields on Perceptual Dissimilarity
- •9.3.1 Perceptual Distance due to Temporal Factors
- •9.3.2 Perceptual Distance due to Spatial Factors
- •10.1 Method of Noise Measurement
- •10.2 Aircraft Noise
- •10.3 Flushing Toilet Noise
- •11.1 Noise Annoyance in Relation to Temporal Factors
- •11.1.1 Annoyance of Band-Pass Noise
- •11.2.1 Experiment 1: Effects of SPL and IACC Fluctuations
- •11.2.2 Experiment 2: Effects of Sound Movement
- •11.3 Effects of Noise and Music on Children
- •12 Introduction to Visual Sensations
- •13 Temporal and Spatial Sensations in Vision
- •13.1 Temporal Sensations of Flickering Light
- •13.1.1 Conclusions
- •13.2 Spatial Sensations
- •14 Subjective Preferences in Vision
- •14.1 Subjective Preferences for Flickering Lights
- •14.2 Subjective Preferences for Oscillatory Movements
- •14.3 Subjective Preferences for Texture
- •14.3.1 Preferred Regularity of Texture
- •15.1 EEG Correlates of Preferences for Flickering Lights
- •15.1.1 Persistence of Alpha Rhythms
- •15.1.2 Spatial Extent of Alpha Rhythms
- •15.2 MEG Correlates of Preferences for Flickering Lights
- •15.2.1 MEG Correlates of Sinusoidal Flicker
- •15.2.2 MEG Correlates of Fluctuating Flicker Rates
- •15.3 EEG Correlates of Preferences for Oscillatory Movements
- •15.4 Hemispheric Specializations in Vision
- •16 Summary of Auditory and Visual Sensations
- •16.1 Auditory Sensations
- •16.1.1 Auditory Temporal Sensations
- •16.1.2 Auditory Spatial Sensations
- •16.1.3 Auditory Subjective Preferences
- •16.1.4 Effects of Noise on Tasks and Annoyance
- •16.2.1 Temporal and Spatial Sensations in Vision
- •16.2.2 Visual Subjective Preferences
- •References
- •Glossary of Symbols
- •Abbreviations
- •Author Index
- •Subject Index
9.3 Effects of Sound Fields on Perceptual Dissimilarity |
189 |
Fig. 9.10 Relationship between the calculated percentile of nonidentified syllables and that obtained by listening tests (r = 0.86,
p < 0.01)
A remarkable finding was that the most significant factor in the previous section was the effective duration of the ACF, τe. In this study also, the most effective and significant temporal factor was the τe value in the temporal factors. In order to obtain effects of the different direction of the noise disturbance, the spatial factors may be taken into consideration. Conclusions are as follows:
1.The syllable identification (NI) may be calculated by both temporal factors extracted from the ACF and spatial factors extracted from the IACF.
2.Particularly in the condition of this experiment, the value of τe as the temporal factor is the most significant as is similar to previous results (Ando et al., 1999); in addition, the WIACC in the spatial factor contributes significantly to the speech identification.
9.3 Effects of Sound Fields on Perceptual Dissimilarity
Perceptual dissimilarity is the perceived difference between the same sounds when they are either produced or heard at different locations. In this section, we discuss dissimilarity for the real sound field of an existing hall in relation to all of the temporal and spatial factors extracted from the ACF and the IACF, respectively. To incorporate dissimilarity into the model, two temporal factors ( t1 and Tsub) of the sound field are added to these temporal and spatial factors. At a given fixed seating position in a real room, the overall psychological distance between sound fields changing source locations on the stage can be obtained by keeping other sensory effects in a room constant. We shall show that the scale value of dissimilarity may be described by the linear combination of all the temporal and spatial factors of the sound field.
190 |
9 Applications (II) – Speech Reception in Sound Fields |
Previously, Yamaguchi (1972) carried out an experiment to empirically measure the perceptual dissimilarity between the same sounds heard from different seats in an existing concert hall. He demonstrated that two significant factors associated with listening position affected subjective similarity: the sound-pressure level and the reverberation characteristics. Edward (1974) also tested dissimilarity by studying differences between different halls and reported that the early-echo pattern, the reverberation time, and the volume level were the significant factors. Cocchi et al. (1990) and Sato et al. (1997) confirmed the effectiveness of the theory of subjective preference through investigations in existing concert halls. Sato et al. (2002) reconfirmed the effectiveness of the theory in an existing opera house as well.
In the current study, a reverberation-free, 4-s fragment of recorded orchestral music (Water Music, Suite No. 2 – Alla Hornpipe, composed by Handel) was used as a source signal (Hotehama et al., 2002). The music source was characterized in terms of the running ACF of the source signal after passing through an A-weighted network. An ACF analysis was carried out with an integration interval 2T = 2.0 s and running step-size of 100 ms, and factors (0), τe, τ1, and φ1 were extracted. As shown in Fig. 9.11, the minimum value of the effective duration of the source signal, (τe)min = 46 ms. This value is obtained at the most active piece of music and thus strongly related to preferred values of the temporal factors ( t1 and Tsub) of the sound field as discussed in Chapter 3 (Ando et al. 1989; Mouri et al., 2000).
Dissimilarity judgments were performed in a multiple-purpose hall, the 400-seat ORBIS Hall in Kobe, Japan, which is shown in Fig. 9.12. Six loudspeakers with identical characteristics were placed on the stage. Twenty student subjects participated in the experiment. They were divided into four groups and seated at specific positions in seating locations A, B, C, and D. To avoid the effects of other environmental conditions, each dissimilarity judgment was conducted at a fixed seat, and sound sources only were switched between the six source locations. Subjects judged difference as an overall impression between the paired stimuli. They were asked to rate the pairs on a subjective linear scale that had two extreme ends: “no different,” and “extremely different.” The judgment was made for 15 pairs of the six sound fields at each listener’s location. The interval between paired stimuli was 1 s. Each pair of sound fields was separated by a silent interval of 5 s, and the pairs were arranged in random order. Each session was repeated 5 times. To obtain the scale value of dissimilarity between sound fields, the original data of dissimilarity judgment were categorized into seven categories by the method of successive categories (Torgerson, 1958). The scale value of dissimilarity for each pair of source locations at the seat positions was obtained with all listeners as listed in Table 9.5.
To measure the acoustical factors extracted from the ACF and IACF, the music signal used in the dissimilarity judgments was reproduced from each loudspeaker. The signal was recorded at each listening position, through two microphones placed at the two ear entrances of a real person facing the center of the stage. To obtain binaural impulse responses, a maximum-length sequence (MLS) signal was reproduced from each loudspeaker (Alrutz, 1981).
Considering the fact that dissimilarity as well as subjective preference judgments may be made at the most “active and informative” running music piece, which indi-
9.3 Effects of Sound Fields on Perceptual Dissimilarity |
191 |
Fig. 9.11 Factors of the running ACF of the source signal analyzed by 2T = 2.0 s with 100 ms of the running interval. (a) τe. (b) Relative(0), obtained as relative to the maximum value at
τ = 0.5 s. (c) τ1. (d) φ1
cates the minimum value of (τe)min extracted from the running ACF, values of τ1 and φ1 at the particular piece also were extracted. Values of LL, IACC, τIACC, and WIACC from the running IACF were also computed.
After obtaining the binaural impulse responses, values of t1 and Tsub were calculated. The value of t1 was defined by the time difference between the arrival time of the direct sound and that of the reflection, which is the maximum energy in the impulse responses. From the two measured values of t1 obtained at both ears, the one with the largest amplitude of the first reflection was selected as the t1 (Ando and Gottlob, 1979). The averaged value of Tsub of the 500-Hz and 1-kHz octave band center frequencies were applied here, because these frequency ranges are the dominant of the source signal. The measured temporal and spatial factors obtained by running the ACF, IACF, and binaural impulse response analysis are shown in Fig. 9.13. The factors extracted from the IACF were also chosen from
192 |
9 Applications (II) – Speech Reception in Sound Fields |
Fig. 9.12 Plan of the ORBIS Hall in which dissimilarity judgment was made. A–D: Locations of listeners. – : Source locations changed in the PCT conducted at fixed seating position during the judgment
Table 9.5 Scale values of dissimilarity judgments for each pair of source locations at seating positions of A, B, C, and D
|
Seat position |
|
|
|
Pair of source locations |
Position A |
Position B |
Position C |
Position D |
|
|
|
|
|
1–2 |
1.4 |
0.8 |
0.8 |
0.9 |
1–3 |
1.9 |
1.7 |
1.8 |
1.6 |
1–4 |
2.3 |
2.0 |
2.5 |
2.4 |
1–5 |
0.7 |
0.9 |
0.8 |
0.7 |
1–6 |
1.3 |
1.0 |
1.4 |
1.0 |
2–3 |
0.8 |
0.6 |
1.2 |
1.3 |
2–4 |
1.2 |
1.2 |
2.2 |
2.0 |
2–5 |
1.6 |
1.2 |
0.7 |
0.9 |
2–6 |
1.8 |
1.4 |
0.8 |
0.6 |
3–4 |
0.5 |
0.4 |
1.6 |
1.2 |
3–5 |
2.1 |
1.9 |
1.8 |
1.7 |
3–6 |
2.1 |
2.0 |
1.4 |
1.7 |
4–5 |
2.4 |
2.0 |
2.2 |
2.1 |
4–6 |
2.2 |
2.1 |
2.0 |
2.0 |
5–6 |
0.4 |
0.4 |
0.9 |
0.8 |
|
|
|
|
|
9.3 Effects of Sound Fields on Perceptual Dissimilarity |
193 |
Fig. 9.13 Measured physical factors at each listener’s locations measured. The location of sources and listeners are shown in Fig. 9.12. (a) τ1. (b) φ1. (c) Tsub. (d) t1. (e) LL. (f) τIACC. (g) IACC. (h) WIACC. •, values measured for source location ; ◦ , values measured for source location ; , values measured for source location ; , values measured for source location ; , values measured for source location ; and , values measured for source location
