- •Preface
- •Acknowledgments
- •Contents
- •1 Introduction
- •1.1 Auditory Temporal and Spatial Factors
- •1.2 Auditory System Model for Temporal and Spatial Information Processing
- •2.1 Analysis of Source Signals
- •2.1.1 Power Spectrum
- •2.1.2 Autocorrelation Function (ACF)
- •2.1.3 Running Autocorrelation
- •2.2 Physical Factors of Sound Fields
- •2.2.1 Sound Transmission from a Point Source through a Room to the Listener
- •2.2.2 Temporal-Monaural Factors
- •2.2.3 Spatial-Binaural Factors
- •2.3 Simulation of a Sound Field in an Anechoic Enclosure
- •3 Subjective Preferences for Sound Fields
- •3.2.1 Optimal Listening Level (LL)
- •3.2.4 Optimal Magnitude of Interaural Crosscorrelation (IACC)
- •3.3 Theory of Subjective Preferences for Sound Fields
- •3.4 Evaluation of Boston Symphony Hall Based on Temporal and Spatial Factors
- •4.1.1 Brainstem Response Correlates of Sound Direction in the Horizontal Plane
- •4.1.2 Brainstem Response Correlates of Listening Level (LL) and Interaural Crosscorrelation Magnitude (IACC)
- •4.1.3 Remarks
- •4.2.2 Hemispheric Lateralization Related to Spatial Aspects of Sound
- •4.2.3 Response Latency Correlates of Subjective Preference
- •4.3 Electroencephalographic (EEG) Correlates of Subjective Preference
- •4.3.3 EEG Correlates of Interaural Correlation Magnitude (IACC) Changes
- •4.4.1 Preferences and the Persistence of Alpha Rhythms
- •4.4.2 Preferences and the Spatial Extent of Alpha Rhythms
- •4.4.3 Alpha Rhythm Correlates of Annoyance
- •5.1 Signal Processing Model of the Human Auditory System
- •5.1.1 Summary of Neural Evidence
- •5.1.1.1 Physical Characteristics of the Ear
- •5.1.1.2 Left and Right Auditory Brainstem Responses (ABRs)
- •5.1.1.3 Left and Right Hemisphere Slow Vertex Responses (SVRs)
- •5.1.1.4 Left and Right Hemisphere EEG Responses
- •5.1.1.5 Left and Right Hemisphere MEG Responses
- •5.1.2 Auditory Signal Processing Model
- •5.2 Temporal Factors Extracted from Autocorrelations of Sound Signals
- •5.3 Auditory Temporal Window for Autocorrelation Processing
- •5.5 Auditory Temporal Window for Binaural Processing
- •5.6 Hemispheric Specialization for Spatial Attributes of Sound Fields
- •6 Temporal Sensations of the Sound Signal
- •6.1 Combinations of Temporal and Spatial Sensations
- •6.2 Pitch of Complex Tones and Multiband Noise
- •6.2.1 Perception of the Low Pitch of Complex Tones
- •6.2.3 Frequency Limits of Missing Fundamentals
- •6.3 Beats Induced by Dual Missing Fundamentals
- •6.4 Loudness
- •6.4.1 Loudness of Sharply Filtered Noise
- •6.4.2 Loudness of Complex Noise
- •6.6 Timbre of an Electric Guitar Sound with Distortion
- •6.6.3 Concluding Remarks
- •7 Spatial Sensations of Binaural Signals
- •7.1 Sound Localization
- •7.1.1 Cues of Localization in the Horizontal Plane
- •7.1.2 Cues of Localization in the Median Plane
- •7.2 Apparent Source Width (ASW)
- •7.2.1 Apparent Width of Bandpass Noise
- •7.2.2 Apparent Width of Multiband Noise
- •7.3 Subjective Diffuseness
- •8.1 Pitches of Piano Notes
- •8.2 Design Studies of Concert Halls as Public Spaces
- •8.2.1 Genetic Algorithms (GAs) for Shape Optimization
- •8.2.2 Two Actual Designs: Kirishima and Tsuyama
- •8.3 Individualized Seat Selection Systems for Enhancing Aural Experience
- •8.3.1 A Seat Selection System
- •8.3.2 Individual Subjective Preference
- •8.3.3 Distributions of Listener Preferences
- •8.5 Concert Hall as Musical Instrument
- •8.5.1 Composing with the Hall in Mind: Matching Music and Reverberation
- •8.5.2 Expanding the Musical Image: Spatial Expression and Apparent Source Width
- •8.5.3 Enveloping Music: Spatial Expression and Musical Dynamics
- •8.6 Performing in a Hall: Blending Musical Performances with Sound Fields
- •8.6.1 Choosing a Performing Position on the Stage
- •8.6.2 Performance Adjustments that Optimize Temporal Factors
- •8.6.3 Towards Future Integration of Composition, Performance and Hall Acoustics
- •9.1 Effects of Temporal Factors on Speech Reception
- •9.2 Effects of Spatial Factors on Speech Reception
- •9.3 Effects of Sound Fields on Perceptual Dissimilarity
- •9.3.1 Perceptual Distance due to Temporal Factors
- •9.3.2 Perceptual Distance due to Spatial Factors
- •10.1 Method of Noise Measurement
- •10.2 Aircraft Noise
- •10.3 Flushing Toilet Noise
- •11.1 Noise Annoyance in Relation to Temporal Factors
- •11.1.1 Annoyance of Band-Pass Noise
- •11.2.1 Experiment 1: Effects of SPL and IACC Fluctuations
- •11.2.2 Experiment 2: Effects of Sound Movement
- •11.3 Effects of Noise and Music on Children
- •12 Introduction to Visual Sensations
- •13 Temporal and Spatial Sensations in Vision
- •13.1 Temporal Sensations of Flickering Light
- •13.1.1 Conclusions
- •13.2 Spatial Sensations
- •14 Subjective Preferences in Vision
- •14.1 Subjective Preferences for Flickering Lights
- •14.2 Subjective Preferences for Oscillatory Movements
- •14.3 Subjective Preferences for Texture
- •14.3.1 Preferred Regularity of Texture
- •15.1 EEG Correlates of Preferences for Flickering Lights
- •15.1.1 Persistence of Alpha Rhythms
- •15.1.2 Spatial Extent of Alpha Rhythms
- •15.2 MEG Correlates of Preferences for Flickering Lights
- •15.2.1 MEG Correlates of Sinusoidal Flicker
- •15.2.2 MEG Correlates of Fluctuating Flicker Rates
- •15.3 EEG Correlates of Preferences for Oscillatory Movements
- •15.4 Hemispheric Specializations in Vision
- •16 Summary of Auditory and Visual Sensations
- •16.1 Auditory Sensations
- •16.1.1 Auditory Temporal Sensations
- •16.1.2 Auditory Spatial Sensations
- •16.1.3 Auditory Subjective Preferences
- •16.1.4 Effects of Noise on Tasks and Annoyance
- •16.2.1 Temporal and Spatial Sensations in Vision
- •16.2.2 Visual Subjective Preferences
- •References
- •Glossary of Symbols
- •Abbreviations
- •Author Index
- •Subject Index
2.1 Analysis of Source Signals |
13 |
Loundness also has ACF correlates. When p’(t) is measured in reference to the pressure 20 μPa leading to the level L(t) , the sound-pressure level Leq is defined by
1 |
|
T |
L(t) |
|
|
|
Leq = 10 log |
|
|
10 |
|
dt |
(2.8) |
T |
0 |
10 |
||||
corresponding to |
|
|
|
|
|
|
10 log p(0) |
|
|
(2.9) |
|||
Although SPL is an important factor related to loudness, it is not the whole story. The envelope of the normalized ACF is also related to important subjective attributes, as is detailed later.
2.1.3 Running Autocorrelation
Because a certain degree of coherence exists in the time sequence of the source signal, which may greatly influence subjective attributes of the sound field, use is made here of the short ACF as well as the long-time ACF.
The short-time moving ACF as a function of the time τ is calculated as (Taguti, and Ando, 1997)
φp(τ) = φp(τ;t,T) |
|
||||
= |
|
|
p(τ;t,T) |
(2.10) |
|
[ p(0;t,T) p(0;τ + t,T)]1/2 |
|
|
|||
where |
|
|
|
||
1 |
t+T |
|
|||
p(τ;t,T) = |
|
p (s)p (s + τ)ds |
(2.11) |
||
2T |
|||||
|
|
|
t−T |
|
|
The normalized ACF satisfies the condition that φp(0) = 1.
To demonstrate a procedure for obtaining the effective duration of the anaylzed short-time ACF analyzed, Fig. 2.3 shows the absolute value in the logarithmic form as a function of the delay time. The envelope decay of the initial and important part of the ACF may be fitted by a straight line in most cases. The effective duration of the ACF, defined by the delay τe at which the envelope of the ACF becomes –10 dB (or 0.1; the tenth-percentile delay), can be easily obtained by the decay rate extrapolated in the range from 0 dB at the origin to –5 dB, for example.
The effective duration of the ACF for various signal durations, 2T, with the moving interval are obtained in such a way. Examples of analyzing the moving ACF of Japanese Syaku-hachi music (Kare-Sansui composed by Hozan Yamamoto, which includes extremely dynamic movements with Ma and Fusi) are shown in Fig. 2.4a–f. The signal duration to be analyzed is discussed in Section 5.3.
14 |
2 Temporal and Spatial Aspects of Sounds and Sound Fields |
Fig. 2.4 Examples of measured effective durations of a 10 s segment of dynamic, Japanese Syaku-hachi music (Music motif K, Kare-Sansui, Yamamoto) computed from its running autocorrelation using different temporal integration windows (2T). Temporal stepsize was 100 ms. (a) 2T=100 ms. (b) 2T=200 ms. (c) 2T=500 ms. (d) 2T=1 s. (e) 2T=2 s. (f) 2T=5 s
Figure 2.5a–f show the moving τe for music motifs A and B, 2T = 2.0 s and 5.0 s. The recommended signal duration (2T)r is discussed in Section 5.3. The minimum value of a moving τe, the most active part of music, containing important informa-
2.1 Analysis of Source Signals |
15 |
Fig. 2.4 (continued)
tion and influencing subjective preferece, is discussed in Chapter 3. The value of (τe)min is plotted in Fig. 2.6 as a function of 2T. It is worth noting that stable values of (τe)min may be obtained in the range of 2T = 0.5 to 2.0 s for these extreme
16 |
2 Temporal and Spatial Aspects of Sounds and Sound Fields |
(a)
(b)
(c)
(d)
Fig. 2.5 (continued)
2.1 Analysis of Source Signals |
17 |
(e)
(f)
Fig. 2.5 Examples of measured running effective durations for two musical pieces and a spoke poem. Temporal stepsize was 100 ms with temporal integration time 2T = 2 or 5 s. (a) and (b) Slow musical piece, motif K, Kare-Sansui, (Yamamoto). (c) and (d) Slow musical piece, motif A (Gibbons) (e) and (f) Fast musical piece, motif B (Arnold)
Fig. 2.6 Minimum values of the running effective duration as a function of temporal integration time 2T. (circles, curve A) Slow musical piece, motif A (Gibbons). (triangles, curve B) Fast musical piece, motif B (Arnold). (filled squares, curve K) Kare-Sansui, music motif K (Yamamoto)
