- •Preface
- •Acknowledgments
- •Contents
- •1 Introduction
- •1.1 Auditory Temporal and Spatial Factors
- •1.2 Auditory System Model for Temporal and Spatial Information Processing
- •2.1 Analysis of Source Signals
- •2.1.1 Power Spectrum
- •2.1.2 Autocorrelation Function (ACF)
- •2.1.3 Running Autocorrelation
- •2.2 Physical Factors of Sound Fields
- •2.2.1 Sound Transmission from a Point Source through a Room to the Listener
- •2.2.2 Temporal-Monaural Factors
- •2.2.3 Spatial-Binaural Factors
- •2.3 Simulation of a Sound Field in an Anechoic Enclosure
- •3 Subjective Preferences for Sound Fields
- •3.2.1 Optimal Listening Level (LL)
- •3.2.4 Optimal Magnitude of Interaural Crosscorrelation (IACC)
- •3.3 Theory of Subjective Preferences for Sound Fields
- •3.4 Evaluation of Boston Symphony Hall Based on Temporal and Spatial Factors
- •4.1.1 Brainstem Response Correlates of Sound Direction in the Horizontal Plane
- •4.1.2 Brainstem Response Correlates of Listening Level (LL) and Interaural Crosscorrelation Magnitude (IACC)
- •4.1.3 Remarks
- •4.2.2 Hemispheric Lateralization Related to Spatial Aspects of Sound
- •4.2.3 Response Latency Correlates of Subjective Preference
- •4.3 Electroencephalographic (EEG) Correlates of Subjective Preference
- •4.3.3 EEG Correlates of Interaural Correlation Magnitude (IACC) Changes
- •4.4.1 Preferences and the Persistence of Alpha Rhythms
- •4.4.2 Preferences and the Spatial Extent of Alpha Rhythms
- •4.4.3 Alpha Rhythm Correlates of Annoyance
- •5.1 Signal Processing Model of the Human Auditory System
- •5.1.1 Summary of Neural Evidence
- •5.1.1.1 Physical Characteristics of the Ear
- •5.1.1.2 Left and Right Auditory Brainstem Responses (ABRs)
- •5.1.1.3 Left and Right Hemisphere Slow Vertex Responses (SVRs)
- •5.1.1.4 Left and Right Hemisphere EEG Responses
- •5.1.1.5 Left and Right Hemisphere MEG Responses
- •5.1.2 Auditory Signal Processing Model
- •5.2 Temporal Factors Extracted from Autocorrelations of Sound Signals
- •5.3 Auditory Temporal Window for Autocorrelation Processing
- •5.5 Auditory Temporal Window for Binaural Processing
- •5.6 Hemispheric Specialization for Spatial Attributes of Sound Fields
- •6 Temporal Sensations of the Sound Signal
- •6.1 Combinations of Temporal and Spatial Sensations
- •6.2 Pitch of Complex Tones and Multiband Noise
- •6.2.1 Perception of the Low Pitch of Complex Tones
- •6.2.3 Frequency Limits of Missing Fundamentals
- •6.3 Beats Induced by Dual Missing Fundamentals
- •6.4 Loudness
- •6.4.1 Loudness of Sharply Filtered Noise
- •6.4.2 Loudness of Complex Noise
- •6.6 Timbre of an Electric Guitar Sound with Distortion
- •6.6.3 Concluding Remarks
- •7 Spatial Sensations of Binaural Signals
- •7.1 Sound Localization
- •7.1.1 Cues of Localization in the Horizontal Plane
- •7.1.2 Cues of Localization in the Median Plane
- •7.2 Apparent Source Width (ASW)
- •7.2.1 Apparent Width of Bandpass Noise
- •7.2.2 Apparent Width of Multiband Noise
- •7.3 Subjective Diffuseness
- •8.1 Pitches of Piano Notes
- •8.2 Design Studies of Concert Halls as Public Spaces
- •8.2.1 Genetic Algorithms (GAs) for Shape Optimization
- •8.2.2 Two Actual Designs: Kirishima and Tsuyama
- •8.3 Individualized Seat Selection Systems for Enhancing Aural Experience
- •8.3.1 A Seat Selection System
- •8.3.2 Individual Subjective Preference
- •8.3.3 Distributions of Listener Preferences
- •8.5 Concert Hall as Musical Instrument
- •8.5.1 Composing with the Hall in Mind: Matching Music and Reverberation
- •8.5.2 Expanding the Musical Image: Spatial Expression and Apparent Source Width
- •8.5.3 Enveloping Music: Spatial Expression and Musical Dynamics
- •8.6 Performing in a Hall: Blending Musical Performances with Sound Fields
- •8.6.1 Choosing a Performing Position on the Stage
- •8.6.2 Performance Adjustments that Optimize Temporal Factors
- •8.6.3 Towards Future Integration of Composition, Performance and Hall Acoustics
- •9.1 Effects of Temporal Factors on Speech Reception
- •9.2 Effects of Spatial Factors on Speech Reception
- •9.3 Effects of Sound Fields on Perceptual Dissimilarity
- •9.3.1 Perceptual Distance due to Temporal Factors
- •9.3.2 Perceptual Distance due to Spatial Factors
- •10.1 Method of Noise Measurement
- •10.2 Aircraft Noise
- •10.3 Flushing Toilet Noise
- •11.1 Noise Annoyance in Relation to Temporal Factors
- •11.1.1 Annoyance of Band-Pass Noise
- •11.2.1 Experiment 1: Effects of SPL and IACC Fluctuations
- •11.2.2 Experiment 2: Effects of Sound Movement
- •11.3 Effects of Noise and Music on Children
- •12 Introduction to Visual Sensations
- •13 Temporal and Spatial Sensations in Vision
- •13.1 Temporal Sensations of Flickering Light
- •13.1.1 Conclusions
- •13.2 Spatial Sensations
- •14 Subjective Preferences in Vision
- •14.1 Subjective Preferences for Flickering Lights
- •14.2 Subjective Preferences for Oscillatory Movements
- •14.3 Subjective Preferences for Texture
- •14.3.1 Preferred Regularity of Texture
- •15.1 EEG Correlates of Preferences for Flickering Lights
- •15.1.1 Persistence of Alpha Rhythms
- •15.1.2 Spatial Extent of Alpha Rhythms
- •15.2 MEG Correlates of Preferences for Flickering Lights
- •15.2.1 MEG Correlates of Sinusoidal Flicker
- •15.2.2 MEG Correlates of Fluctuating Flicker Rates
- •15.3 EEG Correlates of Preferences for Oscillatory Movements
- •15.4 Hemispheric Specializations in Vision
- •16 Summary of Auditory and Visual Sensations
- •16.1 Auditory Sensations
- •16.1.1 Auditory Temporal Sensations
- •16.1.2 Auditory Spatial Sensations
- •16.1.3 Auditory Subjective Preferences
- •16.1.4 Effects of Noise on Tasks and Annoyance
- •16.2.1 Temporal and Spatial Sensations in Vision
- •16.2.2 Visual Subjective Preferences
- •References
- •Glossary of Symbols
- •Abbreviations
- •Author Index
- •Subject Index
6.3 Beats Induced by Dual Missing Fundamentals |
105 |
2.Within this periodicity range, even if a complex tone with a missing fundamental has only two harmonics (n=2,3), the pitch corresponds to the delay time of the first major peak in the NACF.
6.3 Beats Induced by Dual Missing Fundamentals
It has been observed that a monaural beat can be induced by two complex tones that have slightly different missing fundamentals. This is true even when the envelopebeat component is realized by random-phase components. The beat stimulus is constructed by mixing two complex tones, A and B, that have missing fundamental frequencies at F0a and F0b respectively. When we listen to these complex tones A and B together, a beat is heard that corresponds to the frequency difference between the fundamentals ( f = F0b–F0a). When all components are in phase, the composite tone (A + B) has a waveform repetition and envelope periodicity corresponding to f. When the components are in random phase, however, the envelope periodicity f disappears. Experimental results show that, in both cases, beats of f ≤ 4 Hz were perceived clearly for stimuli with missing fundamentals up to 256 Hz. These results show that beats that are independent of the envelope component can be detected. These phenomena can be explained in terms of the delay time of the maximum peak extracted from the ACF of the sound signal.
An experiment on monaural beats induced by two complex tones with missing fundamentals was conducted (Shimokura and Ando, 2004). Each stimulus signal consisted of two complex tones, A and B, mixed together. Let F0a and F0b be the fundamental frequencies of A and B, respectively, each consisting of upper harmonics (n ≥ 8). φa and φb are phases of complex frequency components. Amplitudes of all components were equal. Fundamental frequencies F0a of the first tone A were either 32, 64, 128, 256, or 512 Hz, while the fundamentals F0b of the second tone B differed from the first by 2, 4, 8, or 16 Hz, respectively, so that F0b = F0a + f. The lowest component of A was always fixed at 1,024 Hz. For example, when F0a = 128 Hz and f = 2 Hz, then the components of A consisted of three harmonics n = 8–10 of 128 Hz: 1,024, 1,152, and 1,280 Hz, while the components of B (F0b = 130 Hz) were harmonics n = 10–12 of 1,300, 1,430, and 1,560 Hz. The total peak sound pressure level measured at the center position of the center of the head was fixed at 74 dB SPL.
Figure 6.11a shows the waveforms of the two complex tones with F0a = 128 Hz and f = 2 Hz. When components are in-phase, the envelope has a periodicity corresponding to f. When phases are random, however, the envelope regularity disappears as shown in Fig. 6.11b. Figure 6.12 shows the results of the ACF analysis of the two complex tones. As is well known, the ACF is identical for in-phase and out-of-phase signals. The maximum peak, τ 1 = 0.5 s, corresponds to 2 Hz. In the ACF for τ < 10 ms shown in Fig. 6.12b, two initial fundamental frequencies (128 and 130 Hz) are indicated by arrows.
106 |
6 Temporal Sensations of the Sound Signal |
Fig. 6.11 Waveforms of two complex tones. (a) In-phase condition. (b) Random condition
Three 23to 24-year-old subjects participated in the beat matching test. Each subject was seated in the listening room, and the same sound signal was fed to the two ears via headphone (Sennheiser, HE60). First, subjects were presented with the two combined complex tones and were asked to listen for a single beat in the sound signal. Then, subjects were presented a train of pulse tones generated by an oscillator and asked to adjust the pulse rate to match the beat perceived for the combined complex tones. This process was repeated until subjects could match an identical beat.
6.3 Beats Induced by Dual Missing Fundamentals |
107 |
Fig. 6.12 NACF analyzed for both conditions. (a) τ ≥ 2.0 s. (b) τ≤ 0.01 s
Beat-matching tests for the set of five stimuli were presented in random sequence, with each composite complex tone being presented a total of 10 times.
Figure 6.13 shows the probability of the subjects correctly matching the perceived beat frequency by adjusting the pulse rate f within the one-third octave that was generated separately. This is shown as a function of the fundamental frequency F0a and as a parameter of the beat frequency f. What we find remarkable is that when f = 2 to 4 Hz, the probabilities for the frequency range of F0a = 32 to 256 Hz almost always exceeded 80% for both in-phase and out-of-phase conditions. When f = 8 to 16 Hz, the probabilities all decreased below 65%. For in-phase conditions, beat-matching probabilities were smaller than those for out-of- phase conditions, only except for one condition, F0a = 512 Hz (p < 0.025). This beat that is perceived is independent of the envelope of the waveform; consequently it was distinguished from an envelope beat.
As discussed previously, the pitch-matching test of the single complex tone shows that the listeners hear a pitch at the fundamental frequency, which can be described by the delay time of the first peak in the ACF below 1200 Hz (Inoue et al., 2001). However, the beat phenomenon induced by the dual missing fundamentals was observed in the range of 32 Hz < F0a < 256 Hz. The periodicity-limiting mechanism of this fundamental frequency range for F0a is unknown.
These experiments demonstrated that:
1.Fundamental frequencies of multiple complex tones can induce an additional secondary fundamental frequency that is perceived as a beat. The perceived beat rate corresponds to the delay time of the maximum peak of the ACF of the whole signal.
2.The perceived beat was independent of the existence of regularities and fluctuations in the waveform envelope of the two-tone stimulus. This rule holds for fundamental frequencies below 256 Hz.
