- •Preface
- •Acknowledgments
- •Contents
- •1 Introduction
- •1.1 Auditory Temporal and Spatial Factors
- •1.2 Auditory System Model for Temporal and Spatial Information Processing
- •2.1 Analysis of Source Signals
- •2.1.1 Power Spectrum
- •2.1.2 Autocorrelation Function (ACF)
- •2.1.3 Running Autocorrelation
- •2.2 Physical Factors of Sound Fields
- •2.2.1 Sound Transmission from a Point Source through a Room to the Listener
- •2.2.2 Temporal-Monaural Factors
- •2.2.3 Spatial-Binaural Factors
- •2.3 Simulation of a Sound Field in an Anechoic Enclosure
- •3 Subjective Preferences for Sound Fields
- •3.2.1 Optimal Listening Level (LL)
- •3.2.4 Optimal Magnitude of Interaural Crosscorrelation (IACC)
- •3.3 Theory of Subjective Preferences for Sound Fields
- •3.4 Evaluation of Boston Symphony Hall Based on Temporal and Spatial Factors
- •4.1.1 Brainstem Response Correlates of Sound Direction in the Horizontal Plane
- •4.1.2 Brainstem Response Correlates of Listening Level (LL) and Interaural Crosscorrelation Magnitude (IACC)
- •4.1.3 Remarks
- •4.2.2 Hemispheric Lateralization Related to Spatial Aspects of Sound
- •4.2.3 Response Latency Correlates of Subjective Preference
- •4.3 Electroencephalographic (EEG) Correlates of Subjective Preference
- •4.3.3 EEG Correlates of Interaural Correlation Magnitude (IACC) Changes
- •4.4.1 Preferences and the Persistence of Alpha Rhythms
- •4.4.2 Preferences and the Spatial Extent of Alpha Rhythms
- •4.4.3 Alpha Rhythm Correlates of Annoyance
- •5.1 Signal Processing Model of the Human Auditory System
- •5.1.1 Summary of Neural Evidence
- •5.1.1.1 Physical Characteristics of the Ear
- •5.1.1.2 Left and Right Auditory Brainstem Responses (ABRs)
- •5.1.1.3 Left and Right Hemisphere Slow Vertex Responses (SVRs)
- •5.1.1.4 Left and Right Hemisphere EEG Responses
- •5.1.1.5 Left and Right Hemisphere MEG Responses
- •5.1.2 Auditory Signal Processing Model
- •5.2 Temporal Factors Extracted from Autocorrelations of Sound Signals
- •5.3 Auditory Temporal Window for Autocorrelation Processing
- •5.5 Auditory Temporal Window for Binaural Processing
- •5.6 Hemispheric Specialization for Spatial Attributes of Sound Fields
- •6 Temporal Sensations of the Sound Signal
- •6.1 Combinations of Temporal and Spatial Sensations
- •6.2 Pitch of Complex Tones and Multiband Noise
- •6.2.1 Perception of the Low Pitch of Complex Tones
- •6.2.3 Frequency Limits of Missing Fundamentals
- •6.3 Beats Induced by Dual Missing Fundamentals
- •6.4 Loudness
- •6.4.1 Loudness of Sharply Filtered Noise
- •6.4.2 Loudness of Complex Noise
- •6.6 Timbre of an Electric Guitar Sound with Distortion
- •6.6.3 Concluding Remarks
- •7 Spatial Sensations of Binaural Signals
- •7.1 Sound Localization
- •7.1.1 Cues of Localization in the Horizontal Plane
- •7.1.2 Cues of Localization in the Median Plane
- •7.2 Apparent Source Width (ASW)
- •7.2.1 Apparent Width of Bandpass Noise
- •7.2.2 Apparent Width of Multiband Noise
- •7.3 Subjective Diffuseness
- •8.1 Pitches of Piano Notes
- •8.2 Design Studies of Concert Halls as Public Spaces
- •8.2.1 Genetic Algorithms (GAs) for Shape Optimization
- •8.2.2 Two Actual Designs: Kirishima and Tsuyama
- •8.3 Individualized Seat Selection Systems for Enhancing Aural Experience
- •8.3.1 A Seat Selection System
- •8.3.2 Individual Subjective Preference
- •8.3.3 Distributions of Listener Preferences
- •8.5 Concert Hall as Musical Instrument
- •8.5.1 Composing with the Hall in Mind: Matching Music and Reverberation
- •8.5.2 Expanding the Musical Image: Spatial Expression and Apparent Source Width
- •8.5.3 Enveloping Music: Spatial Expression and Musical Dynamics
- •8.6 Performing in a Hall: Blending Musical Performances with Sound Fields
- •8.6.1 Choosing a Performing Position on the Stage
- •8.6.2 Performance Adjustments that Optimize Temporal Factors
- •8.6.3 Towards Future Integration of Composition, Performance and Hall Acoustics
- •9.1 Effects of Temporal Factors on Speech Reception
- •9.2 Effects of Spatial Factors on Speech Reception
- •9.3 Effects of Sound Fields on Perceptual Dissimilarity
- •9.3.1 Perceptual Distance due to Temporal Factors
- •9.3.2 Perceptual Distance due to Spatial Factors
- •10.1 Method of Noise Measurement
- •10.2 Aircraft Noise
- •10.3 Flushing Toilet Noise
- •11.1 Noise Annoyance in Relation to Temporal Factors
- •11.1.1 Annoyance of Band-Pass Noise
- •11.2.1 Experiment 1: Effects of SPL and IACC Fluctuations
- •11.2.2 Experiment 2: Effects of Sound Movement
- •11.3 Effects of Noise and Music on Children
- •12 Introduction to Visual Sensations
- •13 Temporal and Spatial Sensations in Vision
- •13.1 Temporal Sensations of Flickering Light
- •13.1.1 Conclusions
- •13.2 Spatial Sensations
- •14 Subjective Preferences in Vision
- •14.1 Subjective Preferences for Flickering Lights
- •14.2 Subjective Preferences for Oscillatory Movements
- •14.3 Subjective Preferences for Texture
- •14.3.1 Preferred Regularity of Texture
- •15.1 EEG Correlates of Preferences for Flickering Lights
- •15.1.1 Persistence of Alpha Rhythms
- •15.1.2 Spatial Extent of Alpha Rhythms
- •15.2 MEG Correlates of Preferences for Flickering Lights
- •15.2.1 MEG Correlates of Sinusoidal Flicker
- •15.2.2 MEG Correlates of Fluctuating Flicker Rates
- •15.3 EEG Correlates of Preferences for Oscillatory Movements
- •15.4 Hemispheric Specializations in Vision
- •16 Summary of Auditory and Visual Sensations
- •16.1 Auditory Sensations
- •16.1.1 Auditory Temporal Sensations
- •16.1.2 Auditory Spatial Sensations
- •16.1.3 Auditory Subjective Preferences
- •16.1.4 Effects of Noise on Tasks and Annoyance
- •16.2.1 Temporal and Spatial Sensations in Vision
- •16.2.2 Visual Subjective Preferences
- •References
- •Glossary of Symbols
- •Abbreviations
- •Author Index
- •Subject Index
Chapter 14
Subjective Preferences in Vision
In order to rationally design visual objects and spaces with both temporal and spatial sensations in mind (“temporal design”), we want to understand subjective preferences for visual stimuli. A series of experiments was carried out to probe the subjective preferences for temporal and spatial factors in vision. This included preferences for rates of flickering lights, for frequencies of oscillating vertical and horizontal movements, and for texture-related spatial regularities. The temporal and spatial factors are extracted respectively from temjporal and spatial autocorrelation functions (ACFs).
14.1 Subjective Preferences for Flickering Lights
Subjective preference for a flickering light obtained using paired comparison tests (PCT) was investigated in terms of the ACF factors of the signal in the time domain. It has been found that the preferred period of the flickering light with a sinusoidal signal is approximately 1 Hz (Soeta et al., 2002a). The purpose of this study was to find more preferred conditions introducing a fluctuation to the flickering light. For this purpose, the bandwidth of the noise centered on 1 Hz was varied at five levels (1, 2, 4, 8, and 16 Hz) by use of the second-order Chebyshev filter. A remarkable finding is that the most preferred value of [φ1]p is about 0.46, where φ1 can be extracted from the temporal ACF of the stimulus signal. The scale value of preference is formulated approximately in terms of the 3/2 power of the normalized φ1 of a flickering light by the most preferred value, [φ1]p. This result may suggest a reason, for example, why one likes the twinkling star. Further, we may produce a visual light and even a music signal based on this temporal factor, and also blending visual light and music.
In the natural environment, for example, we have many visual aspects in the temporal fluctuation, such as leaves in the wind and clouds in the sky, twinking stars due to air currents, flames, and flows of water in a river. Flames in a bonfire and a glitter of sunlight reflected by the water surface provide us with a lively and splendid environment.
Although a number of studies have dealt with sensitivity to flickering stimuli (e.g., de Lange, 1952; Kelly, 1961; Mandler and Makous, 1984; Kremers et al.,
Y. Ando, P. Cariani (Guest ed.), Auditory and Visual Sensations, |
253 |
DOI 10.1007/b13253_14, C Springer Science+Business Media, LLC 2009 |
|
254 |
14 Subjective Preferences in Vision |
1993; Wu et al., 1996), this section is concerned with subjective preference in timevarying light. Subjective preference was initially chosen as a primitive and overall response relating to aesthetics. It would lead the individual away from inappropriate environments and toward desirable ones (Schroeder et al., 1974; Kaplan, 1987).
It has been found that the preferred period of a sinusoidally-flickering light is approximately 1 Hz. To obtain more preferred conditions, we introduced a fluctuation to the sinusoidal flickering light centered on 1 Hz (Soeta et al., 2005). The amplitude of the first maximum peak of the ACF, φ1, was controlled by changing the bandwidth of the noise (1, 2, 4, 8, and 16 Hz). This temporal factor that reflects flicker regularity is the visual analogue of auditory pitch strength. An 8-mm- diameter green LED, set at a distance of 1.0 m from the subject in dark surroundings, produced the light source. The stimulus field from the LED was spatially uniform, and its size corresponded with 0.46◦ of the visual angle. The mean luminance was set to 30 cd/m2 and kept constant during the sessions. Examples of the stimulus signal are shown in Fig. 14.1.
Fig. 14.1 Examples of stimuli used. (a) Sinusoidal wave. (b) f = 1 Hz. (c) f = 4 Hz
The source signal was characterized by the ACF factors, (0), φ1, τ1, and τe (Fig. 14.2, 2T = 4.0 s). The value of (0), representing average power, was set constant. The τ1 corresponding to the center frequency of the band-pass noise was fixed at 1.0 s (Fig. 14.3a). Note that the factor Wφ(0) is constant also. The values of φ1 and τe increase as the bandwidth of noise, f, decreases (Fig. 14.3b and c). Because
14.1 Subjective Preferences for Flickering Lights |
255 |
Fig. 14.2 An example of the normalized autocorrelation function NACF of a signal showing definitions of dominant periodicity τ 1 and its relative magnitude φ1. For acoustic signals these features correspond respectively to pitch and pitch strength or salience
there is a certain degree of coherence between φ1 and τe, subjective preference for a flickering light is discussed based only on φ1 in this study.
Ten 20to 23-year-old subjects participated in the experiment. All had normal or corrected-to-normal vision. They adapted to the dark and watched the LED stimulus. Then, the PCT was performed for all combinations of the pairs (i.e., 15 pairs of stimuli interchanging the order in each pair per session), and the pairs were presented in random order. A total of 10 sessions were conducted for each individual subject. The subjects were asked which stimulus they preferred to watch.
The most preferred, [φ1]p, for each subject was obtained by fitting a suitable polynomial curve to a graph on which scale values were plotted. Figure 14.4 shows an example of the method used for estimating [φ1]p. The peak of this curve denotes the subject’s most preferred value. Table 14.1 shows the results of [φ1]p for the 10 subjects. The averaged value of [φ1]p was 0.46.
We also attempted to determine the characteristics of the preference evaluation curve in more detail, in a similar manner to those described in Sections 3.2 and 7.3.
The preference evaluation curve can be expressed widely as |
|
S = SL ≈ −α|x|β |
(14.1) |
where α and β are the weighting coefficients, and x = logφ1 – log[φ1]p. After obtaining the most preferred value [φ1]p for each subject, values of α and β were obtained. To simplify Equation (14.1), the coefficient β may be fixed at a certain value so that the individual preference curve can be expressed by the sole coefficient α. The average value of β, estimated by a quasi-Newton numerical method, was approximately 1.47, as shown in Table 14.2. As discussed later, the scale value of preference for the period of a flickering light and the circular stimulus moving in vertical and horizontal directions is also formulated approximately in terms of the 3/2 power of the normalized period. Thus, the value of β was fixed at 3/2 here again, so that the coefficient α can represent individual difference in subjective preference. The
256 |
14 Subjective Preferences in Vision |
a |
b |
c
Fig. 14.3 Measured factors extracted from the temporal ACF of the visual source signal as a function of the bandwidth f. (a) Delay time of the first maximum peak of ACF (τ1). (b) Amplitude
of the first maximum peak of ACF (φ1). (c) Effective duration of ACF (τe). The τe of the pure tone is ∞
values obtained by a quasi-Newton numerical method are listed in Table 14.3. The weighting coefficient α describes the sharpness of the preference curve with respect to φ1. The large α value signifies that the subject clearly differentiates the level of preference.
Figure 14.5 shows scale values for all of the subjects and the preference evaluation curve calculated by Equation (14.1). The correlation coefficient between scale values of preference and calculated values by Equation (14.1) is 0.92 (p < 0.01). Remarkably, the resulting preferred fluctuation expressed by φ1 is an intermediate of the values between those of sine and a perfectly random wave.
The scale value of preference is formulated commonly in terms of the 3/2 power of x of a flickering light. This behavior is consistent with preference judgment for a flickering light without fluctuation (Soeta et al., 2001); a circular target
14.1 Subjective Preferences for Flickering Lights |
257 |
Fig. 14.4 An example of obtaining the most preferred flicker regularity value, [φ1]p (≈ 0.58), for a single subject. The scale value at φ1 = 1 is not taken into consideration in the curve fitted because the decline of preference saturated already at φ1 = 0.85 in this case
Table 14.1 The most preferred regularity value for flickering light [φ1]p for each observer and the averaged value
Table 14.2 Values of α and β for each observer obtained for Equation (14.1). It is worth noting that the average value of β = 1.47 ≈ 3/2. When β is fixed at 3/2, the individual differences may be expressed by the constant α (the averaged value of α is 10.98)
Observer |
[φ1]p |
A |
0.51 |
B |
0.50 |
C |
0.47 |
D |
0.58 |
E |
0.45 |
F |
0.90 |
G |
0.27 |
H |
0.33 |
I |
0.33 |
J |
0.31 |
Averaged |
0.46 |
|
|
Observer |
α |
β |
|
|
|
A |
9.19 |
1.17 |
B |
4.98 |
0.72 |
C |
7.98 |
1.28 |
D |
11.98 |
1.39 |
E |
11.42 |
1.36 |
F |
5.53 |
1.36 |
G |
6.94 |
1.46 |
H |
24.72 |
2.37 |
I |
14.93 |
2.41 |
J |
6.57 |
1.23 |
Averaged |
|
1.47 |
|
|
|
258
Table 14.3 The value of α obtained for each observer representing the individual difference and the averaged value. When the value of β is fixed at 3/2 in Equation (14.1), then individual differences may be expressed by the constant α
14 Subjective Preferences in Vision
Observer |
α |
|
|
A |
15.99 |
B |
18.00 |
C |
11.02 |
D |
14.45 |
E |
13.82 |
F |
6.49 |
G |
7.27 |
H |
8.32 |
I |
5.88 |
J |
8.53 |
Averaged |
10.98 |
|
|
moving in the vertical and horizontal directions (see Section 14.2) and matching the tonal tempo of windblown camphor leaves (see Section 16.1.1). Thus, Equation (14.1) represents the preference evaluation curve, as similar to that of the sound field.
It has been reported that the power spectrum of natural images tends to behave regularly, with values roughly corresponding to 1/fs2, where fs is the spatial frequency (Field, 1987; Runderman and Bialek, 1994; Schaaf and Hateren, 1996), and the temporal power spectrum of natural time-varying images is given by 1/ft2 on the temporal frequency ft (Dong and Atick, 1995). Previous studies showed that the luminous patterns of fireflies and candlelight have a 1/fn fluctuation mode (Doi et al., 1997; Inagaki et al., 2001). It has also been reported that the spectral density of fluctuations in the audio power of many musical selections and of English speech varies approximately as 1/f, where f is the frequency (Voss and Clarke, 1978a,b). Though spectral pattern may be one factor influencing subjective preference, the stimulus used in this study does not have any 1/fn fluctuation mode. Many repre-
Fig. 14.5 The normalized scale value of preference for all subjects. The solid curve is the value calculated by Equation (14.1) with constants α = 10.98 and β = 3/2 (Table 14.2). Different symbols indicate scale values obtained with different subjects. The abscissa is normalized by [φ1]p. The scale value at [φ1]p is adjusted to zero, without loss of any generality
