Добавил:
Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:
акустика / xie_bosun_spatial_sound_principles_and_applications.pdf
Скачиваний:
158
Добавлен:
04.05.2023
Размер:
28.62 Mб
Скачать

184  Spatial Sound

then the loudspeaker signals are given as

A

A

1 cos

 

S

 

Q .

(4.3.79)

i S

total

 

 

i

 

Similar to Equation (4.3.40) for the first-order Ambisonics, for arbitrary Q-order Ambisonic signals given in Equation (4.3.53), although the virtual source direction is independent from

Atotal, Atotal determines the overall sound pressure or power in reproduction. For the constant amplitude normalization similar to Equation (4.3.41), Atotal can be obtained from the first

equation in Equation (4.3.56)

Atotal =

1

.

(4.3.80a)

 

 

M

 

This equation is identical to the result of the first-order Ambisonics. For the constant power normalization similar to Equation (4.3.43), the following equation can be obtained:

Atotal

 

 

 

1

 

.

(4.3.80b)

 

 

 

 

 

 

 

 

Q

 

 

 

 

 

 

 

 

 

 

 

 

 

M

 

1

2

 

 

 

 

2 q

 

 

 

 

 

 

q 1

 

 

For κq = κ0 = 1 given in Equation (4.3.62),

Atotal

 

1

.

(4.3.81)

 

 

M

2Q 1

 

 

 

 

For κq expressed in Equation (4.3.77),

Atotal

 

1

 

.

(4.3.82)

 

 

 

M

Q 1

 

 

 

 

 

 

4.3.4  Discussion and implementation of the horizontal Ambisonics

From the point of multichannel loudspeaker signals, Ambisonics is a global amplitudebased mixing or panning method. From the point of information representation, Ambisonic encodes the spatial information of a sound field into a set of independent and universal signals, which are independent from the loudspeaker configuration in reproduction. Diverse and equivalent forms of independent signals are available. The number and configuration of loudspeakers for Ambisonic reproduction are flexible. Loudspeaker signals are derived from independent signals by decoding equations or matrices. Ambisonics can recreate the spatial information of sound field at the central listening position and up to a certain frequency limit. Therefore, Ambisonics is a universal and flexible system from the point of signal transmission and reproduction.

From the point of physics, Ambisonics is a series of systems with various orders that are based on the principle of spatial harmonics decomposition and each order approximation of the sound field. In Section 1.9.1, Ambisonics is a typical example of a gradual transition

Horizontal surround with regular loudspeaker configuration  185

from the approximate to the exact reconstruction of the target sound field. The lower-order spatial harmonics in Ambisonics represent the rough information of the spatial sound field, and the higher harmonics represent the detailed information of the spatial sound field. When the capacity of signal transmission (and storage) is limited, transmitting the information in the order from rough to detailed is reasonable. Ambisonics reduces the redundancy among the transmitted information and enables efficient use of the finite transmission capacity of the system because the independent information of the spatial sound field is extracted through spatial harmonic decomposition.

The lower-order Ambisonics can reconstruct the target sound field within a very small spatial region and limited frequency range; therefore, appropriate psychoacoustic methods should be applied to create a virtual source and other spatial auditory perceptions. The perception performance of the lower-order (e.g., the first-order) Ambisonics is limited. As the order increases, the spatial region and frequency range of the accurate transmission and reproduction of the information of the spatial sound field are extended gradually. Consequently, the spatial resolution and perceived performance of the reproduced sound field improve, and the listening region widens. However, the required number of independent signals and loudspeakers also increases as the order increases. Therefore, Ambisonics is a series of hierarchical systems in which the performance and complexity increase as the order increases. When the form of independent signals, such as the universal form of spatial harmonics given in Equation (4.3.49), is appropriately chosen, the HOA can be implemented by adding new independent signals to the lower-order Ambisonics and combining into the decoding equation. This characteristic enables the upward and downward compatibilities among different order Ambisonics. Previous studies also suggested some methods for the compatible transmission of mono, stereophonic, and first-order Ambisonic signals (Gerzon, 1985; Xie X.F., 1982).

The aforementioned features are held for horizontal and spatial Ambisonics (Section 6.4). Q-order horizontal Ambisonics involves an odd number of (2Q + 1) independent signals and requires Mmin = (2Q + 1) or more appropriately (2Q + 2) reproduction channels and loudspeakers; that is, the number of loudspeakers should be equal to or at least slightly larger than the number of independent signals. Further increasing the number of loudspeakers may improve the uniformity of the reproduced sound field but may cause new problems.

Regular horizontal loudspeaker configurations are discussed in the preceding discussion, which is true for most early domestic surround sounds. For irregular loudspeaker configurations, if loudspeakers are arranged front-back and left-right symmetrically so that each pair of loudspeakers is respectively located in two opposite directions alone the diameter of a circle, loudspeaker signals can also be derived easily. For example, the azimuths of four loudspeakers in a rectangular configuration are given as

LF 0 LB 1 180 0 RB 2 180 0 RF 3 0.

(4.3.83)

For the first-order Ambisonics, the signals or decoding equation of the four loudspeakers is derived similarly to that in Section 4.3.2 (Gerzon, 1985):

 

1

 

1

 

 

Ai S Atotal W

 

X

 

Y .

(4.3.84)

cos i

sin i

 

 

 

 

where Ai(θS) with i = 0, 1, 2, 3 denotes ALF, ALB, ARB, and ARF, respectively. When θ0 ≠ 45°, the overall power of the four loudspeaker signals is no longer a constant; instead, it depends on the target source azimuth.

186 Spatial Sound

The mathematical derivation of the decoding equation for other irregular loudspeaker configurations is complicated and may deal with the solution of nonlinear equations and numerical calculations. Moreover, some physical and psychoacoustic criteria for optimization may conflict one another; therefore, requirements from these criteria cannot be satisfied, or the solution of mathematical equations is physically inappropriate. This problem is addressed in Section 5.2.3.

In Sections 4.3.2 and 4.3.3, two methods are applied to improve the midand high-fre- quency localization performance in Ambisonic reproduction. The most effective method is to use the secondor higher-order Ambisonics. For example, a virtual source localization experiment on the Q = 1, 2, and 3-order horizontal Ambisonics indicated that the secondorder Ambisonic reproduction with six or eight loudspeakers exhibits a stable localization effect of full 360° horizontal virtual sources in the central listening position and for speech and music stimuli (Xie and Xie, 1996). This result is due to the dominant role of interaural phase delay difference and its dynamic variation with head rotation in horizontal localization (Section 1.6.5). Compared with the lower-order Ambisonics, higher-order Ambisonics further improve the perceived performance, but it is more complicated. Another method is to decode the lowand mid-to-high-frequency components with different optimizing criteria, such as the criteria of velocity localization vector and energy localization vector. In practice, a set of shelf filters is used to decode the lowand mid-to-high-frequency components of independent signals with different optimizing parameters. The crossover frequency of shelf filters is chosen on the basis of psychoacoustic consideration. Gerzon (1985) as well as Gerzon and Barton (1992) suggested a crossover frequency of 0.7 kHz for the first-order Ambisonics. Daniel et al. (1998) suggested a higher crossover frequency for the 2- and higherorder Ambisonics, such as 1.2 kHz for the second-order Ambisonics. As the order increases, the upper frequency limit for the exact reconstruction of the sound field increases. In this case, it may be unnecessary to optimize high-frequency decoding.

In Section 4.3.1, independent signals for Ambisonics have various forms corresponding to various formats of Ambisonics. Choosing W, X, and Y given in Equation (4.3.3) as the independent signals of the first-order horizontal Ambisonics is convenient. In practice, the levels of X and Y are enhanced by 3 dB to ensure a diffused field power response identical to that of M. The independent signals are written as

W W 1 X 2X 2 cos S Y 2Y 2 sin S .

(4.3.85)

The decoding equation should be appropriately changed to make adjustments for the independent signals in Equation (4.3.85). The Ambisonic system with independent signals given by (4.3.85) is termed the first-order B-format horizontal Ambisonics (Gerzon, 1985). Figure 4.19 is the block diagram of the B-format first-order horizontal Ambisonics with six

Figure 4.19 Block diagram of the B-format first-order horizontal Ambisonics with six reproduction channels.