Добавил:
Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:
акустика / xie_bosun_spatial_sound_principles_and_applications.pdf
Скачиваний:
158
Добавлен:
04.05.2023
Размер:
28.62 Mб
Скачать

References

Aarts R.M. (1993): Enlarging the sweet spot for stereophony by time/intensity trading, at the AES 94th Convention, Berlin, Germany, Paper 3473.

Aarts R.M. (2000): Phantom sources applied to stereo-base widening, J. Audio Eng. Soc., 48(3), 181–189.

Adami A., Haberts E.A.P., and Herre J. (2014): Perceptual evaluation of a coherence suppressing downmix method, at the AES 55th International Conference, Helsinki, Finland.

AES Staff Technical Writer (2006a): Binaural technology for mobile applications, J. AudioEng. Soc., 54(10), 990–995.

AES Staff Technical Writer (2006b): Digital radio broadcasting, J. Audio Eng. Soc., 54(7/8), 771–774. AES Staff Writer (2004): The world of digital radio, J. Audio Eng. Soc., 52(12), 1272–1278.

AES Technical Council (2001): Multichannel surround sound systems and operations, AES Technical Council Document, AESTD1001.1.01-10.

Ahnert W., and Feistel R. (1993): EARS auralization software, J. Audio Eng. Soc., 41(11), 894–904. Ahrens J. (2012): Analytic methods of sound field synthesis, Springer-Verlag, Berlin, Germany. Ahrens J., and Spors S. (2008a): Reproduction of moving virtual sound sources with special attention to

the Doppler effect, at the AES 124th Convention, Amsterdam, The Netherlands, Paper 7363. Ahrens J., and Spors S. (2008b): An analytical approach to sound field reproduction using circular and

spherical loudspeaker distributions, Acta Acust. United Ac., 94(6), 988–999

Ahrens J., and Spors S. (2008c): Focusing of virtual sound sources in higher order Ambisonics, at the AES 124th Convention, Amsterdam, The Netherlands, Paper 7378.

Ahrens J., and Spors S. (2009): Sound field reproduction employing non-omnidirectional loudspeakers, at the AES 126th Convention, Munich, Germany, Paper 7741.

Ahrens J. and Spors S. (2010): Sound field reproduction using planar and linear arrays of loudspeakers,

IEEE Trans. Audio, Speech, Language Process, 18(8), 2038–2050.

Ahrens J., and Spors S. (2011): Wave field synthesis of moving virtual sound sources with complex radiation properties, J. Acoust. Soc. Am., 130(5), 2807–2816.

Ahrens J., and Spors S. (2012): Wave field synthesis of a sound field described by spherical harmonics expansion coefficients, J. Acoust. Soc. Am., 131(3), 2190–2199.

Ahveninen J., Kopco N., and Jaaskelainen, I.P. (2014): Psychophysics and neuronal bases of sound localization in humans, Hearing Res., 307, 86–97.

Akeroyd M.A., and Whitmer W.M. (2011): Spatial hearing and hearing aids, ENT Audiol News, 20(5), 76–79.

Algazi V.R., Avendano C., and Duda R.O. (2001b): Elevation localization and head-related transfer function analysis at low frequencies, J. Acoust. Soc. Am., 109(3), 1110–1122.

Algazi V.R., Duda R.O., Duraiswami R., et al. (2002): Approximating the head-related transfer function using simple geometric models of the head and torso, J. Acoust. Soc. Am., 112(5), 2053–2064.

Algazi V.R., Duda R.O., Thompson D.M., et al. (2001a): The CIPIC HRTF database, in Proceeding of 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics, New York, 99–102.

Allen I. (2006): The X-Curve, its origins and history, electro-acoustic characteristics in the cinema and the mix-room, the large room and the small, SMPTE Motion Imag. J., 115(7/8), 264–275.

747

748  References

American Standards Association (1960): Acoustical terminology SI, New York, USA. Ando Y. (1985): Concert hall acoustics, Springer-Verlag Press, Berlin, Germany.

Ando Y. (1998): Architectural acoustics, blending sound sources, sound fields, and listeners, SpringerVerlag, New York, USA.

Ando Y. (2009): Auditory and visual sensations, Springer-Verlag, New York, USA.

ANSI S3.25/ASA80 (1989): Occluded ear simulator, American National Standard, American National Standards Institute, New York, USA.

ANSI S3.36 (1985): Manikin for simulated in-situ airborne acoustic measurements, American National Standard, American National Standards Institute, New York, USA.

ANSI S3.4 (2007): Procedure for the computation of loudness of steady sounds, American National Standard, American National Standards Institute, New York, NY.

Arteaga D. (2013): An Ambisonics decoder for irregular 3D loudspeaker array, at the AES 134th Convention, Rome, Italy, Paper 8918.

Asano F., and Swanson D.C. (1995): Sound equalization in enclosures using modal reconstruction, J. Acoust. Soc. Am., 98(4), 2062–2069.

Ashby T., Mason R., and Brookes T. (2013): Head movements in three-dimensional localisation, at the AES 134th Convention, Rome, Italy, Paper 8881.

Ashby T., Mason R., and Brookes T. (2014): Elevation localisation response accuracy on vertical planes of differing azimuth, at the AES 136th Convention, Berlin, Germany, Paper 9046.

ATSC Standard Doc A/342-1(2017): Audio common elements, Advanced Television system Committee, Washington, USA.

ATSC Standard Doc.A52(2012): Digital audio compression (AC-3, E-AC-3), Advanced Television System Committee, Washington, USA.

Auro Technologies, and Bacro Audio Technologies (2015): AuroMax, next-generation immersive sound system, www.auro-3D.com.

Avendano C., and Jot J.M. (2004): A frequency-domain approach to multichannel upmix, J. Audio Eng. Soc., 52(7/8), 740–749.

Backman J. (2009): Subwoofers in symmetrical and asymmetrical rooms, at the AES 126th Convention, Munich, Germany, Paper 7748.

Backman J. (2010): Subwoofers in rooms: experimental modal analysis, at the AES 128th Convention, London, UK, Paper 7970.

Backman J. (2011): Subwoofers in rooms: modal analysis for loudspeaker placement, at the AES 130th Convention, London, UK, Paper 8323.

Baek Y.H., Jeon S.W., Park Y.C., et al. (2012): Efficient primary-ambient decomposition algorithm for audio upmix, at the AES 133rd Convention, San Francisco, CA, Paper 8754.

Bagousse S.L., Colomes C., and Paquier M. (2010): State of the art on subjective assessment of spatial sound quality, at the AES 38th International Conference, Piteå, Sweden.

Bagousse S.L., Paquier M., and Colomes C. (2014): Categorization of sound attributes for audio quality assessment—a lexical study, J. Audio Eng. Soc., 62(11), 736–747.

Bai M.R., and Bai G. (2005): Optimal design and synthesis of reverberators with a fuzzy user interface for spatial audio, J. Audio Eng. Soc., 53(9), 812–825.

Bai M.R., Hsu H., and Wen J.C. (2014): Spatial sound field synthesis and upmixing based on the equivalent source method, J. Acoust. Soc. Am., 135(1), 269–282.

Bai M.R., Kuo M.C., and Hua Y.H. (2015): An application of miniature microphone array to stereophonic recording compatible to conventional practice, J. Audio Eng. Soc., 63(4), 267–279.

Bai M.R., and Lee C.C. (2006): Objective and subjective analysis of effects of listening angle on crosstalk cancellation in spatial sound reproduction, J. Acoust. Soc. Am., 120(4), 1976–1989.

Bai M.R., and Lee C.C. (2010): Comparative study of design and implementation strategies of automotive virtual surround audio systems, J. Audio Eng. Soc., 58(3), 141–159.

Bai M.R., and Shih G.Y. (2007): Upmixing and downmixing two-channel stereo audio for consumer electronics, IEEE Trans. Consumer Electron., 53(3), 1011–1019.

Balmages I., and Rafaely B. (2007): Open sphere designs for spherical microphone arrays, IEEE Trans.

Audio, Speech, Language Process., 15(2), 727–732.

References  749

Bamford J.S., and Vanderkooy J. (1995): Ambisonic sound for us, at the AES 99th Convention, New York, USA, Paper 4138.

Barbar S. (2015): Surround sound for cinema, in Handbook for sound engineers (5th edition, edited by Ballou G.), Focal Press, Burlinton, USA.

Barbour J.L. (2003): Elevation perception: phantom images in the vertical hemi-sphere, at the AES 24th International Conference, Banff, Canada.

Barrass S. (2012): Digital fabrication of acoustic sonifications, J. Audio Eng. Soc., 60(9), 709–715. Barron M. (2000): Measured early lateral energy fractions in concert halls and opera houses. J. Sound

Vib., 232(1), 79–100.

Barron M., and Marshall A. H. (1981): Spatial impression due to early lateral reflections in concert halls: the derivation of a physical measure. J. Sound Vib., 77(2), 211–232.

Barry D., and Kearney G. (2009): Localisation quality assessment in source separation-based upmixing algorithms, at the AES 35th International Conference, London, UK.

Batke J.M., and Keiler F. (2010): Using VBAP-derived panning functions for 3D Ambisonics decoding, at the 2nd International Symposium on Ambisonics and Spherical Acoustics, Paris, France.

Batteau D.W. (1967): The role of the pinna in human localization, Proc. Royal. Soc. London, 168(Ser, B), 158–180.

Bauck J., and Cooper D.H. (1996): Generalized transaural stereo and applications, J. Audio. Eng. Soc., 44(9), 683–705.

Bauer B.B. (1960): Broadening the area of stereophonic perception, J. Audio Eng. Soc., 8(2), 91–94. Bauer B.B. (1961a): Phasor analysis of some stereophonic phenomena, J. Acoust. Soc. Am., 33(11),

1536–1539.

Bauer B.B. (1961b): Stereophonic earphones and binaural loudspeakers, J. Audio. Eng. Soc., 9(2), 148–151.

Bauer B.B. (1979): A unified 4-4-4, 4-3-4, 4-2-4 SQ®-compatible system of recording and FM broadcasting (USQ™), J. Audio Eng. Soc., 27(11), 866–880.

Bauer B.B., Allen R.G., and Budelman G.A. (1973b): Quadraphonic matrix perspective-advances in SQ encoding and decoding technology, J. Audio Eng. Soc., 21(5), 342–350.

Bauer B.B., Budelman G.A., and Gravereaux D.W. (1973a): Recording techniques for SQ matrix quadraphonic discs, J. Audio Eng. Soc., 21(1), 19–26.

Bauer B.B., Dimattia A.L., and Rosenheck A.J. (1965): Transmission of directional perception, IEEE Trans. Audio, 13(1), 5–8.

Bauer B.B., Gravereaux D.W., and Gust A.J. (1971): A compatible stereo-quadraphonic (SQ) record system, J. Audio Eng. Soc., 19(8), 638–646.

Baumgarte F., and Faller C. (2003): Binaural cue coding part I, Phychoacoustic fundamentals and design principles, IEEE Trans. Speech Audio Process, 11(6), 509–519.

Baumgartner R., and Majdak P. (2015): Modeling localization of amplitude-panned virtual sources in sagittal planes, J. Audio Eng. Soc., 63(7/8), 562–569.

Baumgartner R., Majdak P., and Laback B. (2013): Assessment of sagittal-plane sound-localization performance in spatial-audio applications, in the Technology of binaural listening (edited by Blauert J.), Springer-Verlag, Berlin Heidelberg.

Bech S. (1995): Perception of reproduced sound: audibility of individual reflections in a complete sound field, II, at the AES 99th Convention, New York, USA, Paper 4093.

Bech S., and Zacharov N. (1999): Multichannel level alignment, part III: the influence of loudspeaker directivity and reproduction bandwidth, at the AES 106th Convention, Munich, Germany, Paper 4909.

Bech S., and Zacharov N. (2006): Perceptual audio evaluation – theory, method and application, John Wiley & Sons, West Sussex, UK.

Begault D.R. (1994): 3-D Sound for virtual reality and multimedia, Academic Press Professional, Cambridge, MA.

Begault D.R. (1998): Virtual acoustics, aeronautics, and communications, J. Audio Eng. Soc., 46(6), 520–530.

Begault D.R. (1999): Virtual acoustic displaysfor teleconferencing: intelligibility advantagefor “tele- phone-grade audio”, J. Audio. Eng. Soc., 47(10), 824–828.

750  References

Begault D.R., and Erbe T. (1994): Multichannel spatial auditory display for speech communications, J. Audio Eng. Soc., 42(10), 819–826.

Begault D.R., Wenzel E.M., and Anderson M.R. (2001): Direct comparison of the impact of head tracking, reverberation, and individualized head-related transfer functions on the spatial perception of a virtual speech source, J. Audio. Eng. Soc., 49(10), 904–916.

Begault D.R., Wenzel E.M., Godfroy M., et al. (2010): Applying spatial audio to human interfaces: 25 years of NASA experience, at the AES 40th International Conference, Tokyo, Japan.

Bekesy G.V. (1960): Experiments in hearing, Mcgraw-Hill, New York, USA. Bell D. (2000): Surround sound studio design, Studio Sound, 42(7), 55–58.

Benjamin E., Lee R., and Heller A. (2010): Why Ambisonics does work? at the AES 129th Convention, San Francisco, USA, Paper 8242.

Bennett J.C., Barker K., and Edeko F.O. (1985): A new approach for the assessment of stereophonic sound system performance, J. Audio Eng. Soc., 33(5), 314–321.

Beranek L. (1996). Concert halls and opera houses, Acoustical Society of America, USA.

Berg J. (2009): The contrasting and conflicting definitions of envelopment, at the AES 126th Convention, Munich, Germany, Paper 7808.

Berg J., and Rumsey F. (2001): Verification and correlation of attributes used for describing the spatial quality of reproduced sound, at the AES 19th International Conference, Schloss Elmau, Germany.

Berg J., and Rumsey F. (2002): Validity of selected spatial attributes in the evaluation of 5-channel microphone techniques, at the AES 112th Convention, Munich, Germany, Paper 5593.

Berg J., and Rumsey F. (2003): Systematic evaluation of perceived spatial quality, at the AES 24th International Conference, Banff, Alberta, Canada.

Berg J., and Rumsey F. (2006): Identification of quality attributes of spatial audio by repertory grid technique, J. Audio Eng. Soc., 54(5), 365–379.

Berkhout A.J. (1988): A holographic approach to acoustic control, J. Audio Eng. Soc., 36(12), 977–995. Berkhout A.J., Vries D.D., and Vogel P. (1993): Acoustic control by wave field synthesis, J. Acoust. Soc.

Am., 93(5), 2764–2778.

Bernfeld B. (1975): Simple equations for multichannel stereophonic sound localization, J. Audio Eng. Soc., 23(7), 553–557.

Betlehem T., and Abhayapala T.D. (2005): Theory and design of sound field reproduction in reverberate rooms, J. Acoust. Soc. Am, 117(4), 2100–2111.

Betlehem T., and Poletti M.A. (2014): Two dimensional sound field reproduction using higher order sources to exploit room reflections, J. Acoust. Soc. Am., 135(4), 1820–1833.

Blanco-Martin E., Casajús-Quirós F.J., Gómez-Alfageme J.J., et al. (2011): Objective measurement of sound event localization in horizontal and median planes, J. Audio Eng. Soc., 59(3), 124–136.

Blauert J. (1997): Spatial hearing: the psychophysics of human sound localization (Revised edition), MIT Press, Cambridge, MA.

Blauert J. (2012): Modeling binaural processing: what next? (abstract), J. Acoust. Soc. Am. 132 (3, Pt2), 1911.

Blauert J., Brueggen M., Bronkhorst A.W., et al. (1998): The AUDIS catalog of human HRTFs, J. Acoust. Soc. Am., 103(5), 3082.

Blauert J., Lehnert H., Sahrhage J., et al. (2000): An interactive virtual-environment generator for psychoacoustic research I: architecture and implementation, Acta Acust. United Ac., 86(1), 94–102.

Blauert J., and Lindemann W. (1986): Auditory spaciousness: some further psychoacoustic analyses, J. Acoust. Soc. Am., 80(2), 533–542.

Blauert J., and Rabenstein R. (2012): Providing surround sound with loudspeakers: a synopsis of current methods, Arch. Acoust., 37(1), 5–18.

Blauert J., and Xiang N. (2008): Acoustics for engineers, Springer-Verlag, Berlin, Germany.

Blommer M.A., and Wakefield G.H. (1997): Pole-zero approximations for head-related transfer functions using a logarithmic error criterion, IEEE Trans. Speech Audio Process., 5(3), 278–287.

Bloom P.J. (1977): Determination of monaural sensitivity changes due to the pinna by use of minimum- audible-field measurements in the lateral vertical plane, J. Acoust. Soc. Am., 61(3), 820–828.

Blumlein A.D. (1931): Improvements in and relating to sound transmission, sound recording and sound reproducing systems, British Patent Specification 394, 325. Reprint in J. Audio Eng. Soc., 6(2),91–98.

References  751

Boehm J. (2011): Decoding for 3D, at the AES 130th Convention, London, UK, Paper 8426. Boer K.D. (1940): Stereophonic sound reproduction, Philips Tech. Rev., 1940(5), 107–114. Boer K.D. (1946): The formation of stereophonic image, Philips Tech. Rev., 1946(8), 51–56.

Boer K.D. (1947): A remarkable phenomenon with stereophonic sound reproduction, Philips Tech. Rev., 1947(9), 8–13.

Bogaert T.V.D., Doclo S., Wouters J., et al. (2008): The effect of multimicrophone noise reduction systems­ on sound localization by users of binaural hearing aids, J. Acoust. Soc. Am., 124(1), 484–497.

Bogaert T.V.D., Klasen T.J., Moonen M., et al. (2006): Horizontal localization with bilateral hearing aids: without is better than with, J. Acoust. Soc. Am., 119(1), 515–526.

Bolia R.S., D’Angelo W.R., and McKinley R.L. (1999): Aurally aided visual search in three-dimensional space, Human Factors, 41(4), 664–669.

Boone M.M. (2004): Multi-actuator panels (MAPs) as loudspeaker arrays for wave field synthesis, J. Audio Eng. Soc., 52(7/8), 712–723.

Boone M.M., and Bruijn W.P.J.D. (2003): Improving speech intelligibility in teleconferencing by using wave field synthesis, at the AES 114th Convention, Amsterdam, The Netherlands, Paper 5800.

Boone M.M., Bruijn W.P.J.D., and Horbach U. (1999): Virtual surround speakers with wave field synthesis, at the AES 106th Convention, Munich, Germany, Paper 4928.

Boone M.M., and Verheijen E.N.G. (1998): Sound reproduction applications with wave-field synthesis, at the AES 104th Convention, Amsterdam, The Netherlands, Paper 4689.

Boone M.M., Verheijen E.N.G., and Van Tol P.F. (1995): Spatial sound field reproduction by wave field synthesis, J. Audio Eng. Soc., 43(12), 1003–1012.

Borenius J. (1985): Perceptibility of direction and time delay errors in subwoofer reproduction, at the AES 79th Convention, New York, USA, Paper 2290.

Bosi M., Brandenburg K., Quackenbush S., et al. (1997): ISO/IEC MPEG-2 advanced audio coding, J. Audio Eng. Soc., 45(10), 789–814.

Bosi M., and Goldberg R.E. (2003): Introduction digital audio coding and standards, Springer Science+Bussiness Media, New York, USA.

Bouéri M., and Kyirakakis C. (2004): Audio signal decorrelation based on a critical band approach, at the AES 117th Convention, San Francisco, USA, Paper 6291.

Bovbjerg B.P., Christensen F., Minnaar P., et al. (2000): Measuring the head-related transfer functions of an artificial head with high directional resolution, at the AES 109th Convention, Los Angeles, USA, Paper 5264.

Braasch J., Clapp S., Parks A., et al. (2013): A binaural model that analyses acoustic spaces and stereophonic reproduction systems by utilizing head rotations, in the Technology of binaural listening (edited by Blauert), Springer-Verlag, Berlin, Germany.

Bradley J.S., and Soulodre G.A. (1995): The influence of late arriving energy on spatial impression, J. Acoust. Soc. Am., 97(4), 2263–2271.

Bradley J.S., and Soulodre G.A. (1996): Listener envelopment: an essential part of good concert hall acoustics, J. Acoust. Soc. Am., 99(1), 22–23.

Brandenburg K., and Bosi M. (1997): Overview of MPEG Audio: current and future standards for low- bit-rate audio coding, J. Audio Eng. Soc., 45(1/2), 4–21.

Brandenburg K., and Stoll G. (1994): ISO/MPEG-1 audio: a generic standard for coding of high-quality digital audio, J. Audio Eng. Soc., 42(10), 780–792.

Breebaart J. (2013): Comparison of interaural intensity differences evoked by real and phantom sources, J. Audio Eng. Soc., 61(11), 850–859.

Breebaart J., and Faller C. (2007): Spatial audio processing: MPEG surround and other applications, John Wiley & Sons, West Sussex, UK.

Breebaart J., Herre J., Villemoes L., et al. (2006): Multi-channel goes mobile: MPEG surround binaural rendering, at the AES 29th International Conference, Seoul, Korea.

Breebaart J., Hotho G., Koppens J., et al. (2007): Background, concept, and architecture for the recent MPEG surround standard on multichannel audio compression, J. Audio Eng. Soc., 55(5), 331–351.

Breebaart J., Par S.V.D., Kohlrausch A., et al. (2005): Parametric coding of stereo audio, EURASIP J. Appl. Signal Process., 2005(9), 1305–1322.

752  References

Bregman A.S. (1990): Demonstrations of auditory scene analysis: the perceptual organization, MIT Press, Cambridge, MA.

Briand M., Virette D., and Martin N. (2006): Parametric representation of multichannel audio based on principal component analysis, at the AES 120th Convention, Paris, France, Paper 6813.

Brimijoin W.O., and Akeroyd M.A. (2012): The role of head movements and signal spectrum in an auditory front/back illusion, i-Perception, 3(3), 179–182.

Brix S., Sporer T., and Plogsties J. (2001): CARROUSO-An European approach to 3D-audio, at the AES 110th Convention, Amsterdam, The Netherlands, Paper 5314.

Bronkhorst A.W. (2000): The cocktail party phenomenon: a review of research on speech intelligibility in multiple-talker conditions, Acta Acust. United Ac., 86(1), 117–128.

Bronkhorst A.W., and Houtgast T. (1999): Auditory distance perception in rooms, Nature, 397, 517–520.

Brungart D.S. (1999): auditory localization of nearby sources. III. Stimulus effects, J. Acoust. Soc. Am., 106(6), 3589–3602.

Brungart D.S., Durlach N.I., and Rabinowitz W.M. (1999): Auditory localization of nearby sources. II. Localization of abroadband source, J. Acoust. Soc. Am., 106(4), 1956–1968.

Brungart D.S., Kordik A.J., and Simpson B.D. (2006): Effects of headtracker latency in virtual audio displays, J. Audio Eng. Soc., 54(1/2), 32–44.

Brungart D.S., and Rabinowitz W.M. (1999): Auditory localization of nearby sources. Head-related transfer functions, J. Acoust. Soc. Am., 106(3), 1465–1479.

Bujacz M., Skulimowski P., and Strumillo P. (2012): Naviton—a prototype mobility aid for auditory presentation of three-dimensional scenes to the visually impaired, J. Audio Eng. Soc. 60(9), 696–708.

Burkhard M.D., and Sachs R.M. (1975): Anthropometric manikin for acoustic research, J. Acoust. Soc. Am., 58(1), 214–222.

Butler R.A., and Belendiuk K. (1977): Spectral cues utilized in the localization of sound in the median sagittal plane, J. Acoust. Soc. Am., 61(5), 1264–1269.

Capra A., Fontana S., Adriaensen F., et al. (2007): Listening tests of the localization performance of stereodipole and Ambisonic systems, at the AES 123rd Convention, New York, USA, Paper 7187.

Cengarle G., Mateos T., and Bonsi D. (2011): A second-order Ambisonics device using velocity transducers, J. Audio Eng. Soc., 59(9), 656–668.

Chang J.H., and Jacobsen F. (2012): Sound field control with acircular double-layer array of loudspeakers, J. Acoust. Soc. Am., 131(6), 4518–4525.

Charpentier T. (2017): Normalization schemes in Ambisonic: does it matter? at the AES 142nd Convention, Berlin, Germany, Paper 9769.

Cheer J., Elliott S.J., and Gálvez M.F.S. (2013): Design and implementation of a car cabin personal audio system, J. Audio Eng. Soc., 61(6), 412–424.

Chen J., Veen B.D.V., andHecox K.E. (1995): A spatial feature extraction and regularization model of the head-related transfer function, J. Acoust. Soc. Am., 97(1), 439–452.

Chernyak R.I., and Dubrovsky N.A. (1968): Pattern of the noise image and binaural summation of loudness for different interaural correlation of noise, Proceeding of the 6th International Congress on Acoustics, Tokyo, Japan, 53–56.

Cherry E.C. (1953): Some experiments on the recognition of speech, with one and with two ears, J. Acoust. Soc. Am., 25(5), 975–979.

Chétry N., Pallone G., Emerit M., et al. (2007): A discussion about subjective methods for evaluating blind upmix algorithms, at the AES 31st International Conference, London, UK.

Chi S., Xie B.S., and Rao D. (2009): Effect of mismatched loudspeaker pair on virtual sound image (in Chinese), Appl. Acoust., 28(4), 291–299.

Choi I., Shinn-Cunningham B.G., Chon S.B., et al. (2008): Objective measurement of perceived auditory quality in multichannel audio compression coding systems, J. Audio Eng. Soc., 56(1/2), 3–17.

Choi J., and Chang J.H. (2020): Exploiting deep neural networks for two-to-five channel surround decoder, J. Audio Eng. Soc., 68(12), 938–949.

Choi J.W., and Kim Y.H. (2012): Integral approach for reproduction of virtual sound source surrounded by loudspeaker array, IEEE Trans. Audio, Speech, Language Process., 20(7), 1976–1989.

References  753

Choi T., Park Y.C., and Youn D.H. (2006): Efficient out of head localization systemfor mobile applications, at the AES 120th Convention, Paris, France, Paper 6758.

Choisel S., and Wickelmaier F. (2006): Extraction of auditory features and elicitation of attributes for the assessment of multichannel reproduced sound, J. Audio Eng. Soc., 54(9), 815–826.

Chowning J.M. (1971): The simulation of moving sound sources, J. Audio Eng. Soc., 19(1), 2–6. Chung H., Shim H., Nahn N., et al. (2012): Sound reproduction method by front loudspeaker array for

home theater applications, IEEE Trans. Consum. Electron., 58(2), 528–534.

Clack H.A.M., Dutton G.F., and Vanderlyn P.B. (1957): The “stereosonic” recording and reproduction system, IRE Trans. Audio, 5(4), 96–111.

Cobos M., and Lopez J.J. (2009): Resynthesis of sound scenes on wave-field synthesis from stereo mixtures using sound source separation algorithms, J. Audio Eng. Soc., 57(3), 91–110.

Cobos M., and Lopez J.J. (2010): Interactive enhancement of stereo recordings using time-frequency selective panning, at the AES 40th International Conference, Tokyo, Japan.

Cohen E., and Eargle J. (1995): Audio in a 5.1 channel environment, at the AES 99th Convention, New York, USA, Paper 4071.

Cohn H., and Kumar A. (2007): Universally optimal distribution of points on spheres, J. Amer. Math. Soc., 20(1), 99–148.

Conetta R., Rumsey F., Zielinski S., et al. (2008): QESTRAL (part 2): Calibrating the QESTRAL model using listening test data, at the AES 125th Convention, San Francisco, USA, Paper 7596.

Cook R.K., Waterhouse R.V., Berendt R.D., et al. (1955): Measurement of correlation coefficients in reverberant sound fields, J. Acoust. Soc. Am., 27(6), 1072–1077.

Cooper D.H. (1982): Calculator program for head-related transfer function, J. Audio. Eng. Soc., 30(1/2), 34–38.

Cooper D.H. (1987): Problems with shadowless stereo theory: asymptotic spectral status, J. Audio. Eng. Soc., 35(9), 629–642.

Cooper D.H., and Bauck J.L. (1989): Prospects for transaural recording, J. Audio. Eng. Soc., 37(1/2), 3–19.

Cooper D.H. and Shiga T. (1972): Discrete matrix multichannel stereo, J. Audio Eng. Soc., 20(5), 346–360.

Copper D.H. (1974): QFMX-quadruplex FM transmission using the 4-4-4 QMX matrix system, J. Audio Eng. Soc., 22(2), 82–87.

Cooper D.H., Shiga T., and Takagi T. (1973): QMX carrier channel disc, J. Audio Eng. Soc., 21(8), 614–624.

Corteel E. (2006): Equalization in an extended area using multichannel inversion and wave field synthesis, J. Audio Eng. Soc., 54(12), 1140–1161.

Corteel E., and Nicol R. (2003): Listening room compensation for wave field synthesis. What can be done? at the AES 23rd International Conference, Helsingϕr, Denmark.

Craven P.G. (2003): Continuous surround panning for 5-speaker reproduction, at the AES 24th International Conference, Banff, Canada.

Crispien K., and Ehrenberg T. (1995): Evaluation of the “cocktail-party effect” for multiple speech stimuli within a spatial auditory display, J. Audio Eng. Soc., 43(11), 932–941.

Damaske P. (1967/1968): Subjective investigation of sound fields, Acta Acust. United Ac., 19(4), 199–213.

Damaske P. (1969/1970): Directional dependence of spectrum and correlation functions of the signals received at the ears, Acta Acust. United Ac., 22(4), 191–204.

Damaske P., and Ando Y. (1972): Interaural crosscorrelation for multichannel loudspeaker reproduction, Acta Acust. United Ac., 27(4), 232–238.

Daniel J. (2000): Acoustic field representation, application to the transmission and the reproduction of complex sound environments in a multimedia context (in French), PhD thesis, University of Paris 6, France.

Daniel J. (2003): Spatial sound encoding including near field effect: introducing distance coding filters and a viable, new Ambisonic format, at the AES 23rd International Conference, Copenhagen, Denmark.

754  References

Daniel J., and Moreau S. (2004): Further study of sound field coding with higher order Ambisonics, at the AES 116th Convention, Berlin, Germany, Paper 6017.

Daniel J., Nicol R., and Moreau S. (2003): Further investigations of high-order Ambisonics and wavefield synthesis for holophonic sound imaging, at the AES 114th Convention, Amsterdam, The Netherlands, Paper 5788.

Daniel J., Rault J.B., and Polack J.D. (1998): Ambisonics encoding of other audio formats for multiple listening conditions, at the AES 105th Convention, San Francisco, USA, Paper 4795.

Dantele A., Reiter U., Schuldt M., et al. (2003): Implementation of MPEG-4 audio nodes in an interactive virtual 3D environment, at the AES 114th Convention, Amsterdam, The Netherlands, Paper 5820.

Dattorro J. (1997): Effect design: part 1: Reverberator and other filters, J. Audio Eng. Soc., 45(9), 660–684.

Davis D., and Davis C. (1980): The LEDE™ concept for the control of acoustic and psychoacoustic parameters in recording control rooms, J. Audio Eng. Soc., 28(9), 585–595.

Davis M.F. (1987): Loudspeaker systems with optimized wide-listening-area imaging, J. Audio Eng. Soc., 35(11), 888–896.

Davis M.F. (1993): The AC-3 multichannel coder, at the AES 95th Convention, San Francisco, USA, Paper 3774.

Davis M.F., and Fellers M.C. (1997): Virtual surround presentation of Dolby AC-3 and Pro Logic signal, at the AES 103rd Convention, New York, USA, Paper 4542.

Davis M.F., and Todd C.C. (1994): AC-3 operation, bitstream syntax, and features, at the AES 97th Convention, San Francisco, USA, Paper 3910.

DeFanti T.A., Dawe G., Sandin D.J., et al. (2009): The StarCAVE, a third-generation CAVE and virtual reality OptIPotal, Future Gener. Comput. Syst., 25(2),169–178.

Djelani T., Porschmann C., Sahrhage J., et al. (2000): An interactive virtual-environment generator for psychoacousticresearch. II: collection of head-related impulse responsesand evaluation of auditory localization, Acta Acust. United Ac., 86(6), 1046–1053.

Doerr K.U., Rademacher H., Huesgen S., et al. (2007): Evaluation of a low-cost 3D sound system for immersive virtual reality training systems, IEEE Trans. Vis. Comput. Graph., 13(2), 204–212.

Dolby Laboratories (1997): Dolby professional encoding manual, http://www.dolby.com. Dolby Laboratories (1998): Dolby surround mixing manual, http://www.dolby.com. Dolby Laboratories (2000): 5.1 channel production guidelines, http://www.dolby.com

Dolby Laboratories (2002): Standards and practices for authoring Dolby digital and Dolby E bitstreams, http://www.dolby.com.

Dolby Laboratories (2012): Dolby Atmos, next-generation audio for cinema, http://www.dolby.com. Dolby Laboratories (2015): Dolby Atmos specifications, http://www.dolby.com.

Dolby Laboratories (2016): Dolby Atmos for the home theater, http://www.dolby.com.

Dooley W.L., and Streicher R.D. (1982): M-S stereo: a powerful technique for working in stereo, J. Audio Eng. Soc., 30(10), 707–718.

Dragnev P.D., Legg D.A., and Townsend D.W. (2002): Discrete logarithmic energy on the sphere, Pacific J. Mat., 207(2), 345–358.

Dressler R. (1996): A step toward improved surround sound: making the 5.1 channel format reality, at the AES 100th Convention, Copenhagen, Denmark, Paper 4287.

Dressler R. (2000): Dolby surround Pro Logic II decoder principles of operation, http://www.dolby. com.

Dressler R. (2006): Audio coding for future entertainment formats, at the AES 21st UK Conference, Cambridge, UK.

Drullman R., and Bronkhorst A.W. (2000): Multichannel speech intelligibility and talker recognition using monaural, binaural, and three-dimensional auditory presentation, J. Acoust. Soc. Am., 107(4), 2224–2235.

DTS Inc. (2006): DTS-HD Audio, consumer white paper for blu-ray disc and HD DVD applications, http://www.dts.com

Du G.H., Zhu Z.M., and Gong X.F. (2001): Fundamental acoustics (2nd edition, in Chinese), Nanjing University Press, Nanjing, China.

References  755

Duda R.O., and Martens W.L. (1998): Range dependence of the response of a spherical head model, J. Acoust. Soc. Am., 104(5), 3048–3058.

Duraiswami R., Zotkin D.N., and Gumerov N.A. (2004): Interpolation and range extrapolation of HRTFs, Proceedings of 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, Canada, Vol. 4, 45–48.

Duraiswami R., Zotkin D.N., Li Z.Y., et al. (2005): High order spatial audio capture and its binaural head-tracked playback over headphones with HRTF cues, at the AES 119th Convention, New York, USA, Paper 6540.

Durbin H.M. (1972): Playback effects from matrix recordings, J. Audio Eng. Soc., 20(9), 729–733. Durlach N.I., and Colburn H.S. (1978): Binaural phenomena, in Handbook of perception, Vol. IV,

Academic Press, New York, USA.

Durlach N.I., Rigopulos A., Pang X.D., et al. (1992): On the externalization of auditory images, Presence, 1(2), 251–257.

DVD Forum (1997): DVD specifications for read-only disc Part 3: video specifications, Version 1.1, Tokyo, Japan.

Eargle J.M. (1971a): On the processing of twoand three-channel program material for four-channel playback, J. Audio Eng. Soc., 19(4), 262–266.

Eargle J.M. (1971b): Multichannel stereo matrix systems: an overview, J. Audio Eng. Soc., 19(7), 552–559.

Eargle J.M. (1972): 4-2-4 Matrix systems: standards, practice, and interchangeability, J. Audio Eng. Soc., 20(10), 809–815.

Eargle J.M. (2006): Handbook of recording engineering (4th edition), Springer Science+Business Media Inc., New York, USA.

EBU-Tech 3324 (2007): EBU evaluations of multichannel audio codecs, European Broadcasting Union, Geneva, Switzerland.

Economou E.N. (2006): Green’s function in quantum physics (3rd edition), Springer-Verlag, New York, USA.

Edwin P.C. (2002): In the light of 5.1 channel surround, “why A-B polycardiod centerfill” (AB-PC) is superior for symphony-orchestra recording, at the AES 112th Convention, Munich, Germany, Paper 5565.

Ehmer R.H. (1959a): Masking patterns of tones, J. Acoust. Soc. Am., 31(8), 1115–1120. Ehmer R.H. (1959b): Masking by tones vs noise bands, J. Acoust. Soc. Am., 31(9), 1253–1256.

Ehret A., Groschel A., Purnhagen H., et al. (2007): Coding of “2+2+2” surround sound content using the MPEG surround standard, at the AES 122nd Convention, Vienna, Austria, Paper 6992.

Eilers C.G. (1961): Stereophonic FM broadcasting, IRE Trans. Broadcasting TV Rec. BTR, 7(2), 73–80.

Enzner G., Weinert M., Abeling S., et al. (2013): Advanced system options for binaural rendering of Ambisonic format, in Proceeding of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, Canada, 251–255.

Epain N., Jin C.T., and Zotter F. (2014): Ambisonic decoding with constant angular spread, Acta Acust. United Ac., 100(5), 928–936.

Erber T., and Hockney G.M. (2007): Complex systems: equilibrium configurations of N equal charges on a sphere ( 2 ≤ N ≤ 112), Adv. Chem. Phys., 98, 495–594.

Ericson M.A. (2011): Multichannel sound reproduction in the environment for auditory research, at the AES 131st Convention, New York, USA, Paper 8513.

ETSI ES 201 980 V3.2.1 (2012): Digital radio mondiale (DRM); system specification, European Telecommunications Standards Institute, Sophia-Antipolis Cedex, France.

ETSI TS 102 366 V1.4.1 (2017): Digital audio compression (AC-3, Enhanced AC-3) standard, European Telecommunications Standards Institute, Sophia-Antipolis Cedex, France.

ETSI TS 103 190-1 V1.3.1 (2018): Digital audio compression (AC-4) standard, part 1: channel based coding, European Telecommunications Standards Institute, Sophia-Antipolis Cedex, France.

ETSI TS 103 190-2 V1.2.1 (2018): Digital audio compression (AC-4) standard, part 2: Immersive and personalized audio, European Telecommunications Standards Institute, Sophia-Antipolis Cedex, France.

756  References

ETSI TS 103 223 V1.1.1 (2015): MDA: object-based audio immersive sound metadata and bitstream, European Telecommunications Standards Institute, Sophia-Antipolis Cedex, France.

Evans M.J., Angus J.A.S., and Tew A.I. (1998): Analyzing head-related transfer function measurements using surface spherical harmonics, J. Acoust. Soc. Am., 104(4), 2400–2411.

Evans M.J., Tew A.I., and Angus J.A.S. (1997): Spatial audio teleconferencing – which way is better?, in Proceedings of the Fourth International Conference on Auditory Displays (ICAD 97), Palo Alto, California, USA, 29–37.

Evjen P., Bradley J.S., and Norcross S.G. (2001): The effect of late reflections from above and behind on listener envelopment, Appl. Acoust., 62(2), 137–153.

Faller C. (2004): Coding of spatial audio compatible with different playback formats, at the AES 117th Convention, San Francisco, USA, Paper 6187.

Faller C. (2006): Multiple-loudspeakers playback of stereo signals, J. Audio Eng. Soc., 54(11), 1051–1064.

Faller C. (2007): Matrix surround revised, at the AES 30th International Conference, Saariselka, Finland.

Faller C. (2010): Conversion of two closely spaced omnidirectional microphone signals to an XY stereo signal, at the AES 129th Convention, San Francisco, USA, Paper 8188.

Faller C., Altmann L., Levison J., et al. (2013): Multi-channel ring upmix, at the AES 134th Convention, Rome, Italy, Paper 8908.

Faller C., and Baumgarte F. (2003): Binaural cue coding part II, scheme and applications, IEEE Trnas. Speech Audio Process., 11(6), 520–531.

Faller C., and Schillebeeckx P. (2011): Improved ITU and matrix surround downmixing, at the AES 130th Convention, London, UK, Paper 8339.

Farina A., Armelloni E., and Martignon P. (2004): An experimental comparative study of 20 Italian opera houses: measurement techniques, J. Acoust. Soc. Am., 115(5), 2475.

Farina A., and Ayalon R. (2003): Recording concert acoustics for posterity, at the AES 24th International Conference, Banff, Canada.

Farina A., and Ugolotti E. (1998): Numerical model of the sound field inside cars for the creation of virtual audible reconstructions, First COST-G6 Workshopon Digital Audio Effects (DAFX98), Barcelona, Spain.

Favrot S., and Buchholz J.M. (2012): Reproduction of nearby sound sources using higher-order Ambisonics with practical loudspeaker arrays, Acta Acust. United Ac., 98(1), 48–60.

Favrot A., and Faller C. (2020): Wiener-based spatial B-format equalization, J. Audio Eng. Soc., 68(7/8), 488–494.

Favrot S., Marschall M., and Kasbach J. et al. (2011): Mixed-order ambisonics recording and playback for improving horizontal directionality, at the AES 131st Convention, New York, USA, Paper 8528.

Fazi F.M., and Nelson P.A. (2010): The relation between sound field reproduction and near-field acoustical holography, at the AES 129th Convention, San Francisco, USA, Paper 8247.

Fazi F.M., and Nelson P.A. (2013): Sound field reproduction as an equivalent acoustical scattering problem, J. Acoust. Soc. Am., 134(5), 3721–3729.

Feige F., and Kirby D.G. (1994): Report on the MPEG/Audio multichannel formal subjective listening tests, MPEG document ISO/IEC JTC1/SC29/WG11/N0685, International Organization for Standardization, Geneva, Switzerland.

Fejzo Z., Kramer L., McDowell K., et al. (2005): DTS-HD: technical overview of lossless mode of operation, at the AES 118th Convention, Barcelona, Spain, Paper 6445.

Fernando L.L. (2014): An architecture for reverberation in high order Ambisonics, at the AES 137th Convention, Los Angeles, USA, Paper 9109.

Fielder L.D., Andersen R.L., Crockett B.G., et al. (2004): Introduction to Dolby digital plus, an enhancement to the Dolby digital coding system, at the AES 117th Convention, San Francisco, USA, Paper 6196.

Fielder L.D., and Robinson D.P. (1995): AC-2 and AC-3: the technology and its application, at the AES 5th Australian Regional Convention, Sydney, Australian, Paper 4022.

Firtha G., and Fiala P. (2015a): Sound field synthesis of uniformly moving virtual monopoles, J. Audio Eng. Soc., 63(1/2), 46–53.

References  757

Firtha G., and Fiala P. (2015b): Wave field synthesis of moving sources with retarded stationary phase approximation, J. Audio Eng. Soc., 63(12), 958–965.

Fletcher H. (1940): Auditory patterns, Rev. Mod. Psys., 12(1), 47–65.

Florentine M., Buus S., and Mason C.R. (1987): Level discrimination as a function of level for tones from 0.25 to 16 kHz, J. Acoust. Soc. Am., 81(5), 1528–1541.

Franck A., Graefe A., Korn T., et al. (2007): Reproduction of moving sound sources by wave field synthesis: an analysis of artifacts, at the AES 32nd International Conference, Hillerød, Denmark.

Francombe J., Brookes T., and Mason R. (2017a): Evaluation of spatial audio reproduction methods (part 1): elicitation of perceptual differences, J. Audio Eng. Soc., 65(3), 198–211.

Francombe J., Brookes T., Mason R., et al. (2017b): Evaluation of spatial audio reproduction methods (part 2): analysis of listener preference, J. Audio Eng. Soc., 65(3), 212–225.

Freeland F.P., Biscainho L.W.P., and Diniz P.S.R. (2004): Interpositional transfer function for 3D-sound generation, J. Audio Eng. Soc., 52(9), 915–930.

Fuchigami N., Kuroiwa T., Suzuki B.H. (2000): DVD-Audio specifications, J. Audio Eng. Soc., 48(12), 1228–1240.

Fug S., Holzer A., and Borb C., et al. (2014): Design, coding and processing of metadata for 48(12), 1228–1240. Object-based interactive audio, at the AES 137th Convention, Los Angeles, Paper 9097.

Fukada A. (2001): A challenge in multichannel sound recording, at the AES 19th International Conference, Bavaria, Germany.

Fukada A., Tsujimoto K., and Akita S. (1997): Microphone techniques for ambient sound on a music recording, at the AES 103rd Convention, New York, USA, Paper 4540.

Furui S. (2000): Digital speech processing, synthesis, and recognition (2nd edition), Marcel Dekker, New York, USA.

Furuya H., Fujimoto K., Choi Y.J., and Higa N. (2001): Arrival direction of late sound and listener envelopment, Appl. Acoust., 62(2), 125–136.

Furuya H., Fujimoto K., and Wakuda A. (2008): Psychological experiments on listener envelopment when both the early-to-late sound level and directional late energy ratios are varied, and consideration of calculated LEV in actual halls. Appl. Acoust., 69(11), 1085–1095.

Furuya H., Fujimoto K., Wakuda A., et al. (2005): The influence of total and directional energy of late sound on listener envelopment, Acoust. Sci. Tech, 26(2), 208–211.

Fuster L., Lopez J.J., and Gonzalez A. (2005): Room compensation using multichannel inverse filters for wave field synthesis system, at the AES 118th Convention, Barcelona, Spain, Paper 6401.

Gardner W.G. (1995): Efficient convolution without input-output delay, J. Audio Eng. Soc. 43(3), 127–136.

Gardner W.G. (1997): 3-D audio using loudspeakers, Doctor thesis of Massachusetts Institute of Technology, Massachusetts, USA.

Gardner W.G. (2002): Reverberation algorithms, in Applications of digital signal processing to audio and acoustics (edited by Brandenburg K.), The International Series in Engineering and Computer Science, vol. 437, Springer, Boston, MA.

Gardner W.G., and Martin K.D. (1995): HRTF measurements of a KEMAR, J. Acoust. Soc. Am., 97(6), 3907–3908.

Gaston L., and Sanders R. (2008): Evaluation of HE-AAC, AC-3 and E-AC-3 codecs, J. Audio Eng. Soc., 56(3), 140–155.

Gauthier P.A., and Berry A. (2006): Adaptive wave field synthesis with independent radiation mode control for active sound field reproduction: theory, J. Acoust. Soc. Am., 119(5), 2721–2737.

Gauthier P.A., and Berry A. (2007): Adaptive wave field synthesis for sound field reproduction, theory, experiment and future perspectives, J. Audio Eng. Soc., 55(12), 1107–1124.

Gauthier P.A., and Berry A. (2008): Adaptive wave field synthesis with independent radiation mode control­ for active sound field reproduction: experimental results, J. Acoust. Soc. Am., 123(4), 1991–2002.

Gauthier P.A., Berry A., and Wieslaw W. (2005): Sound field reproduction in-room using optimal control techniques: simulations in the frequency domain, J. Acoust. Soc. Am., 117(2), 662–678.

Gauthier P.A., Camier C., Padois T., et al., (2015): Sound field reproduction of real flight recordings in aircraft cabin mock-up, J. Audio Eng. Soc., 63(1/2), 6–20.

758  References

Gauthier P.A., Chambatte É., Camier C., et al. (2014a): Beamforming regularization, scaling matrices, and inverse problems for sound field extrapolation and characterization: part I– theory, J. Audio Eng. Soc., 62(3), 77–98.

Gauthier P.A., Chambatte É.C., Camier C., et al. (2014b): Beamforming regularization, scaling matrices, and inverse problems for sound field extrapolation and characterization: part II– experiments, J. Audio Eng. Soc., 62(4), 207–219.

GB/T22726-2008 (2008): Specification for multichannel digital audio coding technique (in Chinese), National Standard of the P.R China, National Institute of Standards of the People’s Republic of China, Beijing, China.

GB/T26686-2017 (2017): General specification for digital terrestrial television receiver (in Chinese), National Standard of the P.R China, National Institute of Standards of the People’s Republic of China, Beijing, China.

Geier M., Wierstorf H., and Ahrens J. (2010): Perceptual assessment of focused sources in wave field synthesis, at the AES 128th Convention, London, UK, Paper 8069.

Geisler C.D. (1998): From sound to synapse: physiology of the mammalian ear, Oxford University Press, New York, USA.

Gelfand S.A. (2010): Hearing: An Introduction to psychological and physiological acoustics (5th edition), Informa Healthcare, London, UK.

Geluso P. (2012): Capturing Height: the addition of Z microphones to stereo and surround microphone arrays, at the AES 132nd Convention, Budapest, Hungary, Paper 8595.

Genuit K., and Xiang N. (1995): Measurements of artificial head transfer functions for auralization and virtual auditory environment, in Proceedings of 15th International Congress on Acoustics (invited paper), Trondheim, Norway, II 469–472.

George S., Zielinski S., and Rumsey F. (2006): Feature extraction for prediction of multichannel spatial audio fidelity, IEEE Trans. Audio, Speech, Language Process., 14(6), 1994–2005.

George S., Zielinski S., and Rumsey F. (2010): Development and validation of an unintrusive model for predicting the sensation of envelopment arising from surround sound recordings, J. Audio Eng. Soc., 58(12), 1013–1031.

Germanenn A. (1998): The arrangements of microphones using three front channels, a systematic approach (in German), in the Proceeding of Tonmeistertagung, 518–542.

Gersho A., and Gray R.M. (1992): Vector quantization and signal compression, Springer, Boston, MA. Gerzon M.A. (1973): Periphony: with height sound reproduction, J. Audio Eng. Soc., 21(1), 2–10. Gerzon M.A. (1975a): Recording concert hall acoustics for posterity, J.Audio Eng. Soc., 23(7), 569–571. Gerzon M.A. (1975b): A geometric model for two-channel four-speaker matrix stereo system, J. Audio

Eng. Soc., 23(2), 98–106.

Gerzon M.A. (1985): Ambisonics in multichannel broadcasting and video, J. Audio Eng. Soc., 33(11), 859–871.

Gerzon M.A. (1986): Stereo shuffling: new approach-old technique, Studio Sound, 28(7), 122–130. Gerzon M.A. (1990): Three channels, the future of stereo? Studio Sound, 32(6), 112–125.

Gerzon M.A. (1992a): General metatheory of auditory localisation, at the AES the 92nd Convention, Vienna, Austria, Paper 3306.

Gerzon M.A. (1992b): Optimum reproduction matrices for multispeaker stereo, J. Audio Eng. Soc., 40(7/8), 571–589.

Gerzon M.A. (1992c): Panpot laws for multispeaker stereo, at the AES 92nd Convention, Vienna, Austria, Paper 3309.

Gerzon M.A. (1992d): Hierarchical transmission system for multispeaker stereo, J. Audio Eng. Soc., 40(9), 692–705.

Gerzon M.A. (1992e): The design of distance panpots, at the AES 92nd Convention, Vienna, Austria, Paper 3308.

Gerzon M.A. (1992f): Compatibility of and conversion between multispeaker systems, at the AES 93rd Convention, San Francisco, USA, Paper 3405.

Gerzon M.A. (1994): Applications of Blumlein shuffling to stereo microphone techniques, J. Audio Eng. Soc., 42(6), 435–453.

References  759

Gerzon M.A., and Barton G.J. (1992): Ambisonic decoder for HDTV, at the AES 92nd Convention, Vienna, Austria, Paper 3345.

Gerzon M.A., Craven P.G., Stuart J.R., et al. (2004): The MLP lossless compression system for PCM audio, J. Audio Eng. Soc., 52(3), 243–260.

Gibson J.J., Christensen R.M., and Limberg A.L.R. (1972): Compatible FM broadcasting of Panoramic sound, J. Audio Eng. Soc., 20(10), 816–822.

Gierlich H.W. (1992): The application of binaural technology, Appl. Acoust., 36(3/4), 219–243. Gnann V., and Spiertz M. (2008): Comb-filter free audio mixing using STFT magnitude spectra and

phase estimation, in the Proceeding of 11st International Conference of Digital Audio Effect (DAFx08), Espoo, Finland.

Goldmark P.C., Bauer B.B., and Bachman W.S. (1958): The Columbia compatible stereophonic record,

IRE Trans. Audio, 6(2), 25–28.

Goldstein H. (1980): Classical mechanics (2nd edition), Addison-Wesley Publishing Company Inc., Massachusetts, USA.

Gong M., Xiao Z., Qu T.S., et al. (2007): Measurement and analysis of near-field head-related transfer function, Appl. Acoust. (in Chinese), 26(6), 326–334.

Gonot A., Chateau N., Emerit M. (2006): Usability of 3D-sound for navigation in a constrained virtual environment, at the AES 120th Convention, Paris, France, Paper 6800.

Goodwin M.M. (2008a): Primary-ambient decomposition and dereverberation of two-channel and multi-channel audio, in Proceeding of IEEE 42nd Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, 797–800.

Goodwin M.M. (2008b): Geometric signal decomposition for spatial audio enhancement, in Proceeding of IEEE 2008 International Conference on Acoustics, Speech and Signal Processing, Las Vegas, NV, 409–412.

Goodwin M.M., and Jot J.M. (2007): Primary-ambient decomposition and vector-based localization for spatial audio cording and enhancement, in Proceeding of IEEE 2007 International Conference on Acoustics, Speech and Signal Processing, Honolulu, HI, Vol. I, 9–12.

Gorzel M., Kearney G., and Boland F. (2014): Investigation of Ambisonic rendering of elevated sound source, at the AES 55th International Conference, Helsinki, Finland.

Grandjean P., Berry A., Gauthier P.A. (2021a): Sound field reproduction by combination of circular and spherical higher-order Ambisonics: part I–a new 2.5-D driving function for circular arrays, J. Audio Eng. Soc., 69(3), 152–165.

Grandjean P., Berry A., and Gauthier P.A. (2021b): Sound field reproduction by combination of circular and spherical higher-order ambisonics: part II—hybrid system, J. Audio Eng. Soc., 69(3), 166–181.

Grantham D.W., and Wightman F.L. (1978): Detectability of varying interaural temporal differences, J. Acoust. Soc. Am., 63(2), 511–523.

Grassi E., Tulsi J., and Shamma S. (2003): Measurement of head-related transfer functions based on the empirical transfer function estimate, in Proceedings of the 2003 International Conference on Auditory Display, Boston, MA, 119–122.

Gribben C., and Lee H. (2014): The perceptual effects of horizontal and vertical interchannel decorrelation using the Lauridsen decorrelator, at the AES 136th Convention, Berlin, Germany, Paper 9027.

Gribben C., and Lee H. (2017): The perceptual effect of vertical interchannel decorrelation on vertical image spread at different azimuth positions, at the AES 142nd Convention, Berlin, Germany, Paper 9747.

Gribben C., and Lee H. (2018): The frequency and loudspeaker-azimuth dependencies of vertical interchannel decorrelation on the vertical spread of an auditory image, J. Audio Emg. Soc., 66(7/8), 537–555.

Griesinger D. (1986): Spaciousness and localization in listening rooms and their effects on the recording technique, J. Audio Eng. Soc., 34(4), 255–268.

Griesinger D. (1992a): IALF-binaural measures of spatial impression and running reverberance, at the AES 92nd Convention, Vienna, Austria, 1992, Paper 3292.

Griesinger D. (1992b): Measures of spatial impression and reverberance based on the physiology of human hearing, at the AES 11th International Conference, Portland, USA.

760  References

Griesinger D. (1996): Multichannel matrix surround decoder for two-eared listeners, at the AES 101st Convention, Los Angeles, USA, Paper 4402.

Griesinger D. (1997a): Progress in 5-2-5 matrix systems, at the AES 103rd Convention, New York, USA, Paper 4625.

Griesinger D. (1997b): Spatial impression and envelopment in small rooms, at the AES 103rd Convention, New York, USA, Paper 4638.

Griesinger D. (1998): Multichannel sound systems and their interaction with the room, at the AES 15th International Conference, Copenhagen, Denmark.

Grignon L.D. (1949): Experiments in stereophonic sound, J. SMPTE, 52(3), 280–292.

Groschel A., Schug M., Beer M., et al. (2003): Enhancing audio coding efficiency of MPEG Layer-2 with spectral band replication (SBR) for digital radio (EUREKA 147/DAB) in a backwards compatible way, at the AES 114th Convention, Amsterdam, The Netherlands, Paper 5850.

Guan S.Q. (1988): Fundamental electroacoustic technology (revised edition, in Chinese), Posts and Telecommunications Press, Beijing, China.

Guan S.Q. (1995): Some thoughts on Stereophonic (in Chinese), Appl. Acoust., 14(6),6–11. Guastavino C., and Katz B.F.G. (2004): Perceptual evaluation of multi-dimensional spatial audio repro-

duction, J. Acoust. Soc. Am., 116(2), 1105–1115.

Gumerov N.A., O’Donovan A.E., and Duraiswami R., et al. (2010): Computation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation, J. Acoust. Soc. Am., 127(1), 370–386.

Gundry K. (2001): A new active matrix decoder for surround sound, at the AES 19th International Conference, Schloss, Elmau, Germany.

Hahn N., Winter F., and Spors S. (2016): Local wave field synthesis by spatial band-limitation in the circular/spherical harmonics domain, at the AES 140th Convention, Paris, France, Paper 9596.

Hamasaki K. (2011): The 22.2 multichannel sounds and its reproduction at home and personal environment, at the AES 43rd International Conference, Pohang, Korea.

Hamasaki K., and Hiyama K. (2003): Reproduction spatial impression with multichannel audio, at the AES 24th International Conference, Banff, Canda.

Hamasaki K., Hiyama K., Nishiguchi T., et al. (2004): Advanced multichannel audio systems with superior impression of presence and reality, at the AES 116th Convention, Berlin, Germany, Paper 6053.

Hamasaki K., Nishiguchi T., Okumura R., et al. (2007): Wide listening area with exceptional spatial sound quality of a 22.2 multichannel sound system, at the AES 122nd Convention, Vienna, Austria, Paper 7037.

Hamdan E.C., and Fazi F.M. (2021): A modal analysis of multichannel crosstalk cancellation systems and their relationship to amplitude panning, J. Sound Vib., 490, 115743.

Hammershϕi D., and Mϕller H. (1996): Sound transmission to and within the human ear canal, J. Acoust. Soc. Am., 100(1), 408–427.

Han H.L. (1994): Measuring a dummy head in search of pinna cues, J. Audio Eng. Soc., 42(1/2), 15–37. Haneda Y., Makino S., Kaneda Y., et al. (1999): Common acoustical pole and zero modeling of room

transfer functions, IEEE Trans. Speech Audio Process., 7(2), 188–196.

Härmä A. (2010): Classification of time-frequency regions in stereo audio, at the AES 128th Convention, London, UK, Paper 7980.

Härmä A., Karjalainen M., Savioja L., et al. (2000): Frequency-warped signal processing for audio applications, J. Audio Eng. Soc., 48(11), 1011–1031.

Hartmann W.M. and Wittenberg A. (1996): On the externalization of sound images, J. Acoust. Soc. Am., 99(6), 3678–3688.

Hartung K., Sterbing S.J., Keller C.H., et al. (1999): Applications of virtual auditory space in psychoacoustics and neurophysiology, J. Acoust. Soc. Am., 105(2), 1164.

Harvey F.K., and Uecke E.H. (1962): Compatibility problem in two-channel stereophonic recordings,

J. Audio Eng. Soc., 10(1), 8–12.

Harwood H.D. (1968): Stereophonic image sharpness, Wireless World, 74(July), 207–211. Hawksford M.O.J. (2002): Scalable multichannel coding with HRTF enhancement for DVD and virtual

sound systems, J. AudioEng. Soc., 50(11), 894–913.

References  761

He J.J., Gan W.S., and Tan E.L. (2015): Time shifting based primary-ambient extraction for spatial audio reproduction, IEEE Trans. Audio, Speech, Language Process., 23(10), 1576–1588.

He J.J., Tan E.L., and Gan W.S. (2014): Linear estimation based primary-ambient extraction for stereo audio signals, IEEE Trans. Audio, Speech, Language Process., 22(2), 505–517.

He P., Xie B.S., and Rao D. (2006): Subjective and objective analyses of timbre equalized algorithms for virtual sound reproduction by loudspeakers (in Chinese), Appl. Acoust. (in Chinese) 25(1), 4–12.

He P., Xie B.S., and Zhong X.L. (2007): Virtual sound signal processing using HRTF without pinnae (in Chinese), Appl. Acoust., 26(2), 100–106.

He Y.J., Xie B.S., and Liang S.J. (1993): Extension of localization equation for stereophonic sound image(in Chinese), Audio Eng., 17(10), 2–4.

Hebrank J., and Wright D. (1974): Spectral cues used in the localization of sound sources on the median plane, J. Acoust. Soc. Am., 56(6), 1829–1834.

Heller A.J., Benjamin E., and Lee R. (2010): Design of ambisonic decoders for irregular arrays of loudspeakers by non-linear optimization, at the AES 129th Convention, San Francisco, CA, Paper 8243.

Henning G.B. (1974): Detectability of interaural delay in high-frequency complex waveforms, J. Acoust. Soc. Am., 55(1), 84–90.

Herre J., Brandenburg K., and Lederer D. (1994): Intensity stereo coding, at the AES 96th Convention, Amsterdam, The Netherlands, Paper 3799.

Herre J., and Dietz M. (2008): MPEG-4 high-efficiency AAC coding (Standards in a Nutshell), IEEE Signal Process. Mag., 25(3),137–142.

Herre J., Falch C., Mahne D., et al. (2011): Interactive teleconferencing combining spatial audio object coding and DiRAC technology, J. Audio Eng. Soc., 59(12), 924–935.

Herre J., Faller C., Disch S., et al. (2004): Spatial audio coding: next generation efficient and compatible coding of multichannel audio, at the AES 117th Convention, San Francisco, CA, USA, Paper 6186.

Herre J., Hilpert J., Kuntz A., et al. (2014): MPEG-H audio—The new standard for universal spatial/3D audio coding, J. Audio Eng. Soc., 62(12), 821–830.

Herre J., Hilpert J., Kuntz A., et al. (2015): MPEG-H audio—The new standard for coding of immersive spatial audio, IEEE J. Selected Topics Signal Process., 9(5), 770–779.

Herre J., Kjorling K., Breebaart H., et al. (2008): MPEG surround-The ISO/MPEG standard for efficient and compatible multichannel audio coding, J. Audio Eng. Soc., 56(11), 932–955.

Herre J., Purnhagen H., Koppens J., et al. (2012): MPEG spatial audio object coding—The ISO/MPEG standard for efficient coding of interactive audio scenes, J. Audio Eng. Soc., 60(9), 655–673.

Herrmann U., Henkels V., and Braun D. (1998): Comparison of 5 surround microphone method (in German), in the Proceeding of Tonmeistertagung, 508–517.

Hertz B.F. (1981): 100 years with stereo: the beginning, J. Audio Eng. Soc., 29(5), 368–370.

Hibbing M. (1989): XY and MS microphone techniques in comparison, J. Audio Eng. Soc., 37(10), 823–831.

Hidaka T., and Beranek L.L. (2000): Objective and subjective evaluations of twenty-three opera houses in Europe, Japan, and the Americas. J. Acoust. Soc. Am., 107(1), 368–383.

Hidaka T., Beranek L.L., and Okano T. (1995): Interaural cross-correlation (IACC), lateral fraction (LF), and lowand high-frequency sound levels(G) as measures of acoustical quality in concert halls. J. Acoust. Soc. Am., 98(2), 988–1007.

Hiekkanen T., Makivirta A., and Karjalainen M. (2009): Virtualized listening tests for loudspeakers, J. Audio Eng. Soc., 57(4), 237–251.

Hill P.A., Nelson P.A., Kirkeby O., et al. (2000): Resolution of front–back confusion in virtual acoustic imaging systems, J. Acoust. Soc. Am., 108(6), 2901–2910.

Hilpert J., and Disch S. (2009): The MPEG surround coding standard (Standards in a Nutshell), IEEE Signal Process. Mag., 26(1),148–152.

Hirvonen T., and Robinson C.Q. (2016): Extended bass management methods for cost-efficient immersive audio reproduction in digital cinema, at the AES 140th Convention, Paris, France, Paper 9595.

Hiyama K., Komiyama S., and Hamasaki K. (2002): The minimum number of loudspeakers and its arrangement for reproducing the spatial impression of diffuse sound field, at the AES 113rd Convention, Los Angeles, USA, Paper 5674.

762  References

Hoang T.M.N., Ragot S., Kövesi B., et al. (2010): Parametric stereo extension of ITU-T G.722 based on a new downmixing scheme, in Proceedings of the 2010 IEEE International Workshop on Multimedia Signal Processing, Saint-Malo, France.

Hoesel R.J.M.V., and Tyler R.S. (2003): Speech perception, localization, and lateralization with bilateral cochlear implants, J. Acoust. Soc. Am., 113(3), 1617–1630.

Hoffmann P.F., and Møller H. (2006): Audibility of spectral differences in head-related transfer functions, at the AES 120th Convention, Paris, France, Paper 6652.

Hollerweger F. (2006): Periphonic sound spatialization in multi-user virtual environment, Master’s thesis at Graz University of Music and Dramatic art, Graz, Austria.

Hollier M.P., Rimell A.N., and Burraston D. (1997): Spatial audio technology for telepresence, BT Technology J., 15(4), 33–41.

Holman T. (1991): New factors in sound for cinema and television, J.Audio Eng. Soc., 39(7/8), 529–539. Holman T. (1996): The number of audio channels, at the AES 100th Convention, Copenhagen,

Denmark, Paper 4292.

Holman T. (2000): Comments on “subjective appraisal of loudspeaker directivity for multi-channel reproduction”, and Zacharov N., Author’s reply, J. Audio. Eng. Soc., 48(4), 314–321.

Holman T. (2001): The number of loudspeaker channels, at the AES 19th International Conference, Schloss, Elmau, Germany.

Holman T. (2008): Surround sound, up and running (2ndedition), Focal Press, Burlington, MA. Hosoe S., Nishino T., Itou K., et al. (2005): Measurement of head-related transfer functions in the

proximal region, in Proceeding of Forum Acusticum 2005, Budapest, Hungary, 2539–2542.

Howie W., King R., and Martin D. (2016): A three-dimensional orchestral music recording technique, optimized for 22.2 multichannel sound, at the AES 141st Convention, Los Angeles, USA, Paper 9612.

Howie W., King R., and Martin D. (2017): Listener discrimination between common channel-based 3D audio reproduction formats, J. Audio Eng. Soc., 65(10), 796–805.

Hull J. (1999): Surround sound past, present and future, Dolby Laboratories, www.dolby.com. Hulsebos E., Schuurmans T., Vries D.D., et al. (2003): Circular microphone array for discrete multi-

channel audio recording, at the AES 114th Convention, Amsterdam, The Netherlands, Paper 5716. Hulsebos E., and Vries D.D. (2002): Parameterization and reproduction of concert hall acoustics measured with a circular microphone array, at the AES 112nd Convention, Munich, Germany, Paper

5579.

Hulsebos E., Vries D.D., and Bourdillat E. (2002): Improved microphone array configurations for auralization of sound fields by wave-field synthesis, J. Audio Eng. Soc., 50(10), 779–790.

Huopaniemi J., Zacharov N., and Karjalainen M. (1999): Objective and subjective evaluation of headrelated transfer function filter design, J. Audio. Eng. Soc., 47(4), 218–239.

IEC 60268 (1998): Sound system equipment-part 13: listening tests on loudspeakers, International Electrotechnical Commission, Geneva, Switzerland.

IEC 60906 (1999): Audio recording –compact disc digital audio system, International Electrotechnical Commission, Geneva, Switzerland.

IEC 60959 (1990): Provisional head and torso simulator for acoustic measurement on air conduction hearing aids, International Electrotechnical Commission, Geneva, Switzerland.

IEC 62574 (2011): Audio, video and multimedia systems – general channel assignment of multichannel audio, International Electrotechnical Commission, Geneva, Switzerland.

IEEE Computer Society (2020): IEEE standard for second generation audio coding, The Institute of Electrical and Electronics Engineers, New York, USA.

Inoue T., Takahashi N., and Owaki I. (1971): A discrete four-channel disc and its reproducing system (CD-4 system), J. Audio Eng. Soc., 19(7), 576–583.

IRCAM Lab (2003): Listen HRTF database, http://recherche.ircam.fr/equipes/salles/listen/

Irwan R., and Aarts R.M. (2002): Two-to-five channel processing, J. Audio Eng. Soc., 50(11), 914–926. Ise S. (1999): A principle of sound field control based on the Kirchhof-Helmholtz integral equation and

the theory of inverse systems, Acta Acust. United Ac., 85(1), 78–87.

ISO 1999 (1975): Acoustics-assessment of occupational noise exposure for hearing conservation purposes, International Organization for Standardization, Geneva, Switzerland.

References  763

ISO 22234 (2005): Cinematography – relative and absolute sound pressure levels for motion-picture multi-channel sound systems—measurement methods and levels applicable to analog photographic film audio, digital photographic film audio and D-cinema audio, International Organization for Standardization, Geneva, Switzerland.

ISO 226 (2003): Acoustics – normal equal-loudness-level contours, International Organization for Standardization, Geneva, Switzerland.

ISO 2969(2015): Cinematography – B-chain electroacoustic response of motion-picture control rooms and indoor theatres-specifications and measurements, International Organization for Standardization, Geneva, Switzerland.

ISO 3382-1(2009): Acoustics – measurement of room acoustic parameters, part 1: performance spaces, International Organization for Standardization, Geneva, Switzerland.

ISO 389-1(1998): Acoustics – reference zero for the calibration of audiometric equipment, part 1: reference equivalent threshold sound pressure levels for pure tones and supra-aural earphones, International Organization for Standardization, Geneva, Switzerland.

ISO 532-1 (2017): Acoustics – methods for calculating loudness – part 1: Zwicker method, International Organization for Standardization, Geneva, Switzerland.

ISO 532-2 (2017): Acoustics – methods for calculating loudness – part 2: Moore-Glasberg method, International Organization for Standardization, Geneva, Switzerland.

ISO 9568 (1993): Cinematography-background acoustic noise levels in theatres, review rooms and dubbing rooms, International Organization for Standardization, Geneva, Switzerland.

ISO/IEC 11172-3 (1993): Information technology – coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s, part 3: audio, International Organization for Standardization, Geneva, Switzerland.

ISO/IEC 13818-3 (1998): Information technology – generic coding of moving pictures and associated audio, part 3: audio, International Organization for Standardization, Geneva, Switzerland.

ISO/IEC 13818-7 (1997): Information technology – generic coding of moving pictures and associated audio, advanced audio coding-part 7: advanced audio coding (AAC), International Organization for Standardization, Geneva, Switzerland.

ISO/IEC 16488 (2002): Information technology – 120 mm DVD – read-only disk, International Organization for Standardization, Geneva, Switzerland.

ISO/IEC 23001-8 (2015): Information technology-MPEG systems technologies-part 8: coding-inde- pendent code points, International Organization for Standardization, Geneva, Switzerland.

ISO/IEC 23003-1 (2007): Information technology – MPEG audio technologies– part 1: MPEG surround, International Organization for Standardization, Geneva, Switzerland.

ISO/IEC 23003-1 (2012): Information technology – MPEG audio technologies – part 3: United speech and audio coding, International Organization for Standardization, Geneva, Switzerland.

ISO/IEC 23003-2 (2010): Information technology – MPEG audio technologies –part 2: Spatial audio object coding, International Organization for Standardization, Geneva, Switzerland.

ISO/IEC 23008-3 (2015): Information technology –high efficiency coding and media delivery in heterogeneous environments, part 3: 3D audio, International Organization for Standardization, Geneva, Switzerland.

ISO/IEC 30190 (2021): Information technology — digitally recorded media for information interchange and storage — 120 mm, single layer (25,0 Gbytes per disk)and dual layer (50,0 Gbytes per disk)BD recordable disk, International Organization for Standardization, Geneva, Switzerland.

Itho R. (1972): Proposed universal encoding standards for compatible four-channel matrixing, J. Audio Eng. Soc., 20(3), 167–173.

ITU-R BS 1116-1 (1997): Methods for the subjective assessment of small impairments in audio systems including multichannel sound system, International Telecommunication Union, Geneva, Switzerland.

ITU-R BS 1116-3 (2015): Methods for the subjective assessment of small impairments in audio systems, International Telecommunication Union, Geneva, Switzerland.

ITU-R BS 1284-1 (2003): General methods for the subjective assessment of sound quality, International Telecommunication Union, Geneva, Switzerland.

ITU-R BS 1387-1 (1999): Method for objective measurement of perceived audio quality, International Telecommunication Union, Geneva, Switzerland.

764  References

ITU-R BS 1534-3 (2015): Method for the subjective assessment of intermediate quality level of audio systems, International Telecommunication Union, Geneva, Switzerland.

ITU-R BS 1909 (2012): Performance requirements for an advanced multichannel stereophonic sound system for use with or without accompanying picture, International Telecommunication Union, Geneva, Switzerland.

ITU-R BS 2051-2 (2018): Advanced sound system for programme production, International Telecommunication Union, Geneva, Switzerland.

ITU-R BS 2126-0 (2019): Methods for the subjective assessment of sound system with accompanying picture, International Telecommunication Union, Geneva, Switzerland.

ITU-R BS 2132-0 (2019): Method for subjective quality assessment of audio differences of sound systems using multiple stimuli without a given reference, International Telecommunication Union, Geneva, Switzerland.

ITU-R Report BS 2159-7 (2015): Multichannel sound technology in home and broadcasting applications, International Telecommunication Union, Geneva, Switzerland.

ITU-R BS 2399-0 (2017): Methods for selecting and describing attributes and terms, in the preparation of subjective tests, Geneva, Switzerland.

ITU-R BS 708 (1990): Determination of the electro-acoustical properties of studio monitor headphones, International Telecommunication Union, Geneva, Switzerland.

ITU-R BS 775-1 (1994): Multichannel stereophonic sound system with and without accompanying picture, Doc 10/63, International Telecommunication Union, Geneva, Switzerland.

ITU-R BS 775-3 (2012): Multichannel stereophonic sound system with and without accompanying picture, International Telecommunication Union, Geneva, Switzerland.

ITU-R Doc.10/51-E (1995): Low bit rate multichannel audio coder test results, Geneva, Switzerland. Iwahara M., and Mori T. (1978): Stereophonic sound reproduction system, United States Patent: 4,

118, 599.

Jackson J.D. (1999): Classical electrodynamics (3rd Edition), John Wiley & Sons, New York, USA. Jackson P.J.B., Dewhirst M., Conetta R., et al., (2008): QESTRAL (part 3): system and metrics for spa-

tial quality prediction, at the AES 125th Convention, San Francisco, USA, Paper 7597. JBL Professional (1998): Cinema sound system design, https://www.jblpro.com/

Jecklin J. (1981): A different way to record classical music, J. Audio Eng. Soc., 29(5), 329–332. Jeffress L.A. (1948): A place theory of sound localization, J. Comp. Physiol. Psych., 41(1), 35–39. Jiang J.L., Xie B.S., and Mai H.M. (2018): The influence of the number of loudspeakers on the pressure

error in ambisonics reproduction (in Chinese). J. South China Univ. Technol., 46(3), 119–126. Jiang J.L., Xie B.S., Mai H.M., et al. (2019): The role of dynamic cue in auditory vertical localization,

Appl. Acoust., 146, 398–408.

Jin C., Corderoy A., Carlile S., et al. (2004): Contrasting monaural and interaural spectral cues for human sound localization, J. Acoust. Soc. Am., 115(6), 3124–3141.

Jin C., Epain N., and Parthy A. (2014): Design, optimization and evaluation of a dual-radius spherical microphone array, IEEE Trans. Audio, Speech, Language Process., 22(1), 193–204.

Jin C., Leong P., Leung J., et al. (2000): Enabling individualized virtual auditory space using morphological measurements, in Proceedings of the First IEEE Pacific-Rim Conference on Multimedia, Sydney, Australia, 235–238.

Jin C., Tan T., and Kan A., et al. (2005): Real-time, head-tracked 3D audio with unlimited simultaneous sounds, in Proceedings of Eleventh Meeting of the International Conference on Auditory Display (ICAD 05), Limerick, Ireland.

Joshi A.W. (1977): Elements of group theory for physicist (2nd edition), John Wiley & Sons, New York, USA.

Jot J.M., and Chaigne A. (1991): Digital delay networks for designing artificial reverberators, at the AES 90th Convention, Paris, France, Paper 3030.

Jot J.M., Larcher V., and Pernaux J.M. (1999): A comparative study of 3D audio encoding and rendering techniques, at the AES 16th International Conference, Rovaniemi, Finland.

Jot J.M., and Trivi J.M. (2006): Scene description model and rendering engine for interactive virtual acoustics, at the AES 120th Convention, Paris, France, Paper 6660.

References  765

Jot J.M., Wardle S., and Larcher V. (1998): Approaches to binaural synthesis, at the AES 105th Convention, San Francisco, California, USA, Paper 4861.

Juhasz G., and Piret E. (1980): Compatible correcting-matrix quadraphonic transmission system, J. Audio Eng. Soc., 28(9), 596–600.

Julstrom S. (1987): A high-performance surround process for home video, J. Audio Eng. Soc., 35(7/8), 536–549.

Julstrom S. (1991): An intuitive view of coincident stereo microphones, J. Audio Eng. Soc., 39(9), 632–649.

Kahana Y., and Nelson P.A. (2007): Boundary element simulations of the transfer function of human heads and baffled pinnae using accurate geometric models, J. Sound Vib., 300(3/5), 552–579.

Kan A., Jin C., Tan T., et al. (2005): 3DApe: a real-time 3D audio playback engine, AES 118th Convention, Barcelona, Spain, Preprint 6343.

Kan A., and Litovsky R.Y. (2015): Binaural hearing with electrical stimulation, Hearing Res., 322, 127–137.

Kan A., Pope G., Jin C., and Schaik A.V. (2004): Mobile spatial audio communication system, in

Proceedings of Tenth Meeting of the International Conference on Auditory Display (ICAD 04), Sydney, Australia.

Kang S.H., and Kim S.H. (1996): Realistic audio teleconferencing using binaural and auralization techniques, ETRI J., 18(1), 41–51.

Karamustafaoglu A., Horbach U., Pellegrin R., et al. (1999): Design and applications of a data-based auralization system for surround sound, at the AES 106th Convention, Munich, Germany, Paper 4976.

Karjalainen M., and Järveläinen H. (2007): Reverberation modeling using velvet noise, at the AES 30th International Conference, Saariselkä, Finland.

Kassier R., Lee H.K., Brookes T., et al. (2005): An informal comparison between surround sound microphone techniques, at the AES 118th Convention, Barcelona, Spain, Paper 6429.

Kates J.M. (1980): Optimum loudspeaker directional patterns, J. Audio Eng. Soc., 28(11), 787–794. Katz B.F.G. (2001): Boundary element method calculation of individual head-related transfer function.

I. Rigid model calculation, J. Acoust. Soc. Am., 110(5), 2440–2448.

Kawano S., Taira M., Matsudaira M., et al. (1998): Development of the virtual sound algorithm, IEEE Trans. Consumer Electron., 44(3), 1189–1194.

Kearney G., and Doyle T. (2015): Height perception in Ambisonic based binaural decoding, at the AES 139th Convention, New York, USA, Paper 9423.

Keller A.C. (1981): Early Hi-Fi and stereo recordingat Bell Laboratories (1931–1932), J. Audio Eng. Soc., 29(4), 274–280.

Kendall G.S. (1995): The decorrelation of audio signals and its impact on spatial imagery, Comput. Music J., 19(4), 71–87.

Kessler R. (2005): An optimized method for capturing multidimensional “acoustic fingerprints”, at the AES 118th Convention, Barcelona, Spain, Paper 6342.

Kim C., Mason R., and Brookes T. (2013): Head movements made by listeners in experimental and real-left listening activities, J. Audio Eng. Soc., 61(6), 425–438.

Kim S. (Sungyoung), Ikeda M., and Martens W.L. (2014): Reproducing virtually elevated sound via a conventional home-theater audio system, J. Audio Eng. Soc., 62(5), 337–344.

Kim S. (Sunmin), Lee Y.W., and Pulkki V. (2010): New 10.2-channel vertical surround system (10.2- VSS); comparison study of perceived audio quality in various multichannel sound systems with height loudspeakers, at the AES 129th Convention, San Francisco, USA, Paper 8296.

Kim Y.H., and Choi J.W. (2013): Sound visualization and manipulation, John Wiley & Sons, Singapore. Kirby D.G. (1995): ISO/MPEG subjective tests on multichannel audio systems, at the AES 99th

Convention, New York, USA, Paper 4066.

Kirby D.G., Cutmore N.A.F., and Fletcher J.A. (1998): Program origination of five-channel surround sound, J. Audio Eng. Soc., 46(4), 323–330.

Kirby D.G., Warren K., and Watanabe K. (1996): Report on the formal subjective listening tests of MPEG-2 NBC multichannel audio coding, ISO/IEC JTC1/SC29/WG11 Nov.N1419, International Organization for Standardization, Geneva, Switzerland.

766  References

Kirkeby O. (2002): A balanced stereo widening network for headphones, AES 22nd International Conference, Espoo, Finland.

Kirkeby O., and Nelson P.A. (1993): Reproduction of plane wave sound fields, J. Acoust. Soc. Am., 94(5), 2992–3000.

Kirkeby O., and Nelson P.A. (1999): Digital filter design for inversion problems in sound reproduction, J. Audio Eng. Soc., 47(7/8), 583–595.

Kirkeby O., Nelson P.A., and Hamada H. (1998a): The “stereo dipole” – a virtual source imaging system using two closely spaced loudspeakers, J. Audio Eng. Soc., 46(5), 387–395.

Kirkeby O., Nelson P.A., and Hamada H. (1998b): Local sound field reproduction using two closely spaced loudspeakers, J. Acoust. Soc. Am., 104(4), 1973–1981.

Kirkeby O., Nelson P.A., and Orduna-Bustamante F. (1996): Local sound field reproduction using digital signal processing, J. Acoust. Soc. Am., 100(3), 1584–1593.

Kistler D.J., and Wightman F.L. (1992): A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction, J. Acoust. Soc. Am., 91(3), 1637–1647.

Kjörling K., Rödén J., Wolters M., et al. (2016): AC-4 –the next generation audio codec, at the AES 140th Convention, Paris, France, Paper 9491.

Kleczkowski P., Król A., and Malecki P. (2015): Multichannel sound reproduction quality improves with angular separation of direct and reflected sounds, J. Audio Eng. Soc., 63(6), 427–442.

Kleijn W.B. (2018): Directional emphasis in Ambisonics, IEEE Signal Process. Lett., 25(7), 1079–1083. Kleiner M., Dalenbäck B.I., and Svensson P. (1993): Auralization-an overview, J. Audio Eng. Soc.,

41(11), 861–875.

Klepko J. (1997): 5-channel microphone array with binaural head for multichannel reproduction, at the AES 103th Convention, New York, USA, Paper 4541.

Klipsch P.W. (1958): Stereophonic sound with two tracks, three channels by means of a phantom circuit (2PH3), J. Audio Eng. Soc., 6(2), 118–123.

Kohsaka O., Satoh E., and Nakayama T. (1972): Sound image localization in multichannel matrix reproduction, J. Audio Eng. Soc., 20(7), 542–548.

Kolundžija M., Faller C., and Vetterli M. (2011): Reproducing sound fields using MIMO acoustic channel inversion, J. Audio Eng. Soc., 59(10), 721–734.

Komiyama S. (1989): Subjective evaluation of angular displacement between picture and sound directions for HDTV sound systems, J. Audio Eng. Soc., 37(4), 210–214.

Kopčo N., and Shinn-Cunningham B.G. (2003): Spatial unmasking of nearby pure-tone targets in a simulated anechoic environment, J. Acoust. Soc. Am., 114(5), 2856–2870.

Koyama S., Furuya K., Wakayama K., et al. (2016): Analytical approach to transforming filter design for sound field recording and reproduction using circular arrays with a spherical baffle, J. Acoust. Soc. Am., 139(3), 1024–1036.

Kozamernik F. (1995): Digital audio broadcasting – radio now and for the future, EBU Tech. Rev., 1995(autumn), 2–27.

Kraft S., and Zölzer U. (2016): Low-complexity stereo signal decomposition and source separation for application in stereo to 3D upmixing, at the AES 140th Convention, Paris, France, Paper 9586.

Krebber W., Gierlich H.W., and Genuit K. (2000): Auditory virtual environments: basics and applications for interactive simulations, Signal Process., 80(11), 2307–2322.

Kügler C., and Thiele G. (1992): Loudspeaker reproduction: study on the subwoofer concept, at the AES 92nd Convention, Vienna, Austria, Paper 3335.

Kuhn C., Pellegrini R., Leckschat D., et al. (2003): An approach to miking and mixing of music ensembles using wave field synthesis, at the AES 115th Convention, New York, Paper 5929.

Kuhn G.F. (1977): Model for the interaural time differences in the azimuthal plane, J. Acoust. Soc. Am., 62(1), 157–167.

Kulkarni A. (1997): Sound localization in real and virtual acoustical environments, Doctor dissertation of Boston University, Boston, USA.

Kulkarni A., and Colburn H.S. (1998): Role of spectral detail in sound-source localization, Nature, 396, 747–749.

Kulkarni A., and Colburn H.S. (2000): Variability in the characterization of the headphone transferfunction, J. Acoust. Soc. Am., 107(2), 1071–1074.

References  767

Kulkarni A., and Colburn H.S. (2004): Infinite-impulse-response models of the head-relatedtransfer function, J. Acoust. Soc. Am., 115(4), 1714–1728.

Kulkarni A., Isabelle S.K., and Colburn H.S. (1999): Sensitivity of human subjects to head-related transfer-function phase spectra, J. Acoust. Soc. Am., 105(5), 2821–2840.

Kuo S.M., and Morgan D.R. (1999): Active noise control: a tutorial review, Proceedings of the IEEE, 87(6), 943–973.

Kurozumi K., and Ohgushi K. (1983): The relationship between the cross-correlation coefficient of two-channel acoustic signals and sound image quality, J. Acoust. Soc. Am., 74(6), 1726–1733.

Kuttruff H. (2009): Room acoustics (5th edition), Spon Press, Abingdon, UK.

Kyriakakis C. (1998): Fundamental and technological limitations of immersive audio systems, Proc. IEEE, 86(5), 941–951.

Kyriakakis C., Holman T., Lim J.S., et al. (1998): Signal processing, acoustics, and psychoacoustics for high quality desktop audio, J. Vis. Commun. Image Represent, 9(1), 51–61.

Laback B., Egger K., and Majdak P. (2015): Perception and coding of interaural time differences with bilateral cochlear implants, Hearing Res., 322, 138–150.

Laitinen M.V., Kuech F., Disch S., et al. (2011): Reproducing applause-type signals with directional audio coding, J. Audio Eng. Soc., 59(1/2), 29–43.

Laitinen M.V., Vilkamo J., Jussila K., et al. (2014): Gain normalization in amplitude panning as a function of frequency and room reverberance, at the AES 55th International Conference, Helsinki, Finland.

Langendijk E.H.A., and Bronkhorst A.W. (2002): Contribution of spectral cues to human sound localization, J. Acoust. Soc. Am., 112(4), 1583–1596.

Larcher V., Jot J.M., Guyard J., et al. (2000): Study and comparison of efficient methods for 3D audio spatialization based on linear decomposition of HRTF data, at the AES 108th Convention, Paris, France, Paper 5097.

Leakey D.M. (1959): Some measurements on the effects of interchannel intensity and time differences in two channel sound systems, J. Acoust. Soc. Am., 31(7), 977–986.

Leakey D.M. (1960): Further thoughts on stereophonic sound systems, Wireless World, 66, 154–160.

Lecomte P., Gauthier P.A., Langrenne C., et al. (2015): On the use of a Lebedev grid for ambisonics, at the AES 139th Convention, New York, USA, Paper 9433.

Lecomte P., Gauthier P.A., Langrenne C., et al. (2018): Cancellation of room reflections over an extended area using ambisonics, J. Acoust. Soc. Am., 143(2), 811–828.

Lee H. (2010): A new time and intensity trade-off function for localisation of natural sound sources, at the AES 128th Convention, London, UK, Paper 8149.

Lee H. (2011): A new multichannel microphone technique for effective perspective control, at the AES 130th Convention, London, UK, Paper 8337.

Lee H. (2014): The relationship between interchannel time difference and level difference in vertical sound localization and masking, at the AES 131st Convention, New York, USA, Paper 8556.

Lee H. (2017): Sound source and loudspeaker base angle dependency of phantom image elevation effect, J. Audio Eng. Soc., 65(9), 733–748.

Lee H. (2021): Multichannel 3D microphone arrays: a review, J. Audio Eng. Soc., 69(1/2), 5–26.

Lee H., and Gribben C. (2014): Effect of vertical microphone layer spacing for a 3D microphone array, J. Audio Eng. Soc., 62(12), 870–884.

Lee H., and Rumsey F. (2013): Level and time panning of phantom images for musical sources, J. Audio Eng. Soc., 61(12), 978–988.

Lee J.M., Choi J.W., and Kim Y.H. (2013): Wave field synthesis of a virtual source located in proximity to a loudspeaker array, J. Acoust. Soc. Am., 134(3), 2106–2117.

Lee K.S., Abel J.S., Välimäki V., et al. (2009): The switched convolution reverberator, at the AES 127th Convention, New York, USA, Paper 7927.

Lee Y.W., Kim S., Jo H., et al. (2011): Virtual height speaker rendering for Samsung 10.2-channel vertical surround system, at the AES 131st Convention, New York, USA, Paper 8523.

Lehnert H., and Blauert J. (1992): Principles of binaural room simulation, Appl. Acoust., 36(3/4), 259–291.

768  References

Lehtonen H.M., Purnhagen H., Villemoes L., et al. (2017): Parametric joint channel coding of immersive audio, at the AES 142nd Convention, Berlin, Germany, Paper 9740.

Leitner S., Sontacchi A., and Höldrich R. (2000): Multichannel sound reproduction system for binaural signals – the Ambisonic approach, in Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-00), Verona, Italy.

Lentz T., Assenmacher I., Sokoll J., et al. (2005): Performance of spatial audio using dynamic cross-talk cancellation, AES 119th Convention, New York, USA, Paper 6541.

Lentz T., and Schmitz O. (2002): Realisation of an adaptive cross-talk cancellation system for a moving listener, at the AES 21st International Conference, St. Petersburg, Russia.

Leong P., and Carlile S. (1998): Methods for spherical data analysis and visualization, J. Neurosci Met., 80(2), 191–200.

Li Z., and Duraiswami R. (2006): Headphone-based reproduction of 3D auditory scenes captured by spherical/hemispherical microphone arrays, in Proceedings of IEEE 2006 International Conference on Acoustics, Speech and Signal Processing, Toulouse, France, Vol 5, 337–340.

Lin H.B., and Xie B.S. (2018): Dynamic binaural reproduction of multichannel surround sound based on mobile phone (in Chinese), Appl. Acoust., 37(2), 187–195.

Lipshitz S.P. (1986): Stereo microphone techniques, are the purists wrong?, J. Audio Eng. Soc., 34(9), 716–744.

Litovsky R.Y., Colburn H.S., Yost W.A., et al. (1999): The precedence effect, J. Acoust. Soc. Am., 106(4), 1633–1654.

Liu L.L., and Xie B.S. (2021): Analysis and experiment on the limitations of static and dynamic transaural reproduction with two frontal loudspeakers, Arch. Acoust., 46(2), 213–228.

Liu L.L., and Xie B.S. (2022): A high-frequency–band timbre equalization method for transaural reproduction with two frontal loudspeakers, J. Audio Eng. Soc., 70(1/2), 36–49.

Liu Y. (2014): Research on the stability and timbre of Ambisonics reproduction system (in Chinese), Dissertation of doctor degree, South China University of Technology, Guangzhou, China.

Liu Y. (2015): Research on spherical microphone array recording and binaural virtual rendering system (in Chinese), Dissertation of doctor degree, South China University of Technology, Guangzhou, China.

Liu Y., and Xie B.S. (2013a): Analysis on the stability of high-order Ambisonics system (in Chinese), Tech. Acoust., 32(6), pt. 2, 247–248.

Liu Y., and Xie B.S. (2013b): Analysis on the stability of spatial interpolation of head-related transfer function and reproduction of multi-channel sound (in Chinese), J. South China Univ. Technol., 41(8), 131–138.

Liu Y., and Xie B.S. (2015): Analysis with binaural auditory model and experiment on the timbre of Ambisonics recording and reproduction, Chin. J. Acoustics, 34(4), 337–356.

Liu Y., and Xie B.S. (2016): Analysis on the timbre of horizontal Ambisonics with different decoding methods, at the AES 141st Convention, Los Angeles, USA, Paper 9677.

Liu Y., Xie B.S., Yu G.Z., et al. (2016): Analysis on spatial discrimination threshold of head-related transfer function magnitude, Chin J. Acoust., 35(1), 1–17.

Lokki T., and Gröhn M. (2005): Navigation with auditory cues ina virtual environment, IEEE Multimedia, 12(2), 80–86.

Loomis J.M., Golledge R.G., Klatzky R.L., et al. (1998): Navigation system for the blind: auditory display modes and guidance, Presence, 7(2), 193–203.

Loomis J.M., Hebert C., and Cicinelli J.G. (1990): Active localization of virtual sounds, J. Acoust. Soc. Am., 88(4), 1757–1764.

López J.J., and González A. (1999): 3-D audio with dynamic tracking for multimedia environments, at the 2nd COST-G6 Workshopon Digital Audio Effects(DAFx-1999), Trondheim, Norway.

Lopez-Poveda E.A., and Meddis R. (1996): A physical model of sound diffraction and reflections in the human concha, J. Acoust. Soc. Am., 100(5), 3248–3259.

Lorho G., Isherwood D., Zacharov N., et al. (2002): Round robin subjective evaluation of stereo enhancement system for headphones, at the AES 22nd International Conference, Espoo, Finland.

Maa D.Y., and Shen H. (2004): The handbook of acoustics (Revised edition, in Chinese), Science Press, Beijing, China.

References  769

Mac Cabe C.J., and Furlong D.J. (1994): Virtual imaging capabilities of surround sound systems, J. Audio Eng. Soc., 42(1/2), 38–49.

Mackenzie J., Huopaniemi J., Valimaki V., et al. (1997): Low-order modeling of head-related transfer functions using balanced model truncation, IEEE Signal Process. Lett., 4(2), 39–41.

Macpherson E.A. (1991): A computer model of binaural localization for stereo imaging measurement, J. Audio Eng. Soc., 39(9), 604–622.

Macpherson E.A. (2011): Head motion, spectral cues, and Wallach’s “principle of least displacement” in sound localization, in Principles and applications of spatial hearing (Edited by SuzukiY., et al.), 103–120, World Scientific Publishing Co. Pte. Ltd., Singapore.

Macpherson E.A. (2013): Cue weighting and vestibular mediation of temporal dynamics in sound localization via head rotation, at the 21st International Congress on Acoustics, Montreal, Canada.

Maher R.C. (1997): Single-ended spatial enhancement using a cross-coupled lattice equalizer, at the 1997 IEEE Workshop on Application of Signal Processing to Audio and Acoustics, New Paltz, NY, USA.

Mai H.M., Xie B.S., and Jiang J.L. (2018): Analysis and experimental validation of the mixed-order Ambisonics Reproduction (in Chinese), J. South China Univ. Technol., 46(3), 108–118.

Majdak P., Balazs P., and Laback B. (2007): Multiple exponential sweep method for fast measurement of head-related transfer functions, at the AES 122nd Convention, Vienna, Austria, Paper 7019.

Makita Y. (1962): On the directional localization of sound in the stereophonic sound filed, EBU Rev. Pt. A, 73(6), 102–108.

Malham D.G., and Myatt A. (1995): 3-D sound spatialization using Ambisonic technique, Comput. Music J., 19(4), 58–70.

Marques de Sá J.P. (2007): Applied statistics using SPSS, STATISTICA, MATLAB and R, SpringerVerlag, Berlin, Heidelberg, New York.

Márschall M., Favrot S., and Buchholz J. (2012): Robustness of a mixed-order Ambisonics microphone array for sound field reproduction, at the AES 132nd Convention, Budapest, Hungary, Paper 8645.

Marshall A.H. (1967): A note on the importance of room cross-section in concert halls, J. Sound Vib., 5(1), 100–112.

Marshall A.H., and Barron M. (2001): Spatial responsiveness in concert halls and the origins of spatial impression, Appl. Acoust., 62(2):91–108.

Marston D. (2011): Assessment of stereo to surround upmixers for broadcasting, at the AES 130th Convention, London, UK, Paper 8448.

Martens W.L. (1987): Principal component analysis and resynthesisof spectral cues to perceived direction, in Proceeding of the International computer Music Conference, San Francisco, CA, USA, 274–281.

Martens W.L. (2001): Two-subwoofer reproduction enables increased variation inauditory spatial imagery, in Proceedings of the 2nd International Workshop on Spatial Media, Aizu-Wakamatsu, Japan, 86–97.

Martin G. (2005): A new microphone technique for five-channel recording, at the AES 118th Convention, Barcelona, Spain, Paper 6427.

Martin G., Woszczyk W., Corey J., et al. (1999): Sound source localization in a five-channel surround sound reproduction system, at the AES 107th Convention, New York, USA, Paper 4994.

Mason R. (2002): Elicitation and measurement of auditory spatial attributes in reproduced sound, Doctor dissertation of Philosophy, Surrey University, Guildford, UK.

Mason R., Ford N., Rumsey F., et al. (2001): Verbal and nonverbal elicitation techniques in the subjective assessment of spatial sound reproduction, J. Audio Eng. Soc., 49(5), 366–384.

Matsudaira T.K., and Fukami T. (1973): Phase difference and sound image localization, J. Audio Eng. Soc., 21(10), 792–797.

Matsui K., and Ando A. (2010): Binaural reproduction of 22.2 multichannel sound over loudspeakers, at the AES 129th Convention, San Francisco, CA, USA, Paper 8272.

Matsui K., and Ando A. (2013): Binaural reproduction of 22.2 multichannel sound with loudspeaker array frame, at the AES 135th Convention, New York, USA, Paper 8954.

Matsumoto M., Yamanaka S., and Tohyama M. (2004): Effect of arrival time correction on the accuracy of binaural impulse response interpolation, interpolation methods of binaural response, J. Audio. Eng. Soc., 52(1/2), 56–61.

770  References

McKinnie D., and Rumsey F. (1997): Coincident microphone techniques for three-channel stereophonic reproduction, at the AES 102nd Convention, Munich, Germany, Paper 4429.

Meares D.J. (1991): Sound system for high definition television, Appl. Acoust., 33(3), 229–243. Meares D.J. (1992): Multichannel sound system for HDTV, Appl. Acoust., 36(3/4), 245–257.

Meares D.J., and Ratliff P.A. (1976): The development of compatible 4-2-4 Quadraphonic Matrix system: B.B.C Matrix H, EBU. Review-Tech., Pt. 159(1976 Oct.), 208–217.

Melchior F., Thiergart O., Galdo G.D., et al. (2009): Dual radius spherical cardioid microphone arrays for binaural auralization, at the AES 127th Convention, New York, USA, Paper 7855.

Mennie D. (1978): AM stereo: five competing options, IEEE J. Mag., 15(6), 24–31.

Menzies D. (2002): W-panning and O-format, tools for object spatialisation, at the AES 22nd Conference, Espoo, Finland.

Menzies D., and Al-Akaidi M. (2007): Ambisonic synthesis of complex sources, J. Audio Eng. Soc., 55(10), 864–876.

Menzies D., and Marwan A.A. (2007): Nearfield binaural synthesis and ambisonics, J. Acoust. Soc. Am., 121(3), 1559–1563.

Merchel S., and Groth S. (2010): Adaptively adjusting the stereophonic sweet spot to the listener’s position, J. Audio Eng. Soc., 58(10), 809–817.

Merimaa J. (2009): Modification of HRTF filters to reduce timbral effects in binaural synthesis, at the AES 127th Convention, New York, NY, USA, Paper 7912.

Merimaa J. (2010): Modification of HRTF filters to reduce timbral effects in binaural synthesis, part 2: individual HRTFs, in AES 129th Convention, San Francisco, CA, USA, Paper 8265.

Merimaa J., Goodwin M.M., and Jot J.M. (2007): Correlation-based Ambience extraction from stereo recordings, at the AES 123rd Convention, New York, USA, Paper 8265.

Merimaa J., and Pulkki V. (2005): Spatial impulse response rendering I: analysis and synthesis, J. Audio Eng Soc., 53(12), 1115–1127.

Mertens H. (1965): Directional hearing in stereophony theory and experimental verification, EBU Rev., Part A, 92(Aug.), 146–158.

Meyer E., and Schodder G.R. (1952): On the influence of reflected sound on directional localization and loudness of speech (in German), Nachr. Akad. Wiss, Göttingen, Math. Phys. Klasse IIa, 6, 31–42.

Meyer E., and Thiele R. (1956): Room-acoustical investigations in numerous concert halls and radio studios by means of novel measuring technique (in German), Acustica, 6, 425–444.

Meyer J., and Elko G.W. (2004): Spherical microphone arrays for 3D sound recording, in Audio signal processing for the next-generation multimedia communication systems (edited by Huang Y. and Benesty J.), Kluwer Academic Publishers, Boston, USA, 67–89.

Middlebrooks J.C. (1992): Narrow-band sound localization related to external ear acoustics, J. Acoust. Soc. Am., 92(5), 2607–2624.

Middlebrooks J.C. (1999a): Individual differences in external-ear transfer functions reduced by scaling in frequency, J. Acoust. Soc. Am., 106(3), 1480–1492.

Middlebrooks J.C. (1999b): Virtual localization improved by scaling nonindividualized external-ear transfer functions in frequency, J. Acoust. Soc. Am., 106(3), 1493–1510.

Middlebrooks J.C., and Green D.M. (1992): Observations on a principal components analysis of headrelated transfer functions, J. Acoust. Soc. Am., 92(1), 597–599.

Middlebrooks J.C., Makous J.C., and Green D.M. (1989): Directional sensitivity of sound-pressure levels in the human ear canal, J. Acoust. Soc. Am., 86(1), 89–108.

Mills A.W. (1958): On the minimum audible angle, J. Acoust. Soc. Am., 30(4), 237–246.

Miyasaka E. (1989): A sound reproduction system and transmission system for HDTV, at the AES 7th Conference, Toronto, Canada.

Momose T., Otani M., Hashimoto M., et al. (2015): Adaptive amplitude and delay control for stereophonic reproduction that is robust against listener position variations, J. Audio Eng. Soc., 63(1/2), 90–98.

Monro G. (2000): In-phase corrections for Ambisonics, in Proceedings of International Computer Music Conference, Berlin, Germany, 292–295.

Moore B.C.J. (2012): An introduction to the psychology of hearing (6th edition), Emerald Group Publishing Limited, UK.

References  771

Moore B.C.J., and Glasberg B.R. (2007): Modeling binaural loudness, J. Acoust. Soc. Am., 121(3), 1604–1612

Moore B.C.J., Glasberg B.R., and Bear T. (1997): A model for the prediction of thresholds, loudness, and partial loudness, J. Audio Eng. Soc., 45(4), 224–240.

Moore B.C.J., Oldfield S.R., and Dooley G.J. (1989): Detection and discrimination of spectral peaks and notches at 1 and 8 kHz, J. Acoust. Soc. Am., 85(2), 820–836.

Moore D., and Wakefield J. (2008): The design of Ambisonic decoders for the ITU 5.1 layout with even performance Characteristics, at the AES 124th Convention, Paper 7473.

Moorer J.A. (1979): About this reverberation business, Comput. Music J., 3(2), 13–28.

Moreau S., Daniel J., and Bertet S. (2006): 3D sound field recording with higher order Ambisonics – objective measurements and validation of spherical microphone, at the AES 120th Convention, Paris, France, Paper 6857.

Morimoto M., Fujimori H., and Maekawa Z. (1990): Discrimination between auditory source width and envelopment, J. Acoust. Soc. Japan, 46(6), 448–457.

Morimoto M., and Iida K. (1993): A new physical measure for psychological evaluation of a sound field: front/back energy ratio as a measure for envelopment, J. Acoust. Soc. Am., 93(4), 2282.

Morimoto M., and Iida K. (1995): A practical evaluation method of auditory source width in concert halls, J. Acoust. Soc. Japan, 16(2), 59–69.

Morimoto M., Iida K., and Sakagami K. (2001): The role of reflections from behind the listener in spatial impression, Appl. Acoust., 62(2), 109–124.

Morrell M.J., and Reiss J.D. (2009): A comparative approach to sound localization within a 3-D sound field, at the AES 126th Convention, Munich, Germany, Paper 7663.

Morse P.M., and Ingrad K.U. (1968): Theoretical acoustics, McGraw-Hill, New York, USA. Mourjopoulos J.N. (1994): Digital equalization of room acoustics, J. Audio Eng. Soc., 42(11), 884–900. Muraoka T., and Nakazato T. (2007): Examination of multichannel sound-field recomposition utilizing

frequency-dependent interaural cross correlation (FIACC), J. Audio Eng. Soc., 55(4), 236–256. Murtaza A., Herre J., and Paulus J. (2015): ISO/MPEG-H 3D audio: SAOC-3D decoding and rendering,

at the AES 139th Convention, New York, USA, Paper 9434.

Mϕller H. (1992): Fundamentals of binaural technology, Appl. Acoust., 36(3/4), 171–218.

Mϕller H., Hammershϕi D., and Jensen C.B., et al. (1995b): Transfer characteristics of headphones measured on human ears, J. Audio Eng. Soc., 43(4), 203–217.

Mϕller H., Hammershϕi D., Jensen C.B., et al. (1999): Evaluation of artificial heads in listening tests, J. Audio Eng. Soc., 47(3), 83–100.

Mϕller H., Sϕrensen M.F., Hammershϕi D., et al. (1995a): Head-related transfer functions of human subjects, J. Audio Eng. Soc., 43(5), 300–321.

Mϕller H., Sϕrensen M.F., Jensen C.B., et al. (1996): Binaural technique: do we need individual recordings? J. Audio Eng. Soc., 44(6), 451–469.

Nakabayashi K. (1975): A method of analyzing the quadraphonic sound field, J. Audio Eng. Soc., 23(3), 187–193.

Nakabayashi K., Kurozumi K., and Miyasaka E., et al. (1991): Three-one quadraphonic sound system for high definition television, at the AES 10th International Conference, London, UK.

Naylor G.M. (1993): ODEON – another hybrid room acoustical model, Appl. Acoust., 38(2–4), 131–143.

Nelson P.A., and Elliott S.J. (1992): Active control of sound, Academic Press Inc., San Diego, USA. Nelson P.A., and Kahana Y. (2001): Spherical harmonics, singular-value decomposition and the head-

related transfer function, J. Sound Vib., 239(4), 607–637.

Nelson P.A., Orduña-Bustamante F., and Engler E., et al. (1996): Experiments on a system for synthesis of virtual acoustic sources, J. Audio Eng. Soc., 44(11), 990–1007.

Neuendorf M., Multrus M., and Rettelbach N., et al. (2013): The ISO/MPEG unified speech and audio coding standard—consistent high quality for all content types and at all bit rates, J. Audio Eng. Soc., 61(12), 956–977.

Neukom M. (2006): Decoding second order Ambisonics to 5.1 surround systems, at the AES 121st Convention, San Francisco, CA, Paper 6980.

Neukom M. (2007): Ambisonic panning, at the AES 123rd Convention, New York, USA, Paper 7297.

772  References

Nicol R., and Emerit M. (1999): 3D-sound reproduction over an extensive listening area: a hybrid method derived from holophony and ambisonic, at the AES 16th International Conference, Rovaniemi, Finland.

Nielsen S.H. (1993): Auditory distance perception in different rooms, J. Audio Eng. Soc., 41(10), 755–770.

Nikolic I. (2002): Improvements of artificial reverberation by use of subband feedback delay networks, at the AES 112th Convention, Munich, Germany, Paper 5630.

Nishino T., Inoue N., Takeda K., et al. (2007): Estimation of HRTFs on the horizontal plane using physical features, Appl. Acoust., 68(8), 897–908.

Noisternig M., Sontacchi A., Musil T., et al. (2003): A 3D Ambisonic based binaural sound reproduction system, at the AES 24th International Conference, Banff, Canada.

Noll P. (1997): MPEG digital audio coding, IEEE Signal Process. Mag., 14(5), 59–81.

Nousaine T. (1997): Multiple subwoofers for home theater, at the AES 103rd Convention, New York, USA, Paper 4558.

Nymand M. (2003): Introduction to microphone technique for 5.1 surround sound, at the DPA microphone workshop on mic techniques for multichannel audio, the AES 24th International Conference, Banff, Canada.

Ohgushi K., Komiyama S., Kurozumi K., et al. (1987): Subject evaluation of multi-channel stereophony for HDTV, IEEE Trans. Broadcast., 33(4), 197–202.

Okano T., Beranek L.L., and Hidaka T. (1998): Relations among interaural cross-correlation coefficient (IACCE), lateral fraction (LFE), and apparent source width (ASW) in concert halls, J. Acoust. Soc. Am., 104(1), 255–265.

Okubo H., Sugimoto T., Oishi S., et al. (2012): A method for reproducing frontal sound field of 22.2 multichannel sound utilizing a loudspeaker array frame, at the AES 133rd Convention, San Francisco, USA, Paper 8714.

Olive S. (2001): Evaluation of five commercial stereo enhancement 3D audio software plug-ins, at the AES 110th Convention, Amsterdam, The Netherlands, Paper 5386.

Olive S.E., and Toole F.E. (1989): The detection of reflections in typical rooms, J. Audio Eng. Soc., 37(7/8), 539–553.

Olson H.F. (1969): Home entertainment: audio 1988, J. Audio Eng. Soc., 17(4), 390–404.

Ono K., Nishiguchi T., Matsui K., et al. (2013): Portable spherical microphone for super hi-vision 22.2 multichannel audio, New York, USA, Paper 8922.

Oppenheim A.V., Schafer R.W., and Buck J.R. (1999): Discrete-time signal processing (2nd edition), Prentice-Hall, Upper Saddle River, NJ.

Orban R. (1970): A rational technique for synthesizing pseudo-stereo from monophonic sources, J. Audio Eng. Soc., 18(2), 157–164.

Otani M., and Ise S. (2006): Fast calculation system specialized for head-related transfer function based on boundary element method, J. Acoust. Soc. Am., 119(5), 2589–2598.

Ottens L.F. (1967): The compact-cassette system for audio tape recorders, J. Audio Eng. Soc., 15(1), 26–28.

Paavola M., Karlsson E., and Page J. (2005): 3D audio for mobile devices via Java, at the AES 118th Convention, Barcelona, Spain, Paper 6472.

Park J.Y., Chang J.H., and Kim Y.H. (2010): Generation of independent bright zones for a two-channel private audio system, J. Audio Eng. Soc., 58(5), 382–393.

Park Y.C., Chio T.S., and Jung J.W., et al. (2006): Low complexity 3D audio algorithms for handheld devices, at the AES 29th International Conference, Seoul, Korea.

Paul S. (2009): Binaural recording technology: a historical review and possible future developments,

Acta Acust. United Ac., 95(5), 767–788.

Perrett S., and Noble W. (1997): The effect of head rotations on vertical plane sound localization, J. Acoust. Soc. Am., 102(4), 2325–2332.

Piere A.D. (2019): Acoustics, an introduction to its physical principles and applications (3rd edition), Springer, Cham, Switzerland,

Pihlajamaki T., Santala O., and Pulkki V. (2014): Synthesis of spatially extended virtual source with time-frequency decomposition of mono signals, J. Audio Eng. Soc., 62(7/8), 467–484.

References  773

Plenge G. (1972): On the problem of inside-the-head locatedness, Acustica, 26(5), 241–252.

Plenge G. (1974): On the differences between localization and lateralization, J. Acoust. Soc. Am., 56(3), 944–951.

Pohlmann K.C. (2011): Principles of digital audio (6th edition), McCraw-Hill Companies, Inc., New York, USA.

Poletti M.A. (1996): The design of encoding functions for stereophonic and polyphonic sound systems, J. Audio Eng. Soc., 44(11), 948–963.

Poletti M.A. (2000): A unified theory of horizontal holographic sound systems, J. Audio Eng. Soc., 48(12), 1155–1182.

Poletti M.A. (2005a): Effect of noise and transducer variability on the performance of circular microphone arrays, J. Audio Eng. Soc., 53(5), 371–384.

Poletti M.A. (2005b): Three-dimensional surround sound systems based on spherical harmonics, J. Audio Eng. Soc., 53(11), 1004–1025.

Poletti M.A. (2007): Robust two-dimensional surround sound reproduction for nonuniform loudspeaker layouts, J. Audio Eng. Soc., 55(7/8), 598–610.

Poletti M.A. (2008): An investigation of 2D multizone surround sound systems, at the AES 125th Convention, San Francisco, USA, Paper 7551.

Poletti M.A., and Abhayapala T.D. (2011): Interior and exterior sound field control using general twodimensional first order source, J. Acoust. Soc. Am., 129(1), 234–244.

Poletti M.A., and Betlehem T. (2014): Creation of a single sound field for multiple listeners, in Internoise 2014, Melbourne, Australia.

Poletti M.A., Fazi F.M., and Nelson P.A. (2010a): Sound-field reproduction systems using fixed-direc- tivity loudspeakers, J. Acoust. Soc. Am., 127(6), 3590–3601.

Poletti M.A., Fazi F.M., and Nelson P.A. (2010b): Sound reproduction systems using variable -directiv- ity loudspeakers, J. Acoust. Soc. Am., 129(3), 1429–1438.

Politis A., Laitinen M.V., Ahonen J., et al. (2015): Parametric spatial audio processing of spaced microphone array recordings for multichannel reproduction, J. Audio Eng. Soc., 63(4), 216–227.

Pollow M., Nguyen K.V., Warusfel O., et al. (2012): Calculation of head-related transfer functions for arbitrary field points using spherical harmonics decomposition, Acta Acust. United Ac., 98(1), 72–82.

Pöntynen H., Santala O., and Pulkki H. (2016): Conflicting dynamic and spectral directional cues form separate auditory images, at the AES 140th Convention, Paris, France, Paper 9582.

Pörschmann C. (2007): 3-D audio in mobile communication devices: methods for mobile head-track- ing, J. Virtual Real. Broadcast., 4(13), 0009-6-11833.

Potard G., and Burnett I. (2004): Decorrelation techniques for the rendering of apparent sound source width in 3D audio display, in Preceding of the 7th International Conference on Digital Audio Effect, Naples, Italy, 280–284.

Power P., Davies W.J., Hirst J., et al. (2012): Localisation of elevated virtual sources in higher order Ambisonics sound fields, in Proceedings Institute of Acoustics, 34(Pt.4), Brighton, UK.

Pralong D., and Carlile S. (1996): The role of individualized headphone calibration for the generation of high fidelity virtual auditory space, J. Acoust. Soc. Am., 100(6), 3785–3793.

Pueo B., López J., Escolano J., et al. (2010): Multiactuator panels for wave field synthesis: evolution and present developments, J. Audio Eng. Soc., 58(12), 1045–1063.

Pulkki V. (1997): Virtual sound source positioning using vector base amplitude panning, J. Audio Eng. Soc., 45(6), 456–466.

Pulkki V. (2001a): Localization of amplitude-panned virtual sources II: twoand three-dimensional panning, J. Audio Eng. Soc., 49(9), 753–767.

Pulkki V. (2001b): Coloration of amplitude-panned virtual sources, at the AES 110th Convention, Amsterdam, The Netherlands, Paper 5402.

Pulkki V. (2002): Microphone techniques and directional quality of sound reproduction, at the AES 112th Convention, Munich, Germany, Paper 5500.

Pulkki V. (2007): Spatial sound reproduction with directional audio coding, J. Audio Eng. Soc., 55(6), 503–516.

Pulkki V., and Hirvonen T. (2005): Localization of virtual sources in multichannel audio reproduction,

IEEE Trans. Speech, Audio Process., 13(1), 105–119.

774  References

Pulkki V., and Karjalainen M. (2001): Localization of amplitude-panned virtual sources I: stereophonic panning, J. Audio Eng. Soc., 49(9), 739–752.

Pulkki V., and Karjalainen M. (2015): Communication acoustics: an introduction to speech, audio and psychoacoustics, John Wiley & Sons Ltd, West Sussex, UK.

Pulkki V., Karjalainen M., and Huopaniemi J. (1999): Analyzing virtual sound source attributes using a binaural auditory model, J. Audio Eng Soc., 47(4), 203–217.

Pulkki V., and Merimaa J. (2006): Spatial impulse response rendering II: reproduction of diffuse sound and listening tests, J. Audio Eng. Soc., 54(1/2), 3–20.

Pulkki V., Politis A., and Galdo G.D., et al. (2013): Parametric spatial audio reproduction with higherorder B-format microphone input, at the AES 134th Convention, Rome, Italy, Paper 8920.

Pulkki V., Pontynen H., and Santala O. (2019): Spatial perception of sound source distribution in the median plane, J. Audio Eng. Soc., 67(11), 855–870.

Purnhagen H., Hirvonen T., and Villemoes L., et al. (2016): Immersive audio delivery using joint object coding, at the AES 140th Convention, Paris, France, Paper 9587.

Rafaely B. (2004): Plane-wave decomposition of the sound field on a spherical by convolution, J. Acoust. Soc. Am., 116(4), 2149–2157.

Rafaely B. (2005): Analysis and design of spherical microphone arrays, IEEE Trans. Speech, Audio Process., 13(1), 135–143.

Rafaely B. (2015): Fundamentals of spherical array processing, Springer-Verlag, Berlin Heidelberg. Rao D., and Xie B.S. (2004): Multichannel spatial surround sound system, Chin. J. Acoust., 23(2),

153–166.

Rao D., and Xie B.S. (2005): Head rotation and sound image localization in the median plane, Chin. Sci. Bull., 50(5), 412–416.

Rao D., and Xie B.S. (2006): Repeatability analysis on headphone transfer function measurement (in Chinese), Tech. Acoust., 25(supplement), 441–442.

Rao D., and Xie B.S. (2007): Influence of sound source directivity on binaural auralization quality (in Chinese), Tech. Acoust., 26(5), 899–903.

Ratliff P.A. (1974): Properties of hearing related to quadraphonic reproduction, BBC RD38.

Riederer K.A.J. (1998): Head-related transfer function measurement, Master thesis of Helsinki University of Technology, Findland.

Rohr L., Corteel E., and Nguyen K.V., et al. (2013): Vertical localization performance in a practical 3-D WFS formulation, J. Audio Eng. Soc., 61(12), 1001–1014.

Rubak P., and Johansen L.G. (1998): Artificial reverberation based on a pseudo-random impulse response, at the AES 104th Convention, Amsterdam, The Netherlands, Paper 4725.

Rubak P., and Johansen L.G. (1999): Artificial reverberation based on a pseudo-random impulse response II, at the AES 106th Convention, Munich, Germany, Paper 4900.

Rui Y.Q., Yu G.Z., and Xie B.S., et al. (2013): Calculation of individualized near-field head-related transfer function database using boundary element method, at the AES 134th Convention, Rome, Italy, Paper 8901.

Rumsey F. (1998): Subjective assessment of the spatial attributes of reproduced sound, at the AES 15th International Conference, Copenhagen, Denmark.

Rumsey F. (1999): Controlled subjective assessments of two-to-five channel surround sound processing algorithms, J. Audio Eng. Soc., 47(7/8), 563–582.

Rumsey F. (2001): Spatial audio, Focal Press, Oxford, England.

Rumsey F. (2002): Spatial quality evaluation for reproduced sound: terminology, meaning, and a scenebased paradigm, J. Audio Eng. Soc., 50(9), 651–666.

Rumsey F. (2013): Cinema sound in the 3D era, J. Audio Eng. Soc., 61(5), 340–344.

Rumsey F. (2016): Automotive audio: they know where you sit, J. Audio Eng. Soc., 64(9), 705–708. Rumsey F. (2017): Broadcast and streaming: immersive audio, objects and OTT TV, J. Audio Eng. Soc.,

65(4), 338–341.

Rumsey F., Zielinski S., Jackson P., et al. (2008): QESTRAL (part 1): Quality evaluation of spatial transmission and reproduction using an artificial listener, at the AES 125th Convention, San Francisco, USA, Paper 7595.

References  775

Rumsey F., Zieliński S., and Kassier R. (2005): On the relative importance of spatial and timbral fidelities in judgments of degraded multichannel audio quality, J. Acoust. Soc. Am., 118(2), 968–976.

Sakamoto N., Gotoh T., Kogure T., et al. (1981): Controlling sound-image localization in stereophonic sound reproduction, part 1, J. Audio Eng. Soc., 29(11), 794–799.

Sakamoto N., Gotoh T., Kogure T., et al. (1982): Controlling sound-image localization in stereophonic sound reproduction, part 2, J. Audio Eng. Soc., 30(10), 719–722.

Samsudin Kurniawati E., Ng B.H., et al. (2006): A stereoto mono downmixing scheme for MPEG-4 parametric stereo encoder, in Proceeding of 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Toulouse, France.

Sander C., Wefers F., and Leckschat D. (2012): Scalable binaural synthesis on mobile devices, at the AES 133rd Convention, San Francisco, USA, Paper 8783.

Sandvad J. (1996): Dynamic aspects of auditory virtual environments, at the AES 100th Convention, Copenhagen, Denmark, Paper 4226.

Saviojia L., Huopaniemi J., Lokki T., et al. (1999): Creating interactive virtual acoustic environments, J. Audio. Eng. Soc., 47(9), 675–705.

Sawaguchi M. (editor) (2001): Surround production handbook (in Japanese), Kenrokukan Publishing, Japan.

Sawaya I., Sasaki K., Mikami S., et al. (2015): Dubbing studio for 22.2 multichannel sound system in NHK broadcasting center, at the AES 138th Convention, Warsaw, Poland, Paper 9327.

Scaini D., and Arteaga D. (2014): Decoding higher order Ambisonics to irregular periphonic loudspeaker arrays, at the AES 55th International Conference, Helsinki, Finland.

Scaini D., and Arteaga D. (2020): Wavelet-based spatial audio format, J. Audio Eng. Soc., 68(9), 613–627.

Scheiber P. (1971): Four channels and compatibility, J. Audio Eng. Soc., 19(4), 267–279.

Scheirer E.D., Väänänen R., and Huopaniemi J. (1999): AudioBIFS: describing audio scenes with the MPEG-4 multimedia standard, IEEE Trans. Multimedia, 1(3), 237–250.

Schoeffler M., Adami A., and Herre J. (2014): The influence of upand down-mixes on the overall listening experience, at the AES 137th Convention, Los Angeles, USA, Paper 9140.

Schroeder M.R. (1958): An artificial stereophonic effect obtained from a single audio signal, J. Audio Engrg. Soc., 6(2), 74–79.

Schroeder M.R. (1962): Natural sounding artificial reverberation, J. Audio Eng. Soc., 10(3), 219–223. Schroeder M.R. (1965): New method of measuring reverberation time, J. Acoust. Soc. Am., 37(3),

409–412.

Schroeder M.R. (1987): Statistical parameters of the frequency response curves of large rooms, J. Audio Eng. Soc., 35(5), 299–306.

Schroeder M.R. (1989): Self-similarity and fractals in science and art, J. Audio Eng. Soc., 37(10), 795–808.

Schroeder M.R., and Atal B.S. (1963): Computer simulation of sound transmission in rooms,

Proceedings of the IEEE, 51(3), 536–537.

Seeber B.U., Baumann U., and Fastl H. (2004): Localization ability with bimodal hearing aids and bilateral cochlear implants, J. Acoust. Soc. Am., 116(3), 1698–1709.

Seo J., Park G.Y., Jang D.Y., et al. (2003): Implementation of interactive 3D audio using MPEG-4 multimedia standards, at the AES 115th Convention, New York, USA, Paper 5980.

Seo J.H., Chon S.B., Sung K.M., et al. (2013): Perceptual objective quality evaluation method for highquality multichannel audio codecs, J. Audio Eng. Soc., 61(7/8), 535–545.

Shaw E.A.G. (1974): Transformation of sound pressure level from the free field to the eardrum in the horizontal plane, J. Acoust. Soc. Am., 56(6), 1848–1861.

Shaw E.A.G., and Teranishi R. (1968): Sound pressure generated in an external ear replica and real human ears by nearby point source, J. Acoust. Soc. Am., 44(1), 240–249.

Shi B., and Xie B.S. (2008): Auditory spatial impression and some psychoacoustic problems in electroacoustic reproduction (in Chinese), Audio Eng., 32(9), 34–45.

Shi B., and Xie B.S. (2010): The cross-correlation of signals and spatial impression in surround sound reproduction, Chin. J. Acoust., 29(3), 308–320.

776  References

Shinn-Cunningham B.G. (1998): Applications of virtual auditory displays, in Proceedings of the 20th International Conference of the IEEE Engineering in Biology and Medicine Society, Hong Kong, China, 20(3), 1105–1108.

Shinn-Cunningham B.G., Schickler J., Kopčo N., et al. (2001): Spatial unmasking of nearby speech sources in a simulated anechoic environment, J. Acoust. Soc. Am., 110(2), 1118–1129.

Shively R. (2000): Automotive audio design (a tutorial), at the AES 109th Convention, Los Angeles, USA, Paper 5276.

Short K.M., Garcia R.A., and Daniels M.L. (2007): Multichannel audio processing using a unifieddomain representation, J. Audio Eng. Soc., 55(3), 156–165.

Silzle A., Novo P., and Strauss H. (2004): IKA-SIM: A system to generate auditory virtual environments, at the AES 116th Convention, Berlin, Germany, Paper 6016.

Simon L.S.R., and Mason R. (2010): Time and level localization curves for a regularly-spaced octagon loudspeaker array, at the AES 128th Convention, London, UK, Paper 8079.

Simonson G. (1984): Master’s Thesis at the Technical University of Lyngby.

Sivonen V.P., and Ellermeier W. (2008): Binaural loudness for artificial-head measurements in directional sound fields, J. Audio Eng. Soc., 56(6), 452–461.

SMPTE 320M (1999): Television-channel assignments and levels on multichannel audio media, Proposed standard for Television, ITU information doc.ITU-R 10C/11 and 10-11R/2, Society of Moving Picture and Television Engineers, NY, USA.

SMPTE RP 200 (2012): Relative and absolute sound pressure levels for motion-picture multichannel sound systems – applicable for analog photographic film audio, digital photographic film audio and D-cinema, Society of Moving Picture and Television Engineers, NY, USA.

SMPTE RP 2096-1(2017): Cinema sound system baseline setup and calibration, Society of Moving Picture and Television Engineers, NY, USA.

SMPTE RP 2096-2(2017): Cinema Sound System Maintenance Calibration, Society of Moving Picture and Television Engineers, NY, USA.

SMPTE ST 202 (2010): Motion-pictures – dubbing theaters, review rooms and indoor theaters – B-Chain electroacoustic response, Society of Moving Picture and Television Engineers, NY, USA.

SMPTE ST 2036-2-2008 (2008): Ultra high definition television–audio characteristics and audio channel mapping of program production, Society of Moving Picture and Television Engineers, NY, USA.

SMPTE ST 2098-5 (2018): D-Cinema immersive audio channels and soundfield groups, Society of Moving Picture and Television Engineers, NY, USA.

SMPTE428-3 (2006): D-Cinema distribution master, audio channel mapping and channel labeling, Society of Moving Picture and Television Engineers, NY, USA.

Smyth M. (1999): White paper, an overview of the coherent acoustics coding system, www.dts.com. Smyth S.M.F., Smyth W.P., and Smyth M.H.C., et al. (1996): DTS coherent acoustics, delivering high

quality multichannel sound the consumer, at the AES 100th Convention, Copenhagen, Denmark, Paper 4293.

Snow W. (1953): Basic principles of stereophonic sound, J. SMPTE, 61(5), 567–589.

So R.H.Y., Ngan B., Horner A., et al. (2010): Toward orthogonal non-individualised head-related transfer functions for forward and backward directional sound: cluster analysis and an experimental study, Ergonomics, 53(6), 767–781.

Solvang A. (2008): Spectral impairment of two-dimensional higher order ambisonics, J. Audio Eng. Soc., 56(4), 267–279.

Søndergaardand P.L., and Majdak P. (2013): The Auditory modeling toolbox, in the Technology of binaural listening (edited by Blauert J.), Springer-Verlag, Berlin Heidelberg.

Song M.H., Choi J.W., and Kim Y.H. (2012): A selective array activation method for the generation of a focused source considering listening position, J. Acoust. Soc. Am., 131(2), EL156–162.

Song W., Ellermeier W., and Hald J. (2008): Using beamforming and binaural synthesis for the psychoacoustical evaluation of target sources in noise, J. Acoust. Soc. Am., 123(2), 910–924.

Sonke J.J., Labeeuw J. and Vries D.D.E. (1998): Variable acoustics by wavefield synthesis: a closer look at amplitude effects, at the AES 104th Convention, Amsterdam, The Netherlands, Paper 4712.

Sonke J.J., and Vries D.D.E. (1997): Generation of diffuse reverberation by plane wave synthesis, at the AES 102th Convention, Munich, Germany, Paper 4455.

References  777

Sontacchi A. (2003): Dreidimensionale schallfeldreproduktion fuer lautsprecher-und kopfhoereranwendungen, PhD thesis, Graz University of Technology, Styria Austria

Sontacchi A., and Hoeldrich R. (2000): Enhanced 3D sound field synthesis and reproduction using system by compensating interfering reflexions, in Proceedings of DAFX-00, Verona, Italy.

Sontacchi A., and Holdrich R. (2002): Distance coding in 3D sound fields, at the AES 21st International Conference, St. Petersburg, Russia.

Soulodre G.A., Grusec T., and Lavoie M., et al. (1998): Subjective evaluation of state-of-the-art twochannel audio codecs, J. Audio Eng. Soc., 46(3), 164–177.

Spanias A., Painter T., and Atti V. (2007): Audio signal processing and coding, John Wiley & Sons, Hoboken, New Jersey.

Sporer T., Walther A., Liebetrau J., et al. (2006): Perceptual evaluation of algorithms for blind up-mix, at the AES 121st Convention, San Francisco, USA, Paper 6915.

Spors S., and Ahrens J. (2009): Spatial sampling artifacts of wave field synthesis for the reproduction of virtual point sources, at the AES 126th Convention, Munich, Germany, Paper 7744.

Spors S., and Ahrens J. (2010a): Local sound field synthesis by virtual secondary sources, at the AES 40th International Conference, Tokyo, Japan.

Spors S., and Ahrens J. (2010b): Reproduction of focused sources by spectral division method, in

Proceeding of the 4th IEEE International Symposium on Communications, Control and Signal Processing, Limassol, Cyprus.

Spors S., and Ahrens J. (2010c): Analysis and improvement of pre-equalization in 2.5-dimensional wave field synthesis, at the AES 128th Convention, London, UK, Paper 8121.

Spors S., Buchner H., and Rabenstein R. (2004): Efficient active listening room compensation for wave field synthesis, at the AES 116th Convention, Berlin, Germany, Paper 6119.

Spors S., Buchner H., Rabenstein R., et al. (2007): Active listening room compensation for massive multichannel reproduction systems using wave-domain adaptive filtering, J. Acoust. Soc. Am., 122(1), 354–369.

Spors S., Kuntz A., and Rabenstain R. (2003): An approach to listening room compensation with the wave synthesis, at the AES 24th International Conference, Banff, Canada.

Spors S., and Rabenstain R. (2006): Spatial aliasing artifacts produced by linear and circular loudspeaker arrays used for wave field synthesis, at the AES 120th Convention, Paris, France, Paper 6711.

Spors S., Rabenstain R., and Ahrens J. (2008): The theory of wave field synthesis revisited, at the AES 124th Convention, Amsterdam, The Netherlands, Paper 7358.

Spors S., and Wierstorf H. (2008): Comparison of higher order ambisonics and wave field synthesis with respect to spatial discretization artifacts properties and spatial sampling, at the AES 125th Convention, San Francisco, USA, Paper 7556.

Spors S., Wierstorf H., and Ahrens J. (2011): Interpolation and range extrapolation of head-related transfer functions using virtual local wave field synthesis, at the AES 130th Convention, London, UK, Paper 8392.

Spors S., Wierstorf H., Geier M., et al. (2009): Physical and perceptual properties of focused virtual sources in wave field synthesis, at the AES 127th Convention, New York, USA, Paper 7914.

Sports S., Renk M., and Rabenstein R. (2005): Limiting effect of active room compensation using wave field synthesis, at the AES 118th Convention, Barcelona, Spain, Paper 6400.

Stan G.B., Embrechts J.J., and Archambeau D. (2002): Comparison of different impulse response measurement techniques, J. Audio. Eng. Soc., 50(4), 249–262.

Stanuter J., and Puckette M. (1982): Designing multi-channel reverberators, Comput. Music J., 6(1), 52–65.

Start E.W. (1996): Application of curve array in wave field synthesis, at the AES 100th Convention, Copenhagen, Denmark, Paper 4143.

Start E.W., Valstar V.G., and Vries D.D. (1995): Application of spatial bandwidth reduction in wave field synthesis, at the AES 98th Convention, Paris, France, Paper 3972.

Steinberg J.C., and Snow W.B. (1934): Auditory perspective-physical factors, In Stereophonic Techniques, 3-7, Audio Engineering Society.

Steinke G. (1996): Surround sound – the new phase: an overview, at the AES 100th Convention, Copenhagen, Denmark, Paper 4286.

778  References

Støfringsdal B., and Svensson P. (2006): Conversion of discretely sampled sound field data to auralization formats, J. Audio. Eng. Soc., 54(5), 380–400.

Streicher R., and Dooley W. (1985): Basic stereo microphone perspectives – a review, J. Audio Eng. Soc., 33(7/8), 548–556.

Sugimoto T., Oode S., and Nakayama Y. (2015): Downmixing method for 22.2 channel sound signals in 8K super high-vision broadcasting, J. Audio Eng. Soc., 63(7/8), 590–599.

Sun H., and Svensson U.P. (2011): Design 3-D high order Ambisonics encoding matrices using convex optimization, at the AES 130th Convention, London, UK, Paper 8402.

Suokuisma P., Zacharov N., and Bech S. (1998): Multichannel level alignment, part I: signals and methods, at the AES 105th Convention, San Francisco, USA, Paper 4815.

Suzuki H., Shinbara H., and Toyoshima S.M. (1993): Study on optimum rear loudspeaker height for 3-1 reproduction of HDTV audio, at the AES 95th Convention, New York, USA, Paper 3722.

Svensson U.P., Botts J., and Savioja L. (2017a): Computational modeling of room acoustics I: wavebased modeling, in Architectural acoustics handbook (edited by Xiang N.), J.Ross Publishing, USA.

Svensson U.P., Botts J., Savioja L., et al. (2017b): Computational modeling of room acoustics II: geometrical acoustics, in Architectural acoustics handbook (edited by Xiang N.), J. Ross Publishing, USA.

Svensson U.P., and Kristiansen U.R. (2002): Computational modeling and simulation of acoustic spaces, at the AES 22nd International Conference, Espoo, Finland.

Takane S., Arai D., Miyajima T., et al. (2002): A database of head-related transfer functions in whole directions on upper hemisphere, Acoust. Sci. Tech., 23(3), 160–162.

Takanen M., and Lorho G. (2012): A binaural auditory model for the evaluation of reproduced stereophonic sound, at the AES 45th International Conference, Helsinki, Finland.

Takeuchi T., and Nelson P.A. (2002): Optimal source distribution for binaural synthesis over loudspeakers, J. Acoust. Soc. Am., 112(6), 2786–2797.

Takeuchi T., Nelson P.A., Kirkeby O., et al. (1998): Influence of individual head-related transfer function on the performance of virtual acoustic imaging systems, at the AES 104th Convention, Amsterdam, The Netherlands, Preprint 4700.

Tervo S., Pätynen J., Kuusinen A., et al. (2013): Spatial decomposition method for room impulse responses, J. Audio Eng. Soc., 61(1/2), 17–28.

Theile G. (1990): Further developments of loudspeaker stereophony, at the AES 89 Convention, Los Angeles, USA, Paper 2947.

Theile G. (1991a): HDTV sound systems: how many channels? at the AES 9th International Conference, Detroit, Michigan, USA.

Theile G. (1991b): On the naturalness of two-channel stereo sound, J.Audio Eng. Soc., 39(10), 761–767. Theile G. (1993): Trends and activities in the development of multichannel sound systems, at the AES

12nd Conference, Copenhagen, Denmark.

Theile G. (2001): Natural 5.1 channel recording based on psychoacoustic principles, at the AES 19th International Conference, Schloss Elmau, Germany.

Thiele G., and Plenge G. (1977): Localization of lateral phantom sources, J. Audio Eng. Soc., 25(4), 196–200.

Theile G., and Steinke G. (1999): Surround sound guidelines for operational practice, at the AES UK 14th Conference: Audio-The Second Century, London, UK.

Theile G., and Wittek H. (2011): Principles in surround recordings with height, at the AES 130th Convention, London, UK, Paper 8403.

Thompson J., Smith B., Wamer A., et al. (2012): Direct-diffuse decomposition of multichannel signals using a system of pairwise correlations, at the AES 133rd Convention, San Francisco, USA, Paper 8807.

Thompson J., Wamer A., and Smith B. (2009): An active multichannel downmix enhancement for minimizing spatial and spectral distortions, at the AES 127th Convention, New York, USA, Paper 7913.

Todd C.C., Davidson G.A., and Davis M.F. (1994): AC-3, Flexible perceptual coding for audio transmission and storage, at the AES 96th Convention, Amsterdam, The Netherlands, Paper 3796.

Toh C.W., and Gan W.S. (1999): A real-time virtual surround sound system with bass enhancement, at the AES 107th Convention, New York, USA, Paper 5052.

References  779

Tohyama M., and Suzuki A. (1989): Interaural cross-correlation coefficients in stereo-reproduced sound field fields, J. Acoust. Soc. Am., 85(2), 780–786.

Toole F.E. (1985): Subjective measurements of loudspeaker sound quality and listener performance, J. Audio Eng. Soc., 33(1/2), 2–32.

Toole F.E. (1991): Binaural record/reproduction systems and their use in psychoacoustic investigations, AES 91st Convention, New York, USA, Preprint 3179.

Tregonning A., and Martin B. (2015): The vertical precedence effect: utilizing delay panning for height channel mixing in 3D audio, at the AES 139th Convention, New York, USA, Paper 9469.

Tsang P.W.M., and Cheung W.K. (2009): Development of a re-configurable ambisonic decoder for irregular loudspeaker configuration, IET Circ. Dev. Syst., 3(4), 197–203.

Tsang P.W.M., Cheung W.K., and Leung C.S. (2009): Decoding Ambisonic signals to irregular loudspeaker configuration based on artificial neural networks, in Proceedings of ICONIP, part II, LNCS 5864, 273–280, Springer-Verlag, Berlin, Germany.

Tsingos N., Chabanne C., Robinson C., et al. (2010): Surround sound with height in games using Dolby Pro Logic IIz, at the AES 129th Convention, San Francisco, USA, Paper 8248.

Tsutsui K., Suzuki H., Shimoyoshi O., et al. (1992): ATRAC: adaptive transform acoustic coding for MiniDisc, at the AES 93rd Convention, San Francisco, USA, Paper 3456.

Uhle C., and Gampp P. (2016): Mono-to-stereo upmixing, at the AES 140th Convention, Paris, France, Paper 9528.

Uncini A. (2015): Fundamentals of adaptive signal processing, Springer International Publishing, Switzerland.

Usher J., and Benesty J. (2007): Enhancement of spatial sound quality: a new reverberation-extraction audio upmixer, IEEE Trans. Audio, Speech, Language Process., 15(7), 2141–2150.

Väänänen R., and Huopaniemi J. (2004): Advanced audio BIFS: virtual acoustics modeling in MPEG-4 scene description, IEEE Trans. Multimedia, 6(5), 661–675.

Välimäki V., Parker J.D., and Saviojia L. (2012): Fifty years of artificial reverberation, IEEE Trans.

Audio, Speech, Language Process., 20(5), 1421–1447.

Vanderkooy J. (1994): Aspects of MLS measuring systems, J. Audio Eng. Soc., 42(4), 219–231. Vanderlyn P.B. (1954): British Patent Application. No.23989.

Verbakel J., Kerkhof L.Van De., Maeda M., et al. (1998): Super audio CD format, at the AES 104th Convention, Amsterdam, The Netherlands, Paper 4705.

Vilkamo J., Backstrom T., and Kuntz A. (2013): Optimized covariance domain framework for timefrequency processing of spatial audio, J. Audio Eng. Soc., 61(6), 403–411.

Vilkamo J., Kuntz A., and Füg S. (2014): Reduction of spectral artifacts in multichannel downmixing with adaptive phase alignment, J. Audio Eng. Soc., 62(7/8), 516–526.

Vilkamo J., Neugebauer B., and Plogsties J. (2011): Sparse frequency-domain reverberator, J. Audio Eng. Soc., 59(12), 936–943.

Vilkamo J., and Pulkki V. (2014): Adaptive optimization of interchannel coherence with stereo and surround sound audio content, J. Audio Eng. Soc., 62(12), 861–869.

Vilkamo J., Lokki T., and Pulkki V. (2009): Directional audio coding: virtual microphone-based synthesis and subjective evaluation, J. Audio Eng. Soc., 57(9), 709–724.

Villemoes L., Herre J., Breebaart J., et al. (2006): MPEG surround: the forthcoming ISO standard for spatial audio coding, at the AES 28th International Conference, Pitea, Sweden.

Vinton M., McGrath D., Robinson C., et al. (2015): Next generation surround decoding and­ upmixing for consumer and professional applications, at the AES 57th Convention, Hollywood, CA, USA.

Vorlander M. (2004): Past, present and future of dummy head, at the 2004 conference of the Federation of the Ibero-American acoustical societies, Guimaraes, Portugal.

Vorländer M. (2008): Auralization, fundamentals of acoustics, modelling, simulation, algorithms and acoustic virtual reality, Springer-Verlag, Berlin, Germany.

Vries D.D.E. (1996): Sound reinforcement by wave field synthesis: adaptation of the synthesis operator to the loudspeaker directivity characteristics, J. Audio Eng. Soc., 44(12), 1120–1131.

Vries D.D.E. (2009): Wave field synthesis, Audio Engineering Society, New York, USA.

780  References

Vries D.D.E., Reijnen A.J., and Schonewille M.A. (1994b): The wave-field synthesis concept applied to generation of reflections and reverberation, at the AES 96th Convention, Amsterdam, The Netherlands, Paper 3813.

Vries D.D.E., Start E.W., and Valstar V.G. (1994a): The wave-field synthesis concept applied to sound reinforcement restrictions and solutions, at the AES 96th Convention, Amsterdam, The Netherlands, Paper 3812.

Vries D.D.E., and Vogel P. (1993): Experience with a sound enhancement system based on wavefront synthesis, at the AES 95th Convention, New York, USA, Paper 3748.

Walker R. (1994): Early reflections in studio control rooms: the results from the first controlled image design installations, at the AES 96th Convention, Amsterdam, The Netherlands, Paper 3853.

Wallach H. (1940): The role of head movement and vestibular and visual cue in sound localization, J. Exp. Psychol., 27(4), 339–368.

Waller J. K. (1996): The circle surround 5.2.5 5-channel surround system, Rocktron Corporation / RSP Technologies, White Paper.

Wallis R., and Lee H. (2014): Investigation into vertical stereophonic localisation in the presence of interchannel crosstalk, at the AES 136th Convention, Berlin, Germany, Paper 9026.

Walther A., and Faller C. (2011): Assessing diffuse sound field reproduction capabilities of multichannel playback systems, at the AES 130th Convention, London, UK, Paper8428.

Ward D.B., and Abhayapala T.D. (2001): Reproduction of a plane-wave sound field using an array of loudspeakers, IEEE Trans. Speech, Audio Process., 9(6), 697–707.

Watkins A.J. (1978): Psychoacoustical aspects of synthesized vertical localecues, J. Acoust. Soc. Am., 63(4), 1152–1165.

Weller T., Buchholz J.M., and Oreinos C. (2014): Frequency dependent regularization of a mixedorder ambisonics encoding system using psychoacoustically motivated metrics, at the AES 55th Conference, Helsinki, Finland.

Welti T. (2002): How many subwoofers are enough?, at the AES 112nd Convention, Munich, Germany, Paper 5602.

Welti T. (2012): Optimal configurations for subwoofers in rooms considering seat to seat variation and low frequency efficiency, at the AES 133rd Convention, San Francisco, USA, Paper 8748.

Welti T., and Devantier A. (2006): Low-frequency optimization using multiple subwoofers, J. Audio Eng. Soc., 54(5), 347–364.

Wendt F., Frank M., and Zotter F. (2014): Panning with height on 2, 3, and 4 loudspeakers, at the 2nd International Conference on Spatial Audio, Erlangen, Germany.

Wenzel E.M. (1996): What perception implies about implementation of interactive virtual acoustic environments, AES 101st Convention, Los Angeles, CA, Paper 4353.

Wenzel E.M., Arruda M., Kistler D.J., et al. (1993): Localization using nonindividualized head-related transfer functions, J. Acoust. Soc. Am., 94(1), 111–123.

Wenzel E.M., Miller D.J., and Abel J.S. (2000): Sound lab: a real-time, software-based system for the study of spatial hearing, AES 108th Convention, Paris, France, Paper 5140.

White J.V. (1976): Synthesis of 4-2-4 matrix recording systems, J. Audio Eng. Soc., 24(4), 250–257. Wierstorf H., Hohnerlein C., Spors S., et al. (2014): Coloration in wave field synthesis, at the AES 55th

International Conference, Helsinki, Finland.

Wierstorf H., Raake A., Geier M., et al. (2013): Perception of focused sources in wave field synthesis, J. Audio Eng. Soc., 61(1/2), 5–16.

Wiggins B. (2007): The generation of panning laws for irregular speaker arrays using heuristic methods, at the AES 31st International Conference, London, UK.

Wightman F.L., and Kistler D.J. (1989a): Headphone simulation of free-field listening, I: stimulus synthesis, J. Acoust. Soc. Am., 85(2), 858–867.

Wightman F.L., and Kistler D.J. (1989b): Headphone simulation of free-field listening, II: psycho-phys- ical validation, J. Acoust. Soc. Am., 85(2), 868–878.

Wightman F.L., and Kistler D.J. (1992): The dominant role of low-frequency interaural time difference in sound localization, J. Acoust. Soc. Am., 91(3), 1648–1661.

Wightman F.L., and Kistler D.J. (1997): Monaural sound localization revisited, J. Acoust. Soc. Am., 101(2), 1050–1063.

References  781

Wightman F.L., and Kistler D.J. (1999): Resolution of front-back ambiguity in spatial hearing by listener and source movement, J. Acoust. Soc. Am., 105(5), 2841–2853.

Wightman F.L., and Kistler D.J. (2005): Measurement and validation of human HRTFs for use in hearing research, Acta Acust. United Ac., 91(3), 429–439.

Wightman F.L., Kistler D.J., and Arruda M. (1992): Perceptual consequences of engineering compromises in synthesis of virtual auditory objects, J. Acoust. Soc. Am., 92(4), 2332.

Williams E.G. (1999): Fourier acoustics, sound radiation and near-field acoustical holography,Academic Press, London, UK.

Williams M. (1987): United theory of microphone systems for stereophonic and sound recording, at the AES 82nd Convention, London, UK, Paper 2466.

Williams M. (2002): Multichannel microphone array design, segment coverage analysis above and below the horizontal reference plane, at the AES 112nd Convention, Munich, Germany, Paper 5567.

Williams M. (2003): Multichannel sound recording practice using microphone arrays, at the AES 24th International Conference, Banff, Canada.

Williams M. (2004): Multichannel sound recording using 3, 4 and 5 channel arrays for front sound stage coverage, at the AES 117th Convention, San Francisco, USA, Paper 6230.

Williams M. (2007): Magic arrays, multichannel microphone array design applied to multi-format compatibility, at the AES 122nd Convention, Vienna, Austria, Paper 7057.

Williams M. (2008): Migration of 5.0 multichannel microphone array design to higher order MMAD (6.0, 7.0 & 8.0) with or without the inter-format compatibility criteria, at the AES 124th Convention, Amsterdam, The Netherlands, Paper 7480.

Williams M. (2012): Microphone array design for localization with elevation cues, at the AES 132 and Convention, Budapest, Hungary, Paper 8601.

Williams M. (2013): Microphone arrays for stereo and multichannel sound recordings (Vols I and II), Editrice II Rostro, Milano, Italy.

Williams M., and Du G.L. (1999): Microphone array analysis for multichannel sound recording, at the AES 107th Convention, New York, USA, Paper 4997.

Williams M., and Du G.L. (2000): Multichannel microphone array design, at the AES 108th Convention, Paris, France, Paper 5157.

Williams M., and Du G.L. (2001): The quick reference guide to multichannel microphone array, part I: using cardioid microphones, at the AES 110th Convention, Amsterdam, The Netherlands, Paper 5336.

Williams M., and Du G.L. (2004): The quick reference guide to multichannel microphone array, part II: using supercardioid and hypocardioid microphones, at the AES 116th Convention, Berlin, Germany, Paper 6059.

Wittek H. (2007): Perceptual differences between wavefield synthesis and stereophony, PhD thesis, University of Surrey, UK.

Wittek H., Rumsey F., and Theile G. (2007): Perceptual enhancement of wavefield synthesis by stereophonic means, J. Audio. Eng. Soc., 55(9), 723–751.

Wittek H., and Theile G. (2002): The recording angle–based on localization curves, at the AES 112nd Convention, Munich, Germany, Paper 5568.

Wittek H., and Theile G. (2017): Development and application of a stereophonic multichannel recording technique for 3D audio and VR, at the AES 143rd Convention, New York, USA, Paper 9869.

Woodward J.G. (1975a): NQRC measurement of subjective aspects of quadraphonic sound reproduction, part i, J. Audio. Eng. Soc., 23(1), 2–13.

Woodward J.G. (1975b): NQRC measurement of subjective aspects of quadraphonic sound reproduction, part II, J. Audio. Eng. Soc., 23(2), 128–130.

Woodward J.G. (1977): Quadraphony – a review, J. Audio Eng. Soc., 25(10/11), 843–854. Woodworth R.S., and Schlosberg H. (1954): Experimental psychology (edited by Holt H.), New York,

USA.

Woszczyk W., Beghin T., and De Francisco M., et al. (2009): Recording multichannel sound within virtual acoustics, at the AES 127th Convention, New York, USA, Paper 7856.

Woszczyk W., Leonard B., and Ko D. (2010): Space builder: an impulse response-based tool for immersive 22.2 ambiance design, at the AES 40th International Conference, Tokyo, Japan.

782  References

Wu S.X., and Zhao Y.Z. (2003): Room and environmental acoustics (in Chinese), Guangdong Science and Technology Press, Guangzhou, China.

Wu Y.J. and Abhayapala T.D. (2009): Theory and design of sound field reproduction using continuous loudspeaker concept, IEEE Trans. Audio, Speech, Language Process., 17(1), 107–116.

Wu Y.J., and Abhayapala T.D. (2011): Spatial multizone soundfield reproduction, theory and design,

IEEE Trans. Audio, Speech, Language Process., 19(6), 1711–1720.

Wu Z., Chan F.H.Y., Lam F.K., et al. (1997): A time domain binaural model based on spatial feature extraction for the head-related transfer function, J. Acoust. Soc. Am., 102(4), 2211–2218.

Wüstenhagen U., Feiten B., and Hoeg W. (1998): Subjective listening test of multichannel audio codecs, at the AES 105th Conventiom, San Francisco, USA, Paper 4813.

Xiang N., and Blauert J. (1991): A miniature dummy head for binaural evaluation of tenth-scale acoustic models, Appl. Acoust., 33(2), 123–140.

Xiang N., and Blauert J. (1993): Binaural scale modeling for auralisation and prediction of acoustics in auditoria, Appl. Acoust., 38(2/4), 267–290.

Xiang N., and Schroeder M.R. (2003): Reciprocal maximum-length sequence pairs for acoustical dual source measurements, J. Acoust. Soc. Am., 113(5), 2754–2761.

Xiang N., Trivedi U., and Xie B.S. (2019): Artificial enveloping reverberation for binaural auralization using reciprocal maximum-length sequences, J. Acoust. Soc. Am., 145(4), 2691–2702.

Xie B.S. (1992): Surround sound reproduction with N+1 channel cone array of loudspeakers (in Chinese), Audio Eng., 16(6), 2–6.

Xie B.S. (1995): The advance in stereo technology in recent year (in Chinese), Appl. Acoust., 14(4), 1–6. Xie B.S. (1997): Analysis on a defect of the 5 channel 3/2 surround sound system (in Chinese), Appl.

Acoust., 16(5), 1–7.

Xie B.S. (1998): Interchannel phase difference and stereo sound image localization, Chin J. Acoust., 17(1), 85–93.

Xie B.S. (1999a): Problem with multichannel and virtual surround sound (in Chinese), Audio Eng., 23(5), 17–25.

Xie B.S. (1999b): Design consideration and quality assessment of multichannel surround sound (in Chinese), Audio Eng., 23(8), 12–15.

Xie B.S. (1999c): Cross-correlation analysis on stereophonic and surround sound image (in Chinese), J. Tongji Univ., 27(3), 361–365.

Xie B.S. (2001a): Signal mixing for a 5.1 channel surround sound system – analysis and experiment, J. Audio Eng. Soc., 49(4), 263–274.

Xie B.S. (2001b): 6.1 channel general planar surround sound system, Chin. J. Acoust., 20(2), 170–183. Xie B.S. (2002a): Interchannel time difference and stereophonic sound image localization (in Chinese),

Acta Acust., 27(4), 332–338.

Xie B.S. (2002b): Effect of head size on virtual sound image localization (in Chinese), Appl. Acoust., 21(5), 1–7.

Xie B.S. (2005): Rotation of head and stability of virtual sound image (in Chinese), Audio Eng., 29(6), 56–59.

Xie B.S. (2006a): The meaning of the phase character of HRTF and interaural time difference (in Chinese), Audio Eng., 30(11), 40–45.

Xie B.S. (2006b): Spatial interpolation of HRTFs and signal mixing for multichannel surround sound, Chin. J. Acoust., 25(4), 330–341.

Xie B.S. (2008a): Head-related transfer function and virtual auditory (1st edition, in Chinese), National Defense Industry Press, Beijing, China.

Xie B.S. (2008b): The applications of virtual auditory to virtual reality, communications and information systems (in Chinese), Audio Eng., 32(1), 70–75.

Xie B.S. (2008c): Principle, progress and problems of virtual auditory environment (in Chinese), Audio Eng., 32(11), 39–44.

Xie B.S. (2009a): Head-related transfer function and virtual auditory display (in Chinese), Sci. Sin.: Phys. Mech. Astron., 39(9), 1268–1285.

Xie B.S. (2009b): On the low frequency characteristics of head-related transfer function, Chin J. Acoust., 28(2), 116–128.

References  783

Xie B.S. (2012): Recovery of individual head-related transfer functions from a small set of measurements, J. Acoust. Soc. Am., 132(1), 282–294.

Xie B.S. (2013): Head-related transfer function and virtual auditory display (2nd edition), J. Ross Publishing, USA.

Xie B.S. (2014): Head-related transfer functions of typical subjects from Chinese-based database, at the 21st International Congress on Sound and Vibration, Beijing, China.

Xie B.S. (2020): Spatial sound –history, principle, progress and challenge, Chin. J. Electronics, 29(3), 397–416.

Xie B.S., and Guan S.Q. (2002): Development and psychoacoustic principle of multichannel surround sound (in Chinese), Audio Eng., 26(2), 11–18.

Xie B.S., and Guan S.Q. (2004): Virtual sound and its application (in Chinese), Appl. Acoust., 23(4), 43–47.

Xie B.S., Guan S.Q. (2012): Research and application of spatial sound – history, development and state of the art (in Chinese), Appl. Acoust., 31(1), 18–27.

Xie B.S., and Guo T.K. (2004): Analysis on stereophonic sound image for an off center listener, (in Chinese), Acta Acust., 29(5), 445–452.

Xie B.S., and Liang S.J. (1995): The effect of frequency on image localization in the surround sound systems (in Chinese), Appl. Acoust., 14(3), 22–29.

Xie B.S., Liu L.L., and Jiang J.L. (2021b): Dynamic binaural Ambisonics scheme for rendering distance information of free-field virtual sources (in Chinese), Acta Acust., 46(6), 1223–1233.

Xie B.S., Liu L.L., and Zhang C.Y. (2021c): Virtual reproduction of surround sound in frontal space using four loudspeakers, Chin. J. Acoust., 40(2), 155–174.

Xie B.S., Mai H.M., Liu Y., et al. (2015b): Analysis on the timbre coloration of wave field synthesis using a binaural loudness model, at the AES 138th Convention, Warsaw, Poland, Paper 9320.

Xie B.S., Mai H.M., Rao D., et al. (2019): Analysis of and experiments on vertical summing localization of multichannel sound reproduction with amplitude panning, J. Audio. Eng. Soc., 67(6), 382–399.

Xie B.S., Mai H.M., and Zhong X.L. (2017a): The median-plane summing localization in ambisonics reproduction, at the AES 142nd Convention, Berlin, Germany, Paper 9726.

Xie B.S., Mai H.M., and Zhong X.L. (2017b): Analysis on summing virtual source localization in different sagittal planes, at the Inter-noise 2017, Hong Kong, China.

Xie B.S., and Rao D. (2015): Analysis and experiment on summing localization of two loudspeakers in the median plane, at the AES 139th Convention, New York, USA, Paper 9452.

Xie B.S., Shi B., and Xiang N. (2012): Audio signal decorrelation based on reciprocal-maximal length sequence filters and its applications to spatial sound, at the AES 133rd Convention, San Francisco, USA, Paper 8805.

Xie B.S., Shi Y., Xie Z.W., et al. (2005c): Virtual reproduction system for 5.1 channel surround sound, Chin. J. Acoust., 24(1), 76–88.

Xie B.S., Shi Y., Xie Z.W., et al. (2005d): Two-loudspeaker virtual 5.1 channel surround sound signal processing method, China patent No.ZL02134416.7.

Xie B.S., and Tian Z.J. (2014): Improving binaural reproduction of 5.1 channel surround sound using individualized HRTF cluster in the wavelet domain, at the AES 55th International Conference, Helsinki, Finland.

Xie B.S., Wang J., and Guan S.Q. (2001): A simplified way to simulate 3D virtual sound image (in Chinese), Audio Eng., 25(7), 10–14.

Xie B.S., Wang J., Guan S.Q., et al. (2005a): Virtual reproduction of 5.1 channel surround sound by headphone, Chin J. Acoust., 24(1), 63–75.

Xie B.S., Wang J., Guan S.Q., et al. (2005b): Headphone virtual 5.1 channel surround sound signal processing method, China patent No.ZL02134415.9.

Xie B.S., and Xie X.F. (1992b): The study of planar surround sound field (in Chinese), Acta Acust., 17(3), 225–231.

Xie B.S., and Xie X.F. (1996): Analyse and sound image localization experiment study on multi-channel planar surround sound system, Chin. J. Acoust., 15(1), 52–64.

Xie B.S., and Yu G.Z. (2021): Psychoacoustic principle, methods, and problems with perceived distance control in spatial audio, Appl. Sci., 11(23), 111242.

784  References

Xie B.S., and Zhang C.Y. (1999): A simple method for stereophonic image stage extension (in Chinese), Tech. Acoust., 18(supplement), 187–188.

Xie B.S., and Zhang C.Y. (2012): An algorithm for efficiently synthesizing multiple near-field virtual sources in dynamic virtual auditory display, at the AES 132nd Convention, Budapest, Hungary, Paper 8646.

Xie B.S., Zhang C.Y., and Zhong X.L. (2013c): A cluster and subjective selection-based HRTF customization scheme for improving binaural reproduction of 5.1 channel surround sound, in the AES 134th Convention, Rome, Italy, Paper 8879.

Xie B.S., Zhang L.S., and Guan S.Q., et al. (2006): Simplification and subjective evaluation of filters for virtual sound using loudspeakers (in Chinese), Tech. Acoust., 25(6), 547–554.

Xie B.S., and Zhang T.T. (2010): The audibility of spectral detail of head-related transfer functions at high frequency, Acta Acust. United Ac., 96(2), 328–339.

Xie B.S., and Zhong X.L. (2012): Similarity and cluster analysis on magnitudes of individual headrelated transfer functions (abstract), J. Acoust. Soc. Am., 131(4, Pt.2), 3305.

Xie B.S., Zhong X.L., and He N.N. (2015a): Typical data and cluster analysis on head-related transfer functions from Chinese subjects, Appl. Acoust., 94(1), 1–13.

Xie B.S., Zhong X.L., Rao D., et al. (2007): Head-related transfer function database and its analyses,

Sci. China: Phys. Mech. Astron., 50(3), 267–280.

Xie B.S., Zhong X.L., Yu G.Z., et al. (2013b): Report on research projects on head-related transfer functions and virtual auditory displays in China, J. Audio Eng. Soc., 61(5), 314–326.

Xie X.F. (1964a): A novel pseudo-stereophonic soundsystem (in Chinese), at the 1st Acoustic meeting of China, Paper C3.3, Bejing, China.

Xie X.F. (1964b): Serval simple circuits of pseudo-stereophonic sound, at the 1st Acoustic meeting of China, Paper C3.4, Bejing, China.

Xie X.F. (1964c): A pseudo-stereophonic soundsystem with single input and two outputs (in Chinese),

J. South China Inst. Technol., 2(2), 62–73.

Xie X.F. (1977): The 4-3-4 (four-channel) stereophonic system (in Chinese), J. South China Inst. Techn., 5(1), 40–48.

Xie X.F. (1978a): Stereophonic sound, development and state of arts (in Chinese), Audio Eng., 2(3), 1–11.

Xie X.F. (1978b): 4-3-4 transformation and N(≥3) channel reproduction of panoramic (stereophonic) sound (in Chinese), J. South China Inst. Techn., 6(2), 54–70.

Xie X.F. (1981): The principle of stereo (in Chinese), Science Press, Beijing, China.

Xie X.F. (1982): The 4-3-N matrix multi-channel sound system, Chin. J. Acoust., 1(2), 201–218.

Xie X.F. (1987): The researches on stereophonic sound (in Chinese), South China Institute of Technology Press, Guangzhou, China.

Xie X.F. (1988): A Mathematical analysis of three dimensional surrounding sound field (in Chinese), Acta Acoust., 13(5), 321–328.

Xie X.F., and Xie B.S. (1992): Surround sound reproduction with folding loudspeaker arrangement (in Chinese), Appl. Acoust., 11(5), 5–9.

Yamamoto T. (1973): Quadraphonic one point pickup microphone, J. Audio Eng. Soc., 21(4), 256–261. Yang J., and Gan W.S. (2008): Technique of creation of object-based radiation in sound field control and its applications, in Innovation and harmony – Progress on acoustics in China (edited by Cheng

J.C. and Tian J.), Science Press, Beijing, China.

Yasuda Y., Ohya T., McGrath D., et al. (2003): 3-D audio communications services for future mobile networks, at the AES 23rd International Conference, Copenhagen, Denmark.

Yi K.L., and Xie B.S. (2020): Local Ambisonics panning method for creating virtual source in the vertical plane of the frontal hemisphere, Appl. Acoust., 165, 107319.

Yin T.C.T. (1994): Physiological correlates of the precedence effect and summing localization in the inferior colliculus of the cat, J. Neurosci., 14(9), 5170–5186.

Yoshikawa S., Noge S., and Funaki Y. (1993): Monitor Levels and Quality Evaluation of HDTV 3-1 Multichannel Sound, at the AES 95th Convention, New York, USA, Paper 3723.

Yost W.A., and Sheft S. (1993): Auditory perception, in Human psychophysics (edited by Yost W.A., Popper A.N., Fay R.R.), Springer-Verlag, New York, USA.

References  785

Yu G.Z., Liu Y., and Xie B.S. (2018a): Design and validation on a multiple sound source fast-measure- ment system of near-field head-related transfer functions, Chin. J. Acoust., 37(2), 219–240.

Yu G.Z., Wu R.X., Liu Y., et al. (2018b): Near-field head-related transfer-function measurement and database of human subjects, J. Acoust. Soc. Am., 143(3), EL194–198.

Yu G.Z., and Xie B.S. (2007): Head-related transfer function for nearby sources and its applications (in Chinese), Audio Eng., 31(7), 45–50.

Yu G.Z., Xie B.S., Chen Z.W., et al. (2012a): Analysis on multiple scattering between the rigid-spherical microphone array and nearby surface in sound field recording, at the AES 133rd Convention, San Francisco, USA, Paper 8710.

Yu G.Z., Xie B.S., and Rao D. (2012b): Near-field head-related transfer functions of a artificial head and its characteristics (in Chinese), Acta Acust., 37(4), 378–385.

Zacharov N. (1998a): Subjective appraisal of loudspeaker directivity for multi-channel reproduction, J. Audio. Eng. Soc., 46(4), 288–303.

Zacharov N. (1998b): An overview of multichannel level alignment, at the AES 15th International Conference, Copenhagen, Denmark.

Zacharov N., and Bech S. (2000): Multichannel level alignment, part iv: the correlation between physical measures and subjective level calibration, at the AES 109th Convention, Los Angeles, Paper 5241.

Zacharov N., Bech S., and Meares D. (1998a): The use of subwoofers in the context of surround sound program reproduction, J. Audio Eng. Soc., 46(4), 276–287.

Zacharov N., Bech S., and Suokuisma P. (1998b): Multichannel level alignment, part II: the influence of signals and loudspeaker placement, at the AES 105th Convention, San Francisco, USA, Paper 4816.

Zacharov N., and Koivuniemi K. (2001a): Unravelling the perception of spatial sound reproduction: technique and experimental design, at the AES 19th International Conference, Schloss, Elmau, Germany.

Zacharov N., and Koivuniemi K. (2001b): Unravelling the perception of spatial sound reproduction: analysis & external preference mapping, at the AES 111st Convention, New York, USA, Paper 5423.

Zacharov N., and Koivuniemi K. (2001c): Unravelling the perception of spatial sound reproduction: language development, verbal protocol analysis and listener training, at the AES 111st Convention, New York, USA, Paper 5424.

Zacharov N., and Pedersen T.H. (2015): Spatial sound attributes-development of a common lexicon, at the AES 139th Convention, New York, USA, Paper 9436.

Zahorik P. (2002a): Assessing auditory distance perception using virtual acoustics, J. Acoust. Soc. Am., 111(4), 1832–1846.

Zahorik P. (2002b): Auditory display of sound source distance, in Proceedings of the 2002 International Conference on Auditory Display, Kyoto, Japan, 326–332.

Zahorik P., Brungart D.S., and Bronkhorst A.W. (2005): Auditory distance perception in humans: a summary of past and present research, Acta Acust. United Ac., 91(3), 409–420.

Zeng J.Y. (2007): Quantum mechanics (4th edition, Vol. II, in Chinese), Science Press, Beijing, China. Zhang C.Y., and Xie B.S. (2013): Platform for dynamic virtual auditory environment real-time render-

ing system, Chin. Sci. Bull., 58(3), 316–327.

Zhang C.Y., and Xie B.S. (2014): Dynamic binaural reproduction of 5.1 channel surround sound with low cost head-tracking device, at the AES 55th International Conference, Helsinki, Finland.

Zhang C.Y., Xie B.S., and Xie Z.W. (2000): Elimination of effect of inside-the-head localization in sound reproduction by stereophonic earphone (in Chinese), Audio Eng., 24(8), 4–6.

Zhang C.Y., Xie B.S., and Yu G.Z. (2014): A scheme of stereophonic expansion for handheld sound reproduction devices (in Chinese), Appl. Acoust., 33(4), 324–329.

Zhang T., Zhang C.X., and Zhao X. (2016): Review of AVS audio coding standard, ZTE Commun., 14(2), 56–62.

Zhang W., Abhayapala T.D., Kennedy R.A., et al. (2010): Insights into head-related transfer function: spatial dimensionality and continuous representation, J. Acoust. Soc. Am., 127(4), 2347–2357.

Zhong X.L., and Xie B.S. (2004): Progress in the research of head-related transfer function, Audio Eng. (in Chinese) 28(12), 44–46.

Zhong X.L., and Xie B.S. (2005): Spatial characteristics of head related transfer function, Chin. Phys. Lett., 22(5), 1166–1169.

786  References

Zhong X.L., and Xie B.S. (2007): Spatial symmetry of head-related transfer function, Chin J. Acoust., 26(1), 73–84.

Zhong X.L., and Xie B.S. (2009): Maximal azimuthal resolution needed in measurements of headrelated transfer functions, J. Acoust. Soc. Am., 125(4), 2209–2220.

Zhong X.L., and Xie B.S. (2012): Approximation of individualized head-related transfer function – state of the art and problems (in Chinese), Appl. Acoust., 31(6), 410–415.

Zhong X.L., Zhang F.C., and Xie B.S. (2013): On the spatial symmetry of head-related transfer functions, Appl. Acoust., 74(6), 856–864.

Zhu Y. (2000): Experimental psychology (in Chinese), Peking University Press, Beijing, China. Zielinski S.K., Rumsey F., and Bech S. (2003): Effects of the down-mix algorithms on quality of sur-

round sound, J. Audio Eng. Soc., 51(9), 780–798.

Zotkin D.N., Duraiswami R., and Davis L.S. (2004): Rendering localized spatial audio in a virtual auditory space, IEEE Trans. Multimedia, 6(4), 553–564.

Zotkin D.N., Duraiswami R., and Gumerov N.A. (2010): Plane-wave decomposition of acoustical scenes via spherical and cylindrical microphone arrays, IEEE Trans. Audio, Speech, Language Process., 18(1), 2–16.

Zotter F., and Frank M. (2012): All-round Ambisonic panning and decoding, J. Audio Eng. Soc., 60(10), 807–820.

Zotter F., and Frank M. (2019): Ambisonics, A practical 3D audio theory for recording, studio production, sound reinforcement, and virtual reality, Springer Open, Springer Nature, Cham, Switzerland.

Zotter F., Frank M., Kronlachner M., et al. (2014): Efficient phantom source widening and diffuseness in ambisonics, in the Preceding of the EAA Joint Symposium on Auralization and Ambisonics, Berlin, Germany.

Zotter F., Pomberger H., and Noisternig M. (2012): Energy-preserving ambisonic decoding, Acta Acust. United Ac., 98(1), 37–47.

Zurek P.M. (1987): The precedence effect, in Directional hearing (edited by Yost W.A. and Gourevitch G.), Springer-Verlag, New York, USA.

Zwicker E., and Fastl H. (1999): Psychoacoustics: facts and models (2nd edition), Springer, Berlin, Germany.