how to find future audio formats?

15
© Fraunhofer IIS VDT-Symposium, 2009 Hohenkammer How to Find Future Audio Formats? Andreas Silzle, Thomas Bachmann Fraunhofer-Institut für Integrierte Schaltungen IIS [email protected]

Upload: others

Post on 11-Feb-2022

5 views

Category:

Documents


0 download

TRANSCRIPT

© Fraunhofer IIS

VDT-Symposium, 2009 Hohenkammer

How to Find Future Audio Formats?

Andreas Silzle, Thomas BachmannFraunhofer-Institut für Integrierte Schaltungen IIS

[email protected]

© Fraunhofer IIS

How to Find Future Audio Formats?

1.

General classification of audio reproduction systems

2.

What is important for future audio formats?

3.

Why height sound-reproduction?

4.

Questionnaire for a future audio format

5.

Taxonomy for audio formats

Content

© Fraunhofer IIS

1.

General Classification of Audio Reproduction Systems

Reproduction systemsPhysically based

Wave-Field Synthesis (<2 kHz for a loudspeaker distance of 17 cm, microphone array, only 2D reproduction) [Berkhout 1988, Fraunhofer IDMT 2009]

Perceptually based

Mono

2 channel Stereo

5.1 Format [Rumsey 2001]

Both categories (dependent on number of loudspeakers)

Ambisonics [Gerzon 1973]

© Fraunhofer IIS

1.

Transport Format versus Reproduction System

Introduction Transport Format Reproduction

System

~1923 Mono Mono

~1961 2 ch. Stereo 2 ch. Stereo

19935.1 PCM, MPEG Layer II, AAC, Dolby Digital, DTS

5.1 Multichannel [ITU-R BS.775 1994]

2007 MPEG Surround

(32 ch.)Dolby 7.1DTS 7.1TH 10.2NHK 22.2?

2010 MPEG SAOC (Spatial

Audio Object

Coding)

? ?

© Fraunhofer IIS

2.

What is Important for Future Audio Formats or Systems?

What is present in current systems?

Speaker/instrument recognition (timbre)Room recognition (reverberance)Localization information in StereoEnvelopment in 5.1 (in 2D)

What should be present in future systems?

Localization information in 3DRoom perception in 3D?

What is not subject of this investigation

Meta data [Fraunhofer IDMT 2009b]Interactivity, e.g. [Silzle 2004]Multi-modal systems, e.g. [Altinsoy 2006]Bit resolution, sampling frequencyNHK 22.2 [Hamasaki

2004]

© Fraunhofer IIS

2.

Current Attempts for Future Audio Systems with Height Information

Ambisonics (Gerzon, 1973) [Ambisonic 2007]

Ambiophonics (Ralph Glasgal, 1995) [Glasgal 1995]

10.2 System (Tomlinson Holman, 2000) [Holman 2000]

2+2+2 System (Dabringhaus, 2001) [Dabringhaus 2009]

22.2 System (NHK, 2004) [Hamasaki 2004]

DTS-HD, 7.1 (DTS, 2006) [DTS 2009]

Dolby Pro Logic IIz, 9.1 (Dolby Laboratories, 2009) [Dolby 2009]

© Fraunhofer IIS

3.

Why Do We Need Height Sound-Reproduction?

Direct sound sources from above(static or slow moving)

Singing bird

God’s channel (speaker from above)

?

Screen related sound sources from different heights

Recording room reflections from above

Height is the last missing dimension in reproduction

© Fraunhofer IIS

3.

What Do We Like to Achieve with Height Sound- Reproduction?

Same step size of improvement in Overall Quality like Mono – Stereo – 5.1 – n.m

Precise localization of height from the front and abovePlausible auditory width or spaciousness for elevated auditory events

Plausible reverberance (room perception)

?

Constraints

Number of reproduction loudspeaker n between 5 and 22System should work with different numbers of loudspeakers

© Fraunhofer IIS

4. Questionnaire for a Future Audio Format

1.

For which applications is a new audio format with height reproduction

1.1 Radio

1.2 TV

1.3 Cinema

1.4 Gaming consoles

1.5 Internet

1.6 Car

1.7 …

Scale 0 –

not necessary

1 –

less important

2 –

important

3 –

very important2.

Number of loudspeakers and positions (azimuth and elevation in degree)3.

What else is important?

Thank you very much for participation!

© Fraunhofer IIS

Qua

Timbre

Adapted

from

[Silzle

2007]

© Fraunhofer IIS

5. Benefits from the Taxonomy

Necessity to collect the influencing factors

Overview of influencing factors

Structure of the whole problem

Multivariate optimization problem

Possibility to quantify the problem

© Fraunhofer IIS [Silzle

2009]

© Fraunhofer IIS

6. Summary

General classification: Physically and perceptually based reproduction system

What is important: Last missing dimension – height

Questionnaire for a future audio format

Taxonomy for audio formats to get an overview

© Fraunhofer IIS

Thank you very much for your attention!

Please return the filled questionnaire.

Any questions?

© Fraunhofer IIS

7. Literature

[Berkhout

1988] Berkhout, A. J. (1988). "A Holographic

Approach to Acoustic

Control." J. Audio Eng. Soc. 36(12): 977-995.[Fraunhofer IDMT 2009] http://www.idmt.fraunhofer.de/de/presse_medien/download/produktinformation/09_wfs_de.pdf[Rumsey

2001] Rumsey, F. (2001). Spatial

Audio. Oxford, Focal

Press.[Gerzon

1973] Gerzon, M. A. (1973). "Perophony: With-Height

Sound Reproduction." J. Audio Eng. Soc. 21(1): 3-10.[ITU-R BS.775 1994] ITU-R Recommendation-BS.775-1 (1994). Multichannel

stereophonic

sound

system

with

and without

accompanying

picture. Geneva, Suisse, Intern. Telecom Union.[Fraunhofer IDMT 2009b] http://www.idmt.fraunhofer.de/eng/business%20areas/ba_metadata.htm[Silzle

2004] Silzle, A., P. Novo, et al. (2004). IKA-SIM: A system

to generate

auditory

virtual

environments. 116th AES Convention, Berlin, Germany.

[Altinsoy

2006] Altinsoy, M. E. (2006). Auditory-Tactile

Interaction in Virtual

Environments. (Doctoral

dissertation). Institut für Kommunikationsakustik, Ruhr-Universität Bochum.

[Hamasaki

2004] Kimio

Hamasaki, S. K., Hiroyuki

Okubo, Koichiro

Hiyama, and Wataru

Hatano

(2004). 5.1 and 22.2 Multichannel

Sound Productions

Using

an Integrated

Surround

Sound Panning

System. 117th AES Convention, San Francisco, USA.

[Ambisonic

2007] http://www.ambisonic.net/[Holman

2000] Holman, T. (2000). 5.1 Surround

Sound Up and Running, Focal

Press.[Glasgal

1995] Glasgal, R. and K. Yates (1995). Ambiophonics

-

Beyond

Surround

Sound to Virtual

Sonic

Reality, Ambiophonics

Institute.

[Darbringhaus 2009] http://www.mdg.de/frame2.htm[DTS 2009] http://de.wikipedia.org/wiki/Digital_Theater_Systems_High_Definition[Dolby 2009] http://www.dolby.com/consumer/technology/prologic-IIz.html[Silzle

2007] Silzle, A. (2007). Generation of quality

taxonomies

for

auditory

virtual

environments

by

means

of systematic

expert

surveys. (Doctoral

dissertation). Institut für Kommunikationsakustik, Ruhr-Universität Bochum.[Silzle

2009] Silzle, A., S. Geyersberger, et al. (2009). Vision and Technique

behind

the

New Studios and Listening

Rooms

of the

Fraunhofer IIS Audio Laboratory. 126th AES Convention, Munich, Germany.