Jump to content

Draft:Object-based audio

From Wikipedia, the free encyclopedia


Introduction

[edit]

Object-based audio (sometimes called immersive sound or 3D audio) is an advanced surround sound multimedia technology. Its key concept is immersing the listener in the multimedia experience with surround sound enhanced with an overhead speakers and precise sound direction metadata. The speakers are assessed around the subject and their exact location is inputted into a mixing software. Then a 3D matrix is created, where each sound is encoded with metadata as fluidly moving objects, that are seamlessly continuous.[1][2] This method enhances the listening experience with dynamic, immersive sound that surpasses traditional stereo, mono, and channel-based surround sound systems. Object-based audio has been in the consumer market since around 2012 when Dolby Atmos was introduced in Los Angeles for the first time with the movie Brave.[3] Many competitors created their versions in the following years, such as DTS, MPEG-H or Auro-3D. Today, we can find object-based audio in cinemas, home entertainment devices, gaming consoles, headphones, mobile phones, home cinemas, music, virtual reality AR, podcasts, and e-books.

History

[edit]

Object-based audio builds on top of popular surround sound technology. Even though the first immersive sound format was introduced in 2006 by Auro-3D, it added an overhead speaker but used channel-based approach.[4] The first object-based audio technology for the public was introduced in 2012 with the movie Brave from Disney and Pixar.[3] Other audio format manufacturers followed in the next years with their competing formats. MPEG-H from Fraunhofer was demonstrated for the first time in 2014 with the main focus on broadcasting[5], DTS launched its object-based audio format DTS:X in cinemas in 2015[6] with the movie Monk comes down a mountain[7]. Auro-3D later presented its version AuroMax. The first consumer object-based audiovisual receiver was unveiled in 2015, created by Sony with enabled Dolby Atmos. Later on, in the same year, the first soundbar with Dolby Atmos was created by Yamaha[8]. In 2017, MPEG-H format was adapted into ATSC 3.0television broadcasting in South Korea and in 2019 to existing HDTV services in Brazil.[5] DTS furthermore expanded their line-up of products with DTS:X Pro in 2019[9]

Technology

[edit]
5.1 Surround sound - Channel-based audio

Object-based audio is a modern audio technology, offering a truly immersive experience. It adds a depth layer to a widely popular surround sound, that has certain limitations. A surround sound audio technique uses multiple speakers to enhance played digital multimedia content with improving spatial sound accuracy over regular mono or stereo playback. Traditional and the most common surround sound setup is called channel-based audio. This setup has a precise location for each speaker surrounding the listener. Channel-based audio (CBA) typically consists of 5 speakers with a subwoofer (5.1) or seven outputs with a subwoofer (7.1), where CBA 5.1. setup has a subwoofer, a center speaker, two rear and two back speakers. CBA 7.1. adds two speakers, one speaker on each side from the listener respectively.[2][10][11]

However channel-based audio has its limitations. The main flaw is audio fixed to each channel or speaker. Even though the audio has been accurately mastered, the speakers might not be distributed around the room properly, which leads to inconsistency, and the listening experience will not be as intended by the creator.

Object-based audio creates a solution for this problem. It encodes the sound as dynamic objects with positional metadata containing a particular location and movement of the object. The renderer renders the audio content precisely, as the artist intended. Any object-based audio contains two types of audio content: Beds and Objects. Beds are channels from channel-based audio, where static audio comes from, typically dialogue, non-diegetic music, or ambiance. Objects use a Cartesian coordinate system (x,y,z) for placing audio tracks in the 3D matrix. Metadata then plays the desired dynamic audio relative to the listener and the speaker setup. They contain information about the audio file such as information about the file, location of the sound in the matrix, levels, panning, filters, fading, and much more. Object-based audio also enables interactive and personalised audio experience for each viewer in comfort of their home.[12][13]

Use cases/applications

[edit]

Cinema

[edit]

Object-based audio is a technique in cinema sound design, enhancing the audio experience. Formats such as Dolby Atmos, MPEG-H or DTS:X use this technique to immerse the audience and to precisely reproduce the intended sound by the director and sound designer. Sounds are not limited to speakers like channel-based audio, they are joined together in a technique called Vector based amplitude panning. It positions virtual sources of sound into a 3D space, acting as an object. Then forms a triangle from the three closest speakers around the virtual object and adjusts the volume relative to its location in the 3D space. Object-based audio also uses Wave Field Synthesis and High Order Ambisonics (HOA). This enables the sound engineer to place the objects above, next, or underneath the viewers, creating an illusion of the exact location of each subject. [14][13]

Television, broadcast

[edit]

After the breakthrough of the object-based audio market in commercial cinemas, it shifted to consumer-oriented technology and content.  The leader in broadcasting-oriented object-based audio is the MPEG-H format invented by the Fraunhofer Institute[5]. Although many consumers do not own the technology for reproducing this audio format yet, many audiovisual manufacturers include it in their hardware. MPEG-H allows the viewer to experience immersive and personalised broadcasting thanks to the ability to change volume presets, adjust dialogue volume, and more.[15] Consumers can also access object-based audio in VOD applications, for example Netflix, Disney+, Max.[8] Most of movies created after 2015 are mixed in object-based audio such as Dolby Atmos or DTS:X. Also, many older movies are remixed in newer formats, for example Jaws Remastered.[16]

Music

[edit]

Object-based audio is another tool for music producers to create unique sounding content. It adapts the concepts of object-based audio and applies them to music, enabling producers to immerse the listening audience. Many streaming music providers offer object-based audio music, for example Apple Music, Amazon Music, or Tidal. Music can be rendered from playback or live musical performance. Either listening to music on speakers, using technology like in cinema/home cinema, or with headphones.[17] Headphones have two output sources, and have limitations. Therefore a new method to achieve the required 3D effect had to be developed. This method is called binaural audio, which achieves 3D sound effects by simulating spatial differences between two channels to mimic real-life sound directions.[10]

Types

[edit]
Dolby Atmos logo

Dolby Atmos

[edit]

Dolby Atmos has been the market leader in immersive sound technologies since its unveiling in 2012. This technology is being used in around 6100 cinemas worldwide, with more than 6000 movies mixed in Dolby Atmos format. Dolby Atmos builds on Dolby Surround 7.1, expanding it to a 7.1.2 three-dimensional format. Upgrading from Dolby's AC-3 format to AC-4. Dolby Atmos supports up to 118 objects and 10 sound beds (channels) for the creative freedom of artists. It is available for theatres, home cinemas, soundbars, cars, and phone devices.[10][18]

DTS:X logo

DTS:X

[edit]

The main competitor of Dolby Atmos is DTS:X created by DTS in 2015. A key component of DTS:X is MDA, a free, open platform license for immersive audio creation.[19] In comparison, Dolby Atmos had a headstart with disc media (DVD, Blu-Ray) and until 2023 DTS:X was not available on streaming services. Disney+ was the first to offer streaming content with DTS:X with the addition of IMAX Enhanced movies.[8] The cinema version has a limitation of 64 uniquely rendered channels, and the home theatre version has a limit of 11.1 channels.[20]

MPEG-H

[edit]

Created by the Moving Pictures Expert Group, MPEG-H is a standard for 3D sound. Use cases are mainly in broadcasting, but also in streaming and VR/AR. It enables personalised audio: adjusting the volume for dialogues and switching between many audio channels with different languages.[21][15]

Conclusion

[edit]

Object-based audio is positioned to play a significant role in the future of immersive audio in home entertainment, cinema, gaming and music.[13]

References

[edit]
  1. ^ Shepard, Jeff (2023-11-29). "What is the difference between channel-based audio and object-based audio?". Analog IC Tips. Retrieved 2024-11-10.
  2. ^ a b Santos, Iolanda. "What is Object-based Audio?". blog.soundparticles.com. Retrieved 2024-11-10.
  3. ^ a b "World Premiere of Disney•Pixar's "Brave" Heads to Hollywood for Grand Opening of the Dolby Theatre". Film Independent. Retrieved 2024-11-10.
  4. ^ Hiddenwires. "History of 3D Audio Innovation Defines Auro Technologies". www.hiddenwires.co.uk. Retrieved 2024-11-10.
  5. ^ a b c "Why Is MPEG-H Audio Gaining Momentum?". 2023-08-07. Retrieved 2024-11-10.
  6. ^ 5ivecanons (2015-04-09). "Welcome To DTS:X - Open, Immersive And Flexible Object-Based Audio Coming To Cinema And Home". DTS. Retrieved 2024-11-10.{{cite web}}: CS1 maint: numeric names: authors list (link)
  7. ^ 5ivecanons (2015-07-10). "First Theatrical Release Featuring DTS:X Audio Premieres". DTS. Retrieved 2024-11-10.{{cite web}}: CS1 maint: numeric names: authors list (link)
  8. ^ a b c "Ultimate surround sound guide: DTS, Dolby Atmos, and more explained". Digital Trends. 2020-03-31. Retrieved 2024-11-10.
  9. ^ "Trinnov Set to Demo 11.4.6 DTS:X Pro Immersive Audio System at CEDIA 2019 | High-Def Digest". www.highdefdigest.com. Retrieved 2024-11-10.
  10. ^ a b c "spatial overview". ARVUS.COM. Retrieved 2024-11-10.
  11. ^ "2.1, 5.1, 7.1.2—What Do Surround Sound Numbers Mean?". Ultimea. 2022-11-25. Retrieved 2024-11-10.
  12. ^ "AURO-CX" (PDF). 2023-07-14.
  13. ^ a b c Bleidt, Robert. Retrieved 2020. FRAUNHOFER EXPLAINS: MPEG-H IMMERSIVE SOUND FOR BROADCAST, STREAMING, AND MUSIC.https://www.iis.fraunhofer.de/content/dam/iis/de/doc/ame/AES-LA-Webinar-April2020-release.pdf
  14. ^ Pulkki, Ville; Lokki, Tapio (2024-10-11). Creating Auditory Displays with Multiple Loudspeakers Using VBAP: A Case Study with DIVA Project. https://www.icad.org/websiteV2.0/Conferences/ICAD98/papers/PULKKI.PDF
  15. ^ a b "MPEG-H Audio". Fraunhofer Institute for Integrated Circuits IIS. Retrieved 2024-11-10.
  16. ^ Jaws 4K Blu-ray (45th Anniversary Edition). Retrieved 2024-11-10 – via www.blu-ray.com.
  17. ^ "What Is Immersive Audio?: How Engineers, Artists & Industry Are Changing The State Of Sound | GRAMMY.com". grammy.com. Retrieved 2024-11-10.
  18. ^ Svetlik, Becky RobertsContributions from Joe; updated, Tom Parsons last (2022-02-09). "Dolby Atmos: what is it? How can you get it?". whathifi. Retrieved 2024-11-10.
  19. ^ 5ivecanons (2015-04-09). "Welcome To DTS:X - Open, Immersive And Flexible Object-Based Audio Coming To Cinema And Home". DTS. Retrieved 2024-11-10.{{cite web}}: CS1 maint: numeric names: authors list (link)
  20. ^ "Trinnov Set to Demo 11.4.6 DTS:X Pro Immersive Audio System at CEDIA 2019 | High-Def Digest". www.highdefdigest.com. Retrieved 2024-11-10.
  21. ^ "What is MPEG-H? The burgeoning 3D audio standard explained". Digital Trends. 2022-10-18. Retrieved 2024-11-10.