overview of mpeg work for immersive media · internet of media things descriptors for video...

21

Upload: others

Post on 29-Jul-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media
Page 2: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

• Overview of MPEG work for Immersive Media • Rob Koenen, Co-Founder and CBO Tiledmedia

• Co-Founder and first President of VRIF

• Zooming in on Point Cloud Compression • Danillo Graziosi

• Manager of Next-Generation Codec at US Research Center, Sony Corporation of America

Page 3: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

1990 2000 20101995 2005 2015

MP3MPEG-2 Video & Systems AVC HEVC

MP4 FF MMTDASH

AAC

InternetAudio

Digital Television

Music Distribution

Media Storage and Streaming

UHD & ImmersiveServices

New Forms of Digital Television

Unified MediaStreaming

OFF

HD DistributionMobile Video

Custom Fonts on Web & DigiPub

MPEG-4 Video

CMAF

3D Audio

2020

Major MPEG Standards (For Reference)

Page 4: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

Video with 6 DoF

Geometry Point Cloud Compression

OMAF v.2

Jan 2018 20202019 2021

Internet of Media Things

Descriptors for Video Analysis (CDVA)

6 DoF Audio

Video Point Cloud Compression

MediaCoding

2022

Immersive Media Scene Description Interface

Versatile Video Coding

2023

Systems and Tools

Web Resource Tracks

Dense Representation of Light Field Video

3DoF+ Video

Neural Network Compression for Multimedia

Essential Video Coding

Low Complexity Enhancement Video Coding

PCC Systems Support

Media Orchestration

Network-Based Media Processing

BeyondMedia

Genome Compression ExtensionsGenome Compression

Point Cloud Compression v.2

CMAF v.2Multi-Image Application Format

Jan 2024

Color Support in Open Font Format

Partial File Format

Page 5: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

Video with 6 DoF

Geometry Point Cloud Compression

OMAF v.2

Jan 2018 20202019 2021

Internet of Media Things

Descriptors for Video Analysis (CDVA)

6 DoF Audio

Video Point Cloud Compression

MediaCoding

2022

Immersive Media Scene Description Interface

Versatile Video Coding

2023

Systems and Tools

Web Resource Tracks

Dense Representation of Light Field Video

3DoF+ Video

Neural Network Compression for Multimedia

Essential Video Coding

Low Complexity Enhancement Video Coding

PCC Systems Support

Media Orchestration

Network-Based Media Processing

BeyondMedia

Genome Compression ExtensionsGenome Compression

Point Cloud Compression v.2

CMAF v.2Multi-Image Application Format

Jan 2024

Color Support in Open Font Format

Partial File Format

VR360 (3 DoF/+)

Immersive Media with 6 Degrees of Freedom

Combining Natural and Synthetic content

Page 6: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

Video with 6 DoF

Geometry Point Cloud Compression

OMAF v.2

Jan 2018 20202019 2021

Internet of Media Things

Descriptors for Video Analysis (CDVA)

6 DoF Audio

Video Point Cloud Compression

MediaCoding

2022

Immersive Media Scene Description Interface

Versatile Video Coding

2023

Systems and Tools

Web Resource Tracks

Dense Representation of Light Field Video

3DoF+ Video

Neural Network Compression for Multimedia

Essential Video Coding

Low Complexity Enhancement Video Coding

PCC Systems Support

Media Orchestration

Network-Based Media Processing

BeyondMedia

Genome Compression ExtensionsGenome Compression

Point Cloud Compression v.2

CMAF v.2Multi-Image Application Format

Jan 2024

Color Support in Open Font Format

Partial File Format

Media Coding & Systems for 5G

Page 7: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

Coded Representation of Immersive Media

11 Parts (and Counting …)1. Architectures for Immersive Media (Technical Report)2. Omnidirectional Media Application Format3. Versatile Video Coding4. 6 Degrees of Freedom Audio (name t.b.d.)5. Video-Based Point Cloud Coding (V-PCC)6. Immersive Services and Applications7. Immersive Media Metadata8. Network-Based Media Processing9. Geometry Point Cloud Coding (G-PCC)10. Carriage of Point Cloud Data11. Implementation Guidelines for Network-based Media Processing

Page 8: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media
Page 9: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

• Encapsulation in ISO Base Media File Format • Adding timed text

• Transport • DASH and MMT

• Viewport-independent (or -agnostic) streaming • just send everything, no matter where the viewer looks

• Viewport-dependent streaming• Send viewport with better quality

Page 10: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

• 3 DoF+: allowing a bit of lateral head motion

• Interactivity and Overlays

• Better viewport-adaptive streaming• “Late Binding” – allows for more

flexibility and more sophisticated clients

Page 11: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

• Next Generation Video Coding Standard

• Ready mid-2020

• 40% - 50% better than HEVC

• Useful for ever-larger media, including point clouds

Page 12: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

• Streaming Audio Objects + Model ofEnvironment

• Goal: Define Immersive Rendering of Audio

• Thanks to Philips for graphic

Page 13: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

• 5. Video-based• Dense / continuous point

clouds

• See Danillo’s Talk!

• 9. Geometry-based• Sparse point clouds (e.g., Lidar

scans)

• 10:1 for lossless 30:1 acceptable lossy

Page 14: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

• 5. Video-based• Dense / continuous point

clouds

• See Danillo’s Talk!

• 9. Geometry-based• Sparse point clouds (e.g., Lidar

scans)

• 10:1 for lossless 30:1 acceptable lossy

Page 15: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

• Using the network and the edge to support media processing• Network-based, last-second media personalization• Video Stitching• AR rendering• Viewport extraction and encoding• Social VR Support

• Functions that could be carried out in a 5G Edge server

• Example: SK Telecom’s 5G AR (23 March 19, Korean Baseball Game) • https://m.sports.naver.com/video.nhn?id=523565

Page 16: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

• “Exploration” – Design Considerations

• Larger media sizes lead to smaller access units!

• Partial retrieval of partially visible and audible scenes• Requires ultra-fast network response

• Edge Rendering, or Split Rendering• Heavy lifting in network, save on processing in mobile devices

• Very sophisticated media with reasonable device complexity and power use

• New Decoder Architectures for Immersive Media• VR, Point Clouds, Hybrid Scenes

Page 17: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media
Page 18: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media
Page 19: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media

• Also “Exploration”

• Considering Existing Scene Description Formats

• Ideally merge these with timed media• Merging Web and AV Distribution paradigms proves hard

Page 20: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media
Page 21: Overview of MPEG work for Immersive Media · Internet of Media Things Descriptors for Video Analysis (CDVA) 6 DoF Audio Video Point Cloud Compression Media Coding 2022 Immersive Media