3-d direction aligned wavelet transform for scalable video coding

20
3-D Direction Aligned Wavelet Transform for Scalable Video Coding Yu Liu 1 , King Ngi Ngan 1 , and Feng Wu 2 1 Department of Electronic Engineering The Chinese University of Hong Kong 2 Internet Media Group, Microsoft Research Asia, Beijing, China ISCAS2008, Seattle, USA, May 18-21, 2008 Y. Liu, K.N. Ngan and F. Wu 1 3-D Direction Aligned Wavelet Transform for SVC

Upload: adanna

Post on 08-Jan-2016

54 views

Category:

Documents


0 download

DESCRIPTION

3-D Direction Aligned Wavelet Transform for Scalable Video Coding. Yu Liu 1 , King Ngi Ngan 1 , and Feng Wu 2 1 Department of Electronic Engineering The Chinese University of Hong Kong 2 Internet Media Group, Microsoft Research Asia, Beijing, China ISCAS2008, Seattle, USA, May 18-21, 2008. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

3-D Direction Aligned Wavelet Transform for Scalable Video Coding

Yu Liu1, King Ngi Ngan1, and Feng Wu2

1Department of Electronic Engineering The Chinese University of Hong Kong

2Internet Media Group, Microsoft Research Asia, Beijing, China

ISCAS2008, Seattle, USA, May 18-21, 2008

Y. Liu, K.N. Ngan and F. Wu 13-D Direction Aligned Wavelet Transform for SVC

Page 2: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• Introduction• 3-D Directional Threading• 3-D Extension of Weighted Adaptive Lifting• Experimental Results• Conclusion

Y. Liu, K.N. Ngan and F. Wu 23-D Direction Aligned Wavelet Transform for SVC

Outline

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 3: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• 3-D Wavelet-based Scalable Video Coding– full spatio-temporal-quality scalability– non-redundant 3-D subband decomposition– comparable with H.264-based JSVM scheme

• In temporal domain– Motion Aligned Temporal Filtering (MATF)

• motion compensation is incorporated into temporal wavelet transform

• In spatial domain– Conventional 2-D lifting-based wavelet transform

• uses the elements in neighbor horizontal or vertical direction

– However, richly directional attributes in natural image/video• such as linear edges, in neither horizontal nor vertical direction

Y. Liu, K.N. Ngan and F. Wu 33-D Direction Aligned Wavelet Transform for SVC

Introduction

3-D Wavelet–based Scalable Video CodingDirectionally Spatial Wavelet Transform for Image Coding

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 4: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• 2-D DWT with directionally spatial prediction for image coding– Adaptive Directional Lifting (ADL)-based DWT [Ding2007]

– Direction-Adaptive (DA) DWT [Chang2007]

– Weighted Adaptive Lifting (WAL)-based DWT [Liu2007a]

• There is no literature incorporating directionally spatial wavelet transform into the framework of video coding, not to mention scalable video coding

Y. Liu, K.N. Ngan and F. Wu 43-D Direction Aligned Wavelet Transform for SVC

Introduction

3-D Wavelet–based Scalable Video CodingDirectionally Spatial Wavelet Transform for Image Coding

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 5: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• Temporal Motion Threading (MTh) [Liu2007b]– an efficient implementation of Motion Aligned Temporal Filtering (MATF)

– Direction Aligned Spatial Filtering (DASF) vs. MATF• by aligning the direction of the spatial wavelet filtering to the direction of the edges• 2-D Spatial Directional Threading vs. Temporal Motion Threading

– two separable 1-D threading:» horizontal directional threading » vertical directional threading

Y. Liu, K.N. Ngan and F. Wu 53-D Direction Aligned Wavelet Transform for SVC

3-D Directional Threading

Temporal Motion Threading 3-D Direction Coordinate SystemGeneralized Separable 3-D Directional Threading

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

VOP 0 VOP1 VOP 2 … ... ... VOP n-1 VOP n

L0 L1 L2 L3

H0 H1 H2 H3

many-to-one mapping

non-referred pixel

normal pixel

terminatingpixel

ME ME ME ME

Page 6: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• 3-D Direction Coordinate System– 3-D direction coordinate system in a unified framework

• where x, y, and z denote the horizontal, vertical, and temporal direction, respectively

– 3-D direction vector, dv={dx,dy,dz}• dz =-1, dz = 1, and dz = 0 indicate that the current block is forward, backward and

not temporal direction compensated, respectively.• {dx, dy} denote displacements in horizontal and vertical direction

Y. Liu, K.N. Ngan and F. Wu 63-D Direction Aligned Wavelet Transform for SVC

3-D Directional Threading

Temporal Motion Threading 3-D Direction Coordinate SystemGeneralized Separable 3-D Directional Threading

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

z

y

x

Page 7: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• Generalized separable 3-D directional threading– to unify the concepts of temporal motion threading and 2-D spatial

directional threading

– in each direction axis, pixels along the same directional trajectory are linked to form a directional thread according to direction vectors of blocks they belong to.

Y. Liu, K.N. Ngan and F. Wu 73-D Direction Aligned Wavelet Transform for SVC

3-D Directional Threading

Temporal Motion Threading 3-D Direction Coordinate SystemGeneralized Separable 3-D Directional Threading

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

1 2 … ... ... 2n 2n+1

H0 ... ... Hn

DE DE DEDE DE DE DE

Frame/Row/Column

0

L0 L1 ... Ln

Page 8: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

3-D Extension of Weighted Adaptive Lifting

• Original Weighted Adaptive Lifting (WAL) [Liu2007a]– Weighted Function

• Integer Pixel precision:

• Sub-Pixel precision:

where is the coefficient factor of a certain interpolation filter.

• Weighted Lifting

8

)1(;1 :constraint under the,],[)( ,,,, k kiik kikiekiei wWnnmmxwxf

)4(,],[],[],[

)3(,],[],[],[0

1 ,,,

1

0 ,,,

j l ljljeljje

i k kikiekiio

nnmmxwunmxnmc

nnmmxwpnmxnmd

)0,1(],,[)( ,, akiakieei wwanimxxf

)(,],[)( ,,,,, kikik kikiekiei wnnmmxxf ki ,

)2(

xn-1

xn

xn+1

(b)(a)

xn-1

xn

xn+1

ColumnRow /

2m 2m+1 2m+2

ColumnRow /

2m 2m+1 2m+2

xn

xn+1

(d) (e)

xn

xn+1

ColumnRow /

2m 2m+1 2m+2

ColumnRow /

2m 2m+1 2m+2

xn-1 xn-1

(c)

xn-1

xn

xn+1

ColumnRow /

2m 2m+1 2m+2

Y. Liu, K.N. Ngan and F. Wu 3-D Direction Aligned Wavelet Transform for SVC

Original Weighted Adaptive LiftingImproved Weighted Lifting for 3-D TransformsDirectional Adaptive Interpolation for 3-D Transforms3-D WAL-based Direction Aligned Wavelet Transform

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 9: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

ColumnRow /

2m 2m+1 2m+22m-12m-2 2m+3 2m+4

xn-1

xn

xn+1

xn-3

xn-2

xn+2

xn+3

h2

h1

h0

h-1

h-2

h-3

ColumnRow /

2m 2m+1 2m+22m-12m-2 2m+3 2m+4

xn-1

xn

xn+1

xn-3

xn-2

xn+2

xn+3

h2h1

h0

h-1

h-2

h-3

3-D Extension of Weighted Adaptive Lifting

• Original Weighted Adaptive Lifting (WAL)– Directional Interpolation

– Adaptive Interpolation Filter

• To find the optimal filter,minimize the energy of the high subband

by using the Wiener-Hopf equation

9

)5(])],[],[[(minarg 21

0 ,,,

i k kikiekiiol nnmmxwpnmxEw

ColumnRow /

2m 2m+1 2m+22m-12m-2 2m+3 2m+4

xn-1

xn

xn+1

xn-3

xn-2

xn+2

xn+3

h2

h1

h0

h-1

h-2

h-3

Telenor 4-tap

bilinear2-tap

(h-2, h-1, h0, h1)

(h-3, h2)

Y. Liu, K.N. Ngan and F. Wu 3-D Direction Aligned Wavelet Transform for SVC

Original Weighted Adaptive LiftingImproved Weighted Lifting for 3-D TransformsDirectional Adaptive Interpolation for 3-D Transforms3-D WAL-based Direction Aligned Wavelet Transform

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 10: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• Improved Weighted Lifting for 3-D Transforms– Weighted Lifting

• works well for single lifting stage, such as (6,6), 5/3-tap filters– not only in spatial transform [Chang2007,Liu2007a]– but also in temporal transform [Xiong2004]

• over/under-weighted update problems for multiple lifting stages, such as 9/7-tap– reason: update equation doesn’t fulfill the constraint condition in Eq.(1)

– Solutions to over/under-weighted update problems

• Case 1:

• Case 2:

• Case 3:

Y. Liu, K.N. Ngan and F. Wu 103-D Direction Aligned Wavelet Transform for SVC

3-D Extension of Weighted Adaptive Lifting

Original Weighted Adaptive LiftingImproved Weighted Lifting for 3-D TransformsDirectional Adaptive Interpolation for 3-D Transforms3-D WAL-based Direction Aligned Wavelet Transform

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

1, k kiw

)6(],[],[],[0

1 ,,,

j jl ljljeljjje nnmmxwunmxnmc

1 0 1 0 1 01 0

1 1, , 0, 1 and 1if W W

W W

01 0 1 0 1 0

1

11 0 1 0 1 0

0

2, 1, 0, 1 and 1

21, , 0, 1 and 1

Wif W W

W

Wif W W

W

1 0 1 02 2 11, (1 ) ( )[ , ], 1 and 1j j m m jW DA d m j n if W W

Page 11: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• Directional Adaptive Interpolation for 3-D Transforms– Directional interpolation

• extended to temporal domain– To simplify the explanations, the example is

restricted to one spatial coordinate x.– 1-D image lines instead of 2-D images– 3-D filter is restricted to a 2-D filter

• to interpolate the sub-pixel si,2,(i=0,1) in Frame2m+1+(-1)i+1, the integer pixels include

– not only {hi,-2,hi,-1,hi,0,hi,1} in Frame 2m+1+(-1)i+1

– but also {hi,-3,hi,2} in Frame 2m+1+3(-1)i+1

– Adaptive Interpolation Filter• In order to find adaptive filter coefficients for temporal domain, the minimization

problem can also be solved by Wiener-Hopf Eq. (5)

Y. Liu, K.N. Ngan and F. Wu 113-D Direction Aligned Wavelet Transform for SVC

3-D Extension of Weighted Adaptive Lifting

Original Weighted Adaptive LiftingImproved Weighted Lifting for 3-D TransformsDirectional Adaptive Interpolation for 3-D Transforms3-D WAL-based Direction Aligned Wavelet Transform

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Frame/Row/Column

2m 2m+1 2m+22m-12m-2 2m+3 2m+4

xn-1

xn

xn+1

xn-3

xn-2

xn+2

xn+3

h1,2h1,1

h1,0

h1,-1

h1,-2

h1,-3

s1,2

h0,-2

h0,-1

s0,2

h0,0

h0,1h0,2

h0,-3

Page 12: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• 3-D WAL-based Direction Aligned Wavelet Transform– Apply the 3-D directional threading technique to align a series of video

frames to form a totally direction-aligned 3-D video cube– Within each GOP, apply temporal WAL with 5/3-tap filter to each

temporal directional thread• Perform the above operation to the low-pass temporal bands until the desired level

of temporal wavelet decomposition is reached

– Within each frame, apply the spatial WAL with 5/3-tap or 9/7-tap filters to the 2-D spatial directional thread

• Apply the WAL to each horizontal directional thread• Apply the WAL to each vertical directional thread• Perform the above operation to the low-low-pass spatial bands until the desired

level of spatial wavelet decomposition is reached

Y. Liu, K.N. Ngan and F. Wu 123-D Direction Aligned Wavelet Transform for SVC

3-D Extension of Weighted Adaptive Lifting

Original Weighted Adaptive LiftingImproved Weighted Lifting for 3-D TransformsDirectional Adaptive Interpolation for 3-D Transforms3-D WAL-based Direction Aligned Wavelet Transform

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 13: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• MSRA 3-D wavelet video coder VIDWAV 2.0 is used as the reference software

– The 3-D DWT and MATF modules are replaced with the proposed 3-D WAL-based DAWT– Other modules, such as bit-plane coding, entropy coding, etc., keep unchanged.

• Two MPEG standard test sequences: (a) Carphone and (b) Foreman

Y. Liu, K.N. Ngan and F. Wu 133-D Direction Aligned Wavelet Transform for SVC

Experimental Results

Experimental ResultsConclusion

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 14: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• The performance comparison of 3-D WAL-based and 3-D DWT-based SVC for the Y component of decoded Carphone and Foreman at CIF 30 Hz with 5/3 and 9/7-tap spatial filter

Y. Liu, K.N. Ngan and F. Wu 143-D Direction Aligned Wavelet Transform for SVC

Experimental Results

Experimental ResultsConclusion

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 15: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• The performance comparison of 3-D WAL-based and 3-D DWT-based SVC for the Y component of decoded Carphone and Foreman at CIF 15 Hz with 5/3 and 9/7-tap spatial filter

Y. Liu, K.N. Ngan and F. Wu 153-D Direction Aligned Wavelet Transform for SVC

Experimental Results

Experimental ResultsConclusion

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 16: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• The performance comparison of 3-D WAL-based and 3-D DWT-based SVC for the Y component of decoded Carphone and Foreman at QCIF 15 Hz with 5/3 and 9/7-tap spatial filter

Y. Liu, K.N. Ngan and F. Wu 163-D Direction Aligned Wavelet Transform for SVC

Experimental Results

Experimental ResultsConclusion

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 17: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• The performance comparison of 3-D WAL-based and 3-D DWT-based SVC for the Y component of decoded Carphone and Foreman at QCIF 7.5 Hz with 5/3 and 9/7-tap spatial filter

Y. Liu, K.N. Ngan and F. Wu 173-D Direction Aligned Wavelet Transform for SVC

Experimental Results

Experimental ResultsConclusion

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 18: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• Coding Performance Comparison (3-D WAL vs. 3-D DWT)– The highest PSNR gain can be up to 1.62 dB– For 5/3-tap spatial filter

• average PSNR gains are 0.89 dB for Carphone and 1.12 dB for Foreman, respectively

– For 9/7-tap spatial filter• average PSNR gains are 0.69 dB for Carphone and 1.00 dB for Foreman, respectively.

• Complexity Comparison (3-D WAL vs. 3-D DWT)– on encoder side

• Increases considerably complexity • due to 3-D direction estimation process

– on decoder side• has similar complexity• due to asymmetric design

Y. Liu, K.N. Ngan and F. Wu 183-D Direction Aligned Wavelet Transform for SVC

Experimental Results

Experimental ResultsConclusion

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 19: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

• 3-D Direction Aligned Wavelet Transform for Scalable Video Coding– 3-D generalized directional threading– 3-D extension of weighted adaptive lifting

• References– [Ding2007] W. Ding, F. Wu, X. Wu, S. Li, and H. Li, ”Adaptive directional lifting-based wavelet transform for

image coding,” IEEE Trans. Image Process., vol.16, no.2, pp.416-427, Feb. 2007

– [Chang2007] C.-L. Chang and B. Girod, ”Direction-adaptive discrete wavelet transform for image compress,” IEEE Trans. Image Process., vol.16, no.5, pp.1289-1302, May 2007

– [Liu2007a] Y. Liu and K.N. Ngan, ”Weighted adaptive lifting-based wavelet transform,” 2007 IEEE Int. Conf. Image Process. (ICIP2007), San Antonio, USA, Sept. 2007

– [Liu2007b] Y. Liu, F. Wu, and K.N. Ngan, ”3-D object-based scalable wavelet video coding with boundary effect suppression”, IEEE Trans. Circuits Syst. Video Technol., vol.17, no. 5, pp.639-644, May 2007

– [Xiong2004] R. Xiong, F. Wu, J.Xu, S. Li and Y.-Q. Zhang, ”Barbell lifting wavelet transform for highly scalable video coding,” Picture Coding Symposium 2004, USA, Dec 2004

Y. Liu, K.N. Ngan and F. Wu 193-D Direction Aligned Wavelet Transform for SVC

Conclusion

Experimental ResultsConclusion

Introduction3-D Directional Threading

3-D Extension of Weighted Adaptive LiftingExperimental Results and Conclusion

Page 20: 3-D Direction Aligned Wavelet Transform for Scalable Video Coding

Thank You !

Q&AQ&A

3-D Direction Aligned Wavelet Transform for SVC 20Y. Liu, K.N. Ngan and F. Wu