image compression: coding and decoding

EE4328, Section 005 Introduction to Digital Image

Processing

Image Compression

Zhou Wang

Dept. of Electrical EngineeringThe Univ. of Texas at Arlington

Fall 2006

Image Compression: Coding and Decoding

compressed bitstream

00111000001001101…

(2428 Bytes)

imageencoder

imagedecoder

original image

262144 Bytes

compression ratio (CR) = 108:1

From [Gonzalez &

Woods]

From [Gonzalez &

Woods]

Some General Concepts

• How Can an Image be Compressed, AT ALL!?– If images are random matrices, better not to try any

compression– Image pixels are highly correlated redundant information

• Information, Uncertainty and Redundancy– Information is uncertainty (to be) resolved– Redundancy is repetition of the information we have– Compression is about removing redundancy because

• Entropy and Entropy Coding– Entropy is a statistical measure of uncertainty, or a

measure of the amount of information to be resolved– Entropy coding: approaching the entropy (no-redundancy)

• Bit Rate and Compression Ratio– Bit rate: bits/pixel, sometimes written as bpp– Compression ratio (CR):

• Binary, Gray-Scale, Color Image Compression– Original binary image: 1 bit/pixel– Original gray-scale image: typically 8bits/pixel– Original Color image: typically 24bits/pixel

• Lossless, Nearly lossless and Lossy Compression– Lossless: original image can be exactly reconstructed– Nearly lossless: reconstructed image nearly (visually) lossless– Lossy: reconstructed image with loss of quality (but higher

More Concepts

number of bits to represent the original image

number of bits in compressed bit streamCR =

Binary Image Compression

Run Length Coding

what'sstored: '1' 7 5 8 3 1

what's stored: '1'1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

• Run Length– The length of consecutively identical symbols

• Run length Coding Example

• When Does it Work?– Images containing many runs of 1’s and 0’s

• When Does it Not Work?

Run Length Coding

CCITT test image No. 1Size: 172823761 bit/pixel (bpp)

original: 513216 bytes

compressed: 37588 bytes

CR = 13.65

Run Length Coding

• Decoding Example

Row 1: “0”, 16Row 2: “0”, 16Row 3: “0”, 7, 2, 7Row 4: “0”, 4, 8, 4Row 5: “0”, 3, 2, 6, 3, 2Row 6: “0”, 2, 2, 8, 2, 2Row 7: “0”, 2, 1, 10, 1, 2Row 8: “1”, 3, 10, 3Row 9: “1”, 3, 10, 3Row 10: “0”, 2, 1, 10, 1, 2Row 11: “0”, 2, 2, 8, 2, 2Row 12: “0”, 3, 2, 6, 3, 2Row 13: “0”, 4, 8, 4Row 14: “0”, 7, 2, 7Row 15: “0”, 16Row 16: “0”, 16

A binary image is encoded using run length code row by row, with “0” represents white, and “1” represents black. The code is given by

Decode the imagedecode

Run Length Coding

A binary image is encoded using run length code row by row, with “0” represents white, and “1” represents black. The code is given by

decode

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

123456

123456789

1 2 3 4 5 6 7 8 9 11 12 13 14 15 16

Row 1: “0”, 16Row 2: “0”, 16Row 3: “0”, 7, 2, 7Row 4: “0”, 4, 8, 4Row 5: “0”, 3, 2, 6, 3, 2Row 6: “0”, 2, 2, 8, 2, 2Row 7: “0”, 2, 1, 10, 1, 2Row 8: “1”, 3, 10, 3Row 9: “1”, 3, 10, 3Row 10: “0”, 2, 1, 10, 1, 2Row 11: “0”, 2, 2, 8, 2, 2Row 12: “0”, 3, 2, 6, 3, 2Row 13: “0”, 4, 8, 4Row 14: “0”, 7, 2, 7Row 15: “0”, 16Row 16: “0”, 16

Chain Coding

Assume the image contains only single-pixel-wide contours,like this, not this

After the initial point position, code direction only (3bits/step)

From Prof. Al Bovikcontour image

region image

= initial point

Code Stream:(3, 2), 1, 0, 1, 1, 1, 1, 3, 3, 3, 4, 4,

initial point position chain code

Chain Coding

The chain code for a 8x8 binary image is given by:

Decode the image

(1, 6), 7, 7, 0, 1, 1, 3, 3, 3, 1, 1, 0, 7, 7

column row

decode

Chain Coding

The chain code for a 8x8 binary image is given by:

(1, 6), 7, 7, 0, 1, 1, 3, 3, 3, 1, 1, 0, 7, 7

column row

1 2 3 4 5 6 7 8123456781 2 3 4 5 6 7 8

12345678

decode

Lossless Gray-Scale Image Compression

Variable Word Length Coding

• Intuitive Idea– Assign short words to gray levels that occur

frequently– Assign long words to gray levels that occur

infrequently

• How Much Can Be Compressed?– Theoretical limit: entropy of the histogram– Practical algorithms (approach entropy):

Huffman coding, arithmetic coding

H I(k)

0 K-1gray level k

02 )(log)(

Maximum entropy: Uniform distribution

Minimum entropy:Impulse (delta) distribution

typically in-between

From Prof. Al Bovik

Variable Word Length Coding: Example

• A 4x4 4bits/pixel original image is given by

0010 1000 0110 0110

0110 1000 1000 1000

1000 1000 1010 1010

1001 1010 1010 1110

Bit rate = 4bits/pixel

Total # of bits used to represent the

image:

4x16 = 64 bits

0: 00001: 00012: 00103: 00114: 01005: 01016: 01107: 01118: 10009: 100110: 101011: 101112: 110013: 110114: 111015: 1111

Default Code Book

encode

Variable Word Length Coding: Example

• Encode the original image with a CODE BOOK given left

0: 00000001: 00000012: 00013: 00000104: 00000115: 00001006: 017: 00001018: 109: 0010010: 1111: 000011012: 000011113: 00101014: 001115: 001011

0001 10 01 01

01 10 10 10

10 10 11 11

00100 11 11 0011

encode

Total # of bits used to represent the

image:

4+2+2+2+2+2+2+2+2+2+2+2+5+2+

2+4= 39 bits

Bit rate = 39/16= 2.4375 bits/pixel

CR = 64/39 = 1.6410

Huffman Code Book

Predictive Coding

• Intuitive Idea– Image pixels are highly correlated (dependent)– Predict the image pixels to be coded from those already

• Differential Pulse-Code Modulation (DPCM)– Simplest form: code the difference between pixels

– Key features: Invertible, and lower entropy (why?)

H I(k) HD(k)

high entropy image reduced entropy imageK-10 K-11-K

image histogram (high entropy)

DPCM histogram (low entropy)

From Prof. Al Bovik

Original pixels:82, 83, 86, 88, 56, 55, 56, 60, 58,

55, 50, ……

DPCM:82, 1, 3, 2, -32, -1, 1, 4, -2, -3, -

5, ……

• Higher Order (Pattern) Prediction– Use both 1D and 2D patterns for prediction

• Apply Image Transforms before Predictive Coding– Decouple dependencies between image pixels

• Use Advanced Statistical Image Models– Better understanding of “the nature” of image structures

implies potentials of better prediction

Advanced Predictive Coding

1D Causal:

1D Non-causal:

2D Causal:

2D Non-Causal:

Lossy Gray-Scale Image Compression

Quantization

24 408 ... 248

21 3 4

Uniform scalar quantization:

Non-uniform scalar

quantization:

• Quantization: Widely Used in Lossy Compression– Represent certain image components with fewer bits

(compression)– With unavoidable distortions (lossy)

• Quantizer Design– Find the best tradeoff between

maximal compression minimal distortion

• Scalar Quantization

Quantization

Gray-level 1

Gray-level 2Vector quantization:

image component 2

image component 1

From Prof. Al Bovik

• Vector Quantization– Group multiple image components together form a

vector– Quantize the vector in a higher dimensional space– More efficient than scalar quantization (in terms of

compression)

• Block-Based Image Compression

• Transform-Domain Compression– Scalar or vector quantization of transform coefficients

(instead of image pixels)

Ideas on Lossy Image Compression

From Prof. Al Bovik

M x M M x M M x M

code each block

independently

imagePartition

Discrete Cosine Transform (DCT)

)12(cos

)12(cos),(

)()(4),(

vCuCvuX

)12(cos

)12(cos),()()(),(

umvuXvCuCnmx

periodic extension by

reflected periodic

extension by DFT

• 2D-DCT:

• Inverse 2D-DCT:

• DFT vs. DCT

discontinuities: high frequencies continuou

From Prof. Al Bovik

2D-DCT

5151414

5050393

4051412

17201024

2D-DCT

image block

DCT block

low frequency

high frequency

low frequency

high frequency

DC component

JPEG Compression

169130

173129

170181

170183

179181

182180

179180

179179169132

171130

169183

164182

179180

176179

180179

178178167131

167131

165179

170179

177179

182171

177177

168179169130

165132

166187

163194

176116

153183

160183

5151414

5050393

4051412

17201024

scalar quantization

zig-zag scan

– Partition the image into 8x8 blocks, for each block

– Artifacts:Inside blocks: blurring (why?); Across blocks: blocking (why?)

Original: 100KB JPEG: 9KB JPEG: 5KB

JPEG Compression

– Adjust Quantization Step to Achieve Tradeoff between CR and distortion

Wavelet and JPEG2000 Compression• Wavelet Transform

Energy CompactionLower Entropy

• Bitplane Coding– Scan bitplanes

from MSB to LSB– Progressive (scalable)

JPEG (64:1) JPEG2000 (64:1)

s ssssss s ssssss s ssssss

0111 0 000000 0 000000

1 000011 0 000000 0 000000

1 1 000111 0 000000

111 1 000111

..........

Wavelet and JPEG2000 Compression

image compression: coding and decoding

reconstructed image

run length code row

bitspixeloriginal color

resolvedentropy coding

compression ratiobit

original imagenumber

bytescompression ratio

bppcompression ratio

Documents

questions on verbal & non verbal reasoning (coding decoding)

coding/decoding concepts and block coding

perspectives ofoptical coding/decoding techniques in...

jpeg compression/decompression using...

coding and decoding

unit ii text compression. a. outline compression techniques...

variable length coding, dictionary-based coding, lzw...

1 data compression: source coding: part ii data compression:...

dc coding and decoding with convolutional codes

coding & decoding reasoning tricks

near shannon limit error - correcting coding and decoding

image compression coding schemes

2.coding decoding

lecture 7: image coding and compression -...

coding/decoding for data compression and error control...

ekt 357 digital communications. properties of coding basic...

1.4. coding and decoding of airlines 1.4.1. coding of

coding decoding questions for ibps - interviewkiller

lecture 9b convolutional coding/decoding and trellis code...

coding and decoding with convolutional codes