Arithmetic Coding

Arithmetic coding is an efficient lossless data compression technique that encodes data into a single number between 0 and 1. It works by assigning variable-length bit sequences to symbols based on their probability of occurrence. It typically achieves better compression than Huffman coding by encoding the entire message into a single number rather than separate codewords. The arithmetic coding algorithm works by recursively dividing the interval [0,1) into smaller subintervals based on symbol probabilities. While more complex than Huffman coding, arithmetic coding offers improved compression for certain data types.

Uploaded by

miraclesuresh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

314 views5 pages

Arithmetic Coding

Uploaded by

miraclesuresh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Arithmetic Coding

Arithmetic coding is the most efficient method to code symbols according to the probability
of their occurrence. The average code length corresponds exactly to the possible minimum
given by information theory. Deviations which are caused by the bit-resolution of binary
code trees do not exist.

In contrast to a binary Huffman code tree the arithmetic coding offers a clearly better
compression rate. Its implementation is more complex on the other hand.

In arithmetic coding, a message is encoded as a real number in an interval from one to zero.
Arithmetic coding typically has a better compression ratio than Huffman coding, as it
produces a single symbol rather than several separate codewords.

Arithmetic coding differs from other forms of entropy encoding such as Huffman coding in
that rather than separating the input into component symbols and replacing each with a code,
arithmetic coding encodes the entire message into a single number, a fraction n where (0.0
n < 1.0)

Arithmetic coding is a lossless coding technique. There are a few disadvantages of
arithmetic coding. One is that the whole codeword must be received to start decoding the
symbols, and if there is a corrupt bit in the codeword, the entire message could become
corrupt. Another is that there is a limit to the precision of the number which can be
encoded, thus limiting the number of symbols to encode within a codeword. There also
exist many patents upon arithmetic coding, so the use of some of the algorithms also call
upon royalty fees.

Arithmetic coding is part of the JPEG data format. Alternative to Huffman coding it will be
used for final entropy coding. In spite of its less efficiency Huffman coding remains the
standard due to the legal restrictions mentioned above.

Arithmetic Coding Algorithm:
The arithmetic coding algorithm works from leaves to the root in the opposite direction.
1. Start with an interval [0, 1), divided into subintervals of all possible symbols to appear
within a message. Make the size of each subinterval proportional to the frequency at
which it appears in the message.
2. When encoding a symbol, "zoom" into the current interval, and divide it into
subintervals like in step one with the new range.
3. Repeat the process until the maximum precision of the machine is reached, or all
symbols are encoded.
4. Transmit some number within the latest interval to send the codeword. The number
of symbols encoded will be stated in the protocol of the image format.
Example 1:
The source of information A generates the symbols {A0, A1, A2, A3 and A4} with the
corresponding probabilities {0.4, 0.3, 0.2 and 0.1}. Encoding the source symbols using
Huffman encoder gives:
Source Symbol P
i
Binary Code Huffman
A0 0.4 00 0
A1 0.3 01 10
A2 0.2 10 110
A3 0.1 10 111
L
avg
H = 1.846 2 1.9

The Entropy of the source is

Since we have 4 symbols (4=2
2
), we need 2 bits at least to represent each symbol in binary
(fixed-length code). Hence the average length of the binary code is

Thus the efficiency of the binary code is

The average length of the Huffman code is

Thus the efficiency of the Huffman code is

The Huffman encoder has the closest efficiency to the entropy that can be obtained using a
prefix code. Higher efficiency can be yielded with the arithmetic coding.

Dividing into Intervals
On the basis of a well-known alphabet the probability of all symbols has to be determined
and converted into intervals. The size of the interval depends linearly on the symbol's
probability. If this is 50% for example, then the associated sub-interval covers the half of the
current interval. Usually the initial interval is [0; 1) for the encoding of the first symbol.
Source Symbol P
i
Sub-interval
A0 0.4 [0.0;0.4)
A1 0.3 [0.4;0.7)
A2 0.2 [0.7;0.9)
A3 0.1 [0.9;1.0)

Assume that the message to be encoded is A0A0A3A1A2. The first symbol to be encoded is
A0. We "zoom" into the interval corresponding to "A0", and divide up that interval into
smaller subintervals like before. We now use this new interval as the basis of the next
symbol encoding step.

Source Symbol New A0 Interval
A0 [0.0;0.16)
A1 [0.16;0.28)
A2 [0.28;0.36)
A3 [0.36;0.4)

To encode the next character "A0", we use the "A0" interval created before, and zoom into
the subinterval "A0", and use that for the next step. This produces

Source Symbol New A0 Interval
A0 [0.0;0.064)
A1 [0.064;0.112)
A2 [0.112;0.144)
A3 [0.144;0.16)

To encode the next character "A3", we use the "A0" interval created before, and zoom into
the subinterval "A3", and use that for the next step. This produces

Source Symbol New A3 Interval
A0 [0.144;0.1504)
A1 [0.1504;0.1552)
A2 [0.1552;0.1584)
A3 [0.1584;0.16)

And lastly, the final result is

Source Symbol New A0 Interval
A0 [0.1504;0.15232)
A1 [0.15232;0.15376)
A2 [0.15376;0.15472)
A3 [0.15472;0.1552)

Transmit some number within the latest interval to send the codeword. The number of
symbols encoded will be stated in the protocol of the image format, so any number within
[0.15376, 0.15472) will be acceptable.

Lets choose the number 0.1543. The binary representation of this number is 0.001001111.
We need 10 bits to encode the message (9 bits and the floating point). The minimum number
of bits needed to fully encode the message is

H*N = 1.846*5= 9.23 bit

Using Huffman code, the message is encoded to 0 0 111 10 110 which need also 10 bits. The
larger is the number of symbols, the wider is the gap in efficiency.

Decoding the code is a reverse approach. Lets assume the number 0.1543 has been received
at the decoder. 0.1543 lies in the interval [0; 0.4), then the first symbol of the message is A0.

Then, we "zoom" into the interval corresponding to "A0", and divide up that interval into
smaller subintervals. We now use this new interval as the basis of the next symbol decoding
step.

Source Symbol New A0 Interval
A0 [0.0;0.16)
A1 [0.16;0.28)
A2 [0.28;0.36)
A3 [0.36;0.4)

0.1543 lies in the interval [0; 0.16), then the second symbol of the message is A0. Zoom into
the subinterval "A0", and use that for the next step. This produces

Source Symbol New A0 Interval
A0 [0.0;0.064)
A1 [0.064;0.112)
A2 [0.112;0.144)
A3 [0.144;0.16)

0.1543 lies in the interval [0.144; 0.16), then the third symbol of the message is A3. Zoom
into the subinterval "A3", and use that for the next step. This produces

Source Symbol New A3 Interval
A0 [0.144;0.1504)
A1 [0.1504;0.1552)
A2 [0.1552;0.1584)
A3 [0.1584;0.16)

0.1543 lies in the interval [0.1504; 0.1552), then the fourth symbol of the message is A1.
Zoom into the subinterval "A1", and use that for the next step. This produces

Source Symbol New A0 Interval
A0 [0.1504;0.15232)
A1 [0.15232;0.15376)
A2 [0.15376;0.15472)
A3 [0.15472;0.1552)

0.1543 lies in the interval [0.15376; 0.15472), then the last symbol of the message is A2. The
decoder stops after this as the number of symbols encoded will be stated in the protocol of the
message.

Exercise 1:
The source of information A generates the symbols {A0, A1, A2, A3 and A4} with the
probabilities shown in the table below. Encode the source symbols using Arithmetic encoder
and Huffman encoder. The message is A4A1A0A3A2
Source Symbol P
i
A0 0.4
A1 0.4
A2 0.12
A3 0.06
A4 0.02

Compare the efficiency of both codes and comment on the results.

Exercise 2:
The source of information A generates the symbols {A0, A1, A2 and A3} with the
probabilities shown in the table below. Encode the source symbols arithmetic encoder if the
message is A2A0A2A3
Source Symbol P
i
A0 0.5
A1 0.3
A2 0.2

Compare the efficiency of both codes and comment on the results.

Arithmetic Coding Explained
No ratings yet
Arithmetic Coding Explained
12 pages
Arithmetic Coding: Presented By: Einat & Kim
No ratings yet
Arithmetic Coding: Presented By: Einat & Kim
48 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
22 pages
Data Compression with Arithmetic Coding
No ratings yet
Data Compression with Arithmetic Coding
11 pages
Truncated Huffman
No ratings yet
Truncated Huffman
5 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
6 pages
GSM Network and Services: Channel Coding - From Source Data To Radio Bursts
100% (1)
GSM Network and Services: Channel Coding - From Source Data To Radio Bursts
21 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
15 pages
Huffman Coding for Tech Enthusiasts
No ratings yet
Huffman Coding for Tech Enthusiasts
5 pages
4-Two Dimensional Parity Check
No ratings yet
4-Two Dimensional Parity Check
9 pages
5CS3-01: Information Theory & Coding: Unit-3 Linear Block Code
No ratings yet
5CS3-01: Information Theory & Coding: Unit-3 Linear Block Code
75 pages
Information Theory and Coding Sample Question 2021
No ratings yet
Information Theory and Coding Sample Question 2021
5 pages
Slides For Chapter 3: Networking and Internetworking: Distributed Systems: Concepts and Design
No ratings yet
Slides For Chapter 3: Networking and Internetworking: Distributed Systems: Concepts and Design
26 pages
Huffman Coding: Greedy Algorithm Guide
No ratings yet
Huffman Coding: Greedy Algorithm Guide
27 pages
Lecture11 - Linear Block Codes
No ratings yet
Lecture11 - Linear Block Codes
50 pages
CN Lab Manual 18ECL68 VTU
No ratings yet
CN Lab Manual 18ECL68 VTU
68 pages
Shannonfano Source Coding
100% (1)
Shannonfano Source Coding
3 pages
Computer Networks: Module II Notes
No ratings yet
Computer Networks: Module II Notes
160 pages
DC Unit3
No ratings yet
DC Unit3
97 pages
Introduction To Information Theory Channel Capacity and Models
No ratings yet
Introduction To Information Theory Channel Capacity and Models
36 pages
Wireless Networks Homework Guide
No ratings yet
Wireless Networks Homework Guide
2 pages
5CS3-01: Information Theory & Coding: Unit-4 Cyclic Code
No ratings yet
5CS3-01: Information Theory & Coding: Unit-4 Cyclic Code
67 pages
Error Control Coding Sep 2023 To Jan 2018
No ratings yet
Error Control Coding Sep 2023 To Jan 2018
22 pages
Image Compression Coding Schemes
50% (4)
Image Compression Coding Schemes
96 pages
Gray To Binary Code Converter: P.V.Yaswanth (13004342) G.Sudheer Kumar (13004347) K.krithi Kumar (13004369)
100% (1)
Gray To Binary Code Converter: P.V.Yaswanth (13004342) G.Sudheer Kumar (13004347) K.krithi Kumar (13004369)
10 pages
Mathematics of Cryptography: Part III: Primes and Related Congruence Equations
No ratings yet
Mathematics of Cryptography: Part III: Primes and Related Congruence Equations
50 pages
VTU E&C CBCS Scheme 5th Sem Information Theory and Coding Module-2 Notes
0% (1)
VTU E&C CBCS Scheme 5th Sem Information Theory and Coding Module-2 Notes
110 pages
Chap 6,7 Exercise (Marked) ALL
No ratings yet
Chap 6,7 Exercise (Marked) ALL
11 pages
Information Theory and Coding
No ratings yet
Information Theory and Coding
226 pages
Python Notes 11 Dictionary Tuples and Sets 1664121924
No ratings yet
Python Notes 11 Dictionary Tuples and Sets 1664121924
21 pages
Answers For Debugging Exercises: Chapter 3: Find The Output
No ratings yet
Answers For Debugging Exercises: Chapter 3: Find The Output
5 pages
31150-Introduction To Python v2.1
No ratings yet
31150-Introduction To Python v2.1
42 pages
ITC Unit 5 Compression Techniques
100% (1)
ITC Unit 5 Compression Techniques
16 pages
Multimedia University of Kenya. Faculty of Engineering and Technology. Bsc. Electrical and Telecommunication Engineering
No ratings yet
Multimedia University of Kenya. Faculty of Engineering and Technology. Bsc. Electrical and Telecommunication Engineering
8 pages
Chapter 3: Information Theory: Section 3.5
No ratings yet
Chapter 3: Information Theory: Section 3.5
22 pages
Matlab Code For RS Coding and Decoding
No ratings yet
Matlab Code For RS Coding and Decoding
6 pages
Turing Machine and Recursive Language
No ratings yet
Turing Machine and Recursive Language
36 pages
ECE PE5: Instrumentation & Control Track "Computer Vision System With Applied Robotics"
No ratings yet
ECE PE5: Instrumentation & Control Track "Computer Vision System With Applied Robotics"
45 pages
2 5244801349324911431 ١٠٢٨١٤
No ratings yet
2 5244801349324911431 ١٠٢٨١٤
62 pages
Unit 3 - Cyclic Code MCQ
No ratings yet
Unit 3 - Cyclic Code MCQ
6 pages
CN Cheat Sheet
No ratings yet
CN Cheat Sheet
7 pages
GCSE ComputerScience BitmapImage Questions
No ratings yet
GCSE ComputerScience BitmapImage Questions
7 pages
Untitled
No ratings yet
Untitled
25 pages
Unit I Information Theory & Coding Techniques P I
No ratings yet
Unit I Information Theory & Coding Techniques P I
48 pages
Information Theory & Coding Techniques-DCom
No ratings yet
Information Theory & Coding Techniques-DCom
28 pages
Hadamard Code
100% (1)
Hadamard Code
10 pages
Error Control Coding
No ratings yet
Error Control Coding
38 pages
Group Codes
No ratings yet
Group Codes
26 pages
Galois Field Computations With Matlab
No ratings yet
Galois Field Computations With Matlab
31 pages
Digital Data Comm Techniques
No ratings yet
Digital Data Comm Techniques
43 pages
Program: 5: Write A Program To Implement and Find Class, Network ID and Host ID From Given IPV4 Address
No ratings yet
Program: 5: Write A Program To Implement and Find Class, Network ID and Host ID From Given IPV4 Address
5 pages
Shannon Fano Solved Examples
No ratings yet
Shannon Fano Solved Examples
4 pages
Error Control Coding
No ratings yet
Error Control Coding
76 pages
ITC Mod1@
No ratings yet
ITC Mod1@
79 pages
Information Theory & Coding Guide
No ratings yet
Information Theory & Coding Guide
217 pages
Predictive Coding I
No ratings yet
Predictive Coding I
14 pages
Data Compression Unit III
No ratings yet
Data Compression Unit III
23 pages
Data Compression Unit III
No ratings yet
Data Compression Unit III
22 pages
Module IV
No ratings yet
Module IV
37 pages
Huffman Coding Technique
No ratings yet
Huffman Coding Technique
13 pages
MAD Lab Manual - List of Experiments
67% (3)
MAD Lab Manual - List of Experiments
24 pages
Perfrom A Wireless Audit
0% (1)
Perfrom A Wireless Audit
3 pages
MAD Lab Manual - List of Experiments
67% (3)
MAD Lab Manual - List of Experiments
24 pages
Stop and Wait Protocol Using NS2
No ratings yet
Stop and Wait Protocol Using NS2
7 pages
CP Lab Manual
No ratings yet
CP Lab Manual
101 pages
GE6161 Computer Practices Laboratory LTPC 0 0 3 2 List of Experiments
No ratings yet
GE6161 Computer Practices Laboratory LTPC 0 0 3 2 List of Experiments
2 pages
C Programming: Compiler Techniques
80% (5)
C Programming: Compiler Techniques
60 pages
Syllabus
No ratings yet
Syllabus
2 pages
MAD Lab Manual - List of Experiments
67% (3)
MAD Lab Manual - List of Experiments
24 pages
DAA 2marks With Answers
No ratings yet
DAA 2marks With Answers
11 pages
Algorithm Design & Analysis Guide
No ratings yet
Algorithm Design & Analysis Guide
8 pages
DAA 2marks With Answers
No ratings yet
DAA 2marks With Answers
14 pages
CD Lab Manual
100% (1)
CD Lab Manual
55 pages
Networking I/O Multiplexing Guide
100% (1)
Networking I/O Multiplexing Guide
57 pages
CD Lab Manual
100% (1)
CD Lab Manual
30 pages
GPA Calculator - Anna University
No ratings yet
GPA Calculator - Anna University
6 pages
Compiler Design & Networks Lab Manual
No ratings yet
Compiler Design & Networks Lab Manual
69 pages
Name and Address Conversion Functions
0% (1)
Name and Address Conversion Functions
30 pages
Socket Programming Basics
No ratings yet
Socket Programming Basics
69 pages
Medical Imaging Thesis
No ratings yet
Medical Imaging Thesis
2 pages
Velammal College of Engineering & Technology, Madurai - 625 009 Department of Information Technology
No ratings yet
Velammal College of Engineering & Technology, Madurai - 625 009 Department of Information Technology
1 page
Project Report Preparation Guidelines For PG
No ratings yet
Project Report Preparation Guidelines For PG
12 pages
Solutions Manual
91% (11)
Solutions Manual
260 pages
Web Lab Manual
No ratings yet
Web Lab Manual
4 pages
Standard Template Library
No ratings yet
Standard Template Library
24 pages
Graphs: Terminology, Representation, and Traversal
No ratings yet
Graphs: Terminology, Representation, and Traversal
58 pages
Math Behind SVM Part 1 (Support Vector Machine) - by MLMath - Io - Medium
No ratings yet
Math Behind SVM Part 1 (Support Vector Machine) - by MLMath - Io - Medium
15 pages
Anshuman 3063 DSLABweek4B
No ratings yet
Anshuman 3063 DSLABweek4B
5 pages
Programs
No ratings yet
Programs
23 pages
DSD Practice Exam
No ratings yet
DSD Practice Exam
1 page
MPMC EXP 09 Code & Output Print
No ratings yet
MPMC EXP 09 Code & Output Print
4 pages
ME471 Optimization Techniques
No ratings yet
ME471 Optimization Techniques
3 pages
Google Tagged LeetCode Questions Decreasing Frequency 1666171615
No ratings yet
Google Tagged LeetCode Questions Decreasing Frequency 1666171615
33 pages
DAA Unit-5
No ratings yet
DAA Unit-5
23 pages
Lab Manual 07
No ratings yet
Lab Manual 07
23 pages
22XX302-LM-Optimizing For Optimistic Scenarios
No ratings yet
22XX302-LM-Optimizing For Optimistic Scenarios
17 pages
Advance Operations Research
No ratings yet
Advance Operations Research
255 pages
Submodular Set Function
No ratings yet
Submodular Set Function
7 pages
8.6.1 - What Is An Iteration
No ratings yet
8.6.1 - What Is An Iteration
22 pages
DS Assignment 2024-25
No ratings yet
DS Assignment 2024-25
10 pages
Christofides & Branch-Bound Algorithms
No ratings yet
Christofides & Branch-Bound Algorithms
28 pages
Answers From Abhijit
No ratings yet
Answers From Abhijit
4 pages
B.Tech Algorithm Exam Guide
No ratings yet
B.Tech Algorithm Exam Guide
40 pages
Divide and Conquer: Coun - NG Inversions I
No ratings yet
Divide and Conquer: Coun - NG Inversions I
6 pages
All Medium Problems
No ratings yet
All Medium Problems
614 pages
Data Structures Part - A (Shortanswer Questions) : Vemu Institute of Technology
No ratings yet
Data Structures Part - A (Shortanswer Questions) : Vemu Institute of Technology
6 pages
DSA Lecture1
No ratings yet
DSA Lecture1
15 pages
HLOOKUP and VLOOKUP
0% (1)
HLOOKUP and VLOOKUP
11 pages
Lecture 09 Tree II 20230309
No ratings yet
Lecture 09 Tree II 20230309
59 pages
Pattern Recognition
No ratings yet
Pattern Recognition
33 pages
Striver A2z C - Part 4
No ratings yet
Striver A2z C - Part 4
194 pages
Amortized Analysis in Data Structures
No ratings yet
Amortized Analysis in Data Structures
2 pages
Terminology - What Is The Difference Between MLP and RBF - Cross Validated
No ratings yet
Terminology - What Is The Difference Between MLP and RBF - Cross Validated
2 pages
Unit6 Ga
No ratings yet
Unit6 Ga
33 pages
A Prlmal Algorithm For Interval Linear-Programming Problems
No ratings yet
A Prlmal Algorithm For Interval Linear-Programming Problems
14 pages

Arithmetic Coding

Uploaded by

Arithmetic Coding

Uploaded by

Arithmetic Coding

You might also like