0% found this document useful (0 votes)

119 views8 pages

DSP PDF

The document discusses Huffman coding for data compression. It describes the Huffman coding algorithm, provides pseudocode for encoding and decoding, and shows an example MATLAB implementation. Experimental results on a sample data set demonstrate that Huffman coding reduces data size by assigning shorter codes to more probable symbols.

Uploaded by

22bec032

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

119 views8 pages

DSP PDF

Uploaded by

22bec032

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

DSP: Huffman coding for data compression

1. Introduction
Data compression is a crucial aspect of modern computing, enabling efficient storage and

transmission of information. Huffman coding is a widely used technique for lossless data

compression. In this report, we explore Huffman data compression, including its algorithm,

implementation in MATLAB/Python, experimental results, and observations.

2. Problem Description
The goal of Huffman coding is to efficiently represent data by assigning variable-length codes

to input symbols based on their frequencies. Symbols with higher frequencies are assigned

shorter codes, reducing the overall size of the encoded data.

3. Algorithm (Pseudocode)
1.Read the number of symbols and their corresponding probabilities.

2.Sort the probabilities in descending order.

3.Create a priority queue and insert the symbols and their probabilities.

4.Repeat the following steps until the priority queue contains only one element:

a. Remove the two elements with the lowest probabilities from the priority queue.

b. Create a new node with the sum of the two probabilities and the concatenation of their symbols.

c. Insert the new node into the priority queue.

5. The remaining element in the priority queue is the root of the Huffman tree.

6. Generate the Huffman codebook by traversing the Huffman tree.

7. Encode the input signal using the Huffman codebook.

8. Decode the encoded signal using the Huffman tree.

9. Check if the decoded signal is equal to the original input signal.

Algorithm 1: User-Interactive Huffman Encoding

1. Prompt the user to enter the number of symbols (x).

2. Create a vector n with values from 1 to x (symbols).
3. Prompt the user to enter the probabilities for each symbol (p).
4. Sort probabilities in descending order (optional, for visualization).
5. Create a Huffman code dictionary (dict) using `huffmandict(n, p)`.
6. (Optional) Calculate the minimum bits per symbol (bps).
7. Generate a random symbol sequence (inputsig) with a defined length.
8. Encode the symbol sequence using the dictionary to get encoded data (code) with
`huffmanenco(inputsig, dict)`.
9. (Optional) Display the encoded data.

This pseudocode highlights the user interaction for defining symbols and probabilities.

Algorithm 2: Core Huffman Encoding Process

1. Input: Number of symbols (x), symbol probabilities (p).

2. Create a vector n containing symbols from 1 to x.
3. Build a Huffman tree using the probabilities (internal process of
`huffmandict`).
4. Generate Huffman codewords for each symbol based on the tree (internal process
of `huffmandict`).
5. Create a dictionary (dict) to map symbols to codewords.
6. Output: Huffman code dictionary (dict).

This focuses on the core logic of Huffman coding without user interaction details.

Algorithm 3: Encoding a Symbol Sequence with Huffman Coding

1. Input: Symbol sequence (inputsig), Huffman code dictionary (dict).

2. Loop through each symbol in inputsig:
- Find the corresponding Huffman codeword in dict.
- Append the codeword to encoded data (code).
3. Output: Encoded data (code) as a sequence of 0s and 1s.

This pseudocode outlines the process of encoding a symbol sequence using a predefined Huffman
code dictionary.
START

Enter No. of Symbols

1
and Probabilities

Display Symbols and

2
Probabilities

Sort Probabilities in
3
Descending Order

Generate Huffman
4
Dictionary

Generate Random
5
Input Signal

6 Encode the Signal

7 Decode the Signal

Check if Input equals

8
Decoded Signal

Convert Input &

9
Encoded Signal-Binary

Calculate
10
Sequence Length

Calculate Encoded
11
Length

12 Display Results

End
4. MATLAB Implementation

x=input('enter the number of

symbol') n=1:x;

disp('the number of symbols are n:')

disp(n)

p=input('enter the probabilities ')

disp(p)

s=sort(p,'descend')

disp("the sorted probabilites are ")

disp(s)

dict= huffmandict(n,p);

bps = ceil(log2(max(n)));

inputsig= randsrc(100,1,[n;p]);

code = huffmanenco(inputsig,dict)

sig = huffmandeco(code,dict);

isequal(inputsig,sig)

binarySig = int2bit(inputsig,bps);

seqLen = numel(binarySig)

binaryComp = int2bit(code,bps);

encodedLen = numel(binaryComp)

5. Experimental Results and Observations

Sample Input: Let's assume we have 5 symbols with probabilities [0.4, 0.3, 0.2, 0.1, 0.0].

Experimental Results: After applying Huffman coding, the average code length reduces, and the

encoded data size decreases significantly.

Observations: Huffman coding effectively reduces redundancy in the data by assigning

shorter codes to more probable symbols and longer codes to less probable symbols.
THEORY
1. input('enter the number of symbol');

This function prompts the user to enter the number of symbols in your data. It captures the
user's input as a string and stores it in the variable x.

It's mandatory because you need to define the number of symbols you're working with to create
the Huffman code dictionary.

2. n = 1:x;

This creates a row vector n containing sequential integers from 1 to the value stored in x.

It's typically used to represent the symbols in your data. Here, it assumes your symbols are
simply numbered from 1 to x.

3. disp('the number of symbols are n:');

disp(n);

These lines use the disp function to display informative messages to the user.

The first line displays "the number of symbols are n:" on the console.

The second line displays the contents of the vector n, showing the user the defined symbols.

These are not mandatory for functionality but improve clarity and user interaction.

4. p = input('enter the probabilities ');

disp(p);

Similar to input, this prompts the user to enter the probabilities associated with each symbol. It
stores the user's input as a row vector in p.

The values in p should sum to 1 and represent the likelihood of each symbol appearing in your
data.

This is mandatory because Huffman coding uses probabilities to create optimal codewords.

5. s = sort(p,'descend'); disp("the sorted probabilites are ");

disp(s);

sort(p,'descend') sorts the elements in p in descending order and stores the result in s.

disp functions again display messages and the sorted probabilities for user reference.

Sorting probabilities can be helpful for visualization and understanding the distribution of symbol
occurrences, but it's not strictly necessary for Huffman encoding itself.
6. dict = huffmandict(n, p);

This line is where the core Huffman coding happens.

huffmandict is a MATLAB function that creates a Huffman code dictionary based on the
symbols (n) and their probabilities (p).
The internal process (not directly accessible) involves building a Huffman tree using the
probabilities. The tree structure determines the optimal codewords (binary representations)
for each symbol to minimize the average codeword length.
The output, dict, is a cell array with two columns:
The first column lists the symbols (values from n).
The second column contains the corresponding Huffman codewords as row vectors of 0s
and 1s.
This is mandatory because the Huffman code dictionary is essential for encoding and
decoding your data.

7. bps = ceil(log2(max(n)));

This calculates the minimum number of bits required to represent the symbols in binary
form using ceil(log2(max(n))).
log2 calculates the base-2 logarithm, and ceil rounds the result up to the nearest integer.
It's informative to know the minimum bits needed, but not crucial for encoding/decoding.

8. inputsig = randsrc(100, 1, [n; p]);

This generates a random sequence of symbols (inputsig) with a length of 100.

randsrc is a random number generator for symbols.
The first argument (100) specifies the number of symbols to generate.
The second argument (1) sets the number of output channels (single channel here).
The third argument is a cell array [n; p].
The first row (n) defines the possible symbols (same as before).
The second row (p) specifies the probabilities for each symbol.
This step might not be mandatory if you have your own data sequence, but it's useful for
demonstration and testing.

9. code = huffmanenco(inputsig, dict);

This encodes the random symbol sequence (inputsig) using the Huffman code dictionary
(dict).
huffmanenco is another MATLAB function that performs the encoding based on the
codewords in dict.
The internal process (not directly accessible) involves replacing each symbol in inputsig with
its corresponding Huffman codeword from dict.
The output, code, is a row vector containing the encoded data as a sequence of 0s and 1s.
This is mandatory for encoding data using Huffman coding
EXAMPLE : 1

EXAMPLE : 2
6. Conclusion

Huffman coding is a powerful technique for lossless data compression. This report presented an

overview of Huffman coding, its algorithm, implementation, and experimental results. Huffman coding

efficiently reduces the size of data by assigning shorter codes to frequently occurring symbols, thus

improving storage and transmission efficiency.

7. References

Huffman, D. A. (1952). "A Method for the Construction of Minimum-Redundancy Codes".

Proceedings of the IRE.

Sayood, K. (2017). Introduction to Data Compression (5th ed.). Morgan Kaufmann.

MATLAB Documentation: Huffman Encoding

Huffman Coding Assignment
50% (2)
Huffman Coding Assignment
7 pages
739expt No 1
No ratings yet
739expt No 1
4 pages
Multimedia University of Kenya. Faculty of Engineering and Technology. Bsc. Electrical and Telecommunication Engineering
No ratings yet
Multimedia University of Kenya. Faculty of Engineering and Technology. Bsc. Electrical and Telecommunication Engineering
8 pages
Aim: To Implement Huffman Coding Using MATLAB Experimental Requirements: PC Loaded With MATLAB Software Theory
No ratings yet
Aim: To Implement Huffman Coding Using MATLAB Experimental Requirements: PC Loaded With MATLAB Software Theory
5 pages
Huffman Coding with MATLAB
75% (4)
Huffman Coding with MATLAB
20 pages
Hufman Exp
No ratings yet
Hufman Exp
2 pages
Lab Manual 15
No ratings yet
Lab Manual 15
9 pages
Adc-Exp
No ratings yet
Adc-Exp
10 pages
MLSP Lab Exp2
No ratings yet
MLSP Lab Exp2
6 pages
Huffman Coding Program Flow Illustration
No ratings yet
Huffman Coding Program Flow Illustration
2 pages
Huffman Coding and Encoding Data Methods
No ratings yet
Huffman Coding and Encoding Data Methods
6 pages
Practice 2 Coding of Discrete Sources Huffman Code
No ratings yet
Practice 2 Coding of Discrete Sources Huffman Code
13 pages
Huffman Coding for Programmers
No ratings yet
Huffman Coding for Programmers
4 pages
ADCexp 2
No ratings yet
ADCexp 2
7 pages
Lec37 - 210102093 - UJJWAL JAGGARWAL
No ratings yet
Lec37 - 210102093 - UJJWAL JAGGARWAL
4 pages
Programme9 - Huffman Code
No ratings yet
Programme9 - Huffman Code
5 pages
Dcomm Master File
No ratings yet
Dcomm Master File
33 pages
Huffman Coding: Vida Movahedi
No ratings yet
Huffman Coding: Vida Movahedi
24 pages
Transmitters Compression
No ratings yet
Transmitters Compression
18 pages
Create A Huffman Code Dictionary in MATLAB
No ratings yet
Create A Huffman Code Dictionary in MATLAB
10 pages
MATLAB Huffman Coding Guide
No ratings yet
MATLAB Huffman Coding Guide
10 pages
Dcom Experiment No 6
No ratings yet
Dcom Experiment No 6
2 pages
8 Hufman
No ratings yet
8 Hufman
10 pages
RANJANMMC
No ratings yet
RANJANMMC
6 pages
ADCexp 2
No ratings yet
ADCexp 2
8 pages
Huffman Coding Technique
No ratings yet
Huffman Coding Technique
13 pages
IEEE Paper
No ratings yet
IEEE Paper
2 pages
ITC Lab Mannual Using MATLAB Programs
No ratings yet
ITC Lab Mannual Using MATLAB Programs
28 pages
Lec 6
No ratings yet
Lec 6
31 pages
ADC EXPT 2 078 Mane B1
No ratings yet
ADC EXPT 2 078 Mane B1
10 pages
Getting Started: Huffman Coding
No ratings yet
Getting Started: Huffman Coding
5 pages
DCE Lab Manual: Compression & Encryption
0% (1)
DCE Lab Manual: Compression & Encryption
21 pages
Huffman Code 2020
No ratings yet
Huffman Code 2020
5 pages
Haufman
No ratings yet
Haufman
8 pages
Deep Dive Into Huffman Coding Techniques
No ratings yet
Deep Dive Into Huffman Coding Techniques
3 pages
IEEE Paper
No ratings yet
IEEE Paper
2 pages
Haufman 1
No ratings yet
Haufman 1
8 pages
EC3021D
No ratings yet
EC3021D
22 pages
Info Theory & Coding Guide
No ratings yet
Info Theory & Coding Guide
13 pages
Data Compression - Unit 2
No ratings yet
Data Compression - Unit 2
31 pages
16-QAM Modulation & CRC Simulation
No ratings yet
16-QAM Modulation & CRC Simulation
20 pages
Huffman Code - Brilliant Math & Science Wiki
No ratings yet
Huffman Code - Brilliant Math & Science Wiki
18 pages
Mdcs Lab Manual
No ratings yet
Mdcs Lab Manual
32 pages
Data Compression Unit-2
No ratings yet
Data Compression Unit-2
74 pages
Activity 1 & 2 - Manual
No ratings yet
Activity 1 & 2 - Manual
12 pages
DC Expt 6 - Huffman
No ratings yet
DC Expt 6 - Huffman
2 pages
Huffman Coding for Tech Students
No ratings yet
Huffman Coding for Tech Students
10 pages
Huffman Code
No ratings yet
Huffman Code
51 pages
Image Processing (RCS082) Unit V Huffman Coding
No ratings yet
Image Processing (RCS082) Unit V Huffman Coding
12 pages
Unit 2
No ratings yet
Unit 2
82 pages
ITC 2020 21 Lecture 5
No ratings yet
ITC 2020 21 Lecture 5
27 pages
Huffman Coding: Vida Movahedi
No ratings yet
Huffman Coding: Vida Movahedi
8 pages
DC - Lab - Aug - 2025 - 9-12
No ratings yet
DC - Lab - Aug - 2025 - 9-12
12 pages
Multimedia Data Compression Manual
No ratings yet
Multimedia Data Compression Manual
10 pages
Error-Free Compression: Variable Length Coding
100% (1)
Error-Free Compression: Variable Length Coding
13 pages
Adc Exp 11
No ratings yet
Adc Exp 11
4 pages
Utilities Guide
No ratings yet
Utilities Guide
25 pages
OBIEE Metadata Dictionary
No ratings yet
OBIEE Metadata Dictionary
12 pages
Idma
No ratings yet
Idma
1 page
Article 15
No ratings yet
Article 15
4 pages
Sri Vidya College of Engineering & Technology - Dept of CSE
No ratings yet
Sri Vidya College of Engineering & Technology - Dept of CSE
4 pages
How To Install SAP IDES For Practice
No ratings yet
How To Install SAP IDES For Practice
56 pages
Database Project Template
No ratings yet
Database Project Template
12 pages
Implement Ipv4 Acls: Topology
No ratings yet
Implement Ipv4 Acls: Topology
15 pages
Introduction To lNTEL: X-86 Family
No ratings yet
Introduction To lNTEL: X-86 Family
9 pages
PHP OOP Interview Guide
No ratings yet
PHP OOP Interview Guide
117 pages
Web Tech Printouts
No ratings yet
Web Tech Printouts
26 pages
Recovery From Corrupt or Missing Software Image On Cisco Catalyst 2900XL and 3500XL Series Switches
No ratings yet
Recovery From Corrupt or Missing Software Image On Cisco Catalyst 2900XL and 3500XL Series Switches
7 pages
Ict 9
100% (1)
Ict 9
2 pages
Avro
No ratings yet
Avro
5 pages
Recording View: Cache Status
No ratings yet
Recording View: Cache Status
1 page
Service & Support: Communication Between SIMATIC S5 and Simatic S7 Over Profibus
No ratings yet
Service & Support: Communication Between SIMATIC S5 and Simatic S7 Over Profibus
30 pages
Azure DR Migration Proposal-HBL Power Systems
No ratings yet
Azure DR Migration Proposal-HBL Power Systems
9 pages
Patient Information System
No ratings yet
Patient Information System
5 pages
HoKo Compression in Graphics Pipeline PDF
No ratings yet
HoKo Compression in Graphics Pipeline PDF
8 pages
Unit 1: Asymptotic Notations
No ratings yet
Unit 1: Asymptotic Notations
24 pages
Sample Question - Bank - IP
No ratings yet
Sample Question - Bank - IP
3 pages
CIS Apache Tomcat 9 Benchmark v1.1.0
No ratings yet
CIS Apache Tomcat 9 Benchmark v1.1.0
127 pages
AirLive WMU-6500FS Specs
No ratings yet
AirLive WMU-6500FS Specs
3 pages
Dial-Up vs. VPN: Connections & Security
No ratings yet
Dial-Up vs. VPN: Connections & Security
13 pages
Tape HD
No ratings yet
Tape HD
77 pages
Multiple Choice Questions On Embedded Systems
No ratings yet
Multiple Choice Questions On Embedded Systems
35 pages
Enhanced Isometrics
100% (1)
Enhanced Isometrics
28 pages
BOM Report for SAP Developers
No ratings yet
BOM Report for SAP Developers
6 pages
01 SD-WAN Solution (Online Reading)
No ratings yet
01 SD-WAN Solution (Online Reading)
73 pages
AMC Engineering College, Bangalore-83
No ratings yet
AMC Engineering College, Bangalore-83
2 pages

DSP PDF

Uploaded by

DSP PDF

Uploaded by

DSP: Huffman coding for data compression

implementation in MATLAB/Python, experimental results, and observations.

shorter codes, reducing the overall size of the encoded data.

2.Sort the probabilities in descending order.

c. Insert the new node into the priority queue.

6. Generate the Huffman codebook by traversing the Huffman tree.

7. Encode the input signal using the Huffman codebook.

8. Decode the encoded signal using the Huffman tree.

9. Check if the decoded signal is equal to the original input signal.

1. Prompt the user to enter the number of symbols (x).

Algorithm 2: Core Huffman Encoding Process

1. Input: Number of symbols (x), symbol probabilities (p).

Algorithm 3: Encoding a Symbol Sequence with Huffman Coding

1. Input: Symbol sequence (inputsig), Huffman code dictionary (dict).

Enter No. of Symbols

Display Symbols and

6 Encode the Signal

7 Decode the Signal

Check if Input equals

Convert Input &

x=input('enter the number of

disp('the number of symbols are n:')

p=input('enter the probabilities ')

disp("the sorted probabilites are ")

5. Experimental Results and Observations

encoded data size decreases significantly.

Observations: Huffman coding effectively reduces redundancy in the data by assigning

3. disp('the number of symbols are n:');

4. p = input('enter the probabilities ');

5. s = sort(p,'descend'); disp("the sorted probabilites are ");

This line is where the core Huffman coding happens.

8. inputsig = randsrc(100, 1, [n; p]);

This generates a random sequence of symbols (inputsig) with a length of 100.

9. code = huffmanenco(inputsig, dict);

improving storage and transmission efficiency.

Huffman, D. A. (1952). "A Method for the Construction of Minimum-Redundancy Codes".

Proceedings of the IRE.

Sayood, K. (2017). Introduction to Data Compression (5th ed.). Morgan Kaufmann.

MATLAB Documentation: Huffman Encoding

You might also like