Source Coding in Information Theory

The document discusses source coding and different coding systems. It defines an information source, codewords, and commonly used codes like ASCII and Morse code. It also defines uniquely decodable codes, instantaneous codes, and prefix codes. An example problem describes different coding systems for an information source and how to reduce the file size by assigning shorter codewords to more frequent symbols. The key questions of the source coding problem are determining if data can be compressed, calculating the minimum average bits per symbol, and designing compression algorithms.

Uploaded by

Mustafa Khider

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views5 pages

Source Coding in Information Theory

Uploaded by

Mustafa Khider

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Information theory lecture-4

Source coding

An information source is a device which delivers symbols (or letters) randomly from a
set of symbols (or letters) called an alphabet. A code is a set of words called codewords.
A codeword can be defined as a combination of code symbols (alphabet) generated
from a source output.
The most common codes are binary codes, i.e. codes whose code alphabet is {0, 1}.

 The ASCII code: stands for American Standard for Communication Information
Interchange. Originally intended to represent the whole set of characters of a
typewriter, It consists of 128 binary codewords having the same length (7-bits).
Later on, additional and non printing characters were added to meet new
demands. This gave birth to the extended ASCII code, a 256 fixed length binary
code whose. Nowadays, keyboards still communicate to computers with ASCII
codes and when saving document in “plain text”, characters are encoded with
ASCII codes.

1/5
Information theory lecture-4

 The Morse code. Invented by Samuel Morse in the 1840’s, it allows letters of the
alphabet {a, b, … , z, “ space” , “ full stop” , “ comma” , … } to be sent as short
electrical signals (dots) and long electrical signals (dashes).

Morse code differs from ASCII code in the sense that shorter words are assigned to
more frequent letters.

Coding Definitions
1. A code is said to be uniquely decodable if any sequence of codewords can be
interpreted in only one way.
- The code {1, 10, 11} is not uniquely decodable as the sequence “ 1111” can be
interpreted in “ 1” ,“ 11”, “ 1” or “ 1”, “ 1” ,“ 11” or …
- The code {1, 10} is uniquely decodable. In the sequence 11011, the first codeword
is “ 1” since the following symbol is “ 1” whereas the second codeword is “ 10”
since the third symbol is “ 0” and so on …
2. An instantaneous code is a code in which any sequence of codewords can be
interpreted codeword by codeword, as soon as they are received.
- The code {0, 10} is instantaneous
- The code {1, 10} is not instantaneous. For instance, in the sequence 1110, we
need to know whether the second symbol is “ 0” or “ 1” before interpreting the
first symbol. This is due to the fact that a codeword (“ 1” ) is the beginning of
another codeword (“ 10” ). Instantaneous code is also known as prefix code.

2/5
Information theory lecture-4

3. A code is a prefix code if and only if no codeword is the beginning of another

codeword.
- The code {1, 01, 000, 001} is a prefix code.
An instantaneous (or prefix) code is uniquely decodable but some uniquely decodable
codes do not have the prefix property.

Example-1
An information source has an output of four symbols u1, u2, u3 and u4. The each code
symbol consist of alphabets 0 and 1. From the coding definitions discuss the coding
systems A, B, C and D given in the table below
A B C D
u1 0 00 0 0
u2 11 01 10 01
u3 00 10 110 011
u4 01 11 1110 0111

Solution
- Code A is not uniquely decodable because
u1u1u2=0011 and u3u2=0011
- Codes B, C and D are uniquely decodable codes
Code B is instantaneous code because of the fix length
Code C is instantaneous code because each codeword end with 0
Code D is not instantaneous because one must always wait for the first alphabet
of the next codeword before the current codeword can be decoded.

The source coding problem:

3/5
Information theory lecture-4

Let U be a source generating values {A, B, C, D} with probabilities 1/2, 1/4, 1/8 and 1/8.
1,000 outputs of U are to be stored in the form of binary file, and one seeks to reduce
the file to its smallest possible size.

Solution
Using the probability of the symbols, we deduce that in the sequence of 1,000 symbols,
there are roughly:
1000∗1
symbols of type A= =500
2
1000∗1
symbols of type B= =250
4
1000∗1
symbols of type C= =125
8
1000∗1
symbols of type D= =125
8

- First coding solution

There are 4 symbols to be encoded. Thus, each of them can be associated with a word
of two binary digits (2-bits) as follows:
A→ 00 , B→01, C→ 10 , C→11
All the codewords having equal number of bits( the same length), this code is known as
a fixed length code.
Hence,
file ¿ 500∗2+ 250∗2+125∗2+125∗2=2000 bits

- Second coding solution

4/5
Information theory lecture-4

The different symbols do not occur with the same probabilities. Therefore, we can think
of a code which assigns shorter words to more frequent symbols as :
A→ 1 , B→01, C→ 000 , C→001
This code is said to be a variable length code, as the codewords do not have the same
length.
Hence,
file ¿ 500∗1+ 250∗2+125∗3+125∗3=1750 bits
The size of the file is reduced on average
1750
=1.75 bits are necessary to represent one symbol.
1000

We are now faced with three questions:

Given an information source,
1. Is it possible to compress its data?
2. What is the minimum average number of bits necessary to represent one
symbol?
3. How do we design algorithms to achieve effective compression of the data?
These three questions constitute the source coding problem.

5/5

Publication 3 26433 1410
No ratings yet
Publication 3 26433 1410
6 pages
Source Codes
No ratings yet
Source Codes
37 pages
Information Theory and Coding: What You Need To Know in Today's ICE Age!
No ratings yet
Information Theory and Coding: What You Need To Know in Today's ICE Age!
44 pages
Lecture 4-Print
No ratings yet
Lecture 4-Print
18 pages
Information Theory and Coding
No ratings yet
Information Theory and Coding
27 pages
Coding & Information Theory: By: Shiva Navabi January, 29 2011
No ratings yet
Coding & Information Theory: By: Shiva Navabi January, 29 2011
38 pages
Lecture 4
No ratings yet
Lecture 4
18 pages
Source Encoding and Code Types
No ratings yet
Source Encoding and Code Types
186 pages
Lec 2
No ratings yet
Lec 2
17 pages
Ees452 2021 2.1
No ratings yet
Ees452 2021 2.1
9 pages
Decodable PDF
No ratings yet
Decodable PDF
4 pages
Binary Source Coding Basics
No ratings yet
Binary Source Coding Basics
16 pages
Lecture 4
No ratings yet
Lecture 4
34 pages
Source Coding & Theorems Guide
No ratings yet
Source Coding & Theorems Guide
29 pages
Lec 2 X
No ratings yet
Lec 2 X
6 pages
Session 4
No ratings yet
Session 4
66 pages
Ch3:source Code
No ratings yet
Ch3:source Code
64 pages
Information Theory: Dr. Muhammad Imran Farid
No ratings yet
Information Theory: Dr. Muhammad Imran Farid
32 pages
Data Communication Codes Guide
No ratings yet
Data Communication Codes Guide
10 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Chapter 4 - Introduction To Source Coding PDF
No ratings yet
Chapter 4 - Introduction To Source Coding PDF
72 pages
Entropy Coding
No ratings yet
Entropy Coding
18 pages
Image and Video Compression: Lecture 12, April 27, 2009 Lexing Xie
No ratings yet
Image and Video Compression: Lecture 12, April 27, 2009 Lexing Xie
77 pages
Unit 2
No ratings yet
Unit 2
28 pages
Digital Coding for ELEC1010 Students
No ratings yet
Digital Coding for ELEC1010 Students
72 pages
Source Coding: 1. Introduction-Encoding of The Source Output 2. Shannon S Encoding Algorithm 3. 4. 5. Outcome
No ratings yet
Source Coding: 1. Introduction-Encoding of The Source Output 2. Shannon S Encoding Algorithm 3. 4. 5. Outcome
15 pages
Source Coding
No ratings yet
Source Coding
10 pages
Chapter 2
No ratings yet
Chapter 2
22 pages
Introduction To Digital Communications and Information Theory
No ratings yet
Introduction To Digital Communications and Information Theory
8 pages
Lossless Data Compression Basics
100% (2)
Lossless Data Compression Basics
26 pages
Data Compression Basic Concepts of Data Compression Data Compression
No ratings yet
Data Compression Basic Concepts of Data Compression Data Compression
21 pages
ch3 Part1
No ratings yet
ch3 Part1
7 pages
Lecture 4
No ratings yet
Lecture 4
65 pages
18ec501 U4lm1
No ratings yet
18ec501 U4lm1
20 pages
Information Theory-VI
No ratings yet
Information Theory-VI
41 pages
Information Theory: Mohamed Hamada
No ratings yet
Information Theory: Mohamed Hamada
19 pages
Pres 3may 5may 9871
No ratings yet
Pres 3may 5may 9871
11 pages
CHAPTER 01 - Basics of Coding Theory
No ratings yet
CHAPTER 01 - Basics of Coding Theory
17 pages
3 Source Coding
No ratings yet
3 Source Coding
31 pages
Source Coding for Digital Systems
No ratings yet
Source Coding for Digital Systems
26 pages
Module IV
No ratings yet
Module IV
37 pages
M2 Prefixcode
No ratings yet
M2 Prefixcode
44 pages
Shan PDF
No ratings yet
Shan PDF
104 pages
Lecture35-37 SourceCoding
No ratings yet
Lecture35-37 SourceCoding
20 pages
ITC 2020 21 Lecture 4
No ratings yet
ITC 2020 21 Lecture 4
21 pages
Entropy & Coding in Info Theory
No ratings yet
Entropy & Coding in Info Theory
10 pages
Information Theory and Coding
No ratings yet
Information Theory and Coding
9 pages
Coding Theory Essentials
No ratings yet
Coding Theory Essentials
7 pages
Entropy, Coding and Data Compression
No ratings yet
Entropy, Coding and Data Compression
33 pages
MATH468 File1 August31
No ratings yet
MATH468 File1 August31
254 pages
Coding Line Coding Covered
No ratings yet
Coding Line Coding Covered
68 pages
Data Compression
No ratings yet
Data Compression
26 pages
Chapter Three Source Coding: 1-Sampling Theorem
No ratings yet
Chapter Three Source Coding: 1-Sampling Theorem
19 pages
Data Compression
No ratings yet
Data Compression
35 pages
DC-PPT 5
No ratings yet
DC-PPT 5
44 pages
GEMATMW Coding Theory Notes
100% (1)
GEMATMW Coding Theory Notes
8 pages
Introduction To Information Theory: Facsimile Transmission
No ratings yet
Introduction To Information Theory: Facsimile Transmission
3 pages
Principles of Information Theory
No ratings yet
Principles of Information Theory
5 pages
Introduction To Information Theory: Facsimile Transmission
No ratings yet
Introduction To Information Theory: Facsimile Transmission
3 pages
Discrete Memoryless Information Source: H (U) P P
No ratings yet
Discrete Memoryless Information Source: H (U) P P
3 pages
Transmission Lines As Circuit Elements: Shorted Line Circuits
No ratings yet
Transmission Lines As Circuit Elements: Shorted Line Circuits
5 pages
Lecture 2
100% (1)
Lecture 2
8 pages
Transmission Lines - II (Stripline and Microstrip)
No ratings yet
Transmission Lines - II (Stripline and Microstrip)
4 pages
The Term Microwave Is Typically Used For Frequencies Between 3 and 300 GHZ
No ratings yet
The Term Microwave Is Typically Used For Frequencies Between 3 and 300 GHZ
4 pages
Error Detection Dechniques
No ratings yet
Error Detection Dechniques
16 pages
Presentation (ECE452)
No ratings yet
Presentation (ECE452)
32 pages
Gallery - Setup - Rpy - 2025-09-15T101411.374
No ratings yet
Gallery - Setup - Rpy - 2025-09-15T101411.374
168 pages
Channel Coding Techniques For Wireless Communications K. Deergha Rao Online Reading
No ratings yet
Channel Coding Techniques For Wireless Communications K. Deergha Rao Online Reading
149 pages
Hamming Code in Computer Network
No ratings yet
Hamming Code in Computer Network
3 pages
Error Detection
No ratings yet
Error Detection
9 pages
Iterative Error Correction Turbo Low Density Parity Check and Repeat Accumulate Codes 1st Edition Sarah J. Johnson Instant Download
100% (1)
Iterative Error Correction Turbo Low Density Parity Check and Repeat Accumulate Codes 1st Edition Sarah J. Johnson Instant Download
49 pages
Lesson - Huffman and Entropy Coding
No ratings yet
Lesson - Huffman and Entropy Coding
31 pages
Noiseless Channel: Nyquist Bit Rate Noisy Channel: Shannon Capacity Using Both Limits
100% (1)
Noiseless Channel: Nyquist Bit Rate Noisy Channel: Shannon Capacity Using Both Limits
10 pages
BB Live Plans PDF
No ratings yet
BB Live Plans PDF
3 pages
Study & Implementation of Cyclic Redundancy Check (CRC) : Experiment No. 4
No ratings yet
Study & Implementation of Cyclic Redundancy Check (CRC) : Experiment No. 4
9 pages
Continuous Entropy
No ratings yet
Continuous Entropy
17 pages
Text Data Compression Algorithms
No ratings yet
Text Data Compression Algorithms
25 pages
Checksums and Error Control
No ratings yet
Checksums and Error Control
19 pages
Data Compression Quiz for Students
No ratings yet
Data Compression Quiz for Students
53 pages
Huffman Coding Principles
No ratings yet
Huffman Coding Principles
31 pages
An Improved Lossless ECG Data Compression Using ASCII Character Encoding
No ratings yet
An Improved Lossless ECG Data Compression Using ASCII Character Encoding
7 pages
Theories & Models of Communication
No ratings yet
Theories & Models of Communication
39 pages
DCC2018-Tutorial 3
No ratings yet
DCC2018-Tutorial 3
2 pages
The Cybernetics Moment or Why We Call Our Age The Information Age 1st Edition Kline PDF Download
100% (7)
The Cybernetics Moment or Why We Call Our Age The Information Age 1st Edition Kline PDF Download
61 pages
BITS F386 - Handout
No ratings yet
BITS F386 - Handout
2 pages
Class 11 (Computer) CHP 2 Notes
0% (1)
Class 11 (Computer) CHP 2 Notes
52 pages
Variable-Width Encoding - Wikipedia
No ratings yet
Variable-Width Encoding - Wikipedia
1 page
Information Theory & Source Coding
100% (1)
Information Theory & Source Coding
14 pages
Entropy & Run Length Coding
No ratings yet
Entropy & Run Length Coding
45 pages
EC8002 - Multimedia Compression and Communication QP - by WWW - Learnengineering.in
No ratings yet
EC8002 - Multimedia Compression and Communication QP - by WWW - Learnengineering.in
10 pages
Wireless Error Correction Codes
No ratings yet
Wireless Error Correction Codes
7 pages
Info Theory: Discrete Sources & Entropy
No ratings yet
Info Theory: Discrete Sources & Entropy
15 pages
Data Mining
No ratings yet
Data Mining
32 pages
Hashing
No ratings yet
Hashing
27 pages

Source Coding in Information Theory

Uploaded by

Source Coding in Information Theory

Uploaded by

Information theory lecture-4

3. A code is a prefix code if and only if no codeword is the beginning of another

The source coding problem:

- First coding solution

- Second coding solution

We are now faced with three questions:

You might also like