0% found this document useful (0 votes)

49 views5 pages

Sheet 1 Solution

sheet 1 solution for Information theroey and coding

Uploaded by

omarkhedr7525

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views5 pages

Sheet 1 Solution

sheet 1 solution for Information theroey and coding

Uploaded by

omarkhedr7525

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Problem sheet 1, Information Theory, MT 2022

Designed for the first tutorial class

Question 1 We are given a deck of n cards in order 1, 2, · · · , n. Then a randomly chosen card
is removed and placed at a random position in the deck. What is the entropy of the resulting
deck of card?

Answer 1 There are evenly n cards be picked up, and n places to be placed evenly. So there are
1/n2 different actions with even probability 1/n2 and some of them result in the same outcome as
following:

(a) The original order can be resulted by any card be picked up and placed at its original place, so
the probability of the original order is 1/n;

(b) a swap of two adjacent card can be resulted by two different operations. There are n − 1 of
these results, each with probability 2/n2 .

So there are 1 + (n − 1) + (n2 − 3n + 2) different results with probabilities as above, whose entropy
is
1 2 1
H(“deck”) = log(n) + (n − 1) 2 log(n2 /2) + (n2 − 3n + 2) 2 log(n2 )
n n n
1
n log(n) + 2(n − 1)(2 log(n) − 1) + (n2 − 3n + 2)2 log(n)

= 2
n
2n − 1 2n − 2
= log(n) + .
n n2

Question 2 (Polling inequalities) Let a ≥ 0, b ≥ 0 are given with a + b > 0. Show that

a+b
−(a + b) log(a + b) ≤ −a log(a) − b log(b) ≤ −(a + b) log( )
2
and that the first inequality becomes an equality iff ab = 0, the second inequality becomes an
equality iff a = b.

1
a
Answer 2 Denote p = a+b . Divided by a + b and then add log(a + b) on all three terms, the
equalities are equivalent to
1
0 ≤ −p log(p) − (1 − p) log(1 − p)) ≤ − log( ),
2
which is obvious according to the first basic property of entropy.

Question 3 Let X, Y, Z be discrete random variables. Prove or provide a counterexample to

the following statements:

(a) H(X) = H(42X);

(b) H(X|Y ) ≥ H(X|Y, Z);

(c) H(X, Y ) = H(X) + H(Y ).

Answer 3 The first one is true : f (x) = −42x is a bijective.

The second is true: H(X|Y ) − H(X|Y, Z) = I(X, Z|Y ), and the interpretation by informa-
tion/surprise works.
The third is wrong: By the chain rule, H(X, Y ) = H(Y |X) + H(X), and H(Y |X) = H(Y ) if
and only if X, Y are independent. An easy counter example is when Y = X and H(X) > 0, we
have H(X, Y ) = H(X, X) = H(X) < H(Y ) + H(X).

Question 4 Does there exist a discrete random variable X with a distribution such that
H(X) = +∞? If so, describe it as explicitly as possible.

Answer 4 Obviously, H(X) < +∞ for any case with finite image space. So we assume the
image space is the natural nubmers. Here is an counter example: P(X = n) = n logc2 (n) with c =
2 P h 2c 2c(log(log(n))
i
P 1 2 > 0. then H(X) = c
P
2
n n log (n) [log(n log (n))−log(c)] = n n log(n) + 2 −
n n log n n log (n)
P 1
log(c) = +∞ since log(log(n)) → +∞ and n n log(n) = +∞.

2
Question 5 Let X be a finite set, f a real-valued function f : X 7→ R and fix α ∈ R. We
want to maximise the entropy H(X) of a random variable X taking values in X subject to the
constraint
E[f (X)] ≤ α. (1)
Denote by U a uniformly distributed random variable over X . Prove the following optimal
solutions for the maximisation.
(a) If α ∈ [E[f (U )], maxx∈X f (x) ], then the entropy is maximised subject to (1) by the
uniformly distributed random variable U .

(b) If f is non-constant and α ∈ [minx∈X f (x), E[f (U )] ], then the entropy is maximised subject
to (1) by the random variable Z given by

eλf (x)
P (Z = x) = P λf (y)
for x ∈ X .
y∈X e

where λ < 0 is chosen such that E[f (Z)] = α.

Answer 5 (a) Since the uniform distribution achieves the maximal entropy without any con-
strained, so we just need to verify it satisfies the constraint (1), which is obvious.

(b) Recall the Gibbs’ inequality that for any pmf p and q,
X X
− p(x) log(p(x)) ≤ − p(x) log(q(x)).
x∈X x∈X

So we can try to write E[f (X)] into the form of − x∈X p(x) log(q(x))+c(λ)
P
−λ for some constant
λ < 0 and c with p(·) being the pmf of X, for which we should write

λf (x) = log(q(x)) + c(λ) ⇐⇒ eλf (x) e−c(λ) = q(x).

With the fact that q is a pmf, we have

X eλf (x)
c(λ) = log( eλf (x) ), and hence q(x) = P λf (x)
and .
x∈X x∈X e

λf (x)
So for any λ < 0, define the pmf q(x) := P e λf (x) , then
x∈X e

1 X c(λ)
E[f (X)] = p(x) log(q(x)) + .
λ λ
x∈X
P
With λ < 0, we have that E[f (X)] ≤ α is equivalent to − x∈X p(x) log(q(x)) ≤ −λα+c(λ).
P
Hence H(X) ≤ − x∈X p(x) log(q(x)) ≤ −λα + c(λ), (which means −λα + c(λ) is up-
per bound for any λ < x0), and both equalities hold (hence the upper bound is achieved)

3
λf (x)
P e
P
iff p(x) = q(x), i.e., P(X = x) = q(x) = λf (x) and α = x∈X p(x)f (X) =
x∈X e
λf (x)
P e
P
x∈X f (x) eλf (y)
.
y∈X
P
To make sure the existence of λ < 0 such that E[f (X)] = x∈X q(x)f (x) = α, we leave the
proof to part (c).
λf (x)
(c) Denote g(λ) := x∈X f (x) P e eλf (y) . Then g is a differentiable function with
P
y∈X

X eλf (x) X eλf (x) X

g 0 (λ) = f (x)2 P λf (y)
− f (x) P λf (y) )2
f (y)eλf (y)
x∈X y∈X e x∈X
( y∈X e y∈X

= E[f (X)2 ] − (E[f (X)])2 .

Since f is not a constant, so g 0 (λ) > 0, which means g is a strictly increasing and contin-
uous function. Furthermore, g(0) = E[f (U )], g(−∞) = minx∈X f (x). So g(λ) = α ∈
(min f (x), E[f (U )]) admits a unique solution λ < 0.

Question 6 (A revision on strong law of large numbers (SLLN) in probability theory, please
take this question as a reference) Let X be a real-valued random variable.

(a) Assume additionally that X is non-negative. Show that for every x > 0, we have

E[X]
P(X ≥ x) ≤ .
x

(b) Let X be a random variable of mean µ and variance σ 2 . Show that

σ2
P(|X − µ| > ε) ≤ .
2

)n≥1 be a sequence of i.i.d random variables with mean µ and variance σ 2 . Show
(c) Let (XnP
1 m
that m n=1 Xn converges to µ in probability, i.,e., for every ε > 0,

m
!
1 X
lim P Xn − µ > = 0.
m→+∞ m
n=1

This is a weak version of SLLN. It can bePstrengthen by Borel-Cantelli lemma

1 m
to the often-used version: P(limm→+∞ m n=1 Xn = µ) = 1.

Answer 6 (a) E[X] = E[X1X≥x ] + E[X1X<x ] ≥ E[X1X≥x ] ≥ E[x1X≥x ] = xP(X ≥ x), so

we have the inequality.
E[Y 2 ]
(b) Similar to part (a), for any random variable Y and constant ε > 0, P(|Y | > ε) ≤ ε2
. Apply
Y = X − µ in this inequality, we get the one in the question.

4
1 Pm σ2
(c) For any integer m, denote Ym = m n=1 Xn − µ, then E[Ym ] = 0, Var(Ym ) = m. Hence
σ 2 m→+∞
P(|Ym | > ε) ≤ mε −→ 0.

Question 7 (Optional) Partition the interval [0, 1] into n disjoint sub-intervals of length p1 , · · · , pn .
Let X1 , X2 , · · · be i.i.d. random variables, uniformly distributed on [0, 1], and Zm (i) be the
number of the X1 , · · · , Xm that lie in the ith interval of the partition. Show that the random
variables
n
Z (i) 1 m→+∞
X
Rm = Πni=1 pi m satisfy log(Rm ) −→ pi log(pi ) with probability 1.
m
i=1

Answer 7 Denote Ii as the ith subinterval. By the definition of Zm (i), we have Zm (i) =
Pm
j=1 1Xj ∈Ii , and by the law of large numbers,
Pm !
j=1 1Xj ∈Ii
P lim = pi = 1.
m→+∞ m

It is easy to see that

n n Pm n
1 1 X j=1 1Xj ∈Ii m→+∞
X X
log(Rm ) = Zm (i) log(pi ) = log(pi ) −→ pi log(pi ).
m m m
i=1 i=1 i=1

2024 Midsem Paper Solutions
No ratings yet
2024 Midsem Paper Solutions
4 pages
0 Minor390 PDF
No ratings yet
0 Minor390 PDF
271 pages
0 Notes
No ratings yet
0 Notes
6 pages
Lectures On The Large Deviation Principle: Fraydoun Rezakhanlou Department of Mathematics, UC Berkeley March 25, 2017
No ratings yet
Lectures On The Large Deviation Principle: Fraydoun Rezakhanlou Department of Mathematics, UC Berkeley March 25, 2017
84 pages
hw2 Sol
No ratings yet
hw2 Sol
3 pages
Jam MS
No ratings yet
Jam MS
12 pages
Institute of Actuaries of India: CT3: Probability and Mathematical Statistics Indicative Solution November 2008
No ratings yet
Institute of Actuaries of India: CT3: Probability and Mathematical Statistics Indicative Solution November 2008
9 pages
The Institute of Actuaries of India: Subject CT3 - Probability & Mathematical Statistics
No ratings yet
The Institute of Actuaries of India: Subject CT3 - Probability & Mathematical Statistics
11 pages
hw2 Sol
No ratings yet
hw2 Sol
5 pages
CJC JC 2 H2 Maths 2011 Mid Year Exam Solutions
No ratings yet
CJC JC 2 H2 Maths 2011 Mid Year Exam Solutions
14 pages
Ee364 HW5
No ratings yet
Ee364 HW5
6 pages
Function of A Random Variable and Its Distribution Topics
No ratings yet
Function of A Random Variable and Its Distribution Topics
14 pages
NNTDM 30 2 311 318
No ratings yet
NNTDM 30 2 311 318
8 pages
2018 Midsem Paper Solutions
No ratings yet
2018 Midsem Paper Solutions
5 pages
Asymptotic Formulæ For The Distribution of Integers of Various Types
No ratings yet
Asymptotic Formulæ For The Distribution of Integers of Various Types
21 pages
Indicative Solutions: Institute of Actuaries of India
No ratings yet
Indicative Solutions: Institute of Actuaries of India
9 pages
DR Tim Candy Lecture Questions
No ratings yet
DR Tim Candy Lecture Questions
3 pages
Msqe 2010
No ratings yet
Msqe 2010
14 pages
Advanced Algebra & Probability
No ratings yet
Advanced Algebra & Probability
12 pages
Solution For Assignment 2
No ratings yet
Solution For Assignment 2
5 pages
Lecture 04
No ratings yet
Lecture 04
4 pages
Info Theory Course Notes
No ratings yet
Info Theory Course Notes
46 pages
hw3 Soln
No ratings yet
hw3 Soln
7 pages
MTL736 2022MT11923
No ratings yet
MTL736 2022MT11923
18 pages
E Omey A New Renewal Type of Sequence June 2011
No ratings yet
E Omey A New Renewal Type of Sequence June 2011
6 pages
01 Practice Problems
No ratings yet
01 Practice Problems
1 page
Sample 6 - Solution-5 - 240630 - 091209
No ratings yet
Sample 6 - Solution-5 - 240630 - 091209
5 pages
Harmonic Analysis As Found in Analytic Number Theory
No ratings yet
Harmonic Analysis As Found in Analytic Number Theory
23 pages
E. Omey A New Sequence
No ratings yet
E. Omey A New Sequence
6 pages
RP Mocktest 29aug2024
No ratings yet
RP Mocktest 29aug2024
12 pages
Answers To Problems in Numerical Analysis, 9th Edition by Richard Burden
No ratings yet
Answers To Problems in Numerical Analysis, 9th Edition by Richard Burden
16 pages
Mit18 05s22 Practice Ex1 Qa
No ratings yet
Mit18 05s22 Practice Ex1 Qa
9 pages
2DD40 20231102 Answers
No ratings yet
2DD40 20231102 Answers
5 pages
MCS-013 (2022-23) Solved Assignment
No ratings yet
MCS-013 (2022-23) Solved Assignment
26 pages
STTF225 - Test 3 - Memo - 2023
No ratings yet
STTF225 - Test 3 - Memo - 2023
7 pages
Tutorial-05 (Answers)
No ratings yet
Tutorial-05 (Answers)
9 pages
2019 Practice Exam 1
No ratings yet
2019 Practice Exam 1
16 pages
2015 Solution
No ratings yet
2015 Solution
8 pages
$RKD776M
No ratings yet
$RKD776M
5 pages
Problem Set: CS203: Introduction To Probability For CS
No ratings yet
Problem Set: CS203: Introduction To Probability For CS
4 pages
Sanov's Theorem & Rate Distortion Solutions
No ratings yet
Sanov's Theorem & Rate Distortion Solutions
15 pages
Hw1-Sol CSE 531
No ratings yet
Hw1-Sol CSE 531
9 pages
JEE Main 2025 January 22 Shift 1 Maths Download PDF
No ratings yet
JEE Main 2025 January 22 Shift 1 Maths Download PDF
8 pages
Chap 1
No ratings yet
Chap 1
26 pages
Revision of Calculus CTA 6 Ans
No ratings yet
Revision of Calculus CTA 6 Ans
4 pages
Sol So23
No ratings yet
Sol So23
15 pages
111 Math Midterm PDF
No ratings yet
111 Math Midterm PDF
59 pages
Solutions To The 83rd William Lowell Putnam Mathematical Competition Saturday, December 3, 2022
No ratings yet
Solutions To The 83rd William Lowell Putnam Mathematical Competition Saturday, December 3, 2022
7 pages
Basics and Logarithm, Mod, WCM
No ratings yet
Basics and Logarithm, Mod, WCM
11 pages
Unit1 Probab
No ratings yet
Unit1 Probab
11 pages
Limit Continuity
100% (1)
Limit Continuity
8 pages
Anna University, Chennai, November/December 2012
No ratings yet
Anna University, Chennai, November/December 2012
11 pages
Discrete Math - 2024 One Pager
No ratings yet
Discrete Math - 2024 One Pager
1 page
ECE 863 Stochastic Systems HW Solutions
No ratings yet
ECE 863 Stochastic Systems HW Solutions
11 pages
Pre Final Solution
No ratings yet
Pre Final Solution
5 pages
Chains of Large Gaps Between Primes: P N K X G X P P, - . - , P P, K
No ratings yet
Chains of Large Gaps Between Primes: P N K X G X P P, - . - , P P, K
16 pages
NYJC JC 2 H2 Maths 2011 Mid Year Exam Solutions Paper 2
No ratings yet
NYJC JC 2 H2 Maths 2011 Mid Year Exam Solutions Paper 2
7 pages
Homework 0: Highly Recommended Yourself
No ratings yet
Homework 0: Highly Recommended Yourself
7 pages
Problem Sheet 3
No ratings yet
Problem Sheet 3
2 pages
Solutions For Sheet 3
No ratings yet
Solutions For Sheet 3
7 pages
Sheet 2
No ratings yet
Sheet 2
3 pages
Solution Sheet 2
No ratings yet
Solution Sheet 2
8 pages
Sheet 1
No ratings yet
Sheet 1
2 pages
Kioti Daedong MEC2240 UTV (Utility Vehicle) Service Manual 04-2014
No ratings yet
Kioti Daedong MEC2240 UTV (Utility Vehicle) Service Manual 04-2014
19 pages
Lengua para Diablo
No ratings yet
Lengua para Diablo
2 pages
Subliminal Mind Control
No ratings yet
Subliminal Mind Control
1 page
Map Chart Samples
No ratings yet
Map Chart Samples
6 pages
OVERALL RING - Untitled - GR1S - Complete
No ratings yet
OVERALL RING - Untitled - GR1S - Complete
3 pages
02 Springer Paper Template
No ratings yet
02 Springer Paper Template
17 pages
Gender Development Plan & Budget
No ratings yet
Gender Development Plan & Budget
14 pages
Test 3 - Ôn Thi 10
No ratings yet
Test 3 - Ôn Thi 10
3 pages
Allred & Myklas Composers:: 10 @riginal Piano Solos by
No ratings yet
Allred & Myklas Composers:: 10 @riginal Piano Solos by
28 pages
Quiz 2 - Solutions
No ratings yet
Quiz 2 - Solutions
4 pages
RRB Pharmacist Solved Paper
No ratings yet
RRB Pharmacist Solved Paper
73 pages
Minimalism Versus Contextualism in Semantics
No ratings yet
Minimalism Versus Contextualism in Semantics
9 pages
Physician Certification Form
No ratings yet
Physician Certification Form
2 pages
Budson - Memory Loss, Alzheimer's Disease, and Dementia - 2 Ed - 2015
100% (4)
Budson - Memory Loss, Alzheimer's Disease, and Dementia - 2 Ed - 2015
283 pages
Art Detective's Thrilling Pursuit
No ratings yet
Art Detective's Thrilling Pursuit
5 pages
Dolphin LTD
No ratings yet
Dolphin LTD
2 pages
Table 3
No ratings yet
Table 3
86 pages
Lesson 2: Reading Texts Critically: at The End of The Lesson, You Should Be Able To
No ratings yet
Lesson 2: Reading Texts Critically: at The End of The Lesson, You Should Be Able To
11 pages
Admission of A Partner MCQs 2024
No ratings yet
Admission of A Partner MCQs 2024
4 pages
Handshake Resume Template
100% (1)
Handshake Resume Template
4 pages
Torts & Damages: Law Student Guide
100% (1)
Torts & Damages: Law Student Guide
200 pages
7 Principles of Highly Effective Command Centers: Written by Everbridge
No ratings yet
7 Principles of Highly Effective Command Centers: Written by Everbridge
12 pages
Course Overview IT6010 MathsForIT
No ratings yet
Course Overview IT6010 MathsForIT
9 pages
Rwa Annual Reports
No ratings yet
Rwa Annual Reports
1 page
Sulfur Recovery Project Summaries
No ratings yet
Sulfur Recovery Project Summaries
8 pages
NEC4 Subcontract Cheat Sheet!
No ratings yet
NEC4 Subcontract Cheat Sheet!
2 pages
Buletin Tapol No. 52, 1982 (July)
No ratings yet
Buletin Tapol No. 52, 1982 (July)
20 pages
10 AAU - Level 5 - Test - Challenge - Unit 1
No ratings yet
10 AAU - Level 5 - Test - Challenge - Unit 1
4 pages
Georgia Habitats Lesson Plan: Standards
No ratings yet
Georgia Habitats Lesson Plan: Standards
3 pages

Sheet 1 Solution

Uploaded by

Sheet 1 Solution

Uploaded by

Problem sheet 1, Information Theory, MT 2022

Designed for the first tutorial class

Question 3 Let X, Y, Z be discrete random variables. Prove or provide a counterexample to

(a) H(X) = H(42X);

(b) H(X|Y ) ≥ H(X|Y, Z);

(c) H(X, Y ) = H(X) + H(Y ).

Answer 3 The first one is true : f (x) = −42x is a bijective.

where λ < 0 is chosen such that E[f (Z)] = α.

λf (x) = log(q(x)) + c(λ) ⇐⇒ eλf (x) e−c(λ) = q(x).

With the fact that q is a pmf, we have

X eλf (x) X eλf (x) X

= E[f (X)2 ] − (E[f (X)])2 .

(b) Let X be a random variable of mean µ and variance σ 2 . Show that

This is a weak version of SLLN. It can bePstrengthen by Borel-Cantelli lemma

Answer 6 (a) E[X] = E[X1X≥x ] + E[X1X<x ] ≥ E[X1X≥x ] ≥ E[x1X≥x ] = xP(X ≥ x), so

It is easy to see that

You might also like