0% found this document useful (0 votes)

31 views17 pages

Ie 303 - LN9 - 1

Dynamic Programming (DP) is a methodology for solving complex decision-making problems by breaking them into smaller decisions, guided by the Principle of Optimality (PoO). The document outlines the components of DP formulation, including stages, states, decisions, cost-to-go functions, optimal values, recursive equations, and boundary conditions, illustrated through examples like the Stagecoach Problem and the Integer Knapsack Problem. Additionally, it discusses non-linear resource allocation and equipment replacement problems, demonstrating the versatility of DP in various contexts.

Uploaded by

bynkymz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views17 pages

Ie 303 - LN9 - 1

Uploaded by

bynkymz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Dynamic Programming Part I

1 Introduction
Dynamic Programming (DP) is a formulation and solution methodology that
helps solve complex decision-making problems. It achieves this by breaking
a decision problem into a sequence of smaller, more manageable decisions,
all tied together by the key concept known as the Principle of Optimality
(PoO).
The Layman’s explanation of the PoO states that given an optimal deci-
sion sequence, any sub-parts of the optimal decision are also optimal. That
is, PoO implies that if a solution is the best one overall, then any part of the
solution should also be the best for that part of the problem.
For example, consider finding the shortest route from point A to point
G via intermediate points B, C, and E. According to PoO, the route from
A to C must also be optimal. Otherwise, a shorter route from A to C could
improve the overall journey to G, which contradicts the assumption that the
original route was the shortest.

2 Dynamic Programming Formulation

The main components of a Dynamic Programming formulation are as follows:

1. Stages: These represent the sub-parts of the problem, often linked to

different points in time or steps in the process.

2. State: The state of the system or problem at any given stage. This is
the information available to the decision-maker at that point.

3. Decision: The action or control that the planner chooses at a partic-

ular stage.

1
4. Cost-to-go function: A function representing the remaining cost
from a current state to the end of the problem.

5. Optimal value: The goal is to determine the best possible value of

the objective function by optimizing the cost-to-go function.

6. Recursive equation: The problem is typically solved using a recursive

formula, breaking the larger problem into smaller, more manageable
sub-problems. The recursion can be built to be forward or backward
according to how stages are explored.

7. Boundary conditions: These are the simplest sub-parts where the

decision-making is trivial, often used as a starting point for the recursive
process.

2.1 Example I: The Stagecoach Problem

As first example, we consider a Stagecoach Problem which is very close to
the Shortest Path Problem.
In the 1800s, a salesman has to travel from city 1 to city 10, and the roads
between these cities are dangerous. For each leg of the journey, the salesman
must purchase travel insurance for his merchandise.
The cost of insurance for traveling between cities i and j is denoted by
aij , as shown in the figure below.
The journey is taken using a stagecoach, which departs each city in the
morning and arrives at another city by evening. The salesman spends the
night at a traveler’s inn before continuing the next leg of the journey the
following day.
The objective is to find the least expensive route (in terms of insurance
costs) that the salesman can take from city 1 to city 10.
The main elements of the DP formulation for this problem are:

1. Stages: Each day of the journey can be viewed as a stage, so the

problem has four stages corresponding to four days of travel. Formally,
t = 1, . . . , 4.

2. State: The state at any given stage is the city from which the salesman
starts the day’s journey, denoted by i where i belongs to the set of cities
Ct at stage t.

2
Figure 1: The Stagecoach Problem: A map illustrating the cities and the
insurance costs between them.

3. Decision: The decision is to choose the next city, j, that the salesman
will travel to, where j is in the set of cities Ct+1 at the next stage.

4. Cost-to-go function: Let ft (i) represent the minimum insurance cost

for a route that starts at city i on day t and follows an optimal path
onward to the destination.

5. Optimal cost: The objective is to find f1 (1), the minimum insurance

cost starting at city 1.

6. Recursive equation: The recursive equation for stage t is:

ft (i) = min {aij + ft+1 (j)}, ∀i ∈ Ct

j∈Ct+1

7. Boundary conditions: If we are at Stage 4, i.e., at the dawn of day

4, the salesman is either in city 8 or in city 9. The cost of traveling
from these cities to city 10 is already known:

We shall solve the problem backwards.

3
At Stage 3, the salesman could be in city 5, 6, or 7. The minimum
cost-to-go from these cities is calculated as:

Focusing on the first equality, f3 (5) is the minimum cost for reaching city
10 starting from city 5 on the third day of travel. The first term is the cost
when the route passes by city 8 while paying the insurance a58 = 1. The
second term describes the cost if the route goes through city 9.
Now, stage 2:

Finally,

Note that this problem could also have been solved using Dijkstra’s Al-
gorithm, as it is a shortest path problem.

2.2 Example II: The Integer Knapsack Problem

Now we will se how to use dynamic programming to solve the Integer Knap-
sack Problem. Recall that we have encountered both the binary version

4
and the integer version of this problem (the Cutting Stock Problem). Here,
we revisit the integer version.
In the integer knapsack problem, the variables xj for each item j =
1, . . . , n are non-negative integers that represent the quantity of each item.
From each item, we can take as many as the knapsack’s capacity allows,
subject to constraints.
The problem can be formulated as follows:
n
X
max p j xj
j=1

Subject to:
n
X
aj x j ≤ b
j=1

where: pj is the profit from item j, aj is the space or weight item j occupies,
and b is the total capacity (maximum weight or space) of the knapsack
We now proceed to the Dynamic Programming formulation

1. Stages: Each item j represents a stage in the decision-making process.

2. State: The state at any given stage is the remaining space (or capacity)
still available in the knapsack, denoted by y at stage j.

3. Decision: The decision at each stage is how many units of item j to

add to the knapsack, represented by xj .

4. Cost-to-go function: Let fj (y) represent the maximum profit at-

tainable starting from stage j with a remaining capacity of y, while
following an optimal policy in stages j, j + 1, . . . .

5. Optimal value sought: The goal is to find f1 (b), which gives the
maximum possible profit starting at the first item with a knapsack of
capacity of b.

6. Recursive equation: The recursive equation for this problem is:

fj (y) = max
{pj xj + fj+1 (y − aj xj )}
y
0≤xj ≤ aj
xj ∈Z+

5
This equation holds for all stages j and for all remaining capacities
y = 0, 1, . . . , b. The interpretation is: if a number xj of items is in-
serted in the knapsack, a profit pj xj is reached plus the maximum profit
achievable from items j + 1, . . . , j in the remaining capacity y − aj xj .

7. Boundary Condition: If we reach the last item, the decision becomes

straightforward because no further decisions are needed. The cost-to-go
function at this point is:

y
fn (y) = pn ∀y = 0, 1, . . . , b
an

This tells us how many last item copies we can take, given the remaining
capacity.

The recursion works in a backward fashion.

Consider the following integer knapsack problem:

max 11x1 + 7x2 + 5x3

Subject to:
6x1 + 4x2 + 3x3 ≤ 15
Where x1 , x2 , x3 ≥ 0 and integers.
We will now solve the problem starting from the last stage (item 3). At
stage 3, the boundary condition applies, and we can directly calculate the
maximum profit based on the remaining capacity: For capacities of 3 or more,

we begin adding item 3:

Continuing for higher capacities:

6
At stage 2, we calculate the maximum profit by adding item 2.

The two arguments in the function refer, respectively, to the choice of not
inserting item 2, thus using the 4 units of capacity to pick item 3, and to the
choice of insert item 2, gaining a profit of 7.

Finally for Stage 1, it is sufficient to consider only y = 15 as our knapsack

instance has a capacity of 15.

The optimal policy is then traced back:

7
2.3 Example III: Non-linear (Discrete) Resource Allo-
cation
In this section, we explore a non-linear version of a knapsack-type problem:
the Resource Allocation problem, which can be solved using Dynamic
Programming.
A corporation has $5 million to allocate among its three plants for possible
expansion. Each plant has submitted a series of proposals detailing the cost
of expansion and the expected revenue generated by the expansion. Each
plant will only be permitted to enact one of its proposals.

Proposal c1 r1 c2 r2 c3 r3
1 0 0 0 0 0 0
2 1 5 2 8 1 4
3 2 6 3 9 - -
4 - - 4 12 - -

The columns represent the different plants and their corresponding pro-
posals. For each proposal, c indicates the cost of expansion (in millions of
dollars), and r represents the expected revenue (in millions of dollars). The
goal is to maximize the total revenue using the $5 million available. We will
assume that any of the $5 million we don’t spend is lost.

1. Stages: Each plant j = 1, 2, 3 represents a stage in the decision-making

process.

2. State: The state at any stage j is the remaining budget, denoted yj ,

in millions.

3. Decision: The decision at each stage is which proposal xj to choose

for plant j. The chosen proposal will have a corresponding cost cj and
revenue rj .

4. Cost-to-go function: fj (y) denote the maximum revenue attainable

starting from stage j (plant j) with budget y remaining and following
an optimal policy for plants j, j + 1, . . ..

5. Optimal value sought: The goal is to find the optimal value at the
first stage, f1 (5).

8
6. Recursive Equation: The recursive equation for this DP problem is:

fj (y) = max {rj (xj ) + fj+1 (y − cj (xj ))}

xj :cj (xj )≤y

This equation tells us that at each stage j, given a remaining bud-

get y, we must choose the proposal xj that maximizes the sum of the
revenue from that proposal and the maximum revenue from the re-
maining stages (plants) with budget y − cj (xj ). Not the similarity to
the knapsack dynamic programming formulation.

7. Boundary Condition For the final stage (plant j = 3), the decision
is straightforward since there are no further stages:

f3 (y) = max {r3 (x3 )} for y = 0, 1, . . . , 5

x3 :c3 (x3 )≤y

We will now solve the problem backwards, starting from the last stage
and working towards the first stage.
For plant 3, we directly compute the maximum revenue for each budget
level:

Plant 3 can only choose from proposal 1 (which yields no revenue) or

proposal 2 (which costs 1 million and generates 4 million in revenue).
Next, we compute the maximum revenue for plant 2:

For larger budgets, we compute:

Consider f2 (5). The first argument assume that proposal 1 is selected

for plant 2, that is the ”no investment” in plant 2 option, and proposal 2
for plant 3 is chosen by investing $1 million. Then, the second argument

9
considers investing $2 millions in plant 2 for its proposal 2, and $1 million
in plant 3 for its proposal 2. The third option is to invest $3 millions in
proposal 3 presented by plant 2, whilst $1 million is invested in proposal 2
of plant 3. Finally, the last argument represents the choice of investing $4
millions on plant 2’s proposal 4 and the remaining amount in proposal 2 of
plant 3.
Finally, for plant 1, we calculate the maximum revenue:

2.4 Example IV: Equipment Replacement

Suppose a shop needs a specific machine continuously for the next five years.
Each new machine costs b = $1000. The maintenance costs for the machine
depend on its age: during the 1st year $60, during the 2nd year $80, during
the 3rd year $120. Let mi be the maintenance cost for a machine during its
i-th year of operation. A machine can be kept for a maximum of three years
before being traded in. The trade-in values depending on the machine’s age
are: after 1 year $800, after 2 years $600, after 3 years $500. We address by
si be the trade value of a machine after i years of operation.
Our goal is to minimize costs over this period by determining an optimal
replacement/maintenance strategy.
Recall that we can solve this problem using Dijkstra Shortest Path al-
gorithm on a suitable graph: one node per each year, an arc (i, j) if a new
machine is purchased at year P i with cost bi and sold at year j with salvage
value sj−i , arc cost cij = bi + j−i
k=1 mk − sj−i .
The elements of the dynamic programming are:
1. Stages: Each stage corresponds to the beginning of each year, t =
1, . . . , 5.
2. State: The state, xt , is the age of the machine at the beginning of year
t.

10
3. Decision: At each stage, the decision is whether to keep (K) the ma-
chine or replace (R) it.

4. Cost-to-go function: Define ft (x) as the minimum cost of an optimal

policy starting in year t with a machine that is x years old and following
an optimal policy through the years t, t + 1, . . . 5.

5. Optimal value sought: We aim to find f1 (0), as we start with a

brand-new machine.

6. Recursive equation: Consider each possible age for the machine at

time t = 1, 2, 3, 4.

• If the machine is three years old at time t, it must be traded in:

ft (3) = −s3 + b + m1 + ft+1 (1) = −500 + 1000 + 60 + ft+1 (1).

While s3 is related to the trade-in of the old machine, b and m1 are

the costs for buying the new machine and to maintain it during
its first year of operation, respectively. Then, ft+1 (1) comprises
the costs for the subsequent year having a 1 year old machine.
• If the machine is two or one year(s) old at time t, we have two
options:
 
 
ft (2) = min −s2 + b + m1 + ft+1 (1), m3 + ft+1 (3) =
| {z } | {z }
R K
 
 
min −600 + 1000 + 60 + ft+1 (1), 120 + ft+1 (3)
| {z } | {z }
R K
 
 
ft (1) = min −s1 + b + m1 + ft+1 (1), m2 + ft+1 (2) =
| {z } | {z }
R K
 
 
min −800 + 1000 + 60 + ft+1 (1), 80 + ft+1 (2)
| {z } | {z }
R K

11
7. Boundary conditions: At the end of the 5-year period (beginning of
year 6), we assume the machine is no longer needed:

f6 (x) = −sx , x = 1, 2, 3

We now compute the cost-to-go values starting from the end of the period
and moving backwards.
For stage 5 (beginning of year 5)

For stage 4

At stage 3

In stage 2 the machine is only one year old:

In year 1, we start with a brand-new machine:

12
There are multiple optimal policies that result in the minimum cost of
1280. Let BN be the buy new option. The policies are:

2.5 Example V: Non-additive recursion

Here is an example where we multiply terms to get the recursion.
A student is currently taking three courses. It is important that he not
fail all of them. If the probability of failing French is p1 , the probability of
failing English is p2 , and the probability of failing Statistics is p3 , then the
probability of failing all of them is p1 × p2 × p3 . He has left himself with
four hours to study. How should he minimize his probability of failing all his
courses?
Denote the entries in the below table as pt (k) , the probability of failing
course t given k hours are spent on it.

Table 1: Student failure probabilities.

Hours French English Statistics
0 0.8 0.75 0.9
1 0.7 0.7 0.7
2 0.65 0.67 0.6
3 0.62 0.65 0.55
4 0.6 0.62 0.5

1. Stages: Each stage corresponds to a course, t = 1, 2, 3.

2. State: The number of hours x left for studying for the courses of the
corresponding stage and all the subsequent stages.

3. Decision: At each stage, decide how many hours k spend to study the
corresponding subject.

13
4. Cost-to-go function: Define ft (x) be the probability of failing t and
all following courses, assuming x hours are available and to employ an
optimal policies for all the following courses.

5. Optimal value sought: We aim to find f1 (4), as we have available 4

hours for studying in total.

6. Recursive equation: The recursion is given as follows for each stage

t:
ft (x) = min {pt (k)ft+1 (x − k)}.
k:k≤x

Given x remaining hours of study, the function computes the minimum

cost depending on the number of hours k invested in studying for course
t.

7. Boundary conditions: For the final course, we invest all the remain-
ing time:
f3 (x) = p3 (x) ∀x ≤ 4

The solution is computed by backward recursion.

Starting the dynamic programming, boundary conditions are

At stage 2, the recursion becomes

By iterating, one obtains the following:

Table 2: f2 (x) results.

14
Finally, in stage 1 it is sufficient to compute the value for x = 4, that is
the full study time available:

2.6 Example V: The Traveling Salesperson Problem

Recall the Traveling Salesperson Problem (TSP) asks for visiting all the n
cities at the minimum distance. For instance, let us consider that a politician
begin its tour in New York and has to visit Miami, Dallas and Chicago before
returning back to New York.

Table 3: Distances dij among cities.

New York Miami Dallas Chicago
1. New York - 1334 1559 809
2. Miami 1334 - 1343 1397
3. Dallas 1559 1343 - 921
4. Chicago 809 1397 921 -

1. Stages: Each stage t represents having visited t cities.

2. State: The city i in which the politician currently is and the set of t
cities S previously visited, i.e. the pair (i, S). This is required to decide
where to go next and to compose a feasible tour (no city, except the
starting one, is visited twice).

3. Decision: The next city j to visit along the tour.

15
4. Cost-to-go function: Define ft (i, S) be the minimum total distance
for a route that start from i after visiting t cities and follows an optimal
route up to the destination.

5. Optimal value sought: We aim to find f0 (1, {∅}), the tour value
starting from New York with no city already visited.

6. Recursive equation: The recursion is given as follows for each stage

t:
ft (i, S) = min {dij + ft+1 (j, S ∪ j)}.
j̸=i and j ∈S
/

Given the politician in city i that already visited the cities in S, it

evaluates all the possible next cities j and picks the one leading to the
minimum total distance given stages t, t + 1, . . . .

7. Boundary conditions: For the final stage, the conditions are set for
every possible last city i in which the politician can be located:

fn−1 (i, S = {2, . . . , n}) = di1 ∀i = 2, . . . , n

We proceed backward.
First, we set boundary conditions

At stage 2, let S = {2, 3} :

Let S = {2, 4}

16
Let S = {3, 4}

At stage 1, given S = {2}

Let S = {3}

When S = {4}

Finally, the optimal value is

Although a dynamic programming approach can be devised to solve TSP,

the state space here is extremely wide due to the dependence on the set S.
Indeed, there exist |S| states for any possible S ⊆ {2, . . . , n}, that is 2n−1
subsets. Just to have an intuition, let us consider a TSP instance with 20
cities. The number of states in the 10th stage is more than a million. There-
fore, the computational burden makes this method not viable in practice, no
matter the machine available nowadays (and in the future).

Dynamic Programming - Part 1
No ratings yet
Dynamic Programming - Part 1
23 pages
Lec37 Dynamic Programming
No ratings yet
Lec37 Dynamic Programming
23 pages
Dynamic Programming 7707
No ratings yet
Dynamic Programming 7707
51 pages
Dynamic Programming: of Optimality
No ratings yet
Dynamic Programming: of Optimality
11 pages
Dynamic Programming
No ratings yet
Dynamic Programming
6 pages
ADA Unit 3
No ratings yet
ADA Unit 3
19 pages
Ada (Bcs401) Module4
No ratings yet
Ada (Bcs401) Module4
113 pages
Dynamic Programming Guide
No ratings yet
Dynamic Programming Guide
3 pages
Unit 4 New
No ratings yet
Unit 4 New
56 pages
0-1 Knapsack Problem
No ratings yet
0-1 Knapsack Problem
57 pages
Dynamic Programming
No ratings yet
Dynamic Programming
23 pages
Subject Code:-AL-402: Subject:-Analysis of Design & Algorithm
No ratings yet
Subject Code:-AL-402: Subject:-Analysis of Design & Algorithm
29 pages
Dynamic Programming Techniques
No ratings yet
Dynamic Programming Techniques
131 pages
Chapter Four and Five
No ratings yet
Chapter Four and Five
56 pages
Deterministic Dynamic Programming
No ratings yet
Deterministic Dynamic Programming
12 pages
Dynamic Programming
No ratings yet
Dynamic Programming
10 pages
Dynamic Programming & Algorithms
No ratings yet
Dynamic Programming & Algorithms
14 pages
Deterministic Dynamic Programming: To The Next
No ratings yet
Deterministic Dynamic Programming: To The Next
52 pages
Dynamic Programming for CS Students
No ratings yet
Dynamic Programming for CS Students
24 pages
Dynamic Programming
No ratings yet
Dynamic Programming
14 pages
DAA Lec 8 Dynamic Programming
No ratings yet
DAA Lec 8 Dynamic Programming
66 pages
Knapsack
No ratings yet
Knapsack
5 pages
Optimization: Dynamic Programming
No ratings yet
Optimization: Dynamic Programming
49 pages
Unit 3 - Analysis and Design of Algorithm - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Analysis and Design of Algorithm - WWW - Rgpvnotes.in
9 pages
Mod 4
No ratings yet
Mod 4
65 pages
6 Dynamic Programming
No ratings yet
6 Dynamic Programming
81 pages
Dynamic Programming
No ratings yet
Dynamic Programming
39 pages
Daa C4
No ratings yet
Daa C4
16 pages
04 - OR2 - Dynamic Programming
No ratings yet
04 - OR2 - Dynamic Programming
14 pages
Dynamic Programming
No ratings yet
Dynamic Programming
27 pages
Lec13 Dynamic Programming
No ratings yet
Lec13 Dynamic Programming
47 pages
AP - Dynamic Programming - Algorithms and Complexity Ana
No ratings yet
AP - Dynamic Programming - Algorithms and Complexity Ana
11 pages
4 Dynamic Programming-Lec
No ratings yet
4 Dynamic Programming-Lec
13 pages
Cs1401 Design and Analysis of Algorithms Unit Iii Dynamic Programming and Greedy Technique
No ratings yet
Cs1401 Design and Analysis of Algorithms Unit Iii Dynamic Programming and Greedy Technique
19 pages
Unit 3 - Analysis and Design of Algorithm - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Analysis and Design of Algorithm - WWW - Rgpvnotes.in
11 pages
Dynamic Programming
No ratings yet
Dynamic Programming
39 pages
3 Dynamic
No ratings yet
3 Dynamic
46 pages
NERIST NOTES DynamicProgramming
No ratings yet
NERIST NOTES DynamicProgramming
12 pages
Unit Iv R23
No ratings yet
Unit Iv R23
31 pages
A Tutorial On Dynamic Programming
No ratings yet
A Tutorial On Dynamic Programming
18 pages
Chapter4 DynamicProgramming
No ratings yet
Chapter4 DynamicProgramming
13 pages
Lec 9 DynamicProgramming-1
No ratings yet
Lec 9 DynamicProgramming-1
72 pages
Chapter - 12 (Dynamic Programming)
No ratings yet
Chapter - 12 (Dynamic Programming)
15 pages
Dynamic Programming in Graphs
No ratings yet
Dynamic Programming in Graphs
7 pages
Dynamic Programming
No ratings yet
Dynamic Programming
27 pages
Optimal Binary Search Tree1
No ratings yet
Optimal Binary Search Tree1
50 pages
Dynamic Programming Explained
No ratings yet
Dynamic Programming Explained
51 pages
Chapter VI DP and Network
No ratings yet
Chapter VI DP and Network
66 pages
Ad3351 Daa Unit 1 - 5 Important Questions & Answer'
No ratings yet
Ad3351 Daa Unit 1 - 5 Important Questions & Answer'
63 pages
Dynamic Programming
No ratings yet
Dynamic Programming
16 pages
Operational Reseach 1
No ratings yet
Operational Reseach 1
9 pages
Dynamic Programming in Operations Research
No ratings yet
Dynamic Programming in Operations Research
33 pages
Dynamic Programming
No ratings yet
Dynamic Programming
8 pages
Week 09 (Dynamic Programming - Knapsack)
No ratings yet
Week 09 (Dynamic Programming - Knapsack)
60 pages
Lecture 8 Dynamic Programming
No ratings yet
Lecture 8 Dynamic Programming
32 pages
Dynamic Programming Optimization
No ratings yet
Dynamic Programming Optimization
52 pages
CH 4 Daa
No ratings yet
CH 4 Daa
63 pages
Maharishi University of Management: CS 435 - Design and Analysis of Algorithms
No ratings yet
Maharishi University of Management: CS 435 - Design and Analysis of Algorithms
8 pages
MCA Study Material: Key Concepts
71% (34)
MCA Study Material: Key Concepts
13 pages
Viva Questions
No ratings yet
Viva Questions
7 pages
All Algorithms
No ratings yet
All Algorithms
17 pages
Dijkstra Algorithm
No ratings yet
Dijkstra Algorithm
13 pages
Link State Routing Algorithm PDF
No ratings yet
Link State Routing Algorithm PDF
17 pages
Daa Combined
No ratings yet
Daa Combined
407 pages
Chinese Postman Q
No ratings yet
Chinese Postman Q
26 pages
Master Data Structures & Algorithms
No ratings yet
Master Data Structures & Algorithms
13 pages
Algorithm Practice for Students
No ratings yet
Algorithm Practice for Students
2 pages
BSC C.S (H) 6th Semester Syllabus
No ratings yet
BSC C.S (H) 6th Semester Syllabus
6 pages
Data Structures and Algorithms Question Bank Unit - I Part - A
No ratings yet
Data Structures and Algorithms Question Bank Unit - I Part - A
22 pages
145-1569307448595-Ammended-Unit 18 Assignment
No ratings yet
145-1569307448595-Ammended-Unit 18 Assignment
17 pages
SVCT C Programming DSA
No ratings yet
SVCT C Programming DSA
26 pages
AI Unit-2
No ratings yet
AI Unit-2
80 pages
02 Uninformed Search
No ratings yet
02 Uninformed Search
73 pages
Exam Result of Algorithms Topic Test - 3 (Greedy Algorithms)
No ratings yet
Exam Result of Algorithms Topic Test - 3 (Greedy Algorithms)
28 pages
Chapter 5 EX
No ratings yet
Chapter 5 EX
10 pages
CS1201 DS
No ratings yet
CS1201 DS
7 pages
q6 DSA - Q10
No ratings yet
q6 DSA - Q10
6 pages
Assignment 1 AI (Ashraf)
No ratings yet
Assignment 1 AI (Ashraf)
15 pages
Floyd's Algorithm for Shortest Paths
No ratings yet
Floyd's Algorithm for Shortest Paths
4 pages
Graph Theory and Dijkstra's Algorithm:A Solution For Mumbai's BEST Buses
No ratings yet
Graph Theory and Dijkstra's Algorithm:A Solution For Mumbai's BEST Buses
8 pages
Discrete Structures Syllabus
No ratings yet
Discrete Structures Syllabus
2 pages
CS3401 Algorithms Syllabus
No ratings yet
CS3401 Algorithms Syllabus
3 pages
CA Ex S2M10 Link-State Routing Protocol
No ratings yet
CA Ex S2M10 Link-State Routing Protocol
29 pages
AI Lect 10 Uniform Cost Search
100% (1)
AI Lect 10 Uniform Cost Search
2 pages
Src-Pt3-Dính Như Keo
No ratings yet
Src-Pt3-Dính Như Keo
28 pages
Python Programming Exam
No ratings yet
Python Programming Exam
23 pages

Ie 303 - LN9 - 1

Uploaded by

Ie 303 - LN9 - 1

Uploaded by

Dynamic Programming Part I

2 Dynamic Programming Formulation

1. Stages: These represent the sub-parts of the problem, often linked to

3. Decision: The action or control that the planner chooses at a partic-

5. Optimal value: The goal is to determine the best possible value of

6. Recursive equation: The problem is typically solved using a recursive

7. Boundary conditions: These are the simplest sub-parts where the

2.1 Example I: The Stagecoach Problem

1. Stages: Each day of the journey can be viewed as a stage, so the

4. Cost-to-go function: Let ft (i) represent the minimum insurance cost

5. Optimal cost: The objective is to find f1 (1), the minimum insurance

6. Recursive equation: The recursive equation for stage t is:

ft (i) = min {aij + ft+1 (j)}, ∀i ∈ Ct

7. Boundary conditions: If we are at Stage 4, i.e., at the dawn of day

We shall solve the problem backwards.

2.2 Example II: The Integer Knapsack Problem

1. Stages: Each item j represents a stage in the decision-making process.

3. Decision: The decision at each stage is how many units of item j to

4. Cost-to-go function: Let fj (y) represent the maximum profit at-

6. Recursive equation: The recursive equation for this problem is:

7. Boundary Condition: If we reach the last item, the decision becomes

The recursion works in a backward fashion.

max 11x1 + 7x2 + 5x3

we begin adding item 3:

Continuing for higher capacities:

Finally for Stage 1, it is sufficient to consider only y = 15 as our knapsack

The optimal policy is then traced back:

1. Stages: Each plant j = 1, 2, 3 represents a stage in the decision-making

2. State: The state at any stage j is the remaining budget, denoted yj ,

3. Decision: The decision at each stage is which proposal xj to choose

4. Cost-to-go function: fj (y) denote the maximum revenue attainable

fj (y) = max {rj (xj ) + fj+1 (y − cj (xj ))}

This equation tells us that at each stage j, given a remaining bud-

f3 (y) = max {r3 (x3 )} for y = 0, 1, . . . , 5

Plant 3 can only choose from proposal 1 (which yields no revenue) or

For larger budgets, we compute:

Consider f2 (5). The first argument assume that proposal 1 is selected

2.4 Example IV: Equipment Replacement

4. Cost-to-go function: Define ft (x) as the minimum cost of an optimal

5. Optimal value sought: We aim to find f1 (0), as we start with a

6. Recursive equation: Consider each possible age for the machine at

• If the machine is three years old at time t, it must be traded in:

ft (3) = −s3 + b + m1 + ft+1 (1) = −500 + 1000 + 60 + ft+1 (1).

While s3 is related to the trade-in of the old machine, b and m1 are

In stage 2 the machine is only one year old:

In year 1, we start with a brand-new machine:

2.5 Example V: Non-additive recursion

Table 1: Student failure probabilities.

1. Stages: Each stage corresponds to a course, t = 1, 2, 3.

5. Optimal value sought: We aim to find f1 (4), as we have available 4

6. Recursive equation: The recursion is given as follows for each stage

Given x remaining hours of study, the function computes the minimum

The solution is computed by backward recursion.

At stage 2, the recursion becomes

By iterating, one obtains the following:

Table 2: f2 (x) results.

2.6 Example V: The Traveling Salesperson Problem

Table 3: Distances dij among cities.

1. Stages: Each stage t represents having visited t cities.

3. Decision: The next city j to visit along the tour.

6. Recursive equation: The recursion is given as follows for each stage

Given the politician in city i that already visited the cities in S, it

fn−1 (i, S = {2, . . . , n}) = di1 ∀i = 2, . . . , n

At stage 2, let S = {2, 3} :

At stage 1, given S = {2}

Finally, the optimal value is

Although a dynamic programming approach can be devised to solve TSP,

You might also like