0% found this document useful (0 votes)

81 views152 pages

Code Optimization Techniques Guide

The document discusses code optimization techniques used by compilers. It introduces common optimization goals like preserving semantics, speeding up execution, and only optimizing code that provides significant benefits. It then describes several specific optimization techniques like common subexpression elimination, constant propagation, loop optimization, and code motion. The document also discusses factors that influence which optimizations are useful like the target machine architecture.

Uploaded by

Srinivas Ch (Vasu)

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views152 pages

Code Optimization Techniques Guide

Uploaded by

Srinivas Ch (Vasu)

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 152

UNI

T-4
PART-A

Code Optimization
Introdu
ction
• Concerns with machine-independent code optimization

 90-10 rule: execution spends 90% time in 10% of the

code.
 It is moderately easy to achieve 90%
optimization. The
rest 10% is very difficult.
 Identification of the 10% of the code is not possible
for a compiler – it is the job of a profiler.

• In general, loops are the hot-spots

Introdu
ction
• Criterion of code optimization

– Must preserve the semantic equivalence of the programs

– The algorithm should not be modified
– Transformation, on average should speed up the
execution of the program
– Worth the effort: Intellectual and compilation effort
spend on insignificant improvement.
Transformations are simple enough to have a good
effect
Introdu
ction
• Optimization can be done in almost all phases of
compilation.

Source Front Inter Code target

code end . generator
code code

Loop, proc
Profile calls, addr Reg usage,
instruction
and choice,
optimize calculation peephole opt
(user) improvem (compiler)
ent
(compiler)
Introdu
ction
• Organization of an optimizing compiler

Control
Data flow
flow Transformation
analysis
analysis

Code optimizer
Classifications of
Optimization techniques
 Peephole optimization

 Local Optimization

 Global Optimization
 Inter-procedural
 Intra-procedural

 Loop Optimization
Factors influencing
Optimization
• The target machine: machine dependent factors can be
parameterized to compiler for fine tuning

• Architecture of Target CPU:

– Number of CPU registers
– RISC vs CISC
– Pipeline Architecture
– Number of functional
units

• Machine Architecture
– Cache Size and type
– Cache/Memory transfer
rate
Themes behind
Optimization Techniques
• Avoid redundancy: something already computed need
not be computed again
• Smaller code: less work for CPU, cache, and memory!
• Less jumps: jumps interfere with code pre-fetch
• Code locality: codes executed close together in time is
generated close together in memory – increase locality of
reference
• Extract more information about code: More info –
better code generation
Redundancy
• elimination
Redundancy elimination = determining that two computations
are equivalent and eliminating one.

• There are several types of redundancy elimination:

– Value numbering
• Associates symbolic values to computations and
identifies expressions that have the same value

– Common subexpression elimination

• Identifies expressions that have operands with the same
name
Redundancy
elimination
– Constant/Copy propagation
• Identifies variables that have constant/copy values
and uses the constants/copies in place of the
variables.

– Partial redundancy elimination

• Inserts computations in paths to convert partial
redundancy to full redundancy.
Optimizing
Transformations
• Common sub expression elimination
• Code motion
• Strength reduction
• Dead code elimination
• Copy propagation
• Loop optimization
• Compile time evalution
• Induction variables and strength reduction
Compile-Time
Evaluation
• Expressions whose values can be pre-computed at the
compilation time

• Two ways:
– Constant folding
– Constant propagation
Compile-Time
Evaluation
• Constant folding: Evaluation of an expression with constant
operands to replace the expression with single value

• Example:
area := (22.0/7.0) * r ** 2

area := 3.14286 * r ** 2
Compile-Time
Evaluation
• Constant Propagation: Replace a variable with constant
which has been assigned to it earlier.

• Example:
pi := 3.14286
area = pi * r ** 2
area

3.14286

*
Constant
• What does it mean?
Propagation
– Given an assignment x = c, where c is a constant, replace
later uses of x with uses of c, provided there are no
intervening assignments to x.
• Similar to copy propagation
• Extra feature: It can analyze constant-value
conditionals to determine whether a branch should
be executed or not.
• When is it performed?
– Early in the optimization process.
• What is the result?
– Smaller code
– Fewer registers
Common Sub-
•
expression Evaluation
Identify common sub-expression present in different
expression, compute once, and use the result in all the
places.
– The definition of the variables involved should not
change

Example:
a := b * c temp := b * c
… a := temp
… …
x := b * c + 5 x := temp + 5
Common
Subexpression
• Elimination
Local common subexpression elimination
– Performed within basic blocks
– Algorithm sketch:
• Traverse BB from top to bottom
• Maintain table of expressions evaluated so far
– if any operand of the expression is redefined,
remove it from the table
Common
Subexpression
Elimination
• Modify applicable instructions as you go
– generate temporary variable, store the expression
in it and use the variable next time the
expression is encountered.

X=a+b T=a+b
…… X=t
…… …..
Y=a+b Y=t
Common
Subexpression
Elimination
t1 = a + b
c=a+b c = t1
d = m * t2 = m * n
n e = b d = t2
+ d f = t3 = b + d
a + b g e = t3
=-b f = t1
h=b+a g=-
a=j+a b
k = m * h = t1 /* commutative */
n j = b a=j+a
+ d a = k = t2
-b j =
if m * n t3 a
go to L = -b
the table contains quintuples: if t2
go to
(pos, opd1, opr, opd2, tmp) L
Common
Subexpression
•
Elimination
Global common subexpression elimination
– Performed on flow graph
– Requires available expression information
• In addition to finding what expressions are available
at the endpoints of basic blocks, we need to know
where each of those expressions was most recently
evaluated (which block and which position within
that block).
Common Sub-
expression Evaluation

1 x:=a+b
“a + b” is not a
2 a:= b 3 common sub-
expression in 1
and 4

z : = a + b + 10 4

None of the variable involved should be

modified in any path
Code
Motion
• Moving code from one part of the program to other without
modifying the algorithm

– Reduce size of the program

– Reduce execution frequency of the code subjected to
movement
Code
Motion
1. Code Space reduction: Similar to common sub-expression
elimination but with the objective to reduce code size.

Example: Code hoisting

temp : = x ** 2
if (a< b) then if (a< b) then
z := x ** 2 z := temp
else else
y := x ** 2 + y :=
10 tem
p+
“x ** 2“ is computed once in both cases, but the code
10
size in the second case reduces.
Code
Motion
2 Execution
3 frequency reduction: reduce execution frequency of
partially available expressions (expressions available
atleast in one path)

Example:
if (a<b) then if (a<b) then
z=x*2 temp = x * 2
z = temp
else else
y = 10 y = 10
temp = x * 2
g=x*2 g=
temp;
Code
Motion
• Move expression out of a loop if the evaluation
does not change inside the loop.
Example:
while ( i < (max-2) ) …
Equivalent to:
t := max -
2 while
( i < t
) …
Code
Motion
• Safety of Code movement
Movement of an expression e from a basic block bi to
another block bj, is safe if it does not introduce any new
occurrence of e along any path.

Example: Unsafe code movement

temp = x * 2
if (a<b) then if (a<b) then
z=x*2 z = temp
else else
y = 10 y = 10
Strength
• Reduction
Replacement of an operator with a less costly one.

Example:
temp = 5;
for i=1 to 10 do for i=1 to 10 do
… …
x=i*5 x = temp
… …
temp = temp + 5
end end
• Typical cases of strength reduction occurs in address
calculation of array references.
• Applies to integer expressions involving induction variables
(loop optimization)
Dead Code
Elimination
• Dead Code are portion of the program which will not be
executed in any path of the program.
– Can be removed
• Examples:
– No control flows into a basic block
– A variable is dead at a point -> its value is not used
anywhere in the program
– An assignment is dead -> assignment assigns a value to a
dead variable
Dead Code
Elimination
• Examples:
•i=j;
•…
•X=i+10
•The optimization can be performed by
•Eliminating the assignment statement
•i=j
.
This assignment statement is called dead
assignment .
Copy
Propagatio
• What does it mean?
–
n
Given an assignment x = y, replace later uses of x
with uses of y, provided there are no intervening
assignments to x or y.

• When is it performed?
– At any level, but usually early in the optimization
process.

• What is the result?

– Smaller code
Copy
Propagatio
• f :=g n or copies
are called copy statements
• Use of g for f, whenever possible after copy
statement
Example:
x[i] = a; x[i] = a;
sum = x[i] sum = a + a;
+ a;
• May not appear to be code improvement, but opens up
scope for other optimizations.
Local Copy
Propagation
• Local copy propagation
– Performed within basic blocks
– Algorithm sketch:
• traverse BB from top to bottom
• maintain table of copies encountered so far
• modify applicable instructions as you go
Loop
Optimizatio
•
n
Decrease the number if instruction in the inner loop
• Even if we increase no of instructions in the outer loop
• Techniques:
– Code motion
– Induction variable elimination
– Strength reduction
Peephole
Optimization
• Pass over generated code to examine a few
instructions, typically 2 to 4
– Redundant instruction Elimination: Use algebraic
identities
– Flow of control optimization: removal of redundant
jumps
– Use of machine idioms
Redundant instruction
• elimination
Redundant load/store: see if an obvious replacement is
possible
MOV R0, a
MOV a, R0
Can eliminate the second instruction without needing
any global knowledge of a
• Unreachable code: identify code which will never be
executed:
#define DEBUG 0
if( DEBUG) { if (0 != 1) goto L2
print print debugging info
debugging info
} L2:
Algebraic
identities
• Worth recognizing single instructions with a constant
operand:
A * 1 = A
A * 0 = 0
A / 1 = A
A * 2 = A +
A
More delicate with floating-point
• Strength reduction:
A ^ 2 = A * A
Objec
tive
• Why would anyone write X * 1?
• Why bother to correct such obvious junk code?
• In fact one might write
#define MAX_TASKS 1
...
a = b * MAX_TASKS;

• Also, seemingly redundant code can be produced by other

optimizations. This is an important effect.
The right shift
problem
• Arithmetic Right shift:
– shift right and use sign bit to fill most significant
bits
-5 111111...1111111011
SAR 111111...1111111101
which is -3, not -2
– in most languages -5/2 = -2
Addition chains for
multiplication
• If multiply is very slow (or on a machine with no multiply
instruction like the original SPARC), decomposing a constant
operand into sum of powers of two can be effective:
X * 125 = x * 128 - x*4 + x
– two shifts, one subtract and one add, which may be
faster than one multiply
– Note similarity with efficient exponentiation
method
Folding Jumps
to Jumps
• A jump to an unconditional jump can copy the target address
JNE lab1
...
lab1: JMP lab2
Can be replaced by:
JNE lab2
As a result, lab1 may become dead (unreferenced)
Jump to
Return
• A jump to a return can be replaced by a return
JMP lab1
...
lab1: RET
– Can be replaced by
RET
lab1 may become
dead code
Usage of
Machine idioms
• Use machine specific hardware instruction which may be
less costly.

i := i + 1
ADD i, #1 INC i
Local
Optimizatio
n
Optimization of
Basic Blocks
• Many structure preserving transformations can be
implemented by construction of DAGs of basic blocks
representati
on
•
of Basic
Leaves are labeled with unique identifier (var name or const)
• Block
Interior nodes are labeled by an(BB)
operator symbol
• Nodes optionally have a list of labels (identifiers)
• Edges relates operands to the operator (interior nodes are
operator)
• Interior node represents computed value
– Identifier in the label are deemed to hold the value
Example:
DAG for BB
t1
t1 := 4 * *
i 4 i
t1 := 4 * i
t3 := 4 * i
t2 := t1 + if (i <= 20)goto L1
t3
+ t2 (L1)
<=

* i 20

4 t1, i t3
Construction of
DAGs for BB
• I/p: Basic block, B
• O/p: A DAG for B containing the following information:
1) A label for each node
2) For leaves the labels are ids or consts
3) For interior nodes the labels are operators
4) For each node a list of attached ids (possible empty list,
no consts)
Construction of
DAGs for BB
• Data structure and functions:
– Node:
1) Label: label of the node
2) Left: pointer to the left child node
3) Right: pointer to the right child node
4) List: list of additional labels (empty for leaves)
– Node (id): returns the most recent node created for id.
Else return undef
– Create(id,l,r): create a node with label id with l as left
child and r as right child. l and r are optional params.
Construction of
• Method:
DAGs for BB
For each 3AC, A in B
A if of the following forms:
1. x := y op z
2. x := op y
3. x := y
1. if ((ny = node(y)) ==
undef) ny = Create (y);
if (A == type 1)
and ((nz = node(z)) == undef)
nz = Create(z);
Construction of
2. If (A == type 1) DAGs for BB
Find a node labelled ‘op’ with left and right as ny and nz
respectively [determination of common sub-
expression]
If (not found) n = Create (op, ny,
nz); If (A == type 2)
Find a node labelled ‘op’ with a
single child as ny
If (not found) n = Create (op, ny);
If (A == type 3) n = Node (y);
3. Remove x from Node(x).list
Add x in n.list
Node(x) = n;
Example: DAG
construction from BB

t1 := 4 *
i

* t1

4 i
Example: DAG
construction from BB

t1 := 4 * i
t2 := a [ t1 ]

[] t2

*
a 4 i
t1
Example: DAG
construction from BB

t1 := 4 * i
t2 := a [ t1]
t3 := 4 * i

[] t2

*
a 4 i

t1, t3
Example: DAG
construction from BB

t1 := 4 * i
t2 := a [ t1
] t3 := 4
* i
t4 := b [ t3 t4 [] [] t2
]
* t1, t3

b a 4 i
Example: DAG
construction from BB

t1 := 4 * i
t2 := a [ t1
] t3 := 4 + t5
* i
t4 := b [ t3 t4 [] [] t2
]
t5 := t2 + t4 * t t
1, 3

b a 4 i
Example: DAG
construction from BB

t1 := 4 * i
t2 := a [ t1
] t3 := 4 + t5,i
* i
t4 := b [ t3 t4 [] [] t2
i
] :=
t5 t:=
5 t2
* t1, 3
+ t4
t
b a 4 i
DAG of a
Basic Block
• Observations:
– A leaf node for the initial value of an id
– A node n for each statement s
– The children of node n are the last definition (prior to s)
of the operands of n
Optimization of
Basic Blocks
• Common sub-expression elimination: by construction of DAG
– Note: for common sub-expression elimination, we are
actually targeting for expressions that compute the same
value.

a := b +c
Common expressions
b := b –
But do not generate the
d c := c same result
+d e :=
b +c
Optimization of
Basic Blocks
• DAG representation identifies expressions that yield
the same result

+ e
a.:= b + c
b.:= b –
d c :=
+ a
- b + c
c + d
e := b +
c b0 c0 d0
Optimization of
Basic Blocks
• Dead code elimination: Code generation from DAG
eliminates dead code.

c +
a.:= b + c a := b + c
b.:= a – d
d := a – d
×b,d - d := a - d
c := d + c
c := d + c a +
d0

b is not
b0 c0
live
Loop
Optimizatio
n
Loop
Optimizatio
•
ns
Most important set of optimizations
– Programs are likely to spend more time in loops
• Presumption: Loop has been identified
• Optimizations:
– Loop invariant code removal
– Induction variable strength reduction
– Induction variable reduction
Loops in
Flow Graph
• Dominators:
A node d of a flow graph G dominates a node n, if every
path in G from the initial node to n goes through d.

Represented as: d dom n

Corollaries:
Every node dominates itself.
The initial node dominates all nodes in G.
The entry node of a loop dominates all nodes in the loop.
Loops in
Flow Graph
• Each node n has a unique immediate dominator m, which is
the last dominator of n on any path in G from the initial
node to n.
(d ≠ n) && (d dom n) → d dom m
• Dominator tree (T):
A representation of dominator information of flow
graph
G.
• The root node of T is the initial node of G
• A node d in T dominates all node in its sub-tree
Example: Loops in
Flow Graph
1 1

2 3
2 3

4
4
5 6
5 6 7
7

8 9
8 9

Dominator Tree
Flow Graph
Loops in
Flow Graph
• Natural loops:
1. A loop has a single entry point, called the “header”.
Header dominates all node in the loop
2. There is at least one path back to the header from the
loop nodes (i.e. there is at least one way to iterate the
loop)

• Natural loops can be detected by back edges.

• Back edges: edges where the sink node (head)
dominates the source node (tail) in G
Loop
Optimizatio
• Loop interchange: exchange n
inner loops with outer loops

• Loop splitting: attempts to simplify a loop or eliminate

dependencies by breaking it into multiple loops which have
the same bodies but iterate over different contiguous
portions of the index range.

– A useful special case is loop peeling - simplify a loop with

a problematic first iteration by performing that iteration
separately before entering the loop.
Loop
Optimizatio
•
n
Loop fusion: two adjacent loops would iterate the same
number of times, their bodies can be combined as long as
they make no reference to each other's data

• Loop fission: break a loop into multiple loops over the same
index range but each taking only a part of the loop's body.

• Loop unrolling: duplicates the body of the loop multiple

times
Loop
Optimizatio
n Header
• Pre-Header: loop L
– Targeted to hold statements that
are moved out of the loop
– A basic block which has only the
header as successor
– Control flow that used to enter Pre-header
the loop from outside the loop,
through the header, enters the Header
loop from the pre-header loop L
Loop Invariant
Code Removal
• Move out to pre-header the statements whose source
operands do not change within the loop.

– Be careful with the memory operations

– Be careful with statements which are executed in
some of
the iterations
Loop Invariant
Code Removal
• Rules: A statement S: x:=y op z is loop invariant:

– y and z not modified in loop body

– S is the only statement to modify x
– For all uses of x, x is in the available def set.
– For all exit edge from the loop, S is in the available def set
of the edges.
– If S is a load or store (mem ops), then there is no writes
to address(x) in the loop.
Loop Invariant
Code Removal
 Loop invariant code removal can be done without
available definition information.

Rules that need change:

• For all use of x is in the • Approx of First rule:
available definition – d dominates all uses of x
set • Approx of Second rule
• For all exit edges, if x is live – d dominates all exit
on the exit edges, is in basic blocks where x is
the available definition set live
on the exit edges
Loop Induction
• Induction variables areVariable
variables such that every time they
change value, they are incremented or decremented.
– Basic induction variable: induction variable whose only
assignments within a loop are of the form:
i = i +/- C, where C is a constant.

– Primary induction variable: basic induction variable that

controls the loop execution
(for i=0; i<100; i++)
i (register holding i) is the primary induction
variable.

– Derived induction variable: variable that is a linear

function of a basic induction variable.
Loop Induction
Variable
• Basic: r4, r7, r1 r1 = 0
• Primary: r1 r7 = &A

• Derived: r2 Loop: r2 = r1 * 4
r4 = r7 +
3 r7 = r7
+1
r10 = *r2
r3 = *r4
r9 = r1 * r3
r10 = r9 >> 4
*r2 = r10
r1 = r1 + 4
If(r1 < 100)
goto Loop
Induction Variable
Strength Reduction
• Create basic induction variables from derived induction
variables.
• Rules: (S: x := y op z)
– op is *, <<, +, or –
– y is a induction variable
– z is invariant
– No other statement modifies x
– x is not y or z
– x is a register
Induction Variable
Strength Reduction
• Transformation:
Insert the following into the bottom of pre-header:
new_reg = expression of target statement S
if (opcode(S)) is not add/sub, insert to the bottom of the
preheader
Function: inc()
new_inc = inc(y,op,z)
else Calculate the amount of inc
new_inc = inc(x) for 1st param.
Insert the following at each update of y
new_reg = new_reg + new_inc
Change S: x = new_reg
Example: Induction Variable
Strength Reduction
new_reg = r4 * r9
new_inc = r9

r5 = r4 - 3 r5 = r4 - 3
r4 = r4 + 1 r4 = r4 + 1

new_reg += new_inc
r7 = r4 *r9
r7 = new_reg

r6 = r4 << 2 r6 = r4 << 2
Induction Variable
• Elimination
Remove unnecessary basic induction variables from the loop
by substituting uses with another basic induction variable.

• Rules:
– Find two basic induction variables, x and y
– x and y in the same family
• Incremented at the same place
– Increments are equal
– Initial values are equal
– x is not live at exit of loop
– For each BB where x is defined, there is no use of x
between the first and the last definition of y
Example: Induction
Variable Elimination
r1 = 0 r2 = 0
r2 = 0

r1 = r1 - 1 r2 = r2 - 1
r2 = r2 -
1

r9 = r2 + r4 r7 = r1 * r9 r9 = r2 + r4 r7 = r2 * r9

r4 = *(r1) r4 = *(r2)

*r2 = r7 *r7 = r2
Induction Variable
• Variants:
Elimination
1. Trivial: induction variable that are never used except to
Complexity of elimination

increment themselves and not live at the exit of loop

2. Same increment, same initial value (discussed)
3. Same increment, initial values are a known constant
offset from one another
4. Same increment, nothing known about the relation of
initial value
5. Different increments, nothing known about the relation
of initial value

– 1,2 are basically free

– 3-5 require complex pre-header operations
Example: Induction
Variable Elimination
• Case 4: Same increment, unknown initial value
For the induction variable that we are eliminating, look at
each non-incremental use, generate the same sequence of
values as before. If that can be done without adding any
extra statements in the loop body, then the transformation
can be done.
rx := r2 –r1 + 8

r4 := r2 + 8 r4 := r1 + rx
r3 := r1 + 4 r3 := r1 = 4
. .
. .
r1 := r1 + 4 r1 := r1 + 4
r2 := r2 +
4
Loop
Unrolling
• Replicate the body of a loop (N-1) times, resulting in total N
copies.
– Enable overlap of operations from different iterations
– Increase potential of instruction level parallelism (ILP)
• Variants:
– Unroll multiple of known trip counts
– Unroll with remainder loop
– While loop unroll
Optimization

Constant
Folding
• Evaluate constant expressions at compile time
• Only possible when side-effect freeness guaranteed

c:= 1 + 3 c:= 4

true not false

Caveat: Floats — implementation could be different

between machines!
Uni
t-4
PART-B

DATA FLOW ANALYSIS

553
Global Data
Flow Analysis
• Collect information about the whole program.
• Distribute the information to each block in the flow graph.

• Data flow information: Information collected by data flow

analysis.
• Data flow equations: A set of equations solved by data flow
analysis to gather data flow information.

554
Data flow
• IMPORTANT!
analysis
– Data flow analysis should never tell us that a
transformation is safe when in fact it is not.
– When doing data flow analysis we must be
• Conservative
– Do not consider information that may
not
preserve the behavior of the program
• Aggressive
– Try to collect information that is as exact as
possible, so we can get the greatest benefit from
our optimizations.

555
Global Iterative Data
Flow Analysis
• Global:
– Performed on the flow graph
– Goal = to collect information at the beginning and end
of each basic block
• Iterative:
– Construct data flow equations that describe how
information flows through each basic block and solve
them by iteratively converging on a solution.

556
Global Iterative Data
• Flow
Components of data Analysis
flow equations
– Sets containing collected information
• in set: information coming into the BB from outside
(following flow of data)
• gen set: information generated/collected within the
BB
• kill set: information that, due to action within the BB,
will affect what has been collected outside the BB
• out set: information leaving the BB
– Functions (operations on these sets)
• Transfer functions describe how information changes
as it flows through a basic block
• Meet functions describe how information from
multiple paths is combined. 557
Global Iterative Data
Flow Analysis
• Algorithm sketch
– Typically, a bit vector is used to store the information.
• For example, in reaching definitions, each bit
position corresponds to one definition.
– We use an iterative fixed-point algorithm.
– Depending on the nature of the problem we are solving,
we may need to traverse each basic block in a forward
(top-down) or backward direction.
• The order in which we "visit" each BB is not
important in terms of algorithm correctness, but is
important in terms of efficiency.
– In & Out sets should be initialized in a conservative and
aggressive way.
558
Typical
problems
• Reaching definitions
– For each use of a variable, find all definitions that reach
it.
• Upward exposed uses
– For each definition of a variable, find all uses that it
reaches.
• Live variables
– For a point p and a variable v, determine whether v is
live at p.
• Available expressions
– Find all expressions whose value is available at some
point p.

559
Global Data
Flow Analysis
• A typical data flow equation:

S: statement
out[S]  gen[S] (in[S] 
kill[S])
in[S]: Information goes into S
kill[S]: Information get killed by S
gen[S]: New information generated by S
out[S]: Information goes out from S

560
Global Data
Flow Analysis
• The notion of gen and kill depends on the desired
information.
• In some cases, in may be defined in terms of out - equation
is solved as analysis traverses in the backward direction.
• Data flow analysis follows control flow graph.
– Equations are set at the level of basic blocks, or even for
a statement

561
Points and
Paths
• Point within a basic block:
– A location between two consecutive statements.
– A location before the first statement of the basic block.
– A location after the last statement of the basic block.
• Path: A path from a point p1 to pn is a sequence of points
p1, p2, … pn such that for each i : 1 ≤ i ≤ n,
– pi is a point immediately preceding a statement and pi+1 is
the point immediately following that statement in the
same block, or
– pi is the last point of some block and pi+1 is first point in
the successor block.

562
Example: Paths
and Points
d1: i := m – 1
d2: := n B1
j
d3: a := u1 Path:
p3 p1, p2, p3, p4,
d4: i := i + 1 B2 p5, p 6 … pn
p4
p5
p6 d5: j := j 1 B3
-
B4

p1 pn
d6 : a := u2 B5 B6
p2 563
Reaching
Definition
• Definition of a variable x is a statement that assigns or may
assign a value to x.
– Unambiguous Definition: The statements that certainly
assigns a value to x
• Assignments to x
• Read a value from I/O device to x
– Ambiguous Definition: Statements that may assign a
value to x
• Call to a procedure with x as parameter (call by
ref)
• Call to a procedure which can access x (x being in the
scope of the procedure)
• x is an alias for some other variable (aliasing)
564
• Assignment through a pointer that could refer x
Reaching
Definition
• A definition d reaches a point p
– if there is a path from the point immediately following d
to p and
– d is not killed along the path (i.e. there is not
redefinition of the same variable in the path)
• A definition of a variable is killed between two points when
there is another definition of that variable along the path.

565
Example: Reaching
Definition
d1: i := m – 1
d2: := B1
d3: j := n Definition of i (d1)
reaches p1
a u1
p1 Killed as d4, does not
reach p2.
p2 d4: i := i + 1 B2
Definition of i (d1)
does not reach B3, B4,
d5: j := j - 1 B3
B5 and B6.
B4

d6: a := u2 B5 B6
566
Reaching
Definition
• Non-Conservative view: A definition might reach a point
even if it might not.
– Only unambiguous definition kills a earlier definition
– All edges of flow graph are assumed to be traversed.

if (a == b) then a = 2
else if (a == b) then a =
4
The definition “a=4” is not reachable.

Whether each path in a flow graph is taken is an

undecidable problem
567
Data Flow analysis of a
Structured Program
• Structured programs have well defined loop constructs – the
resultant flow graph is always reducible.
– Without loss of generality we only consider while-do and
if-then-else control constructs
S → id := E│“ ; “
│ if E then “ else “ │ do “ while
E E → id + id │ id
The non-terminals represent regions.

568
Data Flow analysis of a
Structured Program
• Region: A graph G’= (N’,E’) which is portion of the control
flow graph G.
– The set of nodes N’ is in G’ such that
• N’ includes a header h
• h dominates all node in N’
– The set of edges E’ is in G’ such that
• All edges a → b such that a,b are in N’

569
Data Flow analysis of a
Structured Program

• Region consisting of a statement S:

– Control can flow to only one block outside the region
• Loop is a special case of a region that is strongly connected
and includes all its back edges.
• Dummy blocks with no statements are used as technical
convenience (indicated as open circles)

570
Data Flow analysis of a
Structured Program:
Composition of
Regions

S1
S → “1 ; S2

571
Data Flow analysis of a
Structured Program:
Composition of
Regions

if E goto S1

“ → if E then S1 else S2
S1 S2

572
Data Flow analysis of a
Structured Program:
Composition of
Regions

S1
“ → do S1 while E

if E goto S1

573
Data Flow
Equations
• Each region (or NT) has four attributes:
– gen[S]: Set of definitions generated by the block S.
If a definition d is in gen[S], then d reaches the end
of block S.
– kill[S]: Set of definitions killed by block S.
If d is in kill[S], d never reaches the end of block S.
Every path from the beginning of S to the end S
must have a definition for a (where a is defined
by d).

574
Data Flow
Equations
– in[S]: The set of definition those are live at the entry
point of block S.
– out[S]: The set of definition those are live at the exit
point of block S.
• The data flow equations are inductive or syntax
directed.
– gen and kill are synthesized attributes.
– in is an inherited attribute.

575
Data Flow
Equations
• gen[S] concerns with a single basic block. It is the set of
definitions in S that reaches the end of S.
• In contrast out[S] is the set of definitions (possibly defined
in some other block) live at the end of S considering all
paths through S.

576
Data Flow Equations
Single statement

gen[S] {d}
kill[S]  Da {d}

S d: a := b + c

out[S]  (in[S] 
gen[S] kill[S])

Da: The set of definitions in the program for variable a

577
Data Flow Equations
Composition

gen[S ]  (gen[S1 ] 
gen[S2 ] kill[S2 ])
kill[S ]  kill[S2 ] (kill[S1 ]  gen[S2 ]) S1
S
in[S1 ]  in[S ]
in[S2 ]  out[S1 ] S2
out[S ]  out[S2
]

578
Data Flow Equations
if-then-else

gen[S]  gen[S2 ]
gen[S1 ] kill[S] kill[S2 ]
 kill[S1 ]
S S1 S2

in[S1 ]  in[S ]
in[S2 ]  in[S ]
out[S ]  out[S2 ]
out[S1 ]
579
Data Flow
Equations Loop

gen[S] 
gen[S1 ] kill[S]
 kill[S1 ]
S S1

in[S1 ]  in[S] gen[S1 ]

out[S ]  out[S1 ]

580
Data Flow
Analysis
• The attributes are computed for each region. The equations
can be solved in two phases:
– gen and kill can be computed in a single pass of a basic
block.
– in and out are computed iteratively.
• Initial condition for in for the whole program is
• In can be computed top- down 
• Finally out is computed

581
Dealing
with loop
• Due to back edge, in[S] cannot be used as
in [S1]
• in[S1] and out[S1] are interdependent.
• The equation is solved iteratively.
• The general equations for in and out:

in[S]  (out[Y ]: Y is a
predecessor of S)
out[S]  gen[S] (in[S]  582
Reaching
definitions
• What is safe?
– To assume that a definition reaches a point even if it
turns out not to.
– The computed set of definitions reaching a point p
will be a superset of the actual set of definitions
reaching p
– Goal : make the set of reaching definitions as small
as possible (i.e. as close to the actual set as possible)

583
Reaching
definitions
• How are the gen and kill sets defined?
– gen[B] = {definitions that appear in B and reach the
end of B}
– kill[B] = {all definitions that never reach the end of
B}
• What is the direction of the analysis?
– forward
– out[B] = gen[B]  (in[B] - kill[B])

584
Reaching
definitions
• What is the confluence operator?
– union
– in[B] =  out[P], over the predecessors P of B
• How do we initialize?
– start small
• Why? Because we want the resulting set to be as
small as possible
– for each block B initialize out[B] = gen[B]

585
Computation of gen
and kill sets
for each basic block BB do

gen(BB) = kill(BB) = ;
for each statement
; (d: x := y op z) in sequential order in BB, do
kill(BB) = kill(BB) U G[x];
G[x] = d;
endfor
gen(BB) = U G[x]: for all id x
endfor

586
Computation of in
and out sets
for all basic blocks BB
in(BB) =
for all basic blocks
out(BB) = gen(BB)
BB change = true
while (change) do
change = false
for each basic block BB, do
old_out = out(BB)
in(BB) = U(out(Y)) for all predecessors Y
of BB
out(BB) = gen(BB) + (in(BB) –
kill(BB)) if (old_out != out(BB)) then
change = true
endfor 587
endfor
Live Variable
(Liveness) Analysis
• Liveness: For each point p in a program and each variable
y, determine whether y can be used before being
redefined, starting at p.

• Attributes
– use = set of variable used in the BB prior to its
definition
– def = set of variables defined in BB prior to any use of
the variable
– in = set of variables that are live at the entry point of
a
BB
– out = set of variables that are live at the exit point of a
BB
588
Live Variable
(Liveness) Analysis
• Data flow equations:
in[B]  use[B] (out[B]  def
[B])
out[B] 
S succ( B) in[S]
– 1st Equation: a var is live, coming in the block, if either
• it is used before redefinition in B
or
• it is live coming out of B and is
not redefined in B
– 2nd Equation: a var is live coming out of B, iff it is live
coming in to one of its successors.
589
Example:
Liveness
r2, r3, r4, r5 are all live as they
r1 = r2 + r3 are consumed later, r6 is dead
r6 = r4 – as it is redefined later
r5 r4 is dead, as it is redefined.
r4 = 4 So is r6. r2, r3, r5 are
r6 = 8 live

r6 = r2 + r3
r7 = r4 – What does this mean?
r5 r6 = r4 – r5 is useless,
it produces a dead
value !!
590
Get rid of it!
Computation of use
and def sets
for each basic block BB do
def(BB) = ; use(BB) = ;
for each statement (x := y op z) in sequential order, do
for each operand y, do
if (y not in def(BB))
use(BB) = use(BB) U {y};
endfor
def(BB) = def(BB) U {x};
endfor
def is the union of all the LHS’s
use is all the ids used before defined 591
Computation of in
and out sets
for all basic blocks BB
in(BB) = ;

change = true;
while (change) do
change = false
for each basic
block BB do
old_in =
in(BB);
out(BB) =
U{in(Y): for
all
successors Y 592
DU/UD
Chains
• Convenient way to access/use reaching definition
information.
• Def-Use chains (DU chains)
– Given a def, what are all the possible consumers of the
definition produced
• Use-Def chains (UD chains)
– Given a use, what are all the possible producers of the
definition consumed

593
Example:
DU/UD Chains
1: r1 = MEM[r2+0]
2: r2 = r2 + 1 DU Chain of r1:
3: r3 = r1 * r4 (1) -> 3,4
(4) ->5

DU Chain of r3:
(3) -> 11
4: r1 = r1 + 5 7: r7 = r6 (5) -> 11
5: r3 = r5 – r1 8: r2 = 0 (12) ->
6: r7 = r3 * 2 9: r7 = r7 + 1

UD Chain of r1:
10: r8 = r7 + 5 (12) -> 11
11: r1 = r3 – r8
12: r3 = r1 * 2 UD Chain of r7:
(10) -> 6,9 594
Some-things to
Think About
• Liveness and Reaching definitions are basically the same
thing!
– All dataflow is basically the same with a few parameters
• Meaning of gen/kill (use/def)
• Backward / Forward
• All paths / some paths (must/may)
– So far, we have looked at may analysis algorithms
– How do you adjust to do must algorithms?
• Dataflow can be slow
– How to implement it efficiently?
– How to represent the info?

595
Generalizing
Dataflow Analysis
• Transfer function
– How information is changed by BB
out[BB] = gen[BB] + (in[BB] – kill[BB]) forward
analysis
in[BB] = gen[BB] + (out[BB] – kill[BB]) backward
analysis
• Meet/Confluence function
– How information from multiple paths is combined
in[BB] = U out[P] : P is pred of BB forward analysis
out[BB] = U in[P] : P is succ of BB backward
analysis

596
Generalized Dataflow
Algorithm
change = true;
while
(change)
change = false;
for each BB
apply meet
function
apply
transfer
function
if any 597
Example: Liveness by
upward exposed uses
for each basic block BB, do
gen[BB] 
kill[BB] 

for each operation (x := y op z) in reverse order in BB,

do
gen[BB]  gen[BB] {x}
kill[BB]  kill[BB] {x}

for each source operand of op, y, do

gen[BB]  gen[BB]
endfor
kill[BB]  kill[BB]
endfor {y}
{y}
endfor 598
Beyond Upward
• Upward exposedExposed
defs Uses
– in = gen + (out – kill)
– out = U(in(succ)) • Downward exposed defs
– Walk ops reverse order – in = U(out(pred))
• gen += {dest} – out = gen + (in - kill)
kill
– Walk in forward order
+= {dest}
• gen += {dest}; kill
+= {dest};
• Downward exposed uses
– in = U(out(pred))
– out = gen + (in - kill)
– Walk in forward order
• gen += {src}; kill -=
{src};
• gen -= {dest};
kill 599
All Path
• Up to this point
Problem
– Any path problems (maybe relations)
• Definition reaches along some path
• Some sequence of branches in which def reaches
• Lots of defs of the same variable may reach a
point
– Use of Union operator in meet function
• All-path: Definition guaranteed to reach
– Regardless of sequence of branches taken, def
reaches
– Can always count on this
– Only 1 def can be guaranteed to reach
– Availability (as opposed to reaching)
• Available definitions
• Available expressions (could also have reaching 600
expressions, but not that useful)
Reaching vs Available
Definitions
1: r1 = r2 1,2 reach
+ r3 2: 1,2 available
r6 = r4 – r5

1,2 reach 3: r4 =
1,2 available 4
4: r6 =
8 1,3,4 reach
1,3,4 available
5: r6 = r2
+ r3 6: 1,2,3,4 reach
r7 = r4 – r5 1 available
601
Available Definition
Analysis (Adefs)
• A definition d is available at a point p if along all paths
from d to p, d is not killed
• Remember, a definition of a variable is killed between 2
points when there is another definition of that variable
along the path
– r1 = r2 + r3 kills previous definitions of r1
• Algorithm:
– Forward dataflow analysis as propagation occurs from
defs downwards
– Use the Intersect function as the meet operator to
guarantee the all-path requirement
– gen/kill/in/out similar to reaching defs
• Initialization of in/out is the tricky part

602
Compute Adef
gen/kill Sets
for each basic block BB do
gen(BB) = ; kill(BB) = ;
for each statement(d: x := y opz) in sequential order in BB, do
kill(BB) = kill(BB) U G[x];
G[x] = d;
endfor
gen(BB) = U G[x]: for all id x
endfor

Exac
tly
the 603
Compute Adef
in/out Sets
U = universal set of all definitions in the prog
in(0) = 0; out(0) = gen(0)
for each basic block BB, (BB != 0), do
in(BB) = 0; out(BB) = U – kill(BB)

change = true
while (change) do
change = false
for each basic block BB, do
old_out = out(BB)
in(BB) = out(Y) : for all predecessors Y
of BB
if (old_out
out(BB) != out(X))
= GEN(X) then change
+ (IN(X) = true
– KILL(X))
endfor
endfor

604
Available Expression
Analysis (Aexprs)
• An expression is a RHS of an operation
– Ex: in “rβ = rγ + r4” “rγ + r4” is an expression
• An expression e is available at a point p if along all paths
from e to p, e is not killed.
• An expression is killed between two points when one of its
source operands are redefined
– Ex: “r1 = rβ + rγ” kills all expressions involving r1
• Algorithm:
– Forward dataflow analysis
– Use the Intersect function as the meet operator
to guarantee the all-path requirement
– Looks exactly like adefs, except gen/kill/in/out are
the
RHS’s of operations rather than the LHS’s 605
Available
Expression
• Input: A flow graph with e_kill[B] and e_gen[B]
• Output: in[B] and out[B]
• Method:
foreach basic block B
in[B1] := ; out[B1] :=
e_gen[B1]; out[B] =U - e_kill[B];
change=true
while(change)
chang
e=false
;
for
each
basic 606
Efficient Calculation
of Dataflow
• Order in which the basic blocks are visited is important
(faster convergence)
• Forward analysis – DFS order
– Visit a node only when all its predecessors have been
visited
• Backward analysis – Post DFS order
– Visit a node only when all of its successors have been
visited

607
Representing Dataflow
Information
• Requirements – Efficiency!
– Large amount of information to store
– Fast access/manipulation
• Bitvectors
– General strategy used by most compilers
– Bit positions represent defs (rdefs)
– Efficient set operations: union/intersect/isone
– Used for gen, kill, in, out for each BB

608
Optimization using
Dataflow
• Classes of optimization
1. Classical (machine independent)
• Reducing operation count (redundancy
elimination)
• Simplifying operations
2. Machine specific
• Peephole optimizations
• Take advantage of specialized
hardware features
3. Instruction Level Parallelism (ILP)
enhancing
• Increasing parallelism
• Possibly increase instructions 609
Types of Classical
Optimizations
• Operation-level – One operation in isolation
– Constant folding, strength reduction
– Dead code elimination (global, but 1 op at a time)
• Local – Pairs of operations in same BB
– May or may not use dataflow analysis
• Global – Again pairs of operations
– Pairs of operations in different BBs
• Loop – Body of a loop

610
Constant
•
Folding
Simplify operation based on values of target operand
– Constant propagation creates opportunities for this
• All constant operands
– Evaluate the op, replace with a move
• r1 = 3 * 4  r1 = 12
• r1 = 3 / 0  ??? Don’t evaluate excepting
ops !,
what about FP?
– Evaluate conditional branch, replace with BRU or noop
• if (1 >
< 2) goto BB2  convert
goto BB2to a noop Dead
code
• Algebraic identities
– r1 = r2 + 0, r2 – 0, r2 | 0, r2 ^ 0, r2 << 0, r2 >> 0  r1 =
r2
– r1 = 0 * r2, 0 / r2, 0 & r2  r1 = 0
– r1 = r2 * 1, r2 / 1  r1 = r2 611
Strength
•
Reduction
Replace expensive ops with cheaper ones
– Constant propagation creates opportunities for this
• Power of 2 constants
– Mult by power of 2: r1 = r2 * 8  r1 = r2 << 3
– Div by power of 2: r1 = r2 / 4  r1 = r2 >>
2
– Rem by power of 2: r1 = r2 % 16  r1 = r2 & 15
• More exoticmultiply by constant by sequence of shift and
– Replace
adds/subs
• r1 = r2 * 6
– r100 = r2 << 2; r101 = r2 << 1; r1 = r100 +
r101
• r1 = r2 * 7
– r100 = r2 << 3; r1 = r100 – r2 612
Dead Code
Elimination
• Remove statement d: x := y op z whose result is never
consumed.
• Rules:
– DU chain for d is empty
– y and z are not live at d

613
Constant
Propagation
• Forward propagation of moves/assignment of the form
d:rx := L where L is literal

– Replacement of “rx” with “L” wherever possible.

– d must be available at point of replacement.

614
Forward Copy
Propagation
• Forward propagation of ‘ H “ of assignment or mov’s.

r1 := r2 r1 := r2
. .
. .
. .
r4 := r1 + 1 r4 := r2 + 1
– Reduce chain of dependency
– Possibly create dead code

615
Forward Copy
Propagation
• Rules:
Statement dS is source of copy propagation
Statement dT is target of copy propagation
– dS is a mov statement
– src(dS) is a register
– dT uses dest(dS)
– dS is available definition at dT
– src(dS) is a available expression at dT

616
Backward Copy
• Propagation
Backward propagation of LHS of an assignment.
dT: r1 := r2 + r3  r4 := r2 + r3
r5 := r1 + r6  r5 := r4 + r6
dS: r4 := r1  Dead Code
• Rules:
– dT and dS are in the same basic block
– dest(dT) is register
– dest(dT) is not live in out[B]
– dest(dS) is a register
– dS uses dest(dT)
– dest(dS) not used between dT and dS
– dest(dS) not defined between d1 and dS
– There is no use of dest(dT) after the first definition of
dest(dS)
617
Local Common Sub-
Expression Elimination
• Benefits:
– Reduced computation
– Generates mov statements,
which can get copy dS: r1 := r2 + r3
propagated
• Rules: dT: r4 := r2 + r3
– dS and dT has the same
expression
– src(dS) == src(dT) for all dS: r1 := r2 + r3
sources r100 := r1
– For all sources x, x is not
redefined between dS and dT
dT: r4 := r100

618
Global Common Sub-
Expression Elimination
• Rules:
– dS and dT has the same expression
– src(dS) == src(dT) for all sources of dS and dT
– Expression of dS is available at dT

619
Unreachable Code
Elimination

Mark initial BB visited entry

to_visit = initial BB
while (to_visit not empty) bb1 bb2
current = to_visit.pop()
for each successor block of current bb3 bb4
Mark successor as visited;
to_visit += successor
bb5
endfor
endwhile Which BB(s) can be deleted?
Elimi
nate all
unvisited 620

Code Optimization
0% (1)
Code Optimization
90 pages
Compiler Optimization Techniques
No ratings yet
Compiler Optimization Techniques
149 pages
0code Optimization
No ratings yet
0code Optimization
149 pages
Compiler Design-Code Optimization
No ratings yet
Compiler Design-Code Optimization
150 pages
Lecture Notes On Code Optimization
No ratings yet
Lecture Notes On Code Optimization
175 pages
CSE2002 Session38 Code Optimization3038
No ratings yet
CSE2002 Session38 Code Optimization3038
39 pages
CODE Optimization
No ratings yet
CODE Optimization
50 pages
CD Unit-5
No ratings yet
CD Unit-5
45 pages
Compiler Optimization Techniques
No ratings yet
Compiler Optimization Techniques
36 pages
Code Optmize
No ratings yet
Code Optmize
41 pages
Unit 5 PCC
No ratings yet
Unit 5 PCC
16 pages
Code Optimization: Compilation. The Optimizer Can Also Be Defined As A System Which Transforms
No ratings yet
Code Optimization: Compilation. The Optimizer Can Also Be Defined As A System Which Transforms
16 pages
Compiler Design UNIT V
No ratings yet
Compiler Design UNIT V
13 pages
CD Unit V
No ratings yet
CD Unit V
17 pages
Code Optimization in Compiler Design: Kunal Jangra Shivam Tripathi Tushar Jain
No ratings yet
Code Optimization in Compiler Design: Kunal Jangra Shivam Tripathi Tushar Jain
19 pages
Unit 5 Part 1
No ratings yet
Unit 5 Part 1
44 pages
Unit 5 Material
No ratings yet
Unit 5 Material
12 pages
Compiler Optimization Techniques
No ratings yet
Compiler Optimization Techniques
58 pages
Code Optimization
No ratings yet
Code Optimization
28 pages
CD Unit 4
No ratings yet
CD Unit 4
32 pages
Unit 5 2 Optimization
No ratings yet
Unit 5 2 Optimization
18 pages
CTCD Unit 5
No ratings yet
CTCD Unit 5
27 pages
Cd-Unit 5 Part-2
No ratings yet
Cd-Unit 5 Part-2
23 pages
Unit 4
No ratings yet
Unit 4
16 pages
Code Optimization
No ratings yet
Code Optimization
51 pages
Code Optimization
No ratings yet
Code Optimization
20 pages
CS602PC - Compiler - Design - Lecture Notes - Unit - 5
No ratings yet
CS602PC - Compiler - Design - Lecture Notes - Unit - 5
28 pages
Compiler Design Unit 5
No ratings yet
Compiler Design Unit 5
39 pages
ATC - Code Optimization and Code Generation
No ratings yet
ATC - Code Optimization and Code Generation
28 pages
Unit5 0CodeOptimization
No ratings yet
Unit5 0CodeOptimization
90 pages
Unit 5
No ratings yet
Unit 5
56 pages
Code Optimization
No ratings yet
Code Optimization
26 pages
Op Tim Ization
No ratings yet
Op Tim Ization
22 pages
Code Optimization Techniques
No ratings yet
Code Optimization Techniques
19 pages
Code Optimization
No ratings yet
Code Optimization
21 pages
Unit - Iv Run Time Storage Organization
No ratings yet
Unit - Iv Run Time Storage Organization
15 pages
Unit 5 Cd.
No ratings yet
Unit 5 Cd.
27 pages
5.3 Principal Sources of Optimization
No ratings yet
5.3 Principal Sources of Optimization
27 pages
Chapter 10 - Code Optimization
No ratings yet
Chapter 10 - Code Optimization
11 pages
CD Unit 5
No ratings yet
CD Unit 5
12 pages
CD Unit 5
No ratings yet
CD Unit 5
41 pages
Unit 5 Compiler PDF
No ratings yet
Unit 5 Compiler PDF
28 pages
ACD Unit-5
No ratings yet
ACD Unit-5
20 pages
Unit 5.1
No ratings yet
Unit 5.1
49 pages
Chapter 9 (Machine-Independent Optimizations)
No ratings yet
Chapter 9 (Machine-Independent Optimizations)
37 pages
Unit V - Code Optimization and Code Generation: Course Material
No ratings yet
Unit V - Code Optimization and Code Generation: Course Material
41 pages
Unit-5 F&CD
No ratings yet
Unit-5 F&CD
27 pages
CD Unit-Iv
No ratings yet
CD Unit-Iv
15 pages
UNIT IV CD (P)
No ratings yet
UNIT IV CD (P)
8 pages
Unit 6 Final
No ratings yet
Unit 6 Final
11 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
Unit 5
No ratings yet
Unit 5
54 pages
Code Optimization Unit-4-II
No ratings yet
Code Optimization Unit-4-II
27 pages
Chapter 7 and 8
No ratings yet
Chapter 7 and 8
5 pages
Lec07 Code Optimzation
No ratings yet
Lec07 Code Optimzation
23 pages
The Impact of Digitization On Business Models - A Systematic Literature Review
No ratings yet
The Impact of Digitization On Business Models - A Systematic Literature Review
3 pages
?????????????????K by TrHPhuc?.AccessibilityPunctuationGroup
No ratings yet
?????????????????K by TrHPhuc?.AccessibilityPunctuationGroup
142 pages
Concrete: History, Types, and Properties
No ratings yet
Concrete: History, Types, and Properties
23 pages
Q1M1 - Work Order and Standard Operating Procedure
100% (1)
Q1M1 - Work Order and Standard Operating Procedure
20 pages
2D Convolution
No ratings yet
2D Convolution
13 pages
Shelly Dimmer 2
No ratings yet
Shelly Dimmer 2
2 pages
Da 66we English
No ratings yet
Da 66we English
2 pages
Excel by Bobby
No ratings yet
Excel by Bobby
82 pages
Unit 4 Lec 13 - Tractor Types Cost Analysis of Power and Attached Implement
No ratings yet
Unit 4 Lec 13 - Tractor Types Cost Analysis of Power and Attached Implement
3 pages
Iso8791-2 (Hrapavost, Gladkost)
No ratings yet
Iso8791-2 (Hrapavost, Gladkost)
19 pages
Computer-Paper Class 2
No ratings yet
Computer-Paper Class 2
7 pages
APQP Timing Plan Guide 2009
No ratings yet
APQP Timing Plan Guide 2009
1 page
Fast Hub Floating Point Adder
No ratings yet
Fast Hub Floating Point Adder
5 pages
Engineering Drawing Creation Procedure
No ratings yet
Engineering Drawing Creation Procedure
53 pages
Online Course Reservation System
50% (2)
Online Course Reservation System
16 pages
Isp Cyberspace
No ratings yet
Isp Cyberspace
6 pages
CCGX Modbus TCP Register List 2.60
No ratings yet
CCGX Modbus TCP Register List 2.60
87 pages
MK HoneyWell
No ratings yet
MK HoneyWell
56 pages
Energy Intelligence: World Crude Oil Data & Handbook 2019
100% (1)
Energy Intelligence: World Crude Oil Data & Handbook 2019
1 page
Network Performance
No ratings yet
Network Performance
12 pages
314320-An-Mathematics For Machine Learning II Sample QP
No ratings yet
314320-An-Mathematics For Machine Learning II Sample QP
3 pages
CS8481 - Set1
No ratings yet
CS8481 - Set1
8 pages
Python Running Notes
No ratings yet
Python Running Notes
6 pages
CSS-Grade 9 Quarter 1
83% (12)
CSS-Grade 9 Quarter 1
44 pages
Log File
No ratings yet
Log File
26 pages
Nomenclature External Material Group
No ratings yet
Nomenclature External Material Group
41 pages
Bal-Ami Picture Story
No ratings yet
Bal-Ami Picture Story
7 pages
BHZeroTurnComZseriesPM 08
No ratings yet
BHZeroTurnComZseriesPM 08
57 pages
Isoflex Topas L 32, L 32 N: Special Low-Temperature Greases
No ratings yet
Isoflex Topas L 32, L 32 N: Special Low-Temperature Greases
2 pages
02ds PM - 1604 Ekinops
No ratings yet
02ds PM - 1604 Ekinops
2 pages

Code Optimization Techniques Guide

Uploaded by

Code Optimization Techniques Guide

Uploaded by

UNI

 90-10 rule: execution spends 90% time in 10% of the

• In general, loops are the hot-spots

– Must preserve the semantic equivalence of the programs

Source Front Inter Code target

• Architecture of Target CPU:

• There are several types of redundancy elimination:

– Common subexpression elimination

– Partial redundancy elimination

None of the variable involved should be

– Reduce size of the program

Example: Code hoisting

Example: Unsafe code movement

• What is the result?

• Also, seemingly redundant code can be produced by other

Represented as: d dom n

• Natural loops can be detected by back edges.

• Loop splitting: attempts to simplify a loop or eliminate

– A useful special case is loop peeling - simplify a loop with

• Loop unrolling: duplicates the body of the loop multiple

– Be careful with the memory operations

– y and z not modified in loop body

Rules that need change:

– Primary induction variable: basic induction variable that

– Derived induction variable: variable that is a linear

increment themselves and not live at the exit of loop

– 1,2 are basically free

true not false

Caveat: Floats — implementation could be different

DATA FLOW ANALYSIS

• Data flow information: Information collected by data flow

Whether each path in a flow graph is taken is an

• Region consisting of a statement S:

Da: The set of definitions in the program for variable a

in[S1 ]  in[S] gen[S1 ]

for each operation (x := y op z) in reverse order in BB,

for each source operand of op, y, do

– Replacement of “rx” with “L” wherever possible.

Mark initial BB visited entry

You might also like