Mod 4

The document discusses intermediate code generation in compilers, highlighting its role in translating source programs into machine-understandable code while maintaining machine independence. It covers various representations of intermediate code, such as high-level and low-level IR, syntax trees, and three-address code, along with their advantages for optimization and efficient code generation. Additionally, it explains the significance of directed acyclic graphs (DAGs) and the structure of three-address instructions, including quadruples and triples.

Uploaded by

sisirakrishnaks

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views39 pages

Mod 4

Uploaded by

sisirakrishnaks

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

MODULE 4

Intermediate Code Generation

Intermediate code generation
• In the analysis-synthesis model of a compiler, the
front end of a compiler translates a source program
into an independent intermediate code, then the
back end of the compiler uses this intermediate
code to generate the target code (which can be
understood by the machine).
• Intermediate code can translate the source
program into the machine program.
• Static checking includes type checking, which
ensures that operators are applied to
compatible operands.
• It also includes any syntactic checks that
remain after parsing.
• For example, static checking assures that a
break-statement in C is enclosed within a
while-, for-, or switch-statement; an error is
reported if such an enclosing statement does
not exist.
Significance of Intermediate code

• If a compiler translates the source language to its target machine language

without having the option for generating intermediate code, then for each new
machine, a full native compiler is required.
• Intermediate code eliminates the need of a new full compiler for every unique
machine by keeping the analysis portion same for all the compilers.
• The second part of compiler, synthesis, is changed according to the target
machine.
• It becomes easier to apply the source code modifications to improve code
performance by applying code optimization techniques on the intermediate code.
• With a suitably defined intermediate
representation, a compiler for language i and
machine j can then be built by combining the
front end for language i with the backend for
machine j.
• This approach to creating suite of compilers can
save a considerable amount of effort: m X n
compilers can be built by writing just m front
ends and n back ends.
Intermediate Representation
Intermediate codes can be represented in a variety
of ways and they have their own benefits.
• High Level IR - High-level intermediate code
representation is very close to the source
language itself. They can be easily generated from
the source code and we can easily apply code
modifications to enhance performance. But for
target machine optimization, it is less preferred.
• Low Level IR - This one is close to the target
machine, which makes it suitable for register and
memory allocation, instruction set selection, etc.
It is good for machine-dependent optimizations.
• Intermediate code can be either language
specific (e.g., Byte Code for Java) or language
independent (three-address code).
• Syntax trees are high level; they depict the
natural hierarchical structure of the source
program and are well suited to tasks like static
type checking.
• Three-address code can range from high- to
low-level, depending on the choice of
operators.
Advantages of Intermediate Code
Generation
• It is Machine Independent. It can be executed
on different platforms.
• It creates the function of code optimization
easy. A machine-independent code optimizer
can be used to intermediate code to optimize
code generation.
• It can perform efficient code generation.
• From the existing front end, a new compiler
for a given back end can be generated.
Representations on Intermediate code
• The intermediate code can be represented in
the form of postfix notation, syntax tree,
directed acyclic graph, three address codes,
Quadruples, and triples.
Variants of syntax tree
• Nodes in a syntax tree represent constructs in
the source program; the children of a node
represent the meaningful components of a
construct.
• A directed acyclic graph (hereafter called a
DAG) for an expression identifies the common
sub-expressions (sub-expressions that occur
more than once) of the expression.
Directed Acyclic graphs for Expressions
• Like the syntax tree for an expression, a DAG has
leaves corresponding to atomic operands and
interior nodes corresponding to operators.
• The difference is that a node N in a DAG has more
than one parent if N represents a common sub-
expression; in a syntax tree, the tree for the
common sub-expression would be replicated as
many times as the sub-expression appears in the
original expression.
• Thus, a DAG not only represents expressions more
succinctly, it gives the compiler important clues
regarding the generation of efficient code to
evaluate the expressions.
• DAG for the expression
a + a * (b - c) + (b - c) * d
• The leaf for a has two parents, because a appears
twice in the expression.
• More interestingly, the two occurrences of the
common subexpression b- c are represented by
one node, the node labeled —.
• That node has two parents, representing its two
uses in the subexpressions a* (b-c ) and (b-c)*d .
• Even though b and c appear twice in the
complete expression, their nodes each have one
parent, since both uses are in the common
subexpression b-c .
• The sequence of steps shown in constructs
the D, provided Node and Leaf return an
existing node, if possible.
• We assume that entry-a points to the symbol-
table entry for a, and similarly for the other
identifiers.
• When the call to Leaf (id; entry-a) is repeated
at step 2, the node creed by the previous call
is returned, so p2 = p1.
• The SDD of Fig. 6.4 can construct either syntax trees or
DAG’s.
• It was used to construct syntax trees where functions
Leaf and Node created a fresh node each time they were
called.
• It will construct a DAG if, before creating a new node,
these functions first check whether an identical node
already exists.
• If a previously created identical node exists, the existing
node is returned.
• For instance, before constructing a new node, Node(op,
left, right) we check whether there is already a node with
label op, and children left and right, in that order.
• If so, Node returns the existing node; otherwise, it
creates a new node.
The Value number method for
constructing DAG’s
• Often, the nodes of a syntax tree or DAG are stored
in an array of records, each row of the array
represents one record, and therefore one node.
• In each record, the first field is an operation code,
indicating the label of the node.
• The leaves have one additional field, which holds
the lexical value (either a symbol-table pointer or a
constant, in this case), interior nodes have two
additional fields indicating the left and right
children.
• In this array, we refer to nodes by giving the
integer index of the record for that node within
the array.
• This integer historically has been called the value
number for the node or for the expression
represented by the node.
• Let the signature of an interior node be the triple
<op; l; r>, where op is the label, l its left child's
value number, and r its right child's value number.
• A unary operator may be assumed to have r = 0.
Three Address Code
• Three-address code is a linearized
representation of a syntax tree or a DAG in
which explicit names correspond to the
interior nodes of the graph.
• In three-address code, there is at most one
operator on the right side of a instruction;
that is, no built-up arithmetic expressions are
permitted.
• Thus source-language expression like
x+y*z
might be translated into the sequence of three-
address instruction
t1 = y * z
t2 = x + t
where t1 and t2 are compiler-generated
temporary names.
Example:
Addresses and Instructions
• Three-address code is built from two
concepts: addresses and instructions.
• Three-address code can be implemented using
records with fields for the addresses; records
called quadruples and triples.
An address can be one of the following:
1. A name. For convenience, we allow source-
program names to appear as addresses in three-
address code. In an implementation, a source
name is replaced by a pointer to its symbol-table
entry, where all information about the name is
kept.
2. A constant. In practice, a compiler must deal with
many different types of constants and variables.
Type conversions within expressions are
considered.
3. A compiler-generated temporary. It is useful,
especially in optimizing compilers, to create a
distinct name each time a temporary is needed.
These temporaries can be combined, if possible,
when registers are allocated to variables.
Symbolic Labels
• Symbolic labels will be used by instructions
that alter the flow of control.
• A symbolic label represents the index of a
three-address instruction in the sequence of
instructions.
• Actual indexes can be substituted for the
labels, either by making a separate pass or by
back-patching.
Common three-address instruction
forms:
1. Assignment instructions of the form x = y op z, where op is a
binary arithmetic or logical operation, and x, y, and z are
addresses.
2. Assignments of the form x = op y, where op is a unary
operation. Essential unary operations include unary minus,
logical negation, and conversion operators that, for example,
convert an integer to a floating-point number.
3. Copy instructions of the form x = y, where x is assigned the
value of y.
4. An unconditional jump goto L. The three-address instruction
with label L is the next to be executed.
5. Conditional jumps of the form if x goto L and if False x goto L.
These instructions execute the instruction with label L next if
x is true and false, respectively. Otherwise, the following
three-address instruction in sequence is executed next, as
usual.
• 6. Conditional jumps such as if x relop y goto L,
which apply a relational operator (=, etc.) to x and
y, and execute the instruction with label L next if x
stands in relation relop to y. If not, the three-
address instruction following if x relop y goto L is
executed next, in sequence.
• 7. Procedure calls and returns are implemented
using the following instructions: param x for
parameters; call p, n and y = call p, n for
procedure and function calls, respectively; and
return y, where y, representing a returned value,
is optional.
• Their typical use is as the sequence of three
address instructions
param x1
param x2
param xn
call p, n
generated as part of a call of the procedure p(x1,
x2,….,xn).
• The integer n, indicating the number of actual
parameters in “call (p, n)”, is not redundant
because calls can be nested.
• That is, some of the first param statements could
be parameters of a call that comes after p returns
its value; that value becomes another parameter
of the later call.
• 8. Indexed copy instructions of the form x = y[i] and x[i]
= y. The instruction x = y[i] sets x to the value in the
location i memory units beyond location y. The
instruction x[i] = y sets the contents of the location i
units beyond x to the value of y.
• 9. Address and pointer assignments of the form x = &
y, x = * y, and * x = y. The instruction x = & y sets the r-
value of x to be the location (l-value) of y.2 Presumably
y is a name, perhaps a temporary, that denotes an
expression with an l-value such as A[i][j], and x is a
pointer name or temporary. In the instruction x = * y,
presumably y is a pointer or a temporary whose r-value
is a location. The r-value of x is made equal to the
contents of that location. Finally, * x = y sets the r-value
of the object pointed to by x to the r-value of y.
Three address code Representations
• The description of three-address instructions
specifies the components of each type of
instruction, but it does not specify the
representation of these instructions in a data
structure.
• In a compiler, these instructions can be
implemented as objects or as records with
fields for the operator and the operands.
• Three such representations are called
“quadruples”," triples," and “indirect triples."
Quadruples
• A quadruple (or quad) has four fields, which we
call op, arg 1, arg 2, and result.
• Some rules:
1. Instructions with unary operators like x = minus
y or x = y do not use arg2. Note that for a copy
statement like x = y, op is =, while for most other
operations, the assignment operator is implied.
2. Operators like param use neither arg 2 nor
result.
3. Conditional and unconditional jumps put the
target label in result.
Three-address code for the assignment a = b*-c+b*-c;
Triples
• A triple has only three fields, which we call op,
arg 1, and arg 2.
• Using triples, we refer to the result of an
operation x op y by its position, rather than by
an explicit temporary name.
• Thus, instead of the temporary t1 , a triple
representation would refer to position (0).
Indirect Triples
Indirect triples consist of a listing of pointers to triples,
rather than a listing of triples themselves.
• For example, let us use an array instruction to
list pointers to triples in the desired order.
• For example, let us use an array instruction to
list pointers to triples in the desired order.
• With indirect triples, an optimizing compiler
can move an instruction by reordering the
instruction list, without affecting the triples
themselves.

24-Module 4 - Variants of Syntax Trees - Three Address Code-10!09!2024
100% (1)
24-Module 4 - Variants of Syntax Trees - Three Address Code-10!09!2024
44 pages
Compiler 8 (Intermediate Code Generation)
No ratings yet
Compiler 8 (Intermediate Code Generation)
14 pages
Compiler Design for B.Tech Students
No ratings yet
Compiler Design for B.Tech Students
21 pages
Intermediate Code Generation Guide
No ratings yet
Intermediate Code Generation Guide
27 pages
CSE-303 Chapter-06 Final
No ratings yet
CSE-303 Chapter-06 Final
97 pages
Lecture 08
No ratings yet
Lecture 08
36 pages
006chapter 6 - Intermediate Code Generation
No ratings yet
006chapter 6 - Intermediate Code Generation
23 pages
Cse3077 CD m3
No ratings yet
Cse3077 CD m3
74 pages
Unit 4
No ratings yet
Unit 4
51 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
23 pages
CD - 3rd Unit - 15
No ratings yet
CD - 3rd Unit - 15
58 pages
CS 346: Intermediate Code Generation: Resource
No ratings yet
CS 346: Intermediate Code Generation: Resource
60 pages
Chapter 6 - Intermediate Code Generation
No ratings yet
Chapter 6 - Intermediate Code Generation
42 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
11 pages
Unit-Iv: Intermediate Code Generation
No ratings yet
Unit-Iv: Intermediate Code Generation
19 pages
Intermediate Code in Compiler Construction
No ratings yet
Intermediate Code in Compiler Construction
34 pages
1 Unit 4 Complete
No ratings yet
1 Unit 4 Complete
92 pages
CD Unit 4
No ratings yet
CD Unit 4
102 pages
CD Module 5
No ratings yet
CD Module 5
12 pages
3 Intermediate Code Generation
No ratings yet
3 Intermediate Code Generation
20 pages
18 Unit-4
No ratings yet
18 Unit-4
16 pages
PYQs Unit 3 CD
No ratings yet
PYQs Unit 3 CD
34 pages
CD Unit3
No ratings yet
CD Unit3
17 pages
BCS 324 Topic 5
No ratings yet
BCS 324 Topic 5
35 pages
Cs 3007 Inter Code Gen
No ratings yet
Cs 3007 Inter Code Gen
42 pages
CH06
No ratings yet
CH06
28 pages
Lecture Notes Compiler Design Chapter-6
No ratings yet
Lecture Notes Compiler Design Chapter-6
55 pages
Compiler Intermediate Code Basics
No ratings yet
Compiler Intermediate Code Basics
62 pages
CD Unit-Iii
No ratings yet
CD Unit-Iii
20 pages
Chapter 6
No ratings yet
Chapter 6
28 pages
Intermediate Code Generation: CD: Compiler Design
No ratings yet
Intermediate Code Generation: CD: Compiler Design
41 pages
Subject Code: 6CS63/06IS662 NO. of Lectures Per Week: 04 Total No. of Lecture HRS: 52 IA Marks: 25 Exam HRS: 03 Exam Marks:100
No ratings yet
Subject Code: 6CS63/06IS662 NO. of Lectures Per Week: 04 Total No. of Lecture HRS: 52 IA Marks: 25 Exam HRS: 03 Exam Marks:100
38 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
42 pages
Outline
No ratings yet
Outline
40 pages
MUUnit 4
No ratings yet
MUUnit 4
63 pages
Poc Unit 3
No ratings yet
Poc Unit 3
22 pages
Unit-Iii: Intermediate Code Generation
No ratings yet
Unit-Iii: Intermediate Code Generation
47 pages
Module 5 Chapter 6 ICG
No ratings yet
Module 5 Chapter 6 ICG
44 pages
Intermediate Code Generation Lecture
No ratings yet
Intermediate Code Generation Lecture
29 pages
UNIT-4 Notes
No ratings yet
UNIT-4 Notes
27 pages
INTERMEDIATE CODE GENERATION & RUNTIME ENVIRNOMENTS
No ratings yet
INTERMEDIATE CODE GENERATION & RUNTIME ENVIRNOMENTS
35 pages
Compiler Intermediate Code Guide
No ratings yet
Compiler Intermediate Code Guide
26 pages
Intermediate Code Generator 1
No ratings yet
Intermediate Code Generator 1
48 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
136 pages
TSR - Class Cd-Unit 3
No ratings yet
TSR - Class Cd-Unit 3
111 pages
Intermediate Code Generation and Code Optimization
No ratings yet
Intermediate Code Generation and Code Optimization
40 pages
Chapter 6 (Intermediate-Code Generation)
No ratings yet
Chapter 6 (Intermediate-Code Generation)
120 pages
CH-6 Intermediate Code Generator
No ratings yet
CH-6 Intermediate Code Generator
54 pages
CD - CH5 - Intermediate Code Generation
No ratings yet
CD - CH5 - Intermediate Code Generation
54 pages
Chapter 5 Intermediate Code Generaration-1
No ratings yet
Chapter 5 Intermediate Code Generaration-1
31 pages
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
No ratings yet
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
11 pages
Intermediate Code Generation Guide
No ratings yet
Intermediate Code Generation Guide
47 pages
Unit-4-2
No ratings yet
Unit-4-2
23 pages
Intermediate Representation and Symbol Table
No ratings yet
Intermediate Representation and Symbol Table
39 pages
Compiler - Three Addr Codes
No ratings yet
Compiler - Three Addr Codes
33 pages
Unit-Iii
No ratings yet
Unit-Iii
19 pages
Compiler Construction Week 14
No ratings yet
Compiler Construction Week 14
23 pages
2024 CD Ch06 Intermidiate & Ch07 Runtime & Ch08 Code Optimization
No ratings yet
2024 CD Ch06 Intermidiate & Ch07 Runtime & Ch08 Code Optimization
29 pages
Shravan
No ratings yet
Shravan
8 pages
Module 1
No ratings yet
Module 1
26 pages
Compiler Java
100% (1)
Compiler Java
330 pages
UNIT-5 Notes
No ratings yet
UNIT-5 Notes
14 pages
Unit - 4 Pushdown Automata: Code Optimization and Code Generation
No ratings yet
Unit - 4 Pushdown Automata: Code Optimization and Code Generation
44 pages
CS 6002 Compiler Design
No ratings yet
CS 6002 Compiler Design
2 pages
Sanjay Dixit: TH TH TH TH
No ratings yet
Sanjay Dixit: TH TH TH TH
5 pages
Chapter 6 - ICG
No ratings yet
Chapter 6 - ICG
15 pages
Compiler Design Unit 1 by Dr. Choudhary Ravi Singh
No ratings yet
Compiler Design Unit 1 by Dr. Choudhary Ravi Singh
27 pages
Assignment 1
No ratings yet
Assignment 1
8 pages
Quick Book of Compiler
100% (1)
Quick Book of Compiler
66 pages
CD Unit-6
No ratings yet
CD Unit-6
78 pages
LLVM: Lifelong Program Analysis Framework
No ratings yet
LLVM: Lifelong Program Analysis Framework
12 pages
B.Tech CS Compiler Design Test
No ratings yet
B.Tech CS Compiler Design Test
20 pages
CD Assignment 4
No ratings yet
CD Assignment 4
7 pages
Compiler Design
100% (2)
Compiler Design
17 pages
Blue Print Exit Exam
No ratings yet
Blue Print Exit Exam
223 pages
Code Generation: Integrated Instruction Selection and Register Allocation Algorithms
No ratings yet
Code Generation: Integrated Instruction Selection and Register Allocation Algorithms
32 pages
Compiler Construction Guide
No ratings yet
Compiler Construction Guide
20 pages
3rd and 4th Year CSE - SYLLABUS - 2022-2023AB-19Aug2024-Finalversion
No ratings yet
3rd and 4th Year CSE - SYLLABUS - 2022-2023AB-19Aug2024-Finalversion
98 pages
Reasons For Studying Concepts of Programming Languages
No ratings yet
Reasons For Studying Concepts of Programming Languages
13 pages
Compiler Design - Phases of Compiler
No ratings yet
Compiler Design - Phases of Compiler
2 pages
Compiler Construction
No ratings yet
Compiler Construction
5 pages
Codegeneration Final
No ratings yet
Codegeneration Final
31 pages
Compiler Design
No ratings yet
Compiler Design
94 pages
Unit-5 - CD - Code Generation
No ratings yet
Unit-5 - CD - Code Generation
56 pages
Compiler Design
No ratings yet
Compiler Design
46 pages
Compiler Design Basics
No ratings yet
Compiler Design Basics
13 pages
Unit-1 Introduction Compiler by AnkitaChauhan
No ratings yet
Unit-1 Introduction Compiler by AnkitaChauhan
27 pages
Compiler Design Unit 5 by Dr. Choudhary Ravi Singh
No ratings yet
Compiler Design Unit 5 by Dr. Choudhary Ravi Singh
19 pages

Mod 4

Uploaded by

Mod 4

Uploaded by

MODULE 4

Intermediate Code Generation

• If a compiler translates the source language to its target machine language

You might also like