0% found this document useful (0 votes)

229 views14 pages

Compiler Design Essentials

The document discusses compiler design and construction. A compiler converts source code into object code by translating high-level code into machine-level code. The compiling process includes lexical, syntax, and semantic analysis at the front-end, and code generation and optimization at the back-end. Compilers verify programs for errors, optimize code for faster execution, and allow programs to run on different machines than where they were compiled.

Uploaded by

mbilipaka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

229 views14 pages

Compiler Design Essentials

Uploaded by

mbilipaka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

COMPILER DESIGN AND CONSTRUCTION

By Evans Ombati Maoncha

LECTURE 3 AND 4

Compiler
A compiler is a software that converts the source code to the object code. In other words, we can
say that it converts the high-level language to machine/binary language. Moreover, it is necessary
to perform this step to make the program executable. This is because the computer understands
only binary language.

Some compilers convert the high-level language to an assembly language as an intermediate

step. Whereas some others convert it directly to machine code. This process of converting the
source code into machine code is called compilation.

The compiling process includes basic translation mechanisms and error detection. The compiler
process goes through lexical, syntax, and semantic analysis at the front end and code
generation and optimization at the back-end.

Steps for Language processing systems

Before knowing about the concept of compilers, you first need to understand a few other tools
which work with compilers.
Figure 1:Steps for Language processing systems

• Preprocessor: The preprocessor is considered as a part of the Compiler. It is a tool which

produces input for Compiler. It deals with macro processing, augmentation, language extension,
etc.

• Interpreter: An interpreter is like Compiler which translates high-level language into low-level
machine language. The main difference between both is that interpreter reads and transforms
code line by line. Compiler reads the entire code at once and creates the machine code.

• Assembler: It translates assembly language code into machine understandable language. The
output result of assembler is known as an object file which is a combination of machine
instruction as well as the data required to store these instructions in memory.

• Linker: The linker helps you to link and merge various object files to create an executable file. All
these files might have been compiled with separate assemblers. The main task of a linker is to
search for called modules in a program and to find out the memory location where all modules
are stored.
• Loader: The loader is a part of the OS, which performs the tasks of loading executable files into
memory and run them. It also calculates the size of a program which creates additional memory
space.

• Cross-compiler: A Cross compiler in compiler design is a platform which helps you to generate
executable code.

• Source-to-source Compiler: Source to source compiler is a term used when the source code of
one programming language is translated into the source of another language.

Why use a Compiler?

• Compiler verifies entire program, so there are no syntax or semantic errors.
• The executable file is optimized by the compiler, so it is executes faster.
• Allows you to create internal structure in memory.
• There is no need to execute the program on the same machine it was built.
• Translate entire program in other language.
• Generate files on disk.
• Link the files into an executable format.
• Check for syntax errors and data types.
• Helps you to enhance your understanding of language semantics.
• Helps to handle language performance issues.
• Opportunity for a non-trivial programming project.
• The techniques used for constructing a compiler can be useful for other purposes as well.

Application of Compilers
• Compiler design helps full implementation Of High-Level Programming Languages.
• Support optimization for Computer Architecture Parallelism.
• Design of New Memory Hierarchies of Machines.
• Widely used for Translating Programs.
• Used with other Software Productivity Tools.

Analysis of a Source Program

We can analyze a source code in three main steps. Moreover, these steps are further divided into
different phases. The three steps are:
1. Linear Analysis

Here, it reads the character of the code from left to right. The characters having a collective
meaning are formed. We call these groups tokens.

2. Hierarchical Analysis

According to collective meaning, we divide the tokens hierarchically in a nested manner.

3. Semantic Analysis

In this step, we check if the components of the source code are appropriate in meaning.

Phases/Structure of Compiler
The compilation process takes place in several phases. Moreover, for each step, the output of
one step acts as the input for the next step. The phases/structure of the compilation process are
is follows:

1. Lexical Analyzer

• It takes the high-level language source code as the input.

• It scans the characters of source code from left to right. Hence, the name scanner also.
• It groups the characters into lexemes. Lexemes are a group of characters which has some
meaning.
• Each lexeme corresponds to form a token.
• It removes white spaces and comments.
• It checks and removes the lexical errors.

2. Syntax Analyzer

• ‘Parser’ is the other name for the syntax analyzer.

• The output of the lexical analyzer is its input.
• It checks for syntax errors in the source code.
• It does this by constructing a parse tree of all the tokens.
• For the syntax to be correct, the parse tree should be according to the rules of source
code grammar.
• The grammar for such codes is context-free grammar.

3. Semantic Analyzer

• It verifies the parse tree of the syntax analyzer.

• It checks the validity of the code in terms of programming language. Like, compatibility of
data types, declaration, and initialization of variables, etc.
• It also produces a verified parse tree. Furthermore, we also call this tree an annotated
parse tree.
• It also performs flow checking, type checking, etc.

4. Intermediate Code Generator (ICG)

• It generates an intermediate code.

• This code is neither in high-level language nor in machine language. It is in an intermediate
form.
• It is converted to machine language but, the last two phases are platform dependent.
• The intermediate code is the same for all the compilers. Further, we generate the machine
code according to the platform.
• An example of an intermediate code is three address code.

5. Code Optimizer

• It optimizes the intermediate code.

• Its function is to convert the code so that it executes faster using fewer resources (CPU,
memory).
• It removes any useless lines of code and rearranges the code.
• The meaning of the source code remains the same.

6. Target Code Generator

• Finally, it converts the optimized intermediate code into the machine code.
• This is the final stage of the compilation.
• The machine code which is produced is relocatable.

Phases of Compiler
All these phases of a compiler divide into two sections:
a) Front End

The phases of lexical analysis, syntax analysis, semantic analysis, and intermediate code
generation comes under this category.

b) Back End

While the other last two phases come under the back end.

Types of Compilers
Following are the different types of Compiler:

• Single Pass Compilers

• Two Pass Compilers
• Multipass Compilers

1. Single Pass Compiler

Single Pass Compiler

In single pass Compiler source code directly transforms into machine code. For example, Pascal
language.
2. Two Pass Compiler

Two Pass Compiler

Two pass Compiler is divided into two sections, viz.

1. Front end: It maps legal code into Intermediate Representation (IR).

2. Back end: It maps IR onto the target machine

The Two pass compiler method also simplifies the retargeting process. It also allows multiple
front ends.

3. Multipass Compilers

Multipass Compilers

The multipass compiler processes the source code or syntax tree of a program several times. It
divides a large program into multiple small programs and process them. It develops multiple
intermediate codes. All of these multipass take the output of the previous phase as an input. So
it requires less memory. It is also known as ‘Wide Compiler’.

Other Classification of compilers are: -

4. Cross Compilers

They produce an executable machine code for a platform but, this platform is not the one on
which the compiler is running.

5. Bootstrap Compilers

These compilers are written in a programming language that they have to compile.

6. Source to source/transcompiler

These compilers convert the source code of one programming language to the source code of
another programming language.

7. Decompiler

Basically, it is not a compiler. It is just the reverse of the compiler. It converts the machine code
into high-level language.

Features of a Compiler
The features are as follows:

• Compilation speed.
• The correctness of machine code.
• The meaning of code should not change.
• Speed of machine code.
• Good error detection.
• Checking the code correctly according to grammar.

Uses/Application of Compilers
• Helps to make the code independent of the platform.
• Makes the code free of syntax and semantic errors.
• Generate executable files of code.
• Translates the code from one language to another.
Error detection and Recovery in Compiler
In this phase of compilation, all possible errors made by the user are detected and reported to
the user in form of error messages. This process of locating errors and reporting them to users
is called the Error Handling process.
Functions of an Error handler

• Detection
• Reporting
• Recovery

Classification of Errors

Compile-time errors are of three types: -

Lexical phase errors

These errors are detected during the lexical analysis phase. Typical lexical errors are:

• Exceeding length of identifier or numeric constants.

• The appearance of illegal characters
• Unmatched string

Example 1 : printf("Hello World");$

This is a lexical error since an illegal character $ appears at the end of statement.

Example 2 : This is a comment */

This is an lexical error since end of comment is present but beginning is not present.
Error recovery:
Panic Mode Recovery
In this method, successive characters from the input are removed one at a time until a
designated set of synchronizing tokens is found. Synchronizing tokens are delimiters such as; or
}

• The advantage is that it is easy to implement and guarantees not to go into an infinite
loop
• The disadvantage is that a considerable amount of input is skipped without checking it
for additional errors

Syntactic phase errors

These errors are detected during the syntax analysis phase. Typical syntax errors are:

• Errors in structure
• Missing operator
• Misspelled keywords
• Unbalanced parenthesis

Example: switch(ch)
{
.......
.......
}

The keyword switch is incorrectly written as a switch. Hence, an “Unidentified

keyword/identifier” error occurs.

Error recovery:

1. Panic Mode Recovery

o In this method, successive characters from the input are removed one at a time
until a designated set of synchronizing tokens is found. Synchronizing tokens are
deli-meters such as; or }
o The advantage is that it’s easy to implement and guarantees not to go into an
infinite loop
o The disadvantage is that a considerable amount of input is skipped without
checking it for additional errors
2. Statement Mode recovery
o In this method, when a parser encounters an error, it performs the necessary
correction on the remaining input so that the rest of the input statement allows
the parser to parse ahead.
oThe correction can be deletion of extra semicolons, replacing the comma with
semicolons, or inserting a missing semicolon.
o While performing correction, utmost care should be taken for not going in an
infinite loop.
o A disadvantage is that it finds it difficult to handle situations where the actual
error occurred before pointing of detection.
3. Error production
o If a user has knowledge of common errors that can be encountered then, these
errors can be incorporated by augmenting the grammar with error productions
that generate erroneous constructs.
o If this is used then, during parsing appropriate error messages can be generated
and parsing can be continued.
o The disadvantage is that it’s difficult to maintain.
4. Global Correction
o The parser examines the whole program and tries to find out the closest match
for it which is error-free.
o The closest match program has less number of insertions, deletions, and changes
of tokens to recover from erroneous input.
o Due to high time and space complexity, this method is not implemented
practically.

Semantic errors

These errors are detected during the semantic analysis phase. Typical semantic errors are

• Incompatible type of operands

• Undeclared variables
• Not matching of actual arguments with a formal one

Example : int a[10], b;

.......
.......
a = b;

It generates a semantic error because of an incompatible type of a and b.

Error recovery

• If the error “Undeclared Identifier” is encountered then, to recover from this a symbol
table entry for the corresponding identifier is made.
• If data types of two operands are incompatible then, automatic type conversion is done
by the compiler.
Difference Between Compiler and Interpreter
A compiler checks the whole program at once. It displays all the errors at a place once the whole
program is checked. On the other hand, an interpreter checks the program line by line. If an error
is detected the execution stops.

Interpreter
An interpreter is a program that directly executes the instructions in a high-level language,
without converting it into machine code. In programming, we can execute a program in two
ways. Firstly, through compilation and secondly, through an interpreter. The common way is to
use a compiler.

Difference Between Compilers and Interpreters

Sr.No Compilers Interpreters

It converts the whole program into

1. It translates only one statement at a time.
machine code at once.
It takes more time to analyze the
It comparatively takes less time to analyze the source
source code. In other words, compile
2. code. In other words, compile time is less. However,
time is more. However, the overall
the overall execution time is more.
execution time is less.

It generates an intermediate object

It does not generate any intermediate object code.
3. code. Therefore, more memory is
Hence it is memory efficient.
used.

The whole program is compiled and

It stops the compilation if any error occurs. Hence,
4. then it shows all the errors together.
debugging is easier.
Therefore, debugging is difficult.

Programming languages like Python, Ruby, PHP, etc.

Programming languages like C, C++,
5. use an interpreter. These interpreted languages are
Java, etc use compiler.
also called scripting languages.

Summary
• A compiler is a computer program which helps you transform source code written in a high-level
language into low-level machine language.
• Correctness, speed of compilation, preserve the correct the meaning of the code are some
important features of compiler design.
• Compilers are divided into three parts 1) Single Pass Compilers 2)Two Pass Compilers, and 3)
Multipass Compilers.
• The “compiler” was word first used in the early 1950s by Grace Murray Hopper.
• Steps for Language processing system are: Preprocessor, Interpreter, Assembler, Linker/Loader.
• Important compiler construction tools are 1) Scanner generators, 2)Syntax-3) directed translation
engines, 4) Parser generators, 5) Automatic code generators.
• The main task of the compiler is to verify the entire program, so there are no syntax or semantic
errors.
Frequently Asked Questions (FAQs)
Q1. What is a compiler?

A1. It is software that converts the source code into machine code. The process is called
compilation.

Q2. What are the phases/structure of a compiler?

A2. The phases are:

• lexical analyzer
• syntax analyzer
• semantic analyzer
• intermediate code generator
• code optimizer
• target code generator

Q3. What is a symbol table?

A3. It helps to find the names of identifiers easily. It consists of identifiers and their types.

Q4. What is the difference between compiler and interpreter?

A4. The compiler checks the code as a whole whereas, the interpreter checks it line by line.

Q5. What is a decompiler?

A5. It converts machine code to the source code. It is the reverse of the compiler.

Q6. What Is Linear Analysis?

A6. Linear analysis is one in which the stream of characters making up the source program is
read from left to right and grouped into tokens that are sequences of characters having a
collective meaning. Also called lexical analysis or scanning.

CD All Units
No ratings yet
CD All Units
117 pages
CD 1
No ratings yet
CD 1
15 pages
Core Course Viii Compiler Design Unit I
No ratings yet
Core Course Viii Compiler Design Unit I
27 pages
CSC303 - Compiler Design - 060624
No ratings yet
CSC303 - Compiler Design - 060624
49 pages
1.lecture Notes 19 Apil
No ratings yet
1.lecture Notes 19 Apil
26 pages
Indian Institute of Information Technology, Bhagalpur: Assignment - 1
No ratings yet
Indian Institute of Information Technology, Bhagalpur: Assignment - 1
26 pages
Compilers
No ratings yet
Compilers
86 pages
CD Unit - 1 Lms Notes
No ratings yet
CD Unit - 1 Lms Notes
58 pages
5CAI4-02 Compiler Design
No ratings yet
5CAI4-02 Compiler Design
21 pages
Compiler Design Ch1
No ratings yet
Compiler Design Ch1
13 pages
5 Com
No ratings yet
5 Com
3 pages
Unit 1 Introduction To Compiler 1. Introduction To Compiler
No ratings yet
Unit 1 Introduction To Compiler 1. Introduction To Compiler
134 pages
Manjakkudi
No ratings yet
Manjakkudi
158 pages
Compiler 2024
No ratings yet
Compiler 2024
179 pages
CD Experiments 1,2
No ratings yet
CD Experiments 1,2
12 pages
Introduction of Compiler Design
No ratings yet
Introduction of Compiler Design
63 pages
CD Unit 1
No ratings yet
CD Unit 1
11 pages
CD Notes
No ratings yet
CD Notes
28 pages
CSC 320 Notes - 1
No ratings yet
CSC 320 Notes - 1
67 pages
Compiler Notes
No ratings yet
Compiler Notes
68 pages
CSC 321 Compiler Consturction 1 Note Main
No ratings yet
CSC 321 Compiler Consturction 1 Note Main
82 pages
Compiler Course for CS Students
No ratings yet
Compiler Course for CS Students
41 pages
Bedasa
No ratings yet
Bedasa
31 pages
Compiler
No ratings yet
Compiler
24 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
13 pages
Chapter 1 in Automated Theory
No ratings yet
Chapter 1 in Automated Theory
19 pages
CMP 352
No ratings yet
CMP 352
16 pages
Compiler Design and Types Explained
No ratings yet
Compiler Design and Types Explained
54 pages
Compiler Design for CS Students
No ratings yet
Compiler Design for CS Students
12 pages
CH1-1 and 1-2
No ratings yet
CH1-1 and 1-2
34 pages
Compiler Design - Module 1-Notes
No ratings yet
Compiler Design - Module 1-Notes
74 pages
CD Module 1 Cambridge
No ratings yet
CD Module 1 Cambridge
136 pages
Compiler Construction and Phases
No ratings yet
Compiler Construction and Phases
8 pages
CS4031 Compiler Construction Lecture 1
No ratings yet
CS4031 Compiler Construction Lecture 1
42 pages
Language Processors
No ratings yet
Language Processors
38 pages
Compiler Design
No ratings yet
Compiler Design
11 pages
CD Sanchit Sir Notes
100% (1)
CD Sanchit Sir Notes
115 pages
Compiler Design
No ratings yet
Compiler Design
65 pages
Compiler Design Unit I 2025
No ratings yet
Compiler Design Unit I 2025
75 pages
Compiler Unit - 1 PDF
No ratings yet
Compiler Unit - 1 PDF
16 pages
Learning Materials, CD, Unit-1 (Btech-5th Sem)
No ratings yet
Learning Materials, CD, Unit-1 (Btech-5th Sem)
12 pages
Compiler Design - Quick Guide
No ratings yet
Compiler Design - Quick Guide
38 pages
Unit 1 Introduction
No ratings yet
Unit 1 Introduction
9 pages
Chapter 1
No ratings yet
Chapter 1
40 pages
Compiler Basics and Functions
No ratings yet
Compiler Basics and Functions
7 pages
Compiler Design and Implementation
No ratings yet
Compiler Design and Implementation
5 pages
Unit 1 Phases of CD
No ratings yet
Unit 1 Phases of CD
33 pages
Compiler Lecture-1
No ratings yet
Compiler Lecture-1
47 pages
Compiler Construction Lec 1a
No ratings yet
Compiler Construction Lec 1a
53 pages
Compiler Construction Guide
100% (1)
Compiler Construction Guide
91 pages
Chapter 1-1
No ratings yet
Chapter 1-1
25 pages
1 Compiler Design Lect1
No ratings yet
1 Compiler Design Lect1
28 pages
Compiler Design Quick Guide
No ratings yet
Compiler Design Quick Guide
51 pages
Compiler Notes 1
No ratings yet
Compiler Notes 1
92 pages
Introduction To Compiler
No ratings yet
Introduction To Compiler
57 pages
Compiler Design Basics
No ratings yet
Compiler Design Basics
13 pages
Compiler 1
No ratings yet
Compiler 1
33 pages
CD Unit-1 (Complete)
No ratings yet
CD Unit-1 (Complete)
90 pages
Week 1 PDF
100% (1)
Week 1 PDF
38 pages
CH 15 Mixed Economic System
No ratings yet
CH 15 Mixed Economic System
2 pages
Communication in Multicultural Settings: at The End of This Module, You Are Expected To
No ratings yet
Communication in Multicultural Settings: at The End of This Module, You Are Expected To
3 pages
Mbx7 Mugen Seiki
No ratings yet
Mbx7 Mugen Seiki
32 pages
2 - UK National Annex To Eurocode 5 - Design of Timber Structures. Bridges
No ratings yet
2 - UK National Annex To Eurocode 5 - Design of Timber Structures. Bridges
10 pages
Electrical Engineering Exam Guide
No ratings yet
Electrical Engineering Exam Guide
3 pages
VFQ Session Book - Building Plans v1.0
100% (1)
VFQ Session Book - Building Plans v1.0
52 pages
Unit-2 Introduction To Ethernet
No ratings yet
Unit-2 Introduction To Ethernet
32 pages
PDF Autoquant 100i Operator Manual - Compress
No ratings yet
PDF Autoquant 100i Operator Manual - Compress
96 pages
Kanupriya - 2MP Built-In Mic & Speaker USB Camera - 10 Dec 25
No ratings yet
Kanupriya - 2MP Built-In Mic & Speaker USB Camera - 10 Dec 25
1 page
LESSON 2.forms and Genres of Contemporary Arts
No ratings yet
LESSON 2.forms and Genres of Contemporary Arts
25 pages
Screenshot 2023-03-31 at 3.17.45 PM
No ratings yet
Screenshot 2023-03-31 at 3.17.45 PM
1 page
5.Rcl Circuit Intro
No ratings yet
5.Rcl Circuit Intro
5 pages
Aerodynamic CFD Analysis On High Lift Mu
No ratings yet
Aerodynamic CFD Analysis On High Lift Mu
12 pages
Edited Introdution To Epidemiology
No ratings yet
Edited Introdution To Epidemiology
90 pages
Aerodynamics: Aerodynamics Is A Branch of Dynamics
No ratings yet
Aerodynamics: Aerodynamics Is A Branch of Dynamics
13 pages
Theology's Historical Challenges
No ratings yet
Theology's Historical Challenges
15 pages
MMW Practice Exam Answer Key
No ratings yet
MMW Practice Exam Answer Key
6 pages
Document Title: SWT Wilden Pump Maintenance Sheets: Equipment Information
No ratings yet
Document Title: SWT Wilden Pump Maintenance Sheets: Equipment Information
5 pages
5 +Hilya+Ayu+Adene+Taqya
No ratings yet
5 +Hilya+Ayu+Adene+Taqya
10 pages
6 - Glaucoma
No ratings yet
6 - Glaucoma
34 pages
Weelee Centurion - 2018 - White - Audi - A4 1.4 T FSi S-Tronic Sport - 05 May 2025
No ratings yet
Weelee Centurion - 2018 - White - Audi - A4 1.4 T FSi S-Tronic Sport - 05 May 2025
5 pages
Sky Full of Star
No ratings yet
Sky Full of Star
2 pages
Unit 3 Unit Test Group A
No ratings yet
Unit 3 Unit Test Group A
6 pages
Feature-Based Semi-Supervised Learning To Detect Malware From Android
No ratings yet
Feature-Based Semi-Supervised Learning To Detect Malware From Android
26 pages
Bagong Barrio National High School
No ratings yet
Bagong Barrio National High School
14 pages
Pulitzer's Trains - Paint Schemes of The NYSW
No ratings yet
Pulitzer's Trains - Paint Schemes of The NYSW
1 page
Economics of Health & Education
No ratings yet
Economics of Health & Education
19 pages
2015 Jeep Compass 4WD L4-2.4L C1
No ratings yet
2015 Jeep Compass 4WD L4-2.4L C1
2 pages
Ak DSH CV
No ratings yet
Ak DSH CV
3 pages
Global Default Trends 2021-2022
No ratings yet
Global Default Trends 2021-2022
60 pages