0% found this document useful (0 votes)

21 views68 pages

Toc Unit 1

Uploaded by

Kashish Bhalla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views68 pages

Toc Unit 1

Uploaded by

Kashish Bhalla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 68

Theory of Computation

Presented By
Dr Saravanan S ME,Ph.D
School of Computing Science and Engineering
VIT Bhopal University, Bhopal
Theory of Automata

• Theory of automata is a theoretical branch of

computer science and mathematical. It is the
study of abstract machines and the
computation problems that can be solved using
these machines. The abstract machine is called
the automata. The main motivation behind
developing the automata theory was to develop
methods to describe and analyse the dynamic
behaviour of discrete systems.
• The State is represented by circles, and
the Transitions is represented by arrows.
Symbols:
• Symbols are an entity or individual objects,
which can be any letter, alphabet or any
picture.
Example:
1, a, b, #
Alphabets:
• Alphabets are a finite set of symbols. It is denoted by ∑.
• Examples:
• ∑ = {a, b}

• ∑ = {A, B, C, D}

• ∑ = {0, 1, 2}

• ∑ = {0, 1, ....., 5]

• ∑ = {#, β, Δ}
String:
• It is a finite collection of symbols from the alphabet. The string is
denoted by w.
Example 1:
• If ∑ = {a, b}, various string that can be generated from ∑ are {ab,
aa, aaa, bb, bbb, ba, aba.....}.
• A string with zero occurrences of symbols is known as an empty
string. It is represented by ε.
• The number of symbols in a string w is called the length of a string.
It is denoted by |w|.
• Example 2:
w = 010
Number of Sting |w| = 3
Language:
• A language is a collection of appropriate
string. A language which is formed over Σ can
be Finite or Infinite.
• Example: 1
L1 = {Set of string of length 2} = {aa, bb, ba,
bb}
L2 = {Set of all strings starts with 'a'} = {a, aa,
aaa, abb, abbb, ababb}
Finite Automata

• Finite automata are used to recognize patterns.

• It takes the string of symbol as input and changes
its state accordingly. When the desired symbol is
found, then the transition occurs.
• At the time of transition, the automata can either
move to the next state or stay in the same state.
• Finite automata have two states, Accept
state or Reject state. When the input string is
processed successfully, and the automata reached
its final state, then it will accept.
Formal Definition of FA

• A finite automaton is a collection of 5-tuple

(Q, ∑, δ, q0, F), where:
• Q: finite set of states
• ∑: finite set of the input symbol
• q0: initial state
• F: final state
• δ: Transition function
Finite Automata Model

• Finite automata can be represented by input tape

and finite control.
• Input tape: It is a linear tape having some
number of cells. Each input symbol is placed in
each cell.
• Finite control: The finite control decides the next
state on receiving particular input from input tape.
The tape reader reads the cells one by one from
left to right, and at a time only one input symbol
is read.
Transition Diagram
Transition Table

Present State Next state for Next State of Input 1

Input 0

→q0 q1 q2

q1 q0 q2

*q2 q2 q2
Types of Automata
Deterministic Finite Automata(DFA)
• DFA refers to deterministic finite automata.
Deterministic refers to the uniqueness of the
computation. The finite automata are called
deterministic finite automata if the machine is
read an input string one symbol at a time.
• In DFA, there is only one path for specific input
from the current state to the next state.
• DFA does not accept the null move, i.e., the DFA
cannot change state without any input character.
• DFA can contain multiple final states. It is used in
Lexical Analysis in Compiler
Formal Definition of DFA

A DFA is a collection of 5-tuples same as we

described in the definition of FA
• Q: finite set of states
• ∑: finite set of the input symbol
• q0: initial state
• F: final state
• δ: Transition function Q x ∑→Q
Example 1

• Q = {{q0, q1, q2} ,∑ = {0, 1},

δ ,q0 = {q0} ,F = {q2} }= ?

State 0 1

→q0 q0 q1

q1 q2 q1

*q2 q2 q2
Example 2

DFA with ∑ = {0, 1} accepts all starting with 0.

Example 3

DFA with ∑ = {0, 1} accepts all ending with 0.

Non-Deterministic Finite
Automata(NFA or NDFA)
• NFA stands for non-deterministic finite automata.
It is easy to construct an NFA than DFA for a
given regular language.
• The finite automata are called NFA when there
exist many paths for specific input from the
current state to the next state.
• Every NFA is not DFA, but each NFA can be
translated into DFA.
• NFA is defined in the same way as DFA but with
the following two exceptions, it contains multiple
next states, and it contains ε transition.
Example
Formal definition of NFA

A DFA is a collection of 5-tuples same as we

described in the definition of FA
• Q: finite set of states
• ∑: finite set of the input symbol
• q0: initial state
• F: final state
• δ: Transition function δ: Q x ∑ →2Q
Example

Present State Next state for Next State of

Input 0 Input 1

→q0 q0, q1 q1
q1 q2 q0
*q2 q2 q1, q2
Example 2
Present State 0 1

→q0 q0, q1 q0, q2

q1 q3 ε
q2 q2, q3 q3
*q3 q3 q3
Example 3
Problem 1
• Let M =({ q0,q1,q2,q3},{0,1},δ,q0,q0) and the
transition functions are
δ (q0, 0) =q2 δ (q2, 0) =q0
δ (q0, 1) =q1 δ (q2, 1) =q3
δ (q1, 0) =q3 δ (q3, 0) =q1
δ (q1, 1) =q0 δ (q3, 1) =q2
a) Represent M by its State table
b) Represents M by its State diagram
c) Which of the following strings are accepted by
1)0101 2) 110101 3) 01001
Problem 2

• String are{ 001, 1001,01101}

Conversion from NFA to DFA

Let, M = (Q, ∑, δ, q0, F) is an NFA which accepts

the language L(M). There should be equivalent
DFA denoted by M' = (Q', ∑', q0', δ', F') such that
L(M) = L(M').
Steps for converting NFA to DFA

• Step 1: Initially Q' = ϕ

• Step 2: Add q0 of NFA to Q'. Then find the
transitions from this start state.
• Step 3: In Q', find the possible set of states for
each input symbol. If this set of states is not in
Q', then add it to Q'.
• Step 4: In DFA, the final state will be all the
states which contain F(final states of NFA)
Example 1: Convert the given NFA to DFA.

State 0 1

→q0 {q0, q1} {q1}

*q1 ϕ {q0, q1}

Solution
State 0 1

→[q0] [q0, q1] [q1]

*[q1] ϕ [q0, q1]

*[q0, q1] [q0, q1] [q0, q1]

Conversion from NFA with ε to DFA

Steps for converting NFA with ε to DFA:

Step 1: We will take the ε-closure for the starting state of NFA
as a starting state of DFA.
Step 2: Find the states for each input symbol that can be
traversed from the present. That means the union of transition
value and their closures for each state of NFA present in the
current state of DFA.
Step 3: If we found a new state, take it as current state and
repeat step 2.
Step 4: Repeat Step 2 and Step 3 until there is no new state
present in the transition table of DFA.
Step 5: Mark the states of DFA as a final state which contains
the final state of NFA.
Example

Convert the given NFA into its equivalent DFA

Solution
• ε-closure(q0) = {q0, q1, q2}
• ε-closure(q1) = {q1, q2}
• ε-closure(q2) = {q2}
• δ'(A, 0) = A
• δ'(A, 1) = B
• δ'(A, 2) = C
• δ'(B, 0) = ϕ
• δ'(B, 1) = B
• δ'(B, 2) = C
Minimization of DFA

• Step 1: Remove all the states that are

unreachable from the initial state via any set
of the transition of DFA.
• Step 2: Draw the transition table for all pair of
states.
• Step 3: Now split the transition table into two
tables T1 and T2. T1 contains all final states,
and T2 contains non-final states.
• Step 4: Find similar rows from T1 such that:
COMPILER
DEFINITION
 Compiler is a program that reads a program written in
one language – source program – and translate it into
an equivalent program in another language – target
language
 Compiler reports to its user the presence of error in
source program

source program Target program

COMPILER

Error Message
Types of Compiler

There are 4 types

– Single – Pass
– Multi- Pass
– Load and Go
– Debugging
Analysis – Synthesis Model

There are 2 pars to compilation

1.Analysis Model
 breaks up the source program into
constituent pieces and creates an
intermediate representation of the source
program
 Tree Syntax tree
Analysis – Synthesis model
2.Synthesis Model
Constructs the desired target program from the
intermediate representation
Software Tools :
 Structural Editors
Pretty Printers
Static Checkers
Interpreters
Text formatters
Silicon compilers
Query interpreters
Context of compilers

Figure 1. A Language-Processing System

Analysis of the source program
Analysis consists of 3 phases
1.Linear analysis
 source program is read from left-to-right
 Grouped into tokens  sequences of
characters having a collective meaning.
2.Hierarchical analysis
 tokens are grouped hierarchically into
nested collections with collective meaning.

3.semantic analysis
 to ensure that the components of a
program fit together meaningfully.
Phases of a Compiler
Source Program

1
Lexical Analyzer

2
Syntax Analyzer

3
Semantic Analyzer

Symbol-table Manager Error Handler

4 Intermediate Code
Generator

5
Code Optimizer

6
Code Generator

Target Program
Lexical Analysis
• Also called scanning
Scans Input & Identify tokens
Removes White spaces and comments

An Example
Position := initial + rate * 60 ;

All are tokens

Token

• Three types of tokens:

• Terminal Symbols (TRM)-
• Keywords and Operators, Literals (LIT),
• Identifiers (IDN).
Example
• int a = 10;
Tokens
int (keyword),
a(identifier),
=(operator),
10(constant)
;(punctuation-semicolon)
Total token =5
Example 2
int main()
{
// printf() sends the string inside quotation to
// the standard output (the display)
printf("Welcome to GeeksforGeeks!"); return 0;
}
• Tokens 'int', 'main', '(', ')', '{', 'printf', '(', '
"Welcome to GeeksforGeeks!" ', ')', ';', 'return',
'0', ';', '}‘

Total number of tokens = 14

Lexeme

• It is a sequence of characters in the source

code that are matched by given predefined
language rules for every lexeme to be specified
as a valid token. int, goto,main
Example
main is lexeme of type identifier(token)
(,),{,} are lexemes of type punctuation(token)
Pattern

• It specifies a set of rules that a scanner follows

to create a token.
• It must start with the alphabet, followed by
the alphabet or a digit.
• The sequence of characters that make the
keyword.
Example
(, ), {, }
LEX

• Lex is a program that generates lexical

analyzer. It is used with YACC parser
generator.
• The lexical analyzer is a program that
transforms an input stream into a sequence of
tokens.
The function of Lex is as follows:
• Lex file format
{definitions } %% { rules } %% { user subroutines }
Syntax Analysis
 The Syntactic Analysis is also called Parsing.

 Tokens are grouped into grammatical phrases represented by a Parse Tree

 The hierarchical structure is expressed by recursive rules, called

Productions.

Parse Tree: An Example

Semantic Analysis
 The Semantic Analysis phase checks the program for
semantic errors (Type Checking) and gathers type information
for the successive phases.

 Type Checking: Check types of operands .No real number as

index for array; etc.
Intermediate Code Generation
 An intermediate code is generated as a program for an abstract machine.

 The intermediate code should be easy to translate into the target program.

 As intermediate code we consider the three-address code, similar to

assembly:

• sequence of instructions with at most three operands such that:

 There is at most one operator, in addition to the assignment.

• make explicit the operators precedence.

 Temporary names must be generated to compute intermediate

operations.
Code Optimization
 This phase attempts to improve the intermediate code so that
faster-running

• machine code can be obtained.

 Different compilers adopt different optimization techniques.

Code Generation
This phase generates the target code consisting of assembly code.
1. Memory locations are selected for each variable;
2. Instructions are translated into a sequence of assembly instructions;
3. Variables and intermediate results are assigned to memory registers.
The Structure of a Compiler
Code Generator
[Intermediate Code Generator]

Non-optimized Intermediate Code

Scanner
[Lexical Analyzer]

Tokens

Code Optimizer
Parser
[Syntax Analyzer]
Optimized Intermediate Code
Parse tree

Code Optimizer
Semantic Process
[Semantic analyzer] Target machine code

Abstract Syntax Tree w/ Attributes

Symbol Table Management
• A symbol table is a data structure containing a record for
each identifier, with fields for the attributes of the
identifier.

• The data structure allows us to find the record for each

identifier quickly and to store or retrieve data from that
record quickly.
Error Detection and Reporting
• Each phase encounter errors
 The lexical phase can detect errors where the characters
remaining in the input do not form any token of the language.
 Errors where the token stream violates the structure rules
(syntax)of the language are determined by the syntax analysis
phase.
 During semantic analysis the compiler tries to detect constructs
that have the right syntactic structure but no meaning to the
operation involved
 e.g., if we try to add two identifiers, one of which is the name
of an array, and the other the name of a procedure
Compiler-Construction Tools
Other names are
 Compiler – compiler
 Compiler – generators
 Translator – writing system

For compiler construction there are special tools like :

• 1- Scanner generators Eg.Lex and Flex
• 2- Parser generators Eg. Yacc and Bison
• 3- Syntax-directed translation engines
• 4- Code-generator generators
• 5- Data-flow analysis engines

Unit I
No ratings yet
Unit I
30 pages
Unit-1 TOC
No ratings yet
Unit-1 TOC
68 pages
TOC Syllabus
No ratings yet
TOC Syllabus
346 pages
Theory of Automata - Lecture - 1
No ratings yet
Theory of Automata - Lecture - 1
36 pages
UNIT-1 Complete Notes
No ratings yet
UNIT-1 Complete Notes
23 pages
TOC Notes
No ratings yet
TOC Notes
104 pages
Automata
No ratings yet
Automata
62 pages
Automata & Complexity Theory Cosc3025: Chapter One
100% (1)
Automata & Complexity Theory Cosc3025: Chapter One
27 pages
CH 1
No ratings yet
CH 1
27 pages
Basics of Language
No ratings yet
Basics of Language
13 pages
ACD Unit 1
No ratings yet
ACD Unit 1
20 pages
Flat 1 Unit
No ratings yet
Flat 1 Unit
30 pages
It 2502 CT & CD Notes-Unit 1
No ratings yet
It 2502 CT & CD Notes-Unit 1
42 pages
Chapter One Introduction To Automata
No ratings yet
Chapter One Introduction To Automata
28 pages
Automata An Complexity Theory - Ch01
No ratings yet
Automata An Complexity Theory - Ch01
28 pages
1.TOC Lecture 1
No ratings yet
1.TOC Lecture 1
29 pages
Finite Automata: Definitions & Types
No ratings yet
Finite Automata: Definitions & Types
51 pages
Unit1 Dfa & Nfa
No ratings yet
Unit1 Dfa & Nfa
74 pages
DR Varun Bansal
No ratings yet
DR Varun Bansal
54 pages
Dfa, Nfa
No ratings yet
Dfa, Nfa
25 pages
Automata: The Methods and The Madness
No ratings yet
Automata: The Methods and The Madness
84 pages
Notes All5units
No ratings yet
Notes All5units
363 pages
Chapter Two
No ratings yet
Chapter Two
28 pages
06 Nondeterminism TG DFA
No ratings yet
06 Nondeterminism TG DFA
82 pages
Finite Automata Basics
No ratings yet
Finite Automata Basics
146 pages
Finite Automata: Formal Definition of FA
No ratings yet
Finite Automata: Formal Definition of FA
39 pages
Flat Unit 1
No ratings yet
Flat Unit 1
36 pages
Module 1 - PART 2
No ratings yet
Module 1 - PART 2
63 pages
Notes All5units
No ratings yet
Notes All5units
345 pages
CC 7 Unit 4
No ratings yet
CC 7 Unit 4
15 pages
Module 1 Part 1
No ratings yet
Module 1 Part 1
64 pages
Automata Learning
No ratings yet
Automata Learning
445 pages
FLAT Unit 1 Notes (Typed)
No ratings yet
FLAT Unit 1 Notes (Typed)
18 pages
Unit - 1
No ratings yet
Unit - 1
36 pages
Finite Automata
No ratings yet
Finite Automata
45 pages
Lecture 1
No ratings yet
Lecture 1
17 pages
Automata Chapter 1 2 &3
No ratings yet
Automata Chapter 1 2 &3
36 pages
Automata Theory for CS Students
No ratings yet
Automata Theory for CS Students
44 pages
Finite Automata and Formal Language (19CS3501) : Department of Computer Science
No ratings yet
Finite Automata and Formal Language (19CS3501) : Department of Computer Science
37 pages
Intro to Finite Automata
No ratings yet
Intro to Finite Automata
54 pages
Automata Theory Essentials
No ratings yet
Automata Theory Essentials
19 pages
TOC Intro To Syllabus
No ratings yet
TOC Intro To Syllabus
47 pages
TOC Lectures
No ratings yet
TOC Lectures
54 pages
Automata (Unit - 1)
100% (1)
Automata (Unit - 1)
12 pages
Flat Notes 1
No ratings yet
Flat Notes 1
28 pages
APP Unit 5
No ratings yet
APP Unit 5
86 pages
Unit 4 FSA and Grammar
No ratings yet
Unit 4 FSA and Grammar
39 pages
Automata Theory & Complexity Basics
No ratings yet
Automata Theory & Complexity Basics
23 pages
Unit-2 Introduction To Finite Automata PDF
No ratings yet
Unit-2 Introduction To Finite Automata PDF
68 pages
Flat
No ratings yet
Flat
38 pages
Finite Automata
No ratings yet
Finite Automata
16 pages
CD Digital Notes
No ratings yet
CD Digital Notes
126 pages
Unit 1 2
No ratings yet
Unit 1 2
34 pages
ATCD
No ratings yet
ATCD
114 pages
Anitha Christopher Automata Theory Lecture Notes
No ratings yet
Anitha Christopher Automata Theory Lecture Notes
80 pages
Automata Stud
No ratings yet
Automata Stud
240 pages
Flat KJR
No ratings yet
Flat KJR
19 pages
3 Fa
No ratings yet
3 Fa
64 pages
Unit-II (Introduction To Finite Automata)
No ratings yet
Unit-II (Introduction To Finite Automata)
80 pages
Graph DFS BFS
No ratings yet
Graph DFS BFS
47 pages
Mod 1 A
No ratings yet
Mod 1 A
34 pages
Relational Algebra
No ratings yet
Relational Algebra
34 pages
Normalisation
No ratings yet
Normalisation
56 pages
LAB EXPT-5 - Linear Filtering
No ratings yet
LAB EXPT-5 - Linear Filtering
7 pages
CSE422 Assignment 1 Uninformed and Local Search
No ratings yet
CSE422 Assignment 1 Uninformed and Local Search
2 pages
ZQX Ddr3 BDW Ult r1b 0903 Acer Aspire r3-471t
No ratings yet
ZQX Ddr3 BDW Ult r1b 0903 Acer Aspire r3-471t
43 pages
Relays-Online
No ratings yet
Relays-Online
2 pages
Altaf Resume
No ratings yet
Altaf Resume
3 pages
Ravali Data Engineer GCP
No ratings yet
Ravali Data Engineer GCP
8 pages
COE C
No ratings yet
COE C
212 pages
Java Theory QuestionsC10 Class As The Basis of All Computation
No ratings yet
Java Theory QuestionsC10 Class As The Basis of All Computation
3 pages
R07 Set No. 2
No ratings yet
R07 Set No. 2
7 pages
Amiga Floppy Drive Compatibility List
No ratings yet
Amiga Floppy Drive Compatibility List
3 pages
Manuale Operatore ENG
No ratings yet
Manuale Operatore ENG
9 pages
VIPA 315-4NE12 Manual
No ratings yet
VIPA 315-4NE12 Manual
180 pages
Family Tree and Connect by
No ratings yet
Family Tree and Connect by
22 pages
OptiPlex 7080 vs 7070: Specs Guide
No ratings yet
OptiPlex 7080 vs 7070: Specs Guide
3 pages
Map Network Drives with Group Policy
No ratings yet
Map Network Drives with Group Policy
12 pages
SND Version
No ratings yet
SND Version
15 pages
Server-Side Request Forgery (SSRF)
No ratings yet
Server-Side Request Forgery (SSRF)
5 pages
Caves Server Log
No ratings yet
Caves Server Log
10 pages
Gas Management System
No ratings yet
Gas Management System
28 pages
Assignment01 Oops
No ratings yet
Assignment01 Oops
2 pages
Al 0796 Ict 1 Mock Hihs 2024
No ratings yet
Al 0796 Ict 1 Mock Hihs 2024
5 pages
AWS Infrastructure for Students
No ratings yet
AWS Infrastructure for Students
8 pages
Unit 1 3
No ratings yet
Unit 1 3
9 pages
5MF - Create Equipment - S4CLD2408 - Master - Data - EN - XX
No ratings yet
5MF - Create Equipment - S4CLD2408 - Master - Data - EN - XX
10 pages
Rtos Faq Vector
No ratings yet
Rtos Faq Vector
56 pages
Unit 2 Omputer Network Aktu
100% (1)
Unit 2 Omputer Network Aktu
30 pages
IT Leadership & Support Expertise
No ratings yet
IT Leadership & Support Expertise
2 pages
Discrete 4 Synergy Core User Manual
No ratings yet
Discrete 4 Synergy Core User Manual
17 pages
Unit 12
No ratings yet
Unit 12
22 pages

Toc Unit 1

Uploaded by

Toc Unit 1

Uploaded by

Theory of Computation

• Theory of automata is a theoretical branch of

• Finite automata are used to recognize patterns.

• A finite automaton is a collection of 5-tuple

• Finite automata can be represented by input tape

Present State Next state for Next State of Input 1

A DFA is a collection of 5-tuples same as we

• Q = {{q0, q1, q2} ,∑ = {0, 1},

DFA with ∑ = {0, 1} accepts all starting with 0.

DFA with ∑ = {0, 1} accepts all ending with 0.

A DFA is a collection of 5-tuples same as we

Present State Next state for Next State of

→q0 q0, q1 q0, q2

• String are{ 001, 1001,01101}

Let, M = (Q, ∑, δ, q0, F) is an NFA which accepts

• Step 1: Initially Q' = ϕ

→q0 {q0, q1} {q1}

*q1 ϕ {q0, q1}

→[q0] [q0, q1] [q1]

*[q1] ϕ [q0, q1]

*[q0, q1] [q0, q1] [q0, q1]

Steps for converting NFA with ε to DFA:

Convert the given NFA into its equivalent DFA

• Step 1: Remove all the states that are

source program Target program

There are 4 types

There are 2 pars to compilation

Figure 1. A Language-Processing System

Symbol-table Manager Error Handler

All are tokens

• Three types of tokens:

Total number of tokens = 14

• It is a sequence of characters in the source

• It specifies a set of rules that a scanner follows

• Lex is a program that generates lexical

 Tokens are grouped into grammatical phrases represented by a Parse Tree

 The hierarchical structure is expressed by recursive rules, called

Parse Tree: An Example

 Type Checking: Check types of operands .No real number as

 As intermediate code we consider the three-address code, similar to

• sequence of instructions with at most three operands such that:

 There is at most one operator, in addition to the assignment.

• make explicit the operators precedence.

 Temporary names must be generated to compute intermediate

• machine code can be obtained.

 Different compilers adopt different optimization techniques.

Non-optimized Intermediate Code

Abstract Syntax Tree w/ Attributes

• The data structure allows us to find the record for each

For compiler construction there are special tools like :

You might also like