0% found this document useful (0 votes)

21 views45 pages

Formal Languages & Automata Theory

The document discusses converting NFAs to equivalent DFAs and describes operations on regular languages such as concatenation, union, and Kleene star. It provides examples of constructing regular expressions by combining simpler languages.

Uploaded by

Cyrus Li

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views45 pages

Formal Languages & Automata Theory

Uploaded by

Cyrus Li

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

NFA to DFA conversion and regular expressions

CSCI 3130 Formal Languages and Automata Theory

Siu On CHAN
Fall 2018
Chinese University of Hong Kong

1/22
DFAs and NFAs are equally powerful

NFA can do everything a DFA can do

How about the other way?

Every NFA is equivalent to some DFA for the same language

2/22
NFA → DFA algorithm

Given an NFA, figure out

1. the initial active states

2. how the set of active states changes upon reading an input
symbol

3/22
NFA → DFA example

ε,1
NFA: q0 q1 ε q2
0

Initial active states (before reading any input)?

4/22
NFA → DFA example

ε,1
NFA: q0 q1 ε q2
0

Initial active states (before reading any input)?

partial
DFA: {q0 , q1 , q2 }

How does the set of active states change?

NFA → DFA example

ε,1
NFA: q0 q1 ε q2
0

Initial active states (before reading any input)?

partial
DFA: {q0 , q1 , q2 }

How does the set of active states change?

NFA → DFA example

ε,1
NFA: q0 q1 ε q2
0

Initial active states (before reading any input)?

partial 1
DFA: {q0 , q1 , q2 } {q1 , q2 }

How does the set of active states change?

NFA → DFA example

ε,1
NFA: q0 q1 ε q2
0

Initial active states (before reading any input)?

0 0,1

partial 1
1
DFA: {q0 , q1 , q2 } {q1 , q2 } ∅
0

How does the set of active states change?

4/22
NFA → DFA summary

0 0,1

1
DFA: 1
{q0 , q1 , q2 } {q1 , q2 } ∅
0

Every DFA state corresponds to a subset of NFA states

A DFA state is accepting if it contains an accepting NFA state

5/22
Regular expressions
Regular expressions

Powerful string matching feature in advanced editors (e.g. Vim,

Emacs) and modern programming languages (e.g. PERL, Python)

PERL regex examples:

colou?r matches “color”/“colour”
[A-Za-z]*ing matches any word ending in “ing”

We will learn to parse complicated regex recursively

by building up from simpler ones
Also construct the language matched by the expression recursively

Will focus on regular expressions in formal language theory

(notations differ from PERL/Python/POSIX regex)

6/22
String concatenation

st = abbbab
s = abb ts = bababb
t = bab ss = abbabb
sst = abbabbbab
s = x1 . . . xn , t = y1 . . . ym
⇓
st = x1 . . . xn y1 . . . ym

7/22
Operations on languages

• Concantenation of languages L1 and L2

L1 L2 = {st : s ∈ L1 , t ∈ L2 }

• n-th power of language L

Ln = {s1 s2 . . . sn | s1 , s2 , . . . , sn ∈ L}

• Union of L1 and L2

L1 ∪ L2 = {s | s ∈ L1 or s ∈ L2 }

8/22
Example

L1 = {0, 01} L2 = {ε, 1, 11, 111, . . . }

L1 L2 = {0, 01, 011, 0111, . . . } ∪ {01, 011, 0111, 01111, . . . }

= {0, 01, 011, 0111, . . . }
0 followed by any number of 1s

L12 = {00, 001, 010, 0101} L22 = L2

L2n = L2 for any n > 1

L1 ∪ L2 = {0, 01, ε, 1, 11, 111, . . . }

9/22
Operations on languages

The star of L are contains strings made up of zero or more chunks

from L

L∗ = L0 ∪ L1 ∪ L2 ∪ . . .

Example: L1 = {0, 01} and L2 = {ε, 1, 11, 111, . . . }

What is L1∗ ? L2∗ ?

10/22
Example

L1 = {0, 01}

L10 = {ε}
L11 = {0, 01}
L12 = {00, 001, 010, 0101}
L13 = {000, 0001, 0010, 00101, 0100, 01001, 01010, 010101}

Which of the following are in L1∗ ?

00100001 00110001 10010001

11/22
Example

L1 = {0, 01}

L10 = {ε}
L11 = {0, 01}
L12 = {00, 001, 010, 0101}
L13 = {000, 0001, 0010, 00101, 0100, 01001, 01010, 010101}

Which of the following are in L1∗ ?

00100001 00110001 10010001
Yes No No

11/22
Example

L1 = {0, 01}

L10 = {ε}
L11 = {0, 01}
L12 = {00, 001, 010, 0101}
L13 = {000, 0001, 0010, 00101, 0100, 01001, 01010, 010101}

Which of the following are in L1∗ ?

00100001 00110001 10010001
Yes No No

L1∗ contains all strings such that any 1 is preceded by a 0

11/22
Example

L2 = {ε, 1, 11, 111, . . . }

any number of 1s

L20 = {ε}
L21 = L2
L22 = L2
L2n = L2 (n > 1)

12/22
Example

L2 = {ε, 1, 11, 111, . . . }

any number of 1s

L2∗ = L20 ∪ L21 ∪ L22 ∪ . . .

L20 = {ε}
= {ε} ∪ L2 ∪ L2 ∪ . . .
L21 = L2
= L2
L22 = L2
L2n = L2 (n > 1)
L2∗ = L2

12/22
Combining languages

We can construct languages by starting with simple ones, like {0}

and {1}, and combining them

{0}({0} ∪ {1})∗ ⇒ 0(0 + 1)∗

all strings that start with 0

13/22
Combining languages

We can construct languages by starting with simple ones, like {0}

and {1}, and combining them

{0}({0} ∪ {1})∗ ⇒ 0(0 + 1)∗

all strings that start with 0

({0}{1}∗ ) ∪ ({1}{0}∗ ) ⇒ 01∗ + 10∗

0 followed by any number of 1s, or
1 followed by any number of 0s

13/22
Combining languages

We can construct languages by starting with simple ones, like {0}

and {1}, and combining them

{0}({0} ∪ {1})∗ ⇒ 0(0 + 1)∗

all strings that start with 0

({0}{1}∗ ) ∪ ({1}{0}∗ ) ⇒ 01∗ + 10∗

0 followed by any number of 1s, or
1 followed by any number of 0s

0(0 + 1)∗ and 01∗ + 10∗ are regular expressions

Blueprints for combining simpler languages into complex ones

13/22
Syntax of regular expressions

A regular expression over Σ is an expression formed by the following

rules

• The symbols ∅ and ε are regular expressions

• Every symbol a in Σ is a regular expression
• If R asd S are regular expressions, so are R + S, RS and R∗

Examples:
∅ ε
∗
0(0 + 1)∗ 1 (ε + 0)
01∗ + 10∗ (0 + 1)∗ 01(0 + 1)∗

A language is regular if it is represented by a regular expression

14/22
Understanding regular expressions

Σ = {0, 1}

01∗ = 0(1)∗ represents {0, 01, 011, 0111, . . . }

0 followed by any number of 1s

01∗ is not (01)∗

15/22
Understanding regular expressions

0 + 1 yields {0, 1} strings of length 1

(0 + 1)∗ yields {ε, 0, 1, 00, 01, 10, 11, . . . } any string

(0 + 1)∗ 010 any string that ends in 010

(0 + 1)∗ 01(0 + 1)∗ any string containing 01

16/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1))∗ + ((0 + 1)(0 + 1)(0 + 1))∗

17/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1))∗ + ((0 + 1)(0 + 1)(0 + 1))∗

((0 + 1)(0 + 1))∗ ((0 + 1)(0 + 1)(0 + 1))∗

17/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1))∗ + ((0 + 1)(0 + 1)(0 + 1))∗

((0 + 1)(0 + 1))∗ ((0 + 1)(0 + 1)(0 + 1))∗

(0 + 1)(0 + 1) (0 + 1)(0 + 1)(0 + 1)

17/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1))∗ + ((0 + 1)(0 + 1)(0 + 1))∗

((0 + 1)(0 + 1))∗ ((0 + 1)(0 + 1)(0 + 1))∗

(0 + 1)(0 + 1) (0 + 1)(0 + 1)(0 + 1)

strings of length 2 strings of length 3

17/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1))∗ + ((0 + 1)(0 + 1)(0 + 1))∗

((0 + 1)(0 + 1))∗ ((0 + 1)(0 + 1)(0 + 1))∗

strings of even length strings whose length is a
multiple of 3

(0 + 1)(0 + 1) (0 + 1)(0 + 1)(0 + 1)

strings of length 2 strings of length 3

17/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1))∗ + ((0 + 1)(0 + 1)(0 + 1))∗
strings whose length is even or a multiple of 3
= strings of length 0, 2, 3, 4, 6, 8, 9, 10, 12, . . .

((0 + 1)(0 + 1))∗ ((0 + 1)(0 + 1)(0 + 1))∗

strings of even length strings whose length is a
multiple of 3

(0 + 1)(0 + 1) (0 + 1)(0 + 1)(0 + 1)

strings of length 2 strings of length 3

17/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1))∗

18/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1))∗

(0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1)

18/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1))∗

(0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1)

(0 + 1)(0 + 1) (0 + 1)(0 + 1)(0 + 1)

18/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1))∗

(0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1)

(0 + 1)(0 + 1) (0 + 1)(0 + 1)(0 + 1)

strings of length 2 strings of length 3

18/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1))∗

(0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1)

strings of length 2 or 3

(0 + 1)(0 + 1) (0 + 1)(0 + 1)(0 + 1)

strings of length 2 strings of length 3

18/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1))∗
strings that can be broken into blocks, where each block has length 2
or 3

(0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1)

strings of length 2 or 3

(0 + 1)(0 + 1) (0 + 1)(0 + 1)(0 + 1)

strings of length 2 strings of length 3

18/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1))∗
strings that can be broken into blocks, where each block has length 2
or 3

Which are in the language?

ε 1 01 011 00110 011010110

19/22
Understanding regular expressions

What language does the following represent?

((0 + 1)(0 + 1) + (0 + 1)(0 + 1)(0 + 1))∗
strings that can be broken into blocks, where each block has length 2
or 3

Which are in the language?

ε 1 01 011 00110 011010110
3 7 3 3 3 3

The regular expression represents all strings except 0 and 1

19/22
Understanding regular expressions

What language does the following represent?

ends in at most two 0s
z }| {
(1 + 01 + 001)∗ (ε + 0 + 00)
| {z }
at most two 0s between two consecutive 1s

20/22
Understanding regular expressions

What language does the following represent?

ends in at most two 0s
z }| {
∗
(1 + 01 + 001) (ε + 0 + 00)
| {z }
at most two 0s between two consecutive 1s

20/22
Understanding regular expressions

What language does the following represent?

ends in at most two 0s
z }| {
∗
(1 + 01 + 001) (ε + 0 + 00)
| {z }
at most two 0s between two consecutive 1s

Never three consecutive 0s

The regular expression represents strings not containing 000

Examples:

ε 00 0110010110 0010010

20/22
Writing regular expressions

Write a regular expression for all strings with two consecutive 0s

21/22
Writing regular expressions

Write a regular expression for all strings with two consecutive 0s

(anything)00(anything)

(0 + 1)∗ 00(0 + 1)∗

21/22

Automata - Chap3+regularexpressionlanguages - 2
No ratings yet
Automata - Chap3+regularexpressionlanguages - 2
61 pages
Automata Theory: CS411-2012S-02 Formal Languages
No ratings yet
Automata Theory: CS411-2012S-02 Formal Languages
33 pages
FLAT Unit 1 August 2023
No ratings yet
FLAT Unit 1 August 2023
69 pages
21CS51 ATCD MODULE 2 - 1 Regular Expressions
No ratings yet
21CS51 ATCD MODULE 2 - 1 Regular Expressions
148 pages
CMP3008 LN4 RegularExpressions
No ratings yet
CMP3008 LN4 RegularExpressions
45 pages
Module 2flat
No ratings yet
Module 2flat
26 pages
Lec 02
No ratings yet
Lec 02
25 pages
Slide1 New Toc
No ratings yet
Slide1 New Toc
22 pages
Chapter 3 - Regular Expressions
No ratings yet
Chapter 3 - Regular Expressions
49 pages
CSC236: Finite Automata & Languages
No ratings yet
CSC236: Finite Automata & Languages
44 pages
Automata Lectuee3
No ratings yet
Automata Lectuee3
27 pages
Dfa 2
No ratings yet
Dfa 2
51 pages
Finite Automata and Regular Languages
No ratings yet
Finite Automata and Regular Languages
98 pages
CH 3 - Regular Languages Amd Regular Grammars
No ratings yet
CH 3 - Regular Languages Amd Regular Grammars
67 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Introduction to Alphabets and Regular Expressions
No ratings yet
Introduction to Alphabets and Regular Expressions
21 pages
Regular Languages & Finite Automata
No ratings yet
Regular Languages & Finite Automata
140 pages
Unit-2 Regular Expression and Languages
No ratings yet
Unit-2 Regular Expression and Languages
42 pages
TOA Lecture 03
No ratings yet
TOA Lecture 03
63 pages
Bcs503 Module 2
No ratings yet
Bcs503 Module 2
46 pages
Unit 3 - Regular Expression
No ratings yet
Unit 3 - Regular Expression
45 pages
Computability 05
No ratings yet
Computability 05
28 pages
CH 3 - Regular Languages Amd Regular Grammars
No ratings yet
CH 3 - Regular Languages Amd Regular Grammars
67 pages
Chapter 3 Regular Expressions Notes
100% (1)
Chapter 3 Regular Expressions Notes
36 pages
cs212 Lect02 63 Inter
No ratings yet
cs212 Lect02 63 Inter
39 pages
Presentation 7741 Content Document 20250625033851PM
No ratings yet
Presentation 7741 Content Document 20250625033851PM
68 pages
Lec 1 IntroToAutomataTheory
100% (1)
Lec 1 IntroToAutomataTheory
20 pages
Unit Ii
No ratings yet
Unit Ii
25 pages
Regular Expressions & Grammars in CS
No ratings yet
Regular Expressions & Grammars in CS
75 pages
Automata & Regular Expressions Guide
No ratings yet
Automata & Regular Expressions Guide
52 pages
Unit I
No ratings yet
Unit I
37 pages
Toc Unit 2
No ratings yet
Toc Unit 2
29 pages
Regular Expressions and Languages
No ratings yet
Regular Expressions and Languages
16 pages
Toc CHP-2
No ratings yet
Toc CHP-2
15 pages
Compilation Techniques
No ratings yet
Compilation Techniques
21 pages
Chapter Two
No ratings yet
Chapter Two
59 pages
TOC Unit-2
No ratings yet
TOC Unit-2
124 pages
Specification of Tokens
No ratings yet
Specification of Tokens
21 pages
AT&CD Unit 1
No ratings yet
AT&CD Unit 1
19 pages
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
No ratings yet
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
52 pages
Atcd Unit-2 PDF
No ratings yet
Atcd Unit-2 PDF
21 pages
Unit22pdf 2021 03 13 13 38 11
No ratings yet
Unit22pdf 2021 03 13 13 38 11
114 pages
Vision 2023 Toc Chapter 3 Regular Expression 59
No ratings yet
Vision 2023 Toc Chapter 3 Regular Expression 59
8 pages
Lec 03 - Finite Languages
No ratings yet
Lec 03 - Finite Languages
29 pages
Unit 2 - (Regular Language
No ratings yet
Unit 2 - (Regular Language
26 pages
Automata Lecture 03 RE
No ratings yet
Automata Lecture 03 RE
20 pages
Regular Languages and Regular Grammars
No ratings yet
Regular Languages and Regular Grammars
20 pages
Regular Expressions & Automata
No ratings yet
Regular Expressions & Automata
28 pages
Regular Expression: Operations On Regular Language
No ratings yet
Regular Expression: Operations On Regular Language
11 pages
Theory of Automata Lecture#3: by Riaz Ahmad Ziar R.ziar@kardan - Edu.af
No ratings yet
Theory of Automata Lecture#3: by Riaz Ahmad Ziar R.ziar@kardan - Edu.af
19 pages
CompilerD L3
No ratings yet
CompilerD L3
36 pages
Chapter 3
No ratings yet
Chapter 3
10 pages
Theory of Computation: Dr. Krishnendu Rarhi E: Krishnendu.e9621@cumail - in
No ratings yet
Theory of Computation: Dr. Krishnendu Rarhi E: Krishnendu.e9621@cumail - in
44 pages
Pcdunit2 Continuation
No ratings yet
Pcdunit2 Continuation
26 pages
Intro to Regular Expressions
No ratings yet
Intro to Regular Expressions
27 pages
Chapter 2 REGULAR EXPRESSION
No ratings yet
Chapter 2 REGULAR EXPRESSION
26 pages
Manual - Es4000 - Standard
No ratings yet
Manual - Es4000 - Standard
15 pages
Final Models
No ratings yet
Final Models
10 pages
CSEC JUNE IT 2023 P2 Solution
No ratings yet
CSEC JUNE IT 2023 P2 Solution
16 pages
02 ACR Integration
No ratings yet
02 ACR Integration
41 pages
F.Y. B.tech. (AIML) 2023-24 Final Syllabus
No ratings yet
F.Y. B.tech. (AIML) 2023-24 Final Syllabus
44 pages
Comprehensive SynchroPro 4D Hands-On Training Manual
No ratings yet
Comprehensive SynchroPro 4D Hands-On Training Manual
26 pages
Preprints202306 1026 v1
No ratings yet
Preprints202306 1026 v1
18 pages
67f804cdf3829e65bdb04297 Rujevagububo
No ratings yet
67f804cdf3829e65bdb04297 Rujevagububo
8 pages
Gui Design in C++.
No ratings yet
Gui Design in C++.
23 pages
SAP Sales-Service Cloud 2011 Release Preview Service Final
100% (1)
SAP Sales-Service Cloud 2011 Release Preview Service Final
79 pages
Byvision Version p80018110
No ratings yet
Byvision Version p80018110
63 pages
CBSE Class 11 English Core 2021-22-Merged
No ratings yet
CBSE Class 11 English Core 2021-22-Merged
8 pages
Sany New CRM User Manual-Engineer APP-en v1.0
No ratings yet
Sany New CRM User Manual-Engineer APP-en v1.0
58 pages
Cyient 2023
No ratings yet
Cyient 2023
426 pages
Pointers
No ratings yet
Pointers
12 pages
Office Administrator Job Description For Resume
100% (1)
Office Administrator Job Description For Resume
7 pages
AnalytixLabs - Visualization & Analytics With Excel-VBA, SQL & Tableau
No ratings yet
AnalytixLabs - Visualization & Analytics With Excel-VBA, SQL & Tableau
16 pages
Unit 1
No ratings yet
Unit 1
41 pages
Bme 431
No ratings yet
Bme 431
2 pages
ECE 863 AFSD - Week2 (Feb.10 15)
No ratings yet
ECE 863 AFSD - Week2 (Feb.10 15)
33 pages
PM80 Handheld Terminal Quick Guide
No ratings yet
PM80 Handheld Terminal Quick Guide
2 pages
Pan Os New Features
No ratings yet
Pan Os New Features
102 pages
Video Sum4
No ratings yet
Video Sum4
5 pages
DNSSec Tutorial 4 - Phil Regnauld and Hervey Allen PDF
No ratings yet
DNSSec Tutorial 4 - Phil Regnauld and Hervey Allen PDF
9 pages
Digital Channel Enrollment Form
No ratings yet
Digital Channel Enrollment Form
2 pages
Ix Maths Question Bank 1
No ratings yet
Ix Maths Question Bank 1
29 pages
CL10 Math (S) PA1 QP 23-24
No ratings yet
CL10 Math (S) PA1 QP 23-24
4 pages
IPython - Beyond Normal Python - Python Data Science Handbook
No ratings yet
IPython - Beyond Normal Python - Python Data Science Handbook
3 pages
Database Test 1
No ratings yet
Database Test 1
2 pages
Linear Algebra Course Guide
No ratings yet
Linear Algebra Course Guide
137 pages