Theoretical Aspects of Lexical Analysis/Exercise 1

From Wiki**3

< Theoretical Aspects of Lexical Analysis


Problem

Use Thompson's algorithm to build the NFA for the following regular expression. Build the corresponding DFA and minimize it.

  • (a|b)*

Solution

The non-deterministic finite automaton (NFA), built by applying Thompson's algorithm to the regular expression (a|b)* is the following:

[Expand] NFA for (a|b)*
nfa 0 0 start->0 7 7 0->7 1 1 0->1 2 2 1->2 4 4 1->4 3 3 2->3 a 5 5 4->5 b 6 6 3->6 5->6 6->7 6->1

Applying the determination algorithm to the above NFA, the following determination table is obtained:

In α∈Σ move(In, α) ε-closure(move(In, α)) In+1 = ε-closure(move(In, α))
- - 0 0, 1, 2, 4, 7 0
0 a 3 1, 2, 3, 4, 6, 7 1
0 b 5 1, 2, 4, 5, 6, 7 2
1 a 3 1, 2, 3, 4, 6, 7 1
1 b 5 1, 2, 4, 5, 6, 7 2
2 a 3 1, 2, 3, 4, 6, 7 1
2 b 5 1, 2, 4, 5, 6, 7 2
Graphically, the DFA is represented as follows:
dfa 0 0 start->0 1 1 0->1 a 2 2 0->2 b 1->1 a 1->2 b 2->1 a 2->2 b

Given the minimization tree to the right, the final minimal DFA is:

dfamin 012 012 start->012 012->012 a 012->012 b
The minimization tree is as follows. As can be seen, the states are indistinguishable.
mintree {0, 1, 2} {0, 1, 2} {} {} {0, 1, 2}->{} NF {0, 1, 2} {0, 1, 2} {0, 1, 2}->{0, 1, 2} F {0, 1, 2} {0, 1, 2} {0, 1, 2}->{0, 1, 2} a,b