Toc 1

THEORY OF COMPUTATION (AUBER) EXAM CRIB SHEET
The regular grammar rules must be one of the

following four forms:
Regular Languages and FA
non-terminal terminal non-terminal |

terminal | non-terminal | .
A language is a set of strings over a finite alphabet

. All languages are finite or countably infinite.
A Deterministic Finite Automata DFA is

M = (Q, , , q0, F), where Q is a finite set of states;
is an alphabet; q0 Q is the initial state; F Q is
the set of final states; and is the transition function
from (Q ) to Q.
The set of all languages over * is uncountably

infinite.
L, L = L = (analogue to multiplication by 0)
L, L{}= {}L = L (multiplication by 1 analogue)
A Nondeterministic Finite Automata NFA is

M = (Q, , , q0, F), where, Q is a finite set of states,
is an alphabet, q0 Q is the initial state, F Q is
the set of final states, and is the transition relation,
which is a finite subset of (Q ( {})) Q.
L n = LLL L (n times)
L 0 = {}
L1 = L
L* = L0 L1 L2 L3 L, L*
A language is regular if and only if it can be

described by a regular expression, regular grammar,
NFA, or DFA or made up using regular language
closure properties. All finite languages are regular.
L+ = L1 L2 L3
* = 0 = {}
Regular languages are closed under: union,

concatenation, Kleene closure *, complementation,
intersection, and reversal.
String Reversal (wR) inductive definition:

if |w| = 0, w = wR = .
If |w| 1, then a , w = ua and wR = auR.
A palindrome is a string w, such that w = wR.
Language Reversal: LR = {wR: w L}.
Subset construction (State set method) for

converting an NFA to a DFA: States in the DFA
(M) will correspond to sets of states in the NFA
(M). =. Compute E(q) for all q in Q (the set of
states reachable on -transitions).
s = E(s). ' (Q, a) = {E(p) : p Q
and (q, a, p)
for some q Q}. Q 2Q that appear in .
F = {Q Q' : Q F }.
The "pure" regular expressions are finite strings

over the alphabet {"(", ")", , , *}:
and each particular member of is a regular
expression.
If , are regular expressions, then so are
, , *, and ().
Creating a regular expression from an NFA

(State Elimination): Convert transitions to simple
regular expressions. Add new start and final state
with appropriate -transitions. Remove all other
states, one state at a time reconstructing regular
expression paths through the state being removed.
For each pair of incoming and outgoing paths via the
Operator precedence (from highest to lowest):

( ), *, concatenation, , .
A regular grammar has a set of non-terminals,
terminals, rules, and a starting non-terminal.
{Terminals} = .
1
state being removed (from x to y, labeled and ,

respectively), add a transition from x to y with the
regular expression * (where is the regular
expression on the self-loop on the state being
removed, or ). Union together the regular
expressions for parallel transitions. When only the
start and final state remain with one transition, that
regular expression is the answer.
Algorithm for minimizing a DFA (State

Reduction by Set Partitioning): Remove any
unreachable states as a first step. Break apart the
equivalence classes of states at each iteration based
on their distinguishability against the equivalence
classes of the previous iteration.
Initially, 0 = {[F], [Q - F]}. For any two states p and
q Q and any n 1, q n p iff:
q n-1 p, AND for all a , (p, a) n-1 (q, a).
Iterate until n-1 equals n. The result, , is this final
equivalence relation. Join together states from the
equivalence classes to create the minimal state DFA.
Pumping Lemma for Regular Languages: Let L

be an infinite regular language. Then m 1, such
that for any string w L, where |w| m, x, y, z,
such that w = xyz and |xy| m, and |y| 1,
i
and i 0, xy z is in L.
The pumping lemma is used to prove languages are
not regular by contradiction.
Context Free Languages and PDA
Strategy: After you assume the language is regular,

you may use closure properties to make your proof
easier/possible by constructing a related language
that you can more easily prove is not regular.
Choose w that depends only on m. Show that for
each possible partitioning (there may be several
cases) that you can find a i that contradicts the
pumping theorem.
A context-free grammar (CFG) is a quadruple

G = (V, , S, R), where:
V is the rule alphabet, which contains
nonterminals (symbols that are used in the grammar
but that do not appear in strings in the language) and
terminals,
(the set of terminals) is a subset of V,
A decision procedure is an algorithm that answers a

question (usually "yes" or "no") and terminates.
There are decision procedures for deciding whether:
L(M) = , L(M) = *, L(M) is infinite, L(M) is
finite, w L(M), L(M1) = L(M2), L(M1) L(M2),
etc. for NFAs M, M1, M2.
S (the start symbol) is an element of V - .

R (the set of rules (productions)) is a finite subset
of (V - ) V*,
Any sequence of the form: w0 G w1 G w2 G . . .
G wn
e.g., (S) (SS) ((S)S)
Equivalence classes of a language: Based on the

equivalence relation x L y if, z *, xz L iff yz
L. If a z exists that violates the test, it is called a
distinguisher. L is an equivalence class that
partitions *.
is called a derivation in G. Each wi is called a

sentential form where in each step, one nonterminal
is replaced with the RHS (right hand side) of one of
its associated rules. The language generated by G is
{w * : S G* w}.
The Myhill-Nerode Theorem states that if L is

regular that there is a DFA that accepts L that has
precisely L states. Also, a language is regular iff L
is finite.
A language L is context free if L = L(G) for some

context -free grammar G. All regular languages are
context free.
any configuration. Determinism implies uniquely

defined machine behavior.
In a leftmost derivation, the leftmost nonterminal is

always the one that is replaced.
In a rightmost derivation, the rightmost
nonterminal is always the one that is replaced.
Nondeterministic PDAs accept the class of context

free languages.
A parse tree captures the essential structure of a

derivation, but does not contain complete
information about the time ordering of rule
applications. Several derivations can therefore be
associated with the same parse tree. Each parse tree
has a unique leftmost derivation and a unique
rightmost derivation. The root of the parse tree is as
a rule S.
There are algorithms to convert from a PDA to a

CFG and from a CFG to a PDA.
"Top-down parse" conversion: Given G = (V, ,
R, S), construct M such that L(M) = L(G).
M = ({p, q}, , V, , p, {q}), where contains:
((p, , ), (q, S)) push the start symbol on the stack
as the "goal"
The leaves of a parse tree are terminals or s. The

yield of a parse tree is the concatenation of the
leaves.
((q, , A), (q, x)) for each rule A x in R replace

left hand side with right hand side
A context free grammar is ambiguous if it has

multiple parse trees for some string in the language.
A context free language is inherently ambiguous if
there is no unambiguous grammar for the language.
((q, a, a), (q, )) for each a read an input

character and pop it from the stack
The resulting machine can execute a leftmost
derivation of an input string and it will usually be
nondeterministic.
A pushdown automata (PDA) is a sextuple

M = (Q, , , , s, F), , where:
Q is a finite set of states; is the input alphabet;
is the stack alphabet;
"Bottom-up parse" conversion: Given G = (V, ,

R, S), construct M such that L(M) = L(G).
s Q is the initial state; F Q is the set of final

states, and
M = ({p, q}, , V, , p, {q}), where contains:

((p, , xR), (p, A)) for each rule A x in R
reduce using the rule A x
is the transition relation. It is a finite subset of

(Q ( {}) *) (Q *).
((p, a, ), (p, a)) for each a shift input symbol

a onto the stack
The first grouping describes the machine computer

that allows the rule to follow (state, optional input
symbol, and symbols at the top of the stack). The
second grouping describes what to do (go to a new
state and replace the top of stack.)
((p, , S), (q, )) accept by popping the start

symbol off the stack
The resulting machine can execute a rightmost
derivation (backwards) of an input string and it will
usually be nondeterministic.
For the machine to accept a string, some

computation must leave the machine in a final state,
consume all the input. L(M) is the set of all such
strings. Other computations on that string which fail
are ignored.
CFLs are closed under union, concatenation,

Kleene closure, and intersection with a regular
language.
A PDA is deterministic if, for each input and state,

there is at most one possible transition to follow in
enumerable iff there is a Turing machine that

lexicographically enumerates it.
CFL Pumping Lemma: For every infinite CFL, L,

there is an integer m, such that any string w L
Theorem: A language is recursive iff it is

lexicographically Turing-enumerable.
where |w| m can be rewritten as w = uvxyz

in such a way that: |vy| 1; |vxy| m; and i 0,
uv xy z L.
A function is recursive iff there is a TM that

computes it and halts for every input in the domain.
There are decision procedures that will determine

for a CFG, G, whether w L(G), whether L(G) = ,
or whether L(G) is infinite.
Let 0 be the input alphabet to a Turing machine M.

Let L 0*. M accepts (semidecides) L iff for
any string w 0*, w L M halts on input w,
and w L M does not halt on input w (M(w) = ).
Unrestricted Languages and Turing Machines
L is Turing recognizable
(recursively enumerable) iff there is a Turing
machine that accepts (semidecides) it.
A Turing machine is a tuple (Q, , , , q0, # , F ):
The class of recursively enumerable languages is

closed under union, intersection, concatenation,
Kleene closure, and reversal, but NOT complement.
Theorem: A language is recursive iff both it and its

complement are recursively enumerable.
Q is a finite set of internal states;

is the input alphabet;
Theorem: If a language is RE and not recursive,

then its complement is not RE.
is a tape alphabet;
A language is Turing-enumerable iff there is a

Turing machine that enumerates it.
is the transition function;

# is a special blank symbol
A language is recursively enumerable iff it is

Turing-enumerable.
q0 Q is the initial state;
A function is partially recursive iff there is a TM

that computes it and halts for every input in its
domain, but the domain is restricted to a recursively
enumerable set.
F Q is the set of final states.

Let 0 be the input alphabet. Then M decides a
language L 0* iff for any string w 0* it is true
that: if w L then M accepts w (halts in the "Y"
state), and if w L then M rejects w (halts in the
"N" state).
An unrestricted, or Type 0, or phrase structure

grammar G is a quadruple (V, , R, S), where
V is an alphabet;
A language L is Turing decidable (recursive) if

there is a Turing machine M that decides it.
(the set of terminals) is a subset of V;
The class of recursive languages is closed under

union, intersection, concatenation, Kleene closure,
complement, and reversal.
R (the set of rules) is a finite subset of

V*(V-)V* V* (left hand sides must contain one
nonterminal); and
We say that M lexicographically enumerates L if

M enumerates the elements of L in lexicographic
order. A language L is lexicographically Turing-
S (the start symbol) is an element of V - .
We define derivations just as we did for context-free

grammars. The language generated by G is
{w * : S G* w}.
Primitive recursive function template, by which a

function can be defined recursively through
f (x, 0) = g1(x)
There is no notion of a derivation tree or

rightmost/leftmost derivation for unrestricted
grammars.
f(x, y+1) = h(g2(x, y), f(x, y))

from defined functions g1, g2 and h.
Theorem: A language is generated by an

unrestricted grammar if and only if it is recursively
enumerable (i.e., it is accepted (semidecided) by
some Turing machine M).
We say that G computes f if, for all w, v *,
SwS G* v iff v = f(w).
A function f is called grammatically computable
iff there is a grammar G that computes it.
Theorem: A function f is recursive iff it is
grammatically computable.
The universal TM (U), is a TM interpreter.
L(U) = H = {M, w: TM M halts on input w}.
H is RE, but not recursive. H is not RE.
Let L1, L2 * be languages. A reduction from L1
to L2 is a recursive function : * * such that
x L1 iff (x) L2.
Theorem: If L1 is not recursive and there is a
reduction from L1 to L2, then L2 is not recursive.
Theorem: If L1 is not RE and there is a reduction
from L1 to L2, then L2 is not RE.
Proof technique for showing a language is
undecidable: Choose some known undecidable
language L1. (H will usually work for this.) Assume
that L2 (your language) is decidable. Construct a
reduction from L1 to L2. You have just shown that L1
is decidable. Contradiction. Therefore L2 is
undecidable.
Rices Theorem: No nontrivial property of the
recursively enumerable languages is decidable.
Nontrivial means that for all RE languages the
property is not always true or always false.

Toc 1

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Toc 1

Uploaded by

Copyright:

Available Formats

THEORY OF COMPUTATION (AUBER) EXAM CRIB SHEET

The regular grammar rules must be one of the

Regular Languages and FA

non-terminal terminal non-terminal |

A language is a set of strings over a finite alphabet

A Deterministic Finite Automata DFA is

The set of all languages over * is uncountably

A Nondeterministic Finite Automata NFA is

A language is regular if and only if it can be

Regular languages are closed under: union,

String Reversal (wR) inductive definition:

Subset construction (State set method) for

The "pure" regular expressions are finite strings

Creating a regular expression from an NFA

Operator precedence (from highest to lowest):

state being removed (from x to y, labeled and ,

Algorithm for minimizing a DFA (State

Pumping Lemma for Regular Languages: Let L

Context Free Languages and PDA

Strategy: After you assume the language is regular,

A context-free grammar (CFG) is a quadruple

A decision procedure is an algorithm that answers a

S (the start symbol) is an element of V - .

Equivalence classes of a language: Based on the

is called a derivation in G. Each wi is called a

The Myhill-Nerode Theorem states that if L is

A language L is context free if L = L(G) for some

any configuration. Determinism implies uniquely

In a leftmost derivation, the leftmost nonterminal is

Nondeterministic PDAs accept the class of context

A parse tree captures the essential structure of a

There are algorithms to convert from a PDA to a

The leaves of a parse tree are terminals or s. The

((q, , A), (q, x)) for each rule A x in R replace

A context free grammar is ambiguous if it has

((q, a, a), (q, )) for each a read an input

A pushdown automata (PDA) is a sextuple

"Bottom-up parse" conversion: Given G = (V, ,

s Q is the initial state; F Q is the set of final

M = ({p, q}, , V, , p, {q}), where contains:

is the transition relation. It is a finite subset of

((p, a, ), (p, a)) for each a shift input symbol

The first grouping describes the machine computer

((p, , S), (q, )) accept by popping the start

For the machine to accept a string, some

CFLs are closed under union, concatenation,

A PDA is deterministic if, for each input and state,

enumerable iff there is a Turing machine that

CFL Pumping Lemma: For every infinite CFL, L,

Theorem: A language is recursive iff it is

where |w| m can be rewritten as w = uvxyz

A function is recursive iff there is a TM that

There are decision procedures that will determine

Let 0 be the input alphabet to a Turing machine M.

Unrestricted Languages and Turing Machines

A Turing machine is a tuple (Q, , , , q0, # , F ):

The class of recursively enumerable languages is

Theorem: A language is recursive iff both it and its

Q is a finite set of internal states;

Theorem: If a language is RE and not recursive,

A language is Turing-enumerable iff there is a

is the transition function;