When you google something, the search engine essentially performs pattern matching: it identifies and retrieves web pages containing the search keyword(s), then displays them in a predetermined (typically relevance-based) order. Cast this pattern matching as a decision problem. Also write down the language corresponding to your decision problem

🔍 Step 1: What happens when you Google something

When you type a keyword, say "Turing Machine", Google:

Looks through its index of web pages (a huge database of text).
Searches for matches — web pages that contain the keyword "Turing Machine".
Returns those pages (ranked by relevance).

This is a pattern-matching task — find whether the keyword appears in a document or not.

💡 Step 2: Casting this as a Decision Problem

In automata theory, a decision problem is one that has a yes/no answer.

So, we can define the problem as follows:

Decision Problem:
Given a web page (or document) $w$ and a search keyword (pattern) $p$ ,
decide whether the keyword $p$ appears anywhere in the web page $w$ .

✅ Output YES — if $p$ appears in $w$
❌ Output NO — if $p$ does not appear in $w$

Formal Definition

Let’s define it formally as a decision problem:

PATTERN-MATCHING Problem:
Input: Two strings $p$ and $w$ , where $p$ is the pattern (keyword) and $w$ is the text (web page content).
Question: Does $p$ occur as a substring of $w$ ?

🧩 Step 3: Defining the Language

Every decision problem can be represented as a language —
the set of all inputs for which the answer is “YES”.

So, for this problem, the language is:

L_{pattern} = \{ \langle p, w \rangle \mid \text{the pattern } p \text{ occurs as a substring in } w \}

That is:

Each element of the language is a pair ⟨pattern, text⟩.
The pair is in the language if and only if the pattern appears in the text.

🧠 Example

Let’s take:

$p = "data"$
$w = "I love data science"$

Since “data” occurs in the web page text,

\langle "data", "I love data science" \rangle \in L_{pattern}

But:

$p = "robot"$
$w = "I love data science"$

Then,

\langle "robot", "I love data science" \rangle \notin L_{pattern}

⚙️ Step 4: Is this Problem Decidable?

✅ Yes — pattern matching is decidable.

Because:

There exists a finite, mechanical procedure (like the Knuth-Morris-Pratt algorithm or even a simple scan)
that can determine in finite time whether $p$ appears in $w$ .

So, a Turing machine can be constructed to:

Read both $p$ and $w$
Slide $p$ along $w$
Compare character by character
Halt and accept if a match is found, else reject

📘 Final Answer

Concept	Description
Decision Problem	Given a pattern $p$ and text $w$ , determine whether $p$ appears in $w.$
Language	$L_{pattern} = \{ \langle p, w \rangle \mid p \text{ occurs as a substring of } w \}$
Output	YES (accept) if $p$ is in $w$ ; NO (reject) otherwise.
Decidability	Decidable — a Turing Machine can check this in finite time.

Construct a DFA for the language consisting of strings over the alphabet Σ = {a, b} that contains no more than one occurrence of the string aa. (Note that the string aaa contains two occurrences of aa.)

Is the class of languages recognized by NFAs closed under complement? Explain your answer.

NFAs themselves are not directly closed under complement — you can’t just flip final and non-final states.
However, the class of languages recognized by NFAs (the regular languages) is closed under complement,
because any NFA can be converted to a DFA, and DFAs are closed under complement.

NFA \Rightarrow equivalent DFA \Rightarrow swap accept/reject states \Rightarrow complement DFA \Rightarrow equivalent NFA for L .

Thus, we can always convert the NFA to an equivalent DFA, complement that DFA, and get an NFA (since DFAs are special cases of NFAs).

Give a context-free grammar that generates the language: Lpair = {a^i b^j c^k} | i = j or j = k, where i, j, k ≥ 0}

this language is the union of two simple context-free patterns:

$L_{1} = {a^{n} b^{n} c^{k} ∣ n, k \geq 0} (equal number of$ $a$ 's and $b$ 's, arbitrary number of $c$ 's), and
$L_2=\{a^i b^n c^n \mid i,n\ge 0\}$ (equal number of $b$ 's and $c$ 's, arbitrary number of $a$ 's).

A straightforward CFG for $L_{\text{pair}}=L_1\cup L_2$

is obtained by giving one nonterminal for each case and making the start symbol choose between them.

Grammar

Nonterminals: S,S1,S2,A,B,C,D
Terminals: {a,b,c}
Start symbol: S

Productions:

S → S1 | S2

S1 → A B

A → ε | a A b (generates a^n b^n)

B → ε | c B (generates c^k)

S2 → C D

C → ε | a C (generates a^i)

D → ε | b D c (generates b^n c^n)

Construct a PDA to recognize the language generated by the
following grammar:
S → aA
A → aABC | bB | a
B → b
C → c

PDA = (Q, Σ, Γ, δ, q, S, accept-by-empty-stack) where:

Q = {q}
Σ = {a, b, c}
Γ = {S, A, B, C, a, b, c} (we push terminals on the stack too)
Start state q, initial stack symbol S. Accept when input is consumed and stack is empty.

Transitions (informal description):

For each production X → α, add an ε-transition that replaces X on top of stack by the string α (pushed so that the leftmost symbol of α becomes the new top at the time of matching). Concretely (we list pop X then push the RHS in reverse so the terminal that should be matched next is on top):

δ(q, ε, S) ⊢ (q, A a) (replace S by aA, so push A then a so a is on top)
δ(q, ε, A) ⊢ (q, C B A a) (replace A by aABC, push in reverse: C, B, A, a so a is top)
δ(q, ε, A) ⊢ (q, B b) (replace A by bB, push B then b)
δ(q, ε, A) ⊢ (q, a) (replace A by a)
δ(q, ε, B) ⊢ (q, b) (replace B by b)
δ(q, ε, C) ⊢ (q, c) (replace C by c)

For matching terminals: when the top of the stack is a terminal equal to the current input symbol, consume it and pop it:

δ(q, a, a) ⊢ (q, ε) (read a, pop a)
δ(q, b, b) ⊢ (q, ε) (read b, pop b)
δ(q, c, c) ⊢ (q, ε) (read c, pop c)

(Other combinations have no moves.)

Trace for input `aaabc`

Convention: show each configuration as (remaining input, stack) with the stack top on the left. Start with stack containing the start symbol S. Accept by empty stack.

Initial: (aaabc, [S])

Apply S → aA (replace S with aA, pushing A then a so a is top):
(aaabc, [a, A])
Match terminal a (consume input a, pop a):
(aabc, [A])
Apply A → aABC (replace A with a A B C, pushing C, B, A, a so a is top):
(aabc, [a, A, B, C])
Match terminal a (consume a, pop a):
(abc, [A, B, C])
Apply A → a (replace A with terminal a, push a):
(abc, [a, B, C])
Match terminal a (consume a, pop a):
(bc, [B, C])
Replace B → b (push terminal b):
(bc, [b, C])
Match terminal b (consume b, pop b):
(c, [C])
Replace C → c (push terminal c):
(c, [c])
Match terminal c (consume c, pop c):
(ε, [])

Stack is empty and input is consumed → accepted.

If G is a context free grammar and w is a string of length l in L(G), how long is a derivation of w in G, if G is in Chomsky normal form? How would your answer change if G is in Greibach normal form?

If $G$ is in Chomsky Normal Form (CNF) and $w\in L(G)$ has length $|w|=l>0$ , then any complete derivation of $w$ uses exactly $2l-1$ production applications (steps).(If $w=\varepsilon$ and CNF allows $S\to\varepsilon$ as the only ε-rule, that derivation has 1 step.)
If $G$ is in Greibach Normal Form (GNF) and $|w|=l$ , then there is a leftmost derivation of $w$ of exactly $l$ steps (one production per terminal). GNF cannot generate $\varepsilon$ except by special convention, so the $l=0$ case is separate.

Brief proofs / reasoning:

CNF. In CNF every non-start production is either $A\to BC$ or $A\to a$ . Let $x$ be the number of uses of $A\to BC$ in a derivation and $y$ the number of uses of $A\to a$ . Each $A\to BC$ increases the number of nonterminals on the sentential form by $+1$ , while each $A\to a$ decreases it by $1$ . Starting with one nonterminal $S$ and ending with zero nonterminals gives

$1 + x - y = 0 \quad\Rightarrow\quad y = x+1.$

But every terminal in the final string is produced by an $A\to a$ application, so $y=l$ . Hence $x=l-1$ and the total number of productions applied is

$x + y = (l - 1) + l = 2 l - 1.$

GNF. In GNF every production has the form $A\to a\alpha$ where $a$ is a terminal and $\alpha$ is (possibly empty) a string of nonterminals. In a leftmost derivation each application replaces the leftmost nonterminal by a production whose first symbol is a terminal, thereby producing exactly one terminal in the leftmost part of the sentential form. So to produce $l$ terminals you need exactly $l$ leftmost steps. Thus there exists (and naturally we take) a leftmost derivation of length $l$ .

For each of the following decision problems about a Turing machine M, indicate whether it is decidable or not.
1. Does M take more than 1008 steps on some input?
2. Does M accept ε?
3. Is L(M) context free?

1. Algorithm: enumerate all strings $x$ with $|x|\le 1008$ (finitely many); simulate $M$ on each $x$ for 1009 steps. If any simulation reaches step 1009, answer YES; if all simulations finish within ≤1008 steps (or reject) then answer NO. This procedure always halts and is correct. Hence the property is decidable

2. This is the classical acceptance problem restricted to a fixed input. The general acceptance problem
$A_{T M} = {⟨ M, w ⟩ ∣ M accepts w}$ is undecidable. Reduce from it: given $\langle M,w\rangle$ build a machine $M'$ that on input $\varepsilon$ first writes $w$ on its tape (or otherwise simulates having $w$ as input) and then simulates $M$ on $w$ ; $M'$ accepts $\varepsilon$ iff $M$ accepts $w$ . Thus a decider for “does a TM accept $\varepsilon$ ?” would decide $\mathsf{A_{TM}}$ , contradiction. So undecidable.

3.This is a nontrivial language property about the (recursively enumerable) language of a TM. By Rice’s theorem any nontrivial semantic property of the language recognized by a TM (a property that holds for some recursively enumerable languages and fails for others) is undecidable. Being context-free is such a nontrivial property: there exist RE languages that are context-free and RE languages that are not context-free. Therefore it is undecidable to test, given⟨M⟩, whether L(M) is a context-free language.

Let L1 and L2 be Turing-recognizable languages over the same alphabet Σ. Prove that L1 ∩ L2 is also Turing-recognizable.

💡 Step 1. What it means to be Turing-recognizable

A language $L$ is Turing-recognizable (or recursively enumerable) if there exists a Turing Machine (TM) that:

Accepts every string in $L$ .
May run forever (does not have to halt) for strings not in $L$ .

So the machine doesn’t have to decide; it only needs to recognize.

💡 Step 2. Given two such languages

Let:

$M_1$ recognize $L_1$
$M_2$ recognize $L_2$

We want a new TM $M$ that recognizes $L_1 \cap L_2$ , i.e.,
it accepts a string only if both $M_1$ and $M_2$ accept it.

💡 Step 3. The key idea — run both machines in parallel

If we just run $M_1$ completely first, it might never halt!
So instead, we simulate both gradually:

On input $w$ :
1. Run $M_1$ for 1 step.
2. Run $M_2$ for 1 step.
3. Run $M_1$ for another step.
4. Run $M_2$ for another step.
5. Continue alternating forever (this is called dovetailing).

Whenever both machines have accepted $w$ , our new machine accepts.

💡 Step 4. Why this works

If $w$ is in both $L_{1 and}$ $L_2$ :
Eventually, both $M_1$ and $M_2$ will accept — so our new machine will also accept.
If $w$ is not in both:
At least one of $M_1$ or $M_2$ will never accept,
so our machine will never reach the “both accepted” condition — it runs forever.
(That’s allowed for recognizers.)

✅ Therefore:
The intersection of two Turing-recognizable languages is also Turing-recognizable.

Search This Blog

Theory Of Computation PCCST302 KTU Semester 3 BTech 2024 Scheme

Theory of Computation PCCST302 KTU 2024 Scheme - Model Question Paper and Answers

🔍 Step 1: What happens when you Google something

💡 Step 2: Casting this as a Decision Problem

Formal Definition

🧩 Step 3: Defining the Language

🧠 Example

⚙️ Step 4: Is this Problem Decidable?

📘 Final Answer

Grammar

Productions:

Construct a PDA to recognize the language generated by the
following grammar:
S → aA
A → aABC | bB | a
B → b
C → c

Trace for input `aaabc`

💡 Step 1. What it means to be Turing-recognizable

💡 Step 2. Given two such languages

💡 Step 3. The key idea — run both machines in parallel

💡 Step 4. Why this works

Comments

Post a Comment

Popular posts from this blog

Theory Of Computation PCCST302 KTU Semester 3 BTech 2024 Scheme

Formal Definition - Turing Machine

Non deterministic Finite Automata NFA

Theory of Computation PCCST302 KTU 2024 Scheme - Model Question Paper and Answers

🔍 Step 1: What happens when you Google something

💡 Step 2: Casting this as a Decision Problem

Formal Definition

🧩 Step 3: Defining the Language

🧠 Example

⚙️ Step 4: Is this Problem Decidable?

📘 Final Answer

Grammar

Productions:

Construct a PDA to recognize the language generated by thefollowing grammar:S → aAA → aABC | bB | aB → bC → c

Trace for input aaabc

💡 Step 1. What it means to be Turing-recognizable

💡 Step 2. Given two such languages

💡 Step 3. The key idea — run both machines in parallel

💡 Step 4. Why this works

Comments

Post a Comment

Popular posts from this blog

Theory Of Computation PCCST302 KTU Semester 3 BTech 2024 Scheme

Formal Definition - Turing Machine

Non deterministic Finite Automata NFA

Construct a PDA to recognize the language generated by the
following grammar:
S → aA
A → aABC | bB | a
B → b
C → c

Trace for input `aaabc`