Parser Flashcards

Question 1

Q

What does the parser do?

Answer

A

Group tokens into grammatical phrases
Discovers underlying structure of program
finds syntax errors

Question 2

Q

What is output of parser for legal program?

Answer

A

Abstract syntax tree (+ symbol table)
intermediate code
object code

Question 3

Q

terminal symbol

Answer

A

Tokens returned by scanner

Question 4

Q

productions

Question 5

Q

Production LHS

Answer

A

Single non-terminal

Question 6

Q

Production RHS

Answer

A

Either epsilon or a sequence of one or more terminals and/or non-terminals

Question 7

Q

CFG 4-tuple (N, E, P, S)

Answer

A

N - set of nonterminals
E - set of terminals
P - set of productions
S - start non-terminals

Question 8

Q

Leftmost derivation

Answer

A

The leftmost nonterminal is always chosen to be replaced

Question 9

Q

Rightmost derivation

Answer

A

The rightmost nonterminal is always chosen to be replaced

Question 10

Q

How to construct a parse tree

Answer

A

Start with start non-terminal
Repeat: 1) choose a leaf nonterm X 2) choose a production X->alpha 3) The symbols in alpha become the children of X in the tree

Question 11

Q

The derived string is formed by reading leaf nodes from _______ to _______

Answer

A

The derived string is formed by reading the leaf nodes from left to right

Question 12

Q

Ambiguous grammer

Answer

A

> 1 leftmost or > 1 right-most derivation or > 1 parse tree

Question 13

Q

How to write a grammer to express precedence

Answer

A

Use a different nonterminal for each precendence leve
Start by writing a rule for the operator with the lowest precedence
exp->

Question 14

Q

What causes ambiguity?

Answer

A

Ill defined precedence and/or associativity

Question 15

Q

For left associativity, use

Answer

A

left recursion

Question 16

Q

For right associativity, use

Answer

A

right recursion

Question 17

Q

How to force left associativity?

Answer

A

exp->exp MINUS exp | term

exp->exp MINUS term | term

Question 18

Q

Syntax directed translation

Answer

A

defined by augmenting the CFS: a translation rule is defined for each production

Question 19

Q

A translation rule defines the translation of the LHS non-terminal as a function of:

Answer

A

constants
the RHS nonterminal’s translations
the RHS tokens’ values

Question 20

Q

To translate an input string using syntax directed translation:

Answer

A

Build the parse tree

2. Use the translation rules to computer the translation of each non-terminal in the tree, working bottom up

Question 21

Q

Why work bottom up for syntax directed translation?

Answer

A

A non-terminal’s value may depend on the value of the symbols on the RHS so need to work bottom up so those values are available

Question 22

Q

AST vs. Parse Tree

Answer

A

AST: operators appear at internal nodes instead of leaves, chains of single productions are collapsed, listed a flattened, syntactic details are omitted

Question 23

Q

Context Free Grammar

Answer

A

A set of recursive rewriting rules to generate patterns of strings

Question 24

Q

CFG in compiler

Answer

A

Start with a string and end with a parse tree for w if w exits in L(G)

Question 25

Q

syntax directed translation

Answer

A

translate a sequence of tokens into a sequence of actions

Question 26

Q

For ASTS, when we execute an SDT rule:

Answer

A

we construct a new node object, which becomes the value of LHS.trans
populate the node’s fields with the translations of the RHS non-terminals

Question 27

Q

How to know if a string is in the language of the CFG?

Answer

A

Yes iff the string can be derived from the CFG’s start non-terminal
Yes iff we can build a parse tree for the string, with the CFG’s start nonterminal at the root

Question 28

Q

CYK algorithm used for what?

Answer

A

Used to parse any CFG (to determine for any input whether that input is in the language of a given CFG)

Question 29

Q

CYK essentially does what?

Answer

A

Builds all of the possible parse trees for all (consecutive) fragments of the input, bottom up.

Question 30

Q

When does CYK accept input?

Answer

A

Iff it is able to build a complete a complete parse tree with the start non-terminal at the root and the entire input at the leaves.

Question 31

Q

Chomsky Normal Form

Answer

A

X->singular terminal
X->2 non-terminals
X->epsilon is not allowed unless the nonterminal is the start non-terminal
If S->epsilon and S is the start non-term, then S cannot occur on the RHS of any rule

Question 32

Q

How to convert from CFG to CNF

Answer

A

Eliminate epsilon rules
Eliminate rules with exactly 1 non-terminal on the right (unit rules)
Fix remaining rules so all rules have single terminal or exactly two non-terminals on the right

Question 33

Q

how to eliminate epsilon for CNF?

Answer

A

A->epsilon
Take any other rule that has A on the RHS and copy it with nothing
F->(A) F->()

Question 34

Q

Idea of predictive parser

Answer

A

build parse tree top down, keep track of work to be done, use a parse table to decide how to do the parse

Question 35

Q

scanned tokens + stack contents correspond to what in predictive parser?

Answer

A

Leaves of the current (incomplete) parse tree

Question 36

Q

Rows of parse table are indexed by what?

Answer

A

indexed by nonterminals

Question 37

Q

columns of parse table are indexed by what?

Answer

A

columns are indexed by the tokens

Question 38

Q

Each element of the parse table is what?

Answer

A

Each element of the table for the row indexed by nonterminal X is either empty or contains the RHS of a grammar rule for X

Question 39

Q

What are the first two things pushed onto the stack for a predictive parser?

Answer

A

1) EOF terminal

2) start nonterminal

Question 40

Q

What happens if top-of-stack symbol is a nonterminal X?

predictive parser

Answer

A

Use nonterminal X and the current token t to index into the parse table and choose a production with X on the LHS (RHS is in table[x][t]
pop x from stack and push the chosen production’s RHS

Question 41

Q

For nonterminal, which direction are chosen RHS pushed onto stack (predictive parser)

Answer

A

Push symbols from R to L

Question 42

Q

What happens if top-of-stack symbol is terminal (predictive parser)?

Answer

A

Match it with the current token; if it matches, pop it and call the scanner to get the next token

Question 43

Q

3 ways to terminate predictive parser?

Answer

A

Top of stack is non-terminal, and parse table entry is empty
top-of-stack=terminal but doesn’t match curr token
stack is empty

Question 44

Q

When is predictive parser input accepted?

Answer

A

Stack is empty?

Question 45

Q

When is predictive parser input rejected

Answer

A

nonterminal and no parse table entry

terminal and doesn’t match current token

Question 46

Q

Is it always possible to build a predictive parser given a CFG?

Answer

A

Only possible to build a predictive parser for CFG if CFG is LL(1)

Question 47

Q

Example of something not in LL(1) because LL(1) only allows 1 look ahead

Answer

A

S->(S)|()

Question 48

Q

How to know if grammar is LL(1)?

Answer

A

If build the parse table and no element of the table contains more than 1 grammar rule RHS, then LL(1)

Question 49

Q

2 properties that preclude grammar from being LL(1)

Answer

A

Left recursive grammar

Grammars that are not left factored

Question 50

Q

A nonterminal X is useless if:

Answer

A

1) you can’t derive a sequence that includes X

2) You can’t derive a string (epsilon or a sequence of terminals) from X

Question 51

Q

Transform left recursion into non-left recursion

A->Aa|B

Answer

A

A->BA’

A’->aA’|epsilon

Question 52

Q

Meaning of left factored

Answer

A

A nonterminal has 2 productions whose RHS have a common prefix

Question 53

Q

Can a LL(1) grammar be left factored?

Question 54

Q

A grammar is not LL(1) if it is:

Answer

A

Left recursive

Not left factored

Question 55

Q

How to check if grammar is in LL(1)

Answer

A

Build the parse table; if any element in the table contains more than 1 grammar rule RHS, the grammar is not LL(1)

Question 56

Q

Idea of first sets

Answer

A

for a sequence a, FIRST(a) is the set of terminals that begin the strings derivable from a

Question 57

Q

Can grammar be LL(1) if a is current token

A->alpha, B->beta and a is in FIRST(alpha) and FIRST(beta)

Question 58

Q

FOLLOW sets are defined for what?

Answer

A

single non-terminals

Question 59

Q

Define FIRST sets for what?

Answer

A

RHS of each production

define FIRST sets for arbitrary sequences of terminals/non-terms/epsilon

Question 60

Q

Can epsilon be in a follow set?

Answer

A

No, epsilon can never be a follow set

Question 61

Q

What does the semantic stack hold?

Answer

A

Nonterminal’s translations

Question 62

Q

When parse is finished, what does semantic stack hold?

Answer

A

Translation of the root nonterminal (translation of the whole input)

Question 63

Q

How are values pushed onto the semantic stack?

Answer

A

Add actions to grammar rules

Question 64

Q

Actions for a grammar rule must:

Answer

A

Pop the translations of all RHS nonterminals

compute and push the translation of the LHS nonterminal

Answer 62

A

part of RHS of grammar rules

Pushed onto normal stack, when action # is top of stack, popped and action carried out

Answer 63

A

not performed until all derivations of LHS are carried out

Answer 64

A

Right to left because predictive parser does a leftmost derivation