Introduction to Computational Linguistics

More documents

Recommendations

Info

20. Greibach Normal Form 78 Recall that a derivation defines a set of constituent occurrences, which in turn constitute the nodes of the tree. Notice that each occurrence of a nonterminal is replaced by some right hand side of a rule during a derivation that leads to a terminal string. After it has been replaced, it is gone and can no longer figure in a derivation. Given a tree, a linearization is an ordering of the nodes which results from a valid derivation in the following way. We write x ⊳ y if the constituent of x is expanded before the constituent of y is. One can characterize exactly what it takes for such an order to be a linearization. First, it is linear. Second if x > y then also x ⊳ y. It follows that the root is the first node in the linearization. Linearizations are closely connected with search strategies in a tree. We shall present examples. The first is a particular case of the so–called depth–first search and the linearization shall be called leftmost linearization. It is as follows. x ⊳ y iff x > y or x ⊏ y. (Recall that x ⊏ y iff x precedes y. Trees are always considered ordered.) For every tree there is exactly one leftmost linearization. We shall denote the fact that there is a leftmost derivation of ⃗α from X by X ⊢ l G ⃗α. We can generalize the situation as follows. Let ◭ be a linear ordering uniformly defined on the leaves of local subtrees. That is to say, if B and C are isomorphic local trees (that is, if they correspond to the same rule ρ) then ◭ orders the leaves B linearly in the same way as ⊳ orders the leaves of C (modulo the unique (!) isomorphism). In the case of the leftmost linearization the ordering is the one given by ⊏. Now a minute’s reflection reveals that every linearization of the local subtrees of a tree induces a linearization of the entire tree but not conversely (there are orderings which do not proceed in this way, as we shall see shortly). X ⊢G ◭ ⃗α denotes the fact that there is a derivation of ⃗α from X determined by ◭. Now call π a priorization for G = 〈S, N, A, R〉 if π defines a linearization on the local tree H ρ , for every ρ ∈ R. Since the root is always the first element in a linearization, we only need to order the daughters of the root node, that is, the leaves. Let this ordering be ◭. We write X ⊢ π G ⃗α if X ⊢◭ G ⃗α for the linearization ◭ defined by π. Proposition 26 Let π be a priorization. Then X ⊢ π G ⃗x iff X ⊢ G ⃗x. A different strategy is the breadth–first search. This search goes through the tree in increasing depth. Let S n be the set of all nodes x with d(x) = n. For each n, S n shall be ordered linearly by ⊏. The breadth–first search is a linearization ∆, which is defined as follows. (a) If d(x) = d(y) then x ∆ y iff x ⊏ y, and (b) if d(x) < d(y) then x ∆ y. The difference between these search strategies, depth–first and breadth–first, can be made very clear with tree domains.
20. Greibach Normal Form 79 Definition 27 A tree domain is a set T of strings of natural numbers such that (i) if ⃗x if a prefix of ⃗y ∈ T then also ⃗x ∈ T, (b) if ⃗x j ∈ T and i < j then also ⃗xi ∈ T. We define ⃗x > ⃗y if ⃗x is a proper prefix of ⃗y and ⃗x ⊏ ⃗y iff ⃗x = ⃗ui⃗v and ⃗y = ⃗u j⃗w for some sequences ⃗u, ⃗v, ⃗w and numbers i < j. The depth–first search traverses the tree domain in the lexicographical order, the breadth–first search in the numerical order. Let the following tree domain be given. ε ❅ ❅ ❅ ❅ 0 1 2 ✂ ❇ ✂ ❇ 00 ✂ 10 ❇ 11 20 The depth–first linearization is (202) ε, 0, 00, 1, 10, 11, 2, 20 The breadth–first linearization, however, is (203) ε, 0, 1, 2, 00, 10, 11, 20 Notice that with these linearizations the tree domain ω ∗ cannot be enumerated. Namely, the depth–first linearization begins as follows. (204) ε, 0, 00, 000, 0000, . . . So we never reach 1. The breadth–first linearization goes like this. (205) ε, 0, 1, 2, 3, . . . So, we never reach 00. On the other hand, ω ∗ is countable, so we do have a linearization, but it is more complicated than the given ones.
Page 1 and 2:
Introduction to Computational Lingu
Page 3 and 4:
2. Practical Remarks Concerning OCa
Page 5 and 6:
3. Welcome To The Typed Universe 5
Page 7 and 8:
Page 9 and 10:
Page 11 and 12:
4. Function Definitions 11 4 Functi
Page 13 and 14:
5. Modules 13 can find out why the
Page 15 and 16:
5. Modules 15 let length l = length
Page 17 and 18:
6. Sets and Functors 17 write it do
Page 19 and 20:
7. Hash Tables 19 to actually see t
Page 21 and 22:
8. Combinators 21 can be used on an
Page 23 and 24:
9. Objects and Methods 23 of type
Page 25 and 26:
10. Characters, Strings and Regular
Page 27 and 28: 10. Characters, Strings and Regular
Page 29 and 30: 10. Characters, Strings and Regular
Page 31 and 32: 11. Interlude: Regular Expressions
Page 37 and 38: 12. Finite State Automata 37 Suppos
Page 39 and 40: 12. Finite State Automata 39 There
Page 41 and 42: 12. Finite State Automata 41 The pr
Page 43 and 44: 12. Finite State Automata 43 Proof.
Page 45 and 46: 13. Complexity and Minimal Automata
Page 51 and 52: 14. Digression: Time Complexity 51
Page 53 and 54: 14. Digression: Time Complexity 53
Page 55 and 56: 15. Finite State Transducers 55 Con
Page 57 and 58: 15. Finite State Transducers 57 And
Page 59 and 60: 16. Finite State Morphology 59 Then
Page 61 and 62: 17. Using Finite State Transducers
Page 67 and 68: 18. Context Free Grammars 67 18 Con
Page 69 and 70: 18. Context Free Grammars 69 symbol
Page 71 and 72: 18. Context Free Grammars 71 which
Page 73 and 74: 19. Parsing and Recognition 73 Z
Page 75 and 76: 19. Parsing and Recognition 75 numb
Page 77: 20. Greibach Normal Form 77 The red
Page 81 and 82: 20. Greibach Normal Form 81 Proposi
Page 83 and 84: 20. Greibach Normal Form 83 nonterm
Page 85 and 86: 21. Pushdown Automata 85 pushdown.
Page 87 and 88: 21. Pushdown Automata 87 machine is
Page 89 and 90: 22. Shift-Reduce-Parsing 89 differe
Page 91 and 92: 23. Some Metatheorems 91 If the loo
Page 93 and 94: 23. Some Metatheorems 93 This proof
Page 95 and 96: 23. Some Metatheorems 95 a decompos
show all

Introduction to Computational Linguistics

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?