23.12.2014 Views

Scrambling as the Combination of Relaxed Context-Free Grammars in

Scrambling as the Combination of Relaxed Context-Free Grammars in

Scrambling as the Combination of Relaxed Context-Free Grammars in

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 6<br />

<strong>Scrambl<strong>in</strong>g</strong> <strong>as</strong> <strong>the</strong> <strong>Comb<strong>in</strong>ation</strong> <strong>of</strong> <strong>Relaxed</strong><br />

<strong>Context</strong>-<strong>Free</strong> <strong>Grammars</strong> <strong>in</strong> a<br />

Model-Theoretic Grammar Formalism<br />

Ralph Debusmann<br />

Programm<strong>in</strong>g Systems Lab<br />

Universität des Saarlandes<br />

Postfach 15 11 50<br />

66041 Saarbrücken, Germany<br />

rade@ps.uni-sb.de<br />

6.1 Introduction<br />

Five years after <strong>the</strong> first ESSLLI workshop on Model-<br />

Theoretic Syntax (MTS), Pullum and Scholz (2001)<br />

stated that s<strong>in</strong>ce <strong>the</strong> work on MTS had largely focused<br />

on reformulat<strong>in</strong>g exist<strong>in</strong>g GES frameworks, <strong>in</strong> a sense, it<br />

had been done <strong>in</strong> <strong>the</strong> shadow <strong>of</strong> Generative-Enumerative<br />

Syntax (GES).<br />

In <strong>the</strong> follow<strong>in</strong>g five years, <strong>the</strong> bulk <strong>of</strong> work h<strong>as</strong> still<br />

been <strong>in</strong>vested <strong>in</strong> model-<strong>the</strong>oretic reformulations <strong>of</strong> GES<br />

frameworks. Reformulations <strong>of</strong> GB can be found <strong>in</strong><br />

(Rogers, 1996, 2003), <strong>of</strong> LFG <strong>in</strong> (Blackburn and Gardent,<br />

1995), <strong>of</strong> GPSG <strong>in</strong> (Kracht, 1995) and (Rogers,<br />

1996, 2003), <strong>of</strong> HPSG <strong>in</strong> (Kepser, 2000) and (Kepser and<br />

Mönnich, 2003), and <strong>of</strong> TAG <strong>in</strong> (Rogers, 2003).<br />

Recently (Rogers, 2004), <strong>the</strong>re have been attempts to<br />

step out <strong>of</strong> <strong>the</strong> shadow <strong>of</strong> GES, and to use MTS not<br />

only to reformulate and compare exist<strong>in</strong>g frameworks,<br />

but to utilize <strong>the</strong> more declarative, clarify<strong>in</strong>g perspective<br />

<strong>of</strong> MTS to also explore extensions <strong>of</strong> <strong>the</strong>m. This is what<br />

we set out to do <strong>in</strong> this paper <strong>as</strong> well.<br />

We b<strong>as</strong>e our work on <strong>the</strong> model-<strong>the</strong>oretic meta grammar<br />

formalism <strong>of</strong> Extensible Dependency Grammar<br />

(XDG) (Debusmann, 2006). XDG can be used to axiomatize<br />

grammatical <strong>the</strong>ories b<strong>as</strong>ed on dependency grammar,<br />

to extend <strong>the</strong>m, and to implement <strong>the</strong>m us<strong>in</strong>g <strong>the</strong><br />

constra<strong>in</strong>t-b<strong>as</strong>ed XDG Development Kit (XDK) (Debusmann<br />

et al., 2004), (Debusmann and Duchier, 2007).<br />

XDG is novel <strong>in</strong> support<strong>in</strong>g <strong>the</strong> axiomatization <strong>of</strong> multidimensional<br />

grammatical <strong>the</strong>ories, where <strong>the</strong> l<strong>in</strong>guistic<br />

<strong>as</strong>pects <strong>of</strong> e.g. syntax and semantics can be modeled modularly<br />

by separate dependency analyses.<br />

This paper contributes a new, previously unpublished<br />

formalization <strong>of</strong> XDG <strong>in</strong> first-order logic (section 6.2),<br />

and <strong>the</strong> first results on <strong>the</strong> closure properties <strong>of</strong> <strong>the</strong> str<strong>in</strong>g<br />

languages licensed by XDG (section 6.3). The closure<br />

properties are proven b<strong>as</strong>ed on <strong>the</strong> operation <strong>of</strong> grammar<br />

composition, where <strong>the</strong> str<strong>in</strong>g language result<strong>in</strong>g from <strong>the</strong><br />

composition <strong>of</strong> two grammars G 1 and G 2 is <strong>the</strong> difference,<br />

union or <strong>in</strong>tersection <strong>of</strong> that <strong>of</strong> G 1 and G 2 .<br />

In section 6.4, we recap <strong>the</strong> axiomatization <strong>of</strong> <strong>Context</strong>-<br />

<strong>Free</strong> Grammar (CFG) <strong>of</strong> (Debusmann, 2006), which we<br />

employ <strong>as</strong> our launch pad to go beyond CFG <strong>in</strong> section<br />

6.5. First, we explore <strong>the</strong> relaxation <strong>of</strong> <strong>the</strong> contiguity<br />

criterion <strong>of</strong> CFG, and second, we explore <strong>the</strong> <strong>in</strong>tersection<br />

<strong>of</strong> CFGs. This br<strong>in</strong>gs us <strong>in</strong>to <strong>the</strong> position to formulate a<br />

simple and elegant account <strong>of</strong> German scrambl<strong>in</strong>g loosely<br />

b<strong>as</strong>ed on (Duchier and Debusmann, 2001).<br />

6.2 Extensible Dependency Grammar<br />

XDG models tuples <strong>of</strong> dependency graphs shar<strong>in</strong>g <strong>the</strong><br />

same set <strong>of</strong> nodes, which are anchored by <strong>the</strong> same str<strong>in</strong>g<br />

<strong>of</strong> words. The components <strong>of</strong> <strong>the</strong> tuple are called dimensions,<br />

and XDG analyses multigraphs.<br />

Figure 6.1 shows an example multigraph with two dimensions:<br />

SYN provides a syntactic, and SEM a semantic<br />

analysis <strong>in</strong> terms <strong>of</strong> predicate-argument structure. The<br />

nodes are identified by <strong>in</strong>dices (1 to 6), and <strong>as</strong>sociated<br />

with words (e.g. Mary, wants, etc.). The edge labels on<br />

SYN are subj for “subject”, v<strong>in</strong>f for “full <strong>in</strong>f<strong>in</strong>itive”, part<br />

for “particle”, obj for “object” and adv for “adverb”. On<br />

SEM, ag stands for “agent”, pat for “patient” and th for<br />

“<strong>the</strong>me”.<br />

Contrary to o<strong>the</strong>r dependency-b<strong>as</strong>ed grammar formalisms<br />

such <strong>as</strong> (Gaifman, 1965), XDG dimensions need<br />

not be projective trees, but can <strong>in</strong> fact be general graphs<br />

<strong>as</strong> <strong>in</strong> Word Grammar (Hudson, 1990). An example is <strong>the</strong><br />

SEM dimension <strong>in</strong> Figure 6.1, which is not a tree but a<br />

directed acyclic graph (DAG). Here, to, which does not<br />

have any semantic content, h<strong>as</strong> no ancestor, and Mary,<br />

which is <strong>the</strong> agent <strong>of</strong> both wants and eat, h<strong>as</strong> two.<br />

Multigraphs are constra<strong>in</strong>ed by grammars specify<strong>in</strong>g:<br />

1. A multigraph type determ<strong>in</strong><strong>in</strong>g <strong>the</strong> possible dimensions,<br />

words, edge labels and additional attributes<br />

<strong>as</strong>sociated with <strong>the</strong> nodes called node attributes.<br />

45


46 <strong>Scrambl<strong>in</strong>g</strong> <strong>as</strong> <strong>the</strong> <strong>Comb<strong>in</strong>ation</strong> <strong>of</strong> <strong>Relaxed</strong> <strong>Context</strong>-<strong>Free</strong> <strong>Grammars</strong> <strong>in</strong> a Model-Theoretic Grammar Formalism<br />

SYN:<br />

subj<br />

v<strong>in</strong>f<br />

adv<br />

SYN:<br />

v<strong>in</strong>f<br />

part<br />

obj<br />

1<br />

2<br />

3<br />

4 5<br />

6<br />

part!<br />

obj<br />

adv*<br />

Mary<br />

SEM:<br />

ag<br />

wants<br />

ag<br />

to<br />

th<br />

eat spaghetti<br />

th<br />

pat<br />

today<br />

SEM:<br />

part <<br />

th*<br />

eat<br />

↓<br />

< obj < adv<br />

1<br />

Mary<br />

2<br />

wants<br />

3<br />

to<br />

4 5<br />

eat spaghetti<br />

6<br />

today<br />

ag!<br />

(subj)<br />

eat<br />

pat<br />

(obj)<br />

Figure 6.1: XDG multigraph for Mary wants to eat<br />

spaghetti today.<br />

Figure 6.2: Lexical entry for <strong>the</strong> word eat<br />

2. A lexicon determ<strong>in</strong><strong>in</strong>g a subset <strong>of</strong> <strong>the</strong> node attributes<br />

<strong>of</strong> each node, depend<strong>in</strong>g on <strong>the</strong> <strong>as</strong>sociated word.<br />

3. A set <strong>of</strong> pr<strong>in</strong>ciples stipulat<strong>in</strong>g <strong>the</strong> well-formedness<br />

conditions <strong>of</strong> <strong>the</strong> multigraphs.<br />

XDG is a meta grammar formalism. Instances <strong>of</strong> XDG<br />

are def<strong>in</strong>ed by fix<strong>in</strong>g a multigraph type and a set <strong>of</strong> pr<strong>in</strong>ciples,<br />

and leav<strong>in</strong>g <strong>the</strong> lexicon variable.<br />

XDG pr<strong>in</strong>ciples stipulate e.g. treeness, DAG-ness, projectivity,<br />

valency and order constra<strong>in</strong>ts. They can also<br />

constra<strong>in</strong> <strong>the</strong> relation <strong>of</strong> multiple dimensions, which is<br />

used e.g. <strong>in</strong> <strong>the</strong> l<strong>in</strong>k<strong>in</strong>g pr<strong>in</strong>ciple to constra<strong>in</strong> <strong>the</strong> relation<br />

between arguments on SEM and <strong>the</strong>ir syntactic realization<br />

on SYN. Some pr<strong>in</strong>ciples are lexicalized, i.e., <strong>the</strong>y<br />

constra<strong>in</strong> <strong>the</strong> analysis with respect to <strong>the</strong> lexicon.<br />

The lexicon constra<strong>in</strong>s all dimensions simultaneously,<br />

and <strong>the</strong>reby synchronizes <strong>the</strong>m. Figure 6.2 depicts an example<br />

graphical lexical entry for <strong>the</strong> word eat. On SYN,<br />

by <strong>the</strong> lexicalized valency pr<strong>in</strong>ciple, <strong>the</strong> lexical entry licenses<br />

zero or one <strong>in</strong>com<strong>in</strong>g edges labeled v<strong>in</strong>f, precisely<br />

one part, zero or one obj, arbitrary many adv dependents,<br />

and no o<strong>the</strong>r <strong>in</strong>com<strong>in</strong>g and outgo<strong>in</strong>g edges. By <strong>the</strong> order<br />

pr<strong>in</strong>ciple, <strong>the</strong> part dependents must precede <strong>the</strong> head<br />

eat, which must precede <strong>the</strong> obj and <strong>the</strong> adv dependents.<br />

On SEM, <strong>the</strong> lexical entry licenses arbitrary many <strong>in</strong>com<strong>in</strong>g<br />

th edges, and requires precisely one ag dependent and<br />

zero or one pat dependents (valency pr<strong>in</strong>ciple). It licenses<br />

no o<strong>the</strong>r <strong>in</strong>com<strong>in</strong>g and outgo<strong>in</strong>g edges. The patient must<br />

be realized by <strong>the</strong> object (l<strong>in</strong>k<strong>in</strong>g pr<strong>in</strong>ciple). The realization<br />

<strong>of</strong> <strong>the</strong> agent is not constra<strong>in</strong>ed.<br />

6.2.1 Multigraph<br />

We turn to <strong>the</strong> formalization <strong>of</strong> XDG. Contrary to (Debusmann,<br />

2006), which is higher-order, our formalization<br />

is first-order, and hence called FO XDG. We beg<strong>in</strong> with<br />

multigraphs. Multigraphs are formulated over <strong>the</strong> labeled<br />

dom<strong>in</strong>ance relation. This corresponds to <strong>the</strong> transitive<br />

closure <strong>of</strong> <strong>the</strong> labeled edge relation, where <strong>the</strong> label is <strong>the</strong><br />

label <strong>of</strong> <strong>the</strong> first edge. The purpose <strong>of</strong> <strong>in</strong>clud<strong>in</strong>g this relation<br />

and not <strong>the</strong> labeled edge relation itself is to stay <strong>in</strong><br />

first-order logic: if we <strong>in</strong>cluded only <strong>the</strong> labeled edge relation,<br />

we could not express <strong>the</strong> transitive closure without<br />

extend<strong>in</strong>g <strong>the</strong> logic with fixpo<strong>in</strong>ts or second-order quantification.<br />

Def<strong>in</strong>ition 6.2.1 (Multigraph). Given a f<strong>in</strong>ite set <strong>of</strong> dimensions<br />

D, a f<strong>in</strong>ite set <strong>of</strong> words W , a f<strong>in</strong>ite set <strong>of</strong> edge<br />

labels L, a f<strong>in</strong>ite set <strong>of</strong> attributes A, and a f<strong>in</strong>ite set <strong>of</strong><br />

set types T , a multigraph M = (V,E + ,


Ralph Debusmann 47<br />

Def<strong>in</strong>ition 6.2.4 (Multigraph <strong>of</strong> Multigraph Type). A<br />

multigraph M = (V,E + ,


48 <strong>Scrambl<strong>in</strong>g</strong> <strong>as</strong> <strong>the</strong> <strong>Comb<strong>in</strong>ation</strong> <strong>of</strong> <strong>Relaxed</strong> <strong>Context</strong>-<strong>Free</strong> <strong>Grammars</strong> <strong>in</strong> a Model-Theoretic Grammar Formalism<br />

2. fixed recognition problem (FRP): G is fixed and s is<br />

variable<br />

3. <strong>in</strong>stance recognition problem (IRP): <strong>the</strong> pr<strong>in</strong>ciples<br />

are fixed, and <strong>the</strong> lexicon and s are variable<br />

In (Debusmann, 2007), we prove us<strong>in</strong>g results from<br />

(Vardi, 1982), that <strong>the</strong> URP is PSPACE-complete, <strong>the</strong><br />

FRP and IRP are NP-complete.<br />

6.2.6 Example Pr<strong>in</strong>ciples<br />

We present a number <strong>of</strong> illustrative example pr<strong>in</strong>ciples.<br />

For generality, <strong>the</strong> pr<strong>in</strong>ciples are parametrized by <strong>the</strong> dimensions<br />

that <strong>the</strong>y constra<strong>in</strong>.<br />

Tree pr<strong>in</strong>ciple. Given a dimension d, <strong>the</strong> tree pr<strong>in</strong>ciple<br />

stipulates that 1) <strong>the</strong>re must be no cycles, 2) <strong>the</strong>re<br />

is precisely one node without a mo<strong>the</strong>r (<strong>the</strong> root), 3) all<br />

nodes have zero or one mo<strong>the</strong>rs, and 4) <strong>the</strong>re cannot be<br />

two edges between mo<strong>the</strong>r and daughter:<br />

tree d =<br />

∀v : ¬(v→ + d v) ∧<br />

∃!v : ¬∃v ′ : v ′ → d<br />

v ∧<br />

∀v : (¬∃v ′ : v ′ → d<br />

v) ∨(∃!v ′ : v ′ → d<br />

v) ∧<br />

l<br />

∀v : ∀v ′ : ∀l : v−→ d v ′ ⇒ (∀l ′ : v−→ l′<br />

d v ′ ⇒ l = l ′ )<br />

Projectivity pr<strong>in</strong>ciple. Given a dimension d, <strong>the</strong> projectivity<br />

pr<strong>in</strong>ciple forbids cross<strong>in</strong>g edges by stipulat<strong>in</strong>g<br />

that all nodes positioned between a head and a dependent<br />

must be below <strong>the</strong> head.<br />

projectivity d =<br />

∀v,v ′ :<br />

(v→ d<br />

v ′ ∧ v < v ′ ⇒ ∀v ′′ : v < v ′′ ∧ v ′′ < v ′ ⇒ v→ + d v′′ ) ∧<br />

(v→ d<br />

v ′ ∧ v ′ < v ⇒ ∀v ′′ : v ′ < v ′′ ∧ v ′′ < v ⇒ v→ + d v′′ )<br />

For example, this pr<strong>in</strong>ciple is violated on <strong>the</strong> SEM dimension<br />

<strong>in</strong> Figure 6.1, where wants is positioned between eat<br />

and Mary, but is not below eat.<br />

To expla<strong>in</strong> <strong>the</strong> lexicalized valency, order and l<strong>in</strong>k<strong>in</strong>g<br />

pr<strong>in</strong>ciples, we show an example concrete lexical entry for<br />

eat <strong>in</strong> Figure 6.3, model<strong>in</strong>g <strong>the</strong> graphical lexical entry <strong>in</strong><br />

Figure 6.2.<br />

Valency pr<strong>in</strong>ciple. Given a dimension d, <strong>the</strong> valency<br />

pr<strong>in</strong>ciple constra<strong>in</strong>s <strong>the</strong> <strong>in</strong>com<strong>in</strong>g and outgo<strong>in</strong>g edges <strong>of</strong><br />

each node accord<strong>in</strong>g to <strong>the</strong> lexical attributes <strong>in</strong> and out <strong>of</strong><br />

type 2 (dl d)×{!,+,,∗} , which models <strong>the</strong> function (dl d) →<br />

{!,+,,∗} from edge labels on d to card<strong>in</strong>alities, where<br />

! stands for “one”, + for “more than one”, for “zero or<br />

one”, and ∗ for “arbitrary many”.<br />

valency d =<br />

∀v : ∀l :<br />

((l,!) ∈ (d v).<strong>in</strong> ⇒ ∃!v ′ : v ′ −→ d v) ∧<br />

((l,+) ∈ (d v).<strong>in</strong> ⇒ ∃v ′ : v ′ −→ d v) ∧<br />

l<br />

l<br />

((l,) ∈ (d v).<strong>in</strong> ⇒ ¬∃v ′ : v ′ −→ d v ∨ ∃!v ′ : v ′ −→ d v) ∧<br />

(¬(l,!) ∈ (d v).<strong>in</strong> ∧ ¬(l,+) ∈ (d v).<strong>in</strong> ∧ ¬(l,) ∈ (d v).<strong>in</strong> ∧<br />

¬(l,∗) ∈ (d v).<strong>in</strong> ⇒ ¬∃v ′ : v ′ −→ d v) ∧<br />

((l,!) ∈ (d v).out ⇒ ∃!v ′ : v−→ d v ′ ) ∧<br />

...<br />

l<br />

l<br />

l<br />

l<br />

The rema<strong>in</strong><strong>in</strong>g part <strong>of</strong> <strong>the</strong> pr<strong>in</strong>ciple deal<strong>in</strong>g with <strong>the</strong> outgo<strong>in</strong>g<br />

edges proceeds analogously. Given <strong>the</strong> concrete<br />

lexical entry <strong>in</strong> Figure 6.3, <strong>the</strong> pr<strong>in</strong>ciple constra<strong>in</strong>s node<br />

eat on SYN such that <strong>the</strong>re can be zero or one <strong>in</strong>com<strong>in</strong>g<br />

edges labeled v<strong>in</strong>f, <strong>the</strong>re must be precisely one part dependent,<br />

zero or one obj dependents, arbitrary many adv<br />

dependents, and no o<strong>the</strong>r <strong>in</strong>com<strong>in</strong>g or outgo<strong>in</strong>g edges.<br />

Order pr<strong>in</strong>ciple. Given a dimension d, <strong>the</strong> order pr<strong>in</strong>ciple<br />

constra<strong>in</strong>s <strong>the</strong> order <strong>of</strong> <strong>the</strong> dependents <strong>of</strong> each<br />

node accord<strong>in</strong>g to <strong>the</strong> lexical attribute order <strong>of</strong> type<br />

2 (dl d)×(dl d) . The order attribute models a partial order on<br />

dl d, where we require that dl d <strong>in</strong>cludes <strong>the</strong> special label<br />

↑. The only purpose <strong>of</strong> ↑ is to denote <strong>the</strong> head <strong>the</strong> partial<br />

order specified by <strong>the</strong> order attribute, which is why <strong>the</strong><br />

pr<strong>in</strong>ciple also stipulates that <strong>the</strong>re must not be any edges<br />

labeled with ↑.<br />

order d =<br />

∀v : ∀v ′ : ¬v ↑<br />

−→ d v ′ ∧<br />

∀v : ∀l : ∀l ′ : (l,l ′ ) ∈ (d v).order ⇒<br />

(∀v ′ : l = ↑ ∧ v−→ l′<br />

d v ′ ⇒ v < v ′ ) ∧<br />

(∀v ′ : v−→ l<br />

d v ′ ∧ l ′ = ↑ ⇒ v ′ < v) ∧<br />

(∀v ′ : ∀v ′′ : v−→ l<br />

d v ′ ∧ v−→ l′<br />

d v ′′ ⇒ v ′ < v ′′ )<br />

For <strong>in</strong>stance, given <strong>the</strong> concrete lexical entry <strong>in</strong> Figure<br />

6.3, <strong>the</strong> order pr<strong>in</strong>ciple orders all part dependents to<br />

<strong>the</strong> left <strong>of</strong> <strong>the</strong> head eat, and to <strong>the</strong> left <strong>of</strong> <strong>the</strong> obj and adv<br />

dependents <strong>of</strong> eat. The head is ordered to <strong>the</strong> left <strong>of</strong> its<br />

obj and adv dependents, and <strong>the</strong> obj precede <strong>the</strong> adv dependents.<br />

L<strong>in</strong>k<strong>in</strong>g pr<strong>in</strong>ciple. Given two dimensions d 1 and d 2 ,<br />

<strong>the</strong> l<strong>in</strong>k<strong>in</strong>g pr<strong>in</strong>ciple requires for all edges from v to v ′<br />

labeled l on d 1 that if <strong>the</strong>re is a label l ′ ∈ (d 1 v).l<strong>in</strong>k, <strong>the</strong>n<br />

<strong>the</strong>re must be a correspond<strong>in</strong>g edge from v to v ′ labeled<br />

l ′ on d 2 . The lexical attribute l<strong>in</strong>k <strong>of</strong> type 2 (dl d 1)×(dl d 2 )<br />

models <strong>the</strong> function (dl d 1 ) → 2 (dl d 2) mapp<strong>in</strong>g labels on<br />

d 1 to sets <strong>of</strong> labels on d 2 .<br />

l<strong>in</strong>k<strong>in</strong>g d1 ,d 2<br />

=<br />

∀v : ∀v ′ : ∀l : v−→ d1<br />

v ′ ⇒<br />

l<br />

(∃l ′ : (l,l ′ ) ∈ (d 1 v).l<strong>in</strong>k ⇒ v l′<br />

−→ d2<br />

v ′ )<br />

This is only one <strong>in</strong>stance <strong>of</strong> a family <strong>of</strong> l<strong>in</strong>k<strong>in</strong>g pr<strong>in</strong>ciples.<br />

O<strong>the</strong>rs are presented e.g. <strong>in</strong> (Debusmann, 2006). In<br />

<strong>the</strong> concrete lexical entry <strong>in</strong> Figure 6.3, d 1 = SEM and<br />

d 2 = SYN, and <strong>the</strong> l<strong>in</strong>k<strong>in</strong>g pr<strong>in</strong>ciple stipulates e.g. that<br />

<strong>the</strong> patient <strong>of</strong> eat on SEM must be realized by its object<br />

on SYN.<br />

6.2.7 Example <strong>Grammars</strong><br />

To illustrate how XDG grammars look like, we present<br />

two example grammars. The first, G 1 , models <strong>the</strong> str<strong>in</strong>g<br />

language L 1 <strong>of</strong> equally many <strong>as</strong>, bs and cs, <strong>in</strong> any order:<br />

L 1 = {s ∈ (a ∪ b ∪ c) + | |w| a = |w| b = |w| c }


Ralph Debusmann 49<br />

eat ↦→ ⎧ ⎧<br />

⎪⎨<br />

⎪ ⎨<br />

⎧⎪ SYN : ⎨<br />

⎪ ⎩ ⎪ ⎩<br />

⎪⎩<br />

⎧<br />

⎨<br />

SEM :<br />

⎩<br />

<strong>in</strong> : {(v<strong>in</strong>f,)}<br />

out : {(part,!),(obj,),(adv,∗)}<br />

order : {(part,↑),(part,obj),<br />

(part,adv),(↑,obj),<br />

(↑,adv),(obj,adv)}<br />

<strong>in</strong> : {(th,∗)}<br />

out : {(ag,!),(pat,)}<br />

l<strong>in</strong>k : {(pat,obj)}<br />

⎫<br />

⎬<br />

⎭<br />

⎫<br />

⎪⎬<br />

⎪⎭<br />

⎫<br />

⎪⎬<br />

⎪⎭<br />

, ...<br />

⎫<br />

⎪⎬<br />

⎪⎭<br />

Figure 6.3: Concrete lexical entry for eat<br />

This grammar demonstrates how to do count<strong>in</strong>g. On its<br />

sole dimension called ID (for “immediate dom<strong>in</strong>ance”, <strong>in</strong><br />

analogy to GPSG), we count us<strong>in</strong>g a cha<strong>in</strong> <strong>of</strong> <strong>as</strong>, each <strong>of</strong><br />

which is required to take one b and one c. An example<br />

analysis is depicted <strong>in</strong> Figure 6.4. Here, <strong>the</strong> a with <strong>in</strong>dex<br />

1 builds a cha<strong>in</strong> with <strong>the</strong> a with <strong>in</strong>dex 6. The first a takes<br />

<strong>the</strong> b with <strong>in</strong>dex 3 and <strong>the</strong> c with <strong>in</strong>dex 4, and <strong>the</strong> second<br />

a <strong>the</strong> b with <strong>in</strong>dex 2 and <strong>the</strong> c with <strong>in</strong>dex 5.<br />

LP:<br />

1 2 3 4 5 6 7<br />

a<br />

1<br />

2<br />

a<br />

3<br />

3<br />

b<br />

3<br />

3<br />

c<br />

Figure 6.6: G 2 LP example analysis <strong>of</strong> a a b b c c<br />

c<br />

c<br />

c<br />

ID:<br />

1<br />

a<br />

b<br />

2<br />

b<br />

c<br />

3<br />

b<br />

a<br />

b<br />

4<br />

c<br />

Figure 6.4: G 1 ID example analysis <strong>of</strong> a b b c c a<br />

G 1 makes use <strong>of</strong> <strong>the</strong> tree pr<strong>in</strong>ciple and <strong>the</strong> valency pr<strong>in</strong>ciple,<br />

where <strong>the</strong> latter does <strong>the</strong> count<strong>in</strong>g. The lexicon is<br />

depicted graphically <strong>in</strong> Figure 6.5. The cha<strong>in</strong> <strong>of</strong> <strong>as</strong> is built<br />

by <strong>the</strong> lexical entry for a licens<strong>in</strong>g zero or one <strong>in</strong>com<strong>in</strong>g<br />

and outgo<strong>in</strong>g edges labeled a. In addition, we require<br />

each a to take precisely one b and precisely one c dependent.<br />

The lexical entries for b and c require precisely one<br />

<strong>in</strong>com<strong>in</strong>g edge labeled resp. b and c.<br />

ID:<br />

ID<br />

a<br />

a<br />

b!<br />

c!<br />

ID<br />

5<br />

c<br />

c<br />

b!<br />

6<br />

a<br />

ID<br />

c!<br />

G 2 makes use <strong>of</strong> <strong>the</strong> tree, valency and order pr<strong>in</strong>ciples.<br />

The lexical entries for <strong>the</strong> latter two are depicted <strong>in</strong> Figure<br />

6.7. Here, <strong>the</strong> word a is lexically ambiguous: it can<br />

ei<strong>the</strong>r be a root (leftmost lexical entry), or a dependent<br />

(second from <strong>the</strong> left). As <strong>the</strong> grammar uses <strong>the</strong> tree pr<strong>in</strong>ciple,<br />

only one a will ever become <strong>the</strong> root, <strong>as</strong> which it<br />

licenses arbitrary many 1 dependents, followed by and<br />

one or more 2 dependents, followed by one or more 3<br />

dependents.<br />

LP:<br />

LP<br />

a<br />

↓<br />

1*<br />

2+<br />

< 1 < 2 < 3<br />

3+<br />

LP<br />

1!<br />

a<br />

↓<br />

LP<br />

2! 3!<br />

Figure 6.7: G 2 lexical entries for a, b and c<br />

b<br />

↓<br />

LP<br />

c<br />

↓<br />

a<br />

Figure 6.5: G 1 lexical entries for a, b and c<br />

The second example grammar, G 2 , models <strong>the</strong> str<strong>in</strong>g<br />

language L 2 <strong>of</strong> arbitrary many <strong>as</strong> followed by arbitrary<br />

many bs followed by arbitrary many cs:<br />

L 2 = a + b + c +<br />

With this grammar, we demonstrate how to do order<strong>in</strong>g.<br />

On its sole dimension LP (for “l<strong>in</strong>ear precedence”), <strong>the</strong><br />

idea is for <strong>the</strong> leftmost a to be <strong>the</strong> root, hav<strong>in</strong>g arbitrary<br />

many outgo<strong>in</strong>g edges to arbitrary many o<strong>the</strong>r <strong>as</strong> (labeled<br />

1), and bs (2) and cs (3) to its right. We show an example<br />

analysis <strong>in</strong> Figure 6.6.<br />

b<br />

c<br />

6.3 Closure Properties<br />

In <strong>the</strong> new formalization presented <strong>in</strong> section 6.2, <strong>the</strong><br />

models and hence also <strong>the</strong> str<strong>in</strong>g languages <strong>of</strong> XDGs are<br />

constra<strong>in</strong>ed by FO formul<strong>as</strong>. This suggests that b<strong>as</strong>ic set<br />

operations on <strong>the</strong> str<strong>in</strong>g languages <strong>of</strong> XDGs can all be<br />

expressed, and that <strong>the</strong> XDG str<strong>in</strong>g languages are closed<br />

under <strong>the</strong>se set operations. In this section, we will show<br />

that <strong>the</strong> XDG languages are <strong>in</strong>deed closed under <strong>the</strong> follow<strong>in</strong>g<br />

b<strong>as</strong>ic set operations:<br />

1. difference and complement<br />

2. union<br />

3. <strong>in</strong>tersection


50 <strong>Scrambl<strong>in</strong>g</strong> <strong>as</strong> <strong>the</strong> <strong>Comb<strong>in</strong>ation</strong> <strong>of</strong> <strong>Relaxed</strong> <strong>Context</strong>-<strong>Free</strong> <strong>Grammars</strong> <strong>in</strong> a Model-Theoretic Grammar Formalism<br />

6.3.1 Grammar Composition<br />

For our pro<strong>of</strong>, we take a detour and def<strong>in</strong>e <strong>the</strong> composition<br />

<strong>of</strong> two XDGs.<br />

Def<strong>in</strong>ition 6.3.1 (Grammar Composition). We<br />

def<strong>in</strong>e <strong>the</strong> composition <strong>of</strong> G = G 1 ⊚ G 2 <strong>of</strong> G 1<br />

and G 2 given two grammars G 1 = (MT 1 ,lex 1 ,P 1 )<br />

and G 2 = (MT 2 ,lex 2 ,P 2 ) with multigraph types<br />

MT 1 = (D 1 ,W 1 ,L 1 ,dl 1 ,A 1 ,T 1 ,dat 1 ) and MT 2 =<br />

(D 2 ,W 2 ,L 2 ,dl 2 ,A 2 ,T 2 ,dat 2 ), and a pr<strong>in</strong>ciples composition<br />

operator ◦.<br />

The prerequisites are that 1) <strong>the</strong> sets <strong>of</strong> dimensions<br />

must be disjo<strong>in</strong>t, and 2) <strong>the</strong> sets <strong>of</strong> words must be <strong>the</strong><br />

same.<br />

The result<strong>in</strong>g grammar G = G 1 ⊚ G 2 with<br />

G = (MT,lex,P) h<strong>as</strong> multigraph type MT =<br />

(D,W,L,dl,A,T,dat) with:<br />

D = D 1 ∪ D 2<br />

W = W 1<br />

L = L 1 ∪ L 2<br />

dl = dl 1 ∪ dl 2<br />

A = A 1 ∪ A 2<br />

T = T 1 ∪ T 2<br />

dat = dat 1 ∪ dat 2<br />

The lexicon lex <strong>of</strong> G is def<strong>in</strong>ed such that lex w conta<strong>in</strong>s<br />

<strong>the</strong> product <strong>of</strong> <strong>the</strong> lexical entries for w from G 1 and G 2 ,<br />

for all w ∈ W: 1<br />

lex w = {e 1 ∪ e 2 | e 1 ∈ lex 1 w ∧ e 2 ∈ lex 2 w}<br />

The pr<strong>in</strong>ciples P <strong>of</strong> G are def<strong>in</strong>ed us<strong>in</strong>g <strong>the</strong> pr<strong>in</strong>ciples<br />

composition operator, which comb<strong>in</strong>es <strong>the</strong> conjunction <strong>of</strong><br />

all pr<strong>in</strong>ciples <strong>in</strong> P 1 and <strong>the</strong> conjunction <strong>of</strong> all pr<strong>in</strong>ciples<br />

<strong>in</strong> P 2 :<br />

P = { V<br />

V<br />

◦ φ 2 }<br />

φ 2 ∈P 2<br />

φ 1 ∈P 1<br />

φ 1<br />

Next, we def<strong>in</strong>e what it means for a multigraph to be<br />

restricted to a subset <strong>of</strong> <strong>the</strong> dimensions <strong>of</strong> its multigraph<br />

type.<br />

Def<strong>in</strong>ition 6.3.2 (Multigraph Restriction). Given a<br />

multigraph M = (V,E + ,


6.4. LCFGS AS XDGS 51<br />

6.3.3 Union<br />

Proposition 6.3.6 (Union). The XDG str<strong>in</strong>g languages<br />

are closed under union.<br />

Pro<strong>of</strong>. Given two grammars G 1 and G 2 with str<strong>in</strong>g languages<br />

L G 1 and L G 2 , we can construct a grammar<br />

G = G 1 ∪ G 2 with L G = L G 1 ∪ L G 2 us<strong>in</strong>g grammar<br />

composition with pr<strong>in</strong>ciples composition operator ◦ = ∨:<br />

ID:<br />

ID<br />

LP:<br />

LP<br />

a<br />

a<br />

1*<br />

a<br />

2+<br />

b!<br />

3+<br />

a<br />

c!<br />

ID<br />

a<br />

1!<br />

LP<br />

a<br />

b!<br />

c!<br />

ID<br />

LP<br />

b!<br />

c!<br />

ID<br />

b<br />

c<br />

2! 3!<br />

LP<br />

P =<br />

V<br />

φ 1 ∈P 1<br />

φ 1<br />

∨<br />

V<br />

φ 2 ∈P 2<br />

φ 2<br />

a<br />

↓<br />

< 1 < 2 < 3<br />

a<br />

↓<br />

b<br />

↓<br />

c<br />

↓<br />

Figure 6.9: G 3 lexical entries for a, b and c<br />

6.3.4 Intersection<br />

Proposition 6.3.7 (Intersection). The XDG str<strong>in</strong>g languages<br />

are closed under <strong>in</strong>tersection.<br />

Pro<strong>of</strong>. Given two grammars G 1 and G 2 with str<strong>in</strong>g languages<br />

L G 1 and L G 2 , we can construct a grammar<br />

G = G 1 ∩ G 2 with L G = L G 1 ∩ L G 2 us<strong>in</strong>g grammar<br />

composition with pr<strong>in</strong>ciples composition operator ◦ = ∧:<br />

P =<br />

6.3.5 Example<br />

V<br />

φ 1 ∈P 1<br />

φ 1<br />

∧<br />

V<br />

φ 2 ∈P 2<br />

φ 2<br />

As an example, we present <strong>the</strong> <strong>in</strong>tersection <strong>of</strong> <strong>the</strong> two<br />

grammars G 1 and G 2 from section 6.2 to obta<strong>in</strong> <strong>the</strong> language<br />

L 3 = L 1 ∩ L 2 <strong>of</strong> n <strong>as</strong> followed by n bs followed by<br />

n cs.<br />

L 3 = L 1 ∩ L 2 = {s ∈ a n b n c n | n ≥ 1}<br />

The models <strong>of</strong> G 3 are multigraphs with two dimensions:<br />

<strong>the</strong> dimension ID from G 1 , and <strong>the</strong> dimension LP<br />

from G 2 . ID ensures that <strong>the</strong>re are equally many <strong>as</strong>, bs<br />

and cs, where<strong>as</strong> LP ensures that <strong>the</strong> <strong>as</strong> precede <strong>the</strong> bs precede<br />

<strong>the</strong> cs. We depict an example analysis <strong>in</strong> Figure 6.8.<br />

ID:<br />

1<br />

a<br />

LP:<br />

1<br />

a<br />

a<br />

1<br />

b<br />

2<br />

a<br />

2<br />

2<br />

a<br />

2<br />

c<br />

b<br />

3<br />

b<br />

3<br />

3<br />

b<br />

3<br />

c<br />

4<br />

b<br />

4<br />

b<br />

Figure 6.8: G 3 ID/LP example analysis <strong>of</strong> a a b b c c<br />

The lexicon <strong>of</strong> G 3 is <strong>the</strong> product <strong>of</strong> <strong>the</strong> lexicons <strong>of</strong> G 1<br />

and G 2 . We depict it <strong>in</strong> Figure 6.9. Note that <strong>the</strong> product<br />

construction <strong>of</strong> <strong>the</strong> lexicon yields two lexical entries for a<br />

which are different on LP, but equal on ID.<br />

5<br />

c<br />

5<br />

c<br />

6<br />

c<br />

6<br />

c<br />

6.4 LCFGs <strong>as</strong> XDGs<br />

(Debusmann, 2006) <strong>in</strong>cludes a constructive pro<strong>of</strong> b<strong>as</strong>ed<br />

on (McCawley, 1968) and (Gaifman, 1965) that reformulates<br />

lexicalized CFGs (LCFGs) <strong>as</strong> XDGs. LCFGs are<br />

CFGs where each rule h<strong>as</strong> precisely one term<strong>in</strong>al symbol<br />

on its right hand side. Given an LCFG G, it is e<strong>as</strong>y<br />

to construct an XDG G ′ with one dimension called DERI<br />

(for “derivation tree”). The derivation trees <strong>of</strong> <strong>the</strong> LCFG<br />

stand <strong>in</strong> <strong>the</strong> follow<strong>in</strong>g correspondence to <strong>the</strong> models on<br />

DERI:<br />

1. The non-term<strong>in</strong>al nodes <strong>in</strong> <strong>the</strong> derivation tree correspond<br />

to <strong>the</strong> nodes on DERI.<br />

2. The labels <strong>of</strong> <strong>the</strong> non-term<strong>in</strong>al nodes <strong>in</strong> <strong>the</strong> derivation<br />

tree are represented by <strong>the</strong> <strong>in</strong>com<strong>in</strong>g edge labels<br />

<strong>of</strong> <strong>the</strong> correspond<strong>in</strong>g nodes on DERI, except for <strong>the</strong><br />

root, which h<strong>as</strong> no <strong>in</strong>com<strong>in</strong>g edge.<br />

3. The term<strong>in</strong>al nodes <strong>in</strong> <strong>the</strong> derivation tree correspond<br />

to <strong>the</strong> words on DERI.<br />

We depict an example LCFG derivation tree and <strong>the</strong> correspond<strong>in</strong>g<br />

XDG DERI tree <strong>in</strong> Figure 6.10.<br />

a<br />

a<br />

S<br />

S<br />

B<br />

b<br />

B<br />

b<br />

1<br />

a<br />

S B<br />

Figure 6.10: LCFG derivation tree (left) and correspond<strong>in</strong>g<br />

XDG DERI tree (right)<br />

The constructed XDG grammar uses <strong>the</strong> tree, projectivity,<br />

valency and order pr<strong>in</strong>ciples. The lexicon <strong>in</strong>cludes<br />

for each rule A → B 1 ...B k aB k+1 ...B n (1 ≤ k ≤ n) <strong>of</strong> <strong>the</strong><br />

LCFG, given that each non-term<strong>in</strong>al occurs at most once<br />

on <strong>the</strong> RHS, and given that A is not <strong>the</strong> start symbol, a<br />

lexical entry graphically depicted <strong>in</strong> Figure 6.11. Here,<br />

<strong>the</strong> anchor is <strong>the</strong> term<strong>in</strong>al symbol a <strong>of</strong> <strong>the</strong> RHS <strong>of</strong> <strong>the</strong><br />

2<br />

a<br />

B<br />

3<br />

b<br />

4<br />

b


52 <strong>Scrambl<strong>in</strong>g</strong> <strong>as</strong> <strong>the</strong> <strong>Comb<strong>in</strong>ation</strong> <strong>of</strong> <strong>Relaxed</strong> <strong>Context</strong>-<strong>Free</strong> <strong>Grammars</strong> <strong>in</strong> a Model-Theoretic Grammar Formalism<br />

LCFG rule. We require precisely one <strong>in</strong>com<strong>in</strong>g edge labeled<br />

by <strong>the</strong> LHS <strong>of</strong> <strong>the</strong> rule, i.e., A. 2 As for <strong>the</strong> outgo<strong>in</strong>g<br />

edges, we require precisely one for each non-term<strong>in</strong>al on<br />

<strong>the</strong> RHS <strong>of</strong> <strong>the</strong> rule. The order requirements reflect <strong>the</strong><br />

order among <strong>the</strong> non-term<strong>in</strong>als and <strong>the</strong> anchor.<br />

A!<br />

B1 ! B n !<br />

...<br />

B k ! B k+1 !<br />

S → NP NP VP sah VP → NP VP helfen<br />

VP → NP füttern NP → John<br />

NP → Mary NP → Peter<br />

NP → Tiere<br />

Figure 6.13: LCFG G ID<br />

a<br />

B 1


Ralph Debusmann 53<br />

S<br />

S<br />

S<br />

NP<br />

NP<br />

VP<br />

saw<br />

NP<br />

NP<br />

VP<br />

saw<br />

NP<br />

NP<br />

VP<br />

saw<br />

John<br />

Mary<br />

NP<br />

VP<br />

help<br />

John<br />

Mary<br />

NP<br />

VP<br />

help<br />

NP<br />

VP<br />

John<br />

Mary<br />

help<br />

Peter NP feed<br />

NP<br />

Peter<br />

feed<br />

Peter<br />

NP<br />

feed<br />

animals<br />

animals<br />

animals<br />

Figure 6.14: Derivation trees<br />

we get precisely <strong>the</strong> correct str<strong>in</strong>g language, <strong>the</strong> derivation<br />

trees do not represent <strong>the</strong> syntactic dependencies<br />

between verbs and <strong>the</strong>ir non-verbal dependents, e.g. between<br />

sah and John and Mary. This renders <strong>the</strong> grammar<br />

practically useless: it is impossible to determ<strong>in</strong>e <strong>the</strong> semantics<br />

<strong>of</strong> a sentence without <strong>the</strong>se syntactic dependencies.<br />

6.5.3 Intersect<strong>in</strong>g LCFGs<br />

To recap our two previous ide<strong>as</strong>, relax<strong>in</strong>g G ID lead to<br />

overgeneration, and <strong>the</strong> sole use <strong>of</strong> <strong>the</strong> topological LCFG<br />

G LP made us lose essential syntactic dependencies. Our<br />

third idea is now to <strong>in</strong>tersect G ID and G LP , follow<strong>in</strong>g section<br />

6.3.4. An analysis <strong>of</strong> <strong>the</strong> result<strong>in</strong>g grammar G ID/LP =<br />

G ID ∩ G LP is a pair <strong>of</strong> two derivation trees, or, <strong>in</strong> terms <strong>of</strong><br />

XDG, two dimensions: one derivation tree for G ID called<br />

ID tree, and one derivation tree for G LP called LP tree. We<br />

show an example <strong>in</strong> Figure 6.17.<br />

ID:<br />

NP<br />

animals<br />

LP:<br />

MF<br />

animals<br />

NP<br />

John<br />

MF<br />

John<br />

S<br />

NP<br />

Mary NP<br />

MF<br />

S<br />

Mary<br />

Peter<br />

MF<br />

Peter<br />

VP<br />

VP<br />

feed<br />

VC<br />

feed<br />

saw<br />

help<br />

VC<br />

help<br />

Figure 6.17: Analysis <strong>of</strong> G ID/LP<br />

saw<br />

This idea comb<strong>in</strong>es <strong>the</strong> best <strong>of</strong> both worlds: through<br />

G LP , we avoid overgeneration, and G ID represents <strong>the</strong> essential<br />

syntactic dependencies. That is, <strong>the</strong> two <strong>in</strong>tersected<br />

grammars can be considered <strong>as</strong> “help<strong>in</strong>g out” each<br />

o<strong>the</strong>r.<br />

6.6 Use or Abuse <strong>of</strong> Intersection<br />

A related approach to model scrambl<strong>in</strong>g by <strong>in</strong>tersection<br />

h<strong>as</strong> been put forward <strong>in</strong> <strong>the</strong> context <strong>of</strong> Range Concatenation<br />

<strong>Grammars</strong> (RCG) (Boullier, 2000). Here, <strong>the</strong> structures<br />

generated by <strong>the</strong> two comb<strong>in</strong>ed grammars are correlated<br />

only by <strong>the</strong>ir yields. In his paper “Uses and abuses<br />

<strong>of</strong> <strong>in</strong>tersected languages”, Chiang (2004) observes that<br />

from <strong>the</strong> po<strong>in</strong>t <strong>of</strong> view <strong>of</strong> strong generative capacity, this<br />

use <strong>of</strong> <strong>in</strong>tersection amounts to only constra<strong>in</strong><strong>in</strong>g <strong>the</strong> tail<br />

end <strong>of</strong> o<strong>the</strong>rwise <strong>in</strong>dependent parallel processes, which<br />

he calls weak parallelism. He argues that it is e<strong>as</strong>y to<br />

overestimate how much control this k<strong>in</strong>d <strong>of</strong> parallelism<br />

<strong>of</strong>fers. He argues that <strong>the</strong> treatment <strong>of</strong> scrambl<strong>in</strong>g <strong>in</strong><br />

(Boullier, 2000) is not general enough, <strong>as</strong> it relies on<br />

nonexistent <strong>in</strong>formation <strong>in</strong> <strong>the</strong> surface str<strong>in</strong>g.<br />

Intersection <strong>in</strong> XDG <strong>of</strong>fers more f<strong>in</strong>e-gra<strong>in</strong>ed control<br />

<strong>as</strong> Boullier’s, and we argue that it thus does not fall <strong>in</strong>to<br />

<strong>the</strong> category <strong>of</strong> “abuse”. First, <strong>the</strong> dimensions <strong>of</strong> XDG are<br />

synchronized by <strong>the</strong> <strong>in</strong>put str<strong>in</strong>g and <strong>the</strong> correspond<strong>in</strong>g<br />

nodes, which are shared among all dimensions. Second,<br />

XDG allows to stipulate any number <strong>of</strong> additional constra<strong>in</strong>ts<br />

to correlate <strong>the</strong> two <strong>in</strong>tersected grammars, such<br />

<strong>as</strong> <strong>the</strong> l<strong>in</strong>k<strong>in</strong>g pr<strong>in</strong>ciple. L<strong>in</strong>k<strong>in</strong>g constra<strong>in</strong>ts could e.g.<br />

be used to synchronize <strong>the</strong> rules <strong>of</strong> <strong>the</strong> two comb<strong>in</strong>ed<br />

CFGs. For <strong>in</strong>stance, we could use it to require that specific<br />

rules <strong>in</strong> one <strong>of</strong> <strong>the</strong> comb<strong>in</strong>ed CFGs can only be used<br />

synchronously with specific rules <strong>in</strong> <strong>the</strong> o<strong>the</strong>r CFG, similar<br />

to Multitext grammars (Melamed, 2003), (Melamed<br />

et al., 2004).<br />

6.7 Conclusions<br />

We have shown that XDGs can be comb<strong>in</strong>ed us<strong>in</strong>g grammar<br />

composition, such that <strong>the</strong> str<strong>in</strong>g language <strong>of</strong> <strong>the</strong> result<strong>in</strong>g<br />

grammar is e.g. <strong>the</strong>ir <strong>in</strong>tersection. Us<strong>in</strong>g a model<strong>the</strong>oretic<br />

axiomatization <strong>of</strong> LCFG <strong>in</strong> XDG, we could <strong>the</strong>n<br />

explore <strong>the</strong> relaxation <strong>of</strong> <strong>the</strong> LCFG contiguity criterion,<br />

and, crucially, <strong>the</strong> <strong>in</strong>tersection <strong>of</strong> LCFGs. Toge<strong>the</strong>r, <strong>the</strong>se<br />

two ide<strong>as</strong> lead us to a model <strong>of</strong> one <strong>of</strong> <strong>the</strong> most complicated<br />

phenomena <strong>in</strong> syntax by <strong>the</strong> comb<strong>in</strong>ation <strong>of</strong> two


54 BIBLIOGRAPHY<br />

grammars formulated <strong>in</strong> one <strong>of</strong> <strong>the</strong> simplest <strong>of</strong> all grammar<br />

formalisms.<br />

Acknowledgments<br />

I’d like to thank Pr<strong>of</strong>. Gert Smolka from Programm<strong>in</strong>g<br />

Systems Lab <strong>in</strong> Saarbrücken, <strong>the</strong> people from <strong>the</strong><br />

CHORUS project, and <strong>the</strong> International Graduate College<br />

(IGK) Saarbrücken/Ed<strong>in</strong>burgh for support<strong>in</strong>g my research<br />

over <strong>the</strong> years. I’d also like to thank <strong>the</strong> anonymous<br />

reviewers <strong>of</strong> this paper for <strong>the</strong>ir valuable suggestions.<br />

Bibliography<br />

Apt, Krzyszt<strong>of</strong> R. (2003). Pr<strong>in</strong>ciples <strong>of</strong> Constra<strong>in</strong>t Programm<strong>in</strong>g.<br />

Cambridge University Press.<br />

Blackburn, Patrick and Claire Gardent (1995). A specification<br />

language for Lexical Functional <strong>Grammars</strong>. In<br />

Proceed<strong>in</strong>gs <strong>of</strong> EACL 1995. Dubl<strong>in</strong>/IE.<br />

Boullier, Pierre (2000). Range Concatenation <strong>Grammars</strong>.<br />

In Proceed<strong>in</strong>gs <strong>of</strong> <strong>the</strong> Sixth International Workshop<br />

on Pars<strong>in</strong>g Technologies (IWPT 2000), pp. 53–64.<br />

Trento/IT.<br />

Chiang, David (2004). Uses and abuses <strong>of</strong> <strong>in</strong>tersected<br />

languages. In Proceed<strong>in</strong>gs <strong>of</strong> TAG+7, pp. 9–15. Vancouver/CA.<br />

Debusmann, Ralph (2006). Extensible Dependency<br />

Grammar: A Modular Grammar Formalism B<strong>as</strong>ed On<br />

Multigraph Description. Ph.D. <strong>the</strong>sis, Universität des<br />

Saarlandes.<br />

Debusmann, Ralph (2007). The complexity <strong>of</strong> First-<br />

Order Extensible Dependency Grammar. Technical report,<br />

Saarland University.<br />

Debusmann, Ralph, Denys Duchier, and Joachim Niehren<br />

(2004). The XDG grammar development kit. In<br />

Proceed<strong>in</strong>gs <strong>of</strong> <strong>the</strong> MOZ04 Conference, volume 3389<br />

<strong>of</strong> Lecture Notes <strong>in</strong> Computer Science, pp. 190–201.<br />

Spr<strong>in</strong>ger, Charleroi/BE.<br />

Debusmann, Ralph and Denys Duchier (2007).<br />

XDG Development Kit. Http://www.mozartoz.org/mogul/<strong>in</strong>fo/debusmann/xdk.html.<br />

Duchier, Denys and Ralph Debusmann (2001). Topological<br />

dependency trees: A constra<strong>in</strong>t-b<strong>as</strong>ed account<br />

<strong>of</strong> l<strong>in</strong>ear precedence. In Proceed<strong>in</strong>gs <strong>of</strong> ACL 2001.<br />

Toulouse/FR.<br />

Gaifman, Haim (1965). Dependency systems and phr<strong>as</strong>estructure<br />

systems. Information and Control, 8(3):304–<br />

337.<br />

Gerdes, Kim and Sylva<strong>in</strong> Kahane (2001). Word order<br />

<strong>in</strong> German: A formal dependency grammar us<strong>in</strong>g<br />

a topological hierarchy. In ACL 2001 Proceed<strong>in</strong>gs.<br />

Toulouse/FR.<br />

Hudson, Richard A. (1990). English Word Grammar. B.<br />

Blackwell, Oxford/UK.<br />

Kathol, Andre<strong>as</strong> (1995). L<strong>in</strong>earization-B<strong>as</strong>ed German<br />

Syntax. Ph.D. <strong>the</strong>sis, Ohio State University, Ohio/US.<br />

Kepser, Stephan (2000). A coalgebraic modell<strong>in</strong>g <strong>of</strong><br />

Head-driven Phr<strong>as</strong>e Structure Grammar. In Proceed<strong>in</strong>gs<br />

<strong>of</strong> AMiLP 2000.<br />

Kepser, Stephan and Uwe Mönnich (2003). Graph properties<br />

<strong>of</strong> HPSG feature structures. In Gerhard Jäger,<br />

Paola Monachesi, Gerald Penn, and Shuly W<strong>in</strong>tner,<br />

eds., Proceed<strong>in</strong>gs <strong>of</strong> Formal Grammar 2003, pp. 115–<br />

124.<br />

Kracht, Marcus (1995). Syntactic codes and grammar<br />

ref<strong>in</strong>ement. Journal <strong>of</strong> Language, Logic and Information,<br />

4:41–60.<br />

McCawley, J. D. (1968). Concern<strong>in</strong>g <strong>the</strong> b<strong>as</strong>e component<br />

<strong>of</strong> a Transformational Grammar. Foundations <strong>of</strong><br />

Language, 4:243–269.<br />

Melamed, I. Dan (2003). Multitext grammars and synchronous<br />

parsers. In Proceed<strong>in</strong>gs <strong>of</strong> <strong>the</strong> Human Language<br />

Technology Conference and <strong>the</strong> North American<br />

Association for Computational L<strong>in</strong>guistics (HLT-<br />

NAACL), pp. 158–165. Edmonton/CA.<br />

Melamed, I. Dan, Giorgio Satta, and Benjam<strong>in</strong> Well<strong>in</strong>gton<br />

(2004). Generalized Multitext <strong>Grammars</strong>. In Proceed<strong>in</strong>gs<br />

<strong>of</strong> ACL 2004. Barcelona/ES.<br />

Pullum, Ge<strong>of</strong>frey K. and Barbara C. Scholz (2001).<br />

On <strong>the</strong> dist<strong>in</strong>ction between model-<strong>the</strong>oretic and<br />

generative-enumerative syntactic frameworks. In<br />

Philippe de Groote, Glyn Morrill, and Christian Retoré,<br />

eds., Logical Aspect <strong>of</strong> Computational L<strong>in</strong>guistics:<br />

4th International Conference, Lecture Notes <strong>in</strong><br />

Artificial Intelligence, pp. 17–43. Spr<strong>in</strong>ger, Berl<strong>in</strong>/DE.<br />

Rogers, James (1996). A model-<strong>the</strong>oretic framework for<br />

<strong>the</strong>ories <strong>of</strong> syntax. In Proceed<strong>in</strong>gs <strong>of</strong> ACL 1996.<br />

Rogers, James (2003). Syntactic structures <strong>as</strong> multidimensional<br />

trees. Journal <strong>of</strong> Research on Language<br />

and Computation, 1(3/4).<br />

Rogers, James (2004). On scrambl<strong>in</strong>g, ano<strong>the</strong>r perspective.<br />

In Proceed<strong>in</strong>gs <strong>of</strong> TAG+7. Vancouver/CA.<br />

Schulte, Christian (2002). Programm<strong>in</strong>g Constra<strong>in</strong>t Services,<br />

volume 2302 <strong>of</strong> Lecture Notes <strong>in</strong> Artificial Intelligence.<br />

Spr<strong>in</strong>ger-Verlag.<br />

Vardi, Moshe Y. (1982). The complexity <strong>of</strong> relational<br />

query languages. In Proceed<strong>in</strong>gs <strong>of</strong> <strong>the</strong> fourteenth annual<br />

ACM symposium on Theory <strong>of</strong> Comput<strong>in</strong>g, pp.<br />

137–146. ACM Press, San Francisco/US.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!