Connectionist Modeling of Experience-based Effects in Sentence ...

More documents

Recommendations

Info

Chapter 3 Connectionist Modelling of Language Comprehension 3.1 Structure and Learning Connectionist networks are prototypical exposure-based models. Or, more precisely, they are the implementation of non-committed exposure-based accounts. Non-committed should mean here accounts without any specific assumptions about structural levels or grain sizes, nor about the linking between corpus regularities and behavior. In literature there does not seem to be an agreement about what models to call connectionist. It must be mentioned that there are hybrid models that use parallel distributed activation spreading between symbolic entities on the one hand (e.g. Just and Carpenter, 1992; Lewis and Vasishth, 2005), and there are connectionist models that use hand-designed architectures and local representations on the other hand (e.g. Dell et al., 2002; McClelland and Elman, 1984; Rohde, 2002). I am concerned here only with “fully connectionist models” using fully distributed representations and no pre-designed internal structuring. The most important feature that distinguishes a connectionist network model of that kind from symbolic models is its architecturally constrained highly adaptive learning ability. Connectionist models are functional problem solving machines that, depending on the specific learning algorithm and certain architectural properties, are able to find the optimal solution to any task representable as input-output pairs. The design of symbolic models mostly involves many assumptions about the desired processes which are hard-coded into the system. For example, it has to be specified how to categorize and represent the input. A connectionist system on the other hand starts from zero without any presumptions. The structure of the internal input representation is shaped during the learning process depending on the task requirements. Obviously the information about the structure that linguists annotate to word strings of natural language is already there in the plain strings. Extracting the underlying structure of the input requires information about sequential and temporal relations between input chunks. For that reason time is an important component of cognitive tasks. In particular language transports highly-structured information while being entirely sequential. A memory of earlier input and the representation of temporal relations between input chunks pro- 47
OUTPUT PUT PLAN Chapter 3 Connectionist Modelling of Language Comprehension vides much contextual information helping to interpret current input. The context of an utterance has a great influence on ambiguity resolution and predictions of content. There have been someThere accounts are many of providing ways in connectionist which this can networks be with temporal representation which were accomplished, of explicit and nature. a number This posed of interesting limits to the number and richness of representations. proposals Elman (1990) have appeared describesin athe simple literature way to (e.g. provide a connectionist network with memory, called Jordan, a simple 1986; recurrent Tank & network Hopfield, (SRN, 1987; figure 3.1). The hidden representations in the network Stornetta, are Hogg, copied & into Huberman, a so-called1987; context layer, which influences the hidden representations Watrous in the next & Shastri, step through 1987; weighted Waibel, activation feeding. This memory loop goes without Hanazawa, any explicit Hinton, representation Shikano, & of Lang, time1987; or relations between input chunks. It is the iterative Pineda, procedure 1988; Williams of copying & Zipser, and back-feeding 1988). One itself that produces temporal relations on anof implicit the most level. promising Becausewas every suggested copy of the by activation pattern has been influenced by all earlier Jordan copies, (1986). the Jordan contextual described memory a network reaches into the “past” in a continuously graded way (shown over several in Figure input1) steps. containing The information recurrent of earlier input representations is still in the connections representation which aswere a trace, used but to associate newer input a has more influential power. Elman (1990) writes: static pattern (a “Plan”) with a serially ordered output pattern (a sequence of “In this account, “Actions”). memory isThe neither recurrent passive connections nor a separate allowsubsystem. One cannot properly speak the network’s of a memory hidden forunits sequences; to see that its own memory is inextricably bound up withprevious the rest output, of the processing so that the mechanism.” subsequent behavior can be shaped by previous This very simple wayresponses. of memory These supply recurrent yields connections architecturally are determined plausible properties that can abstractly what give be described the network asmemory. storage limitations, memory span or decay of memorized representations over time. These are properties explicitly accounted for in symbolic models like ACT-R or CC-READER. rchitecture used by Jordan (1986). from output to state units are one-forxed weight of 1.0. Not all connections ach can be modified in ing way. Suppose a own in Figure 2) is at the input level by nits; call these Context units are also “hidden” se that they interact with other nodes he network, and not the ld. that there is a nput to be processed, clock which regulates of the input to the cessing would then e following sequence of time t, the input units first input in the sequence. Each input might be a single scalar value or a vector, n the nature of the problem. The context units are initially set to 0.5. 2 OUTPUT UNITS HIDDEN UNITS INPUT UNITS CONTEXT UNITS Figure 2. A simple recurrent network in which activations are Figure 3.1: copied Architecture from hidden of layer a simple to context recurrent layer network on a one-for-one (SRN, Elman, 1990). The solid linebasis, represents with fixed fixed weight one-to-one of 1.0. Dotted connections lines represent to thetrainable context layer. Dashed lines connections. represent trainable connections. Both the input ntext units activate the hidden units; and then the hidden units feed forward to 48 tion function used here bounds values between 0.0 and 1.0. Page 4
Page 1 and 2:
Connectionist Modeling of Experienc
Page 3 and 4: Acknowledgments I am grateful to Be
Page 5 and 6: Contents 3.3.4 Summary . . . . . .
Page 7 and 8: List of Tables 2.1 Languages with s
Page 9 and 10: Chapter 1 Preliminaries and the ACT
Page 11 and 12: Chapter 1 Preliminaries (1) a. The
Page 13 and 14: 1.3 Psycholinguistic Aspects Chapte
Page 15 and 16: Chapter 1 Preliminaries referrentia
Page 17 and 18: Chapter 1 Preliminaries empirically
Page 19 and 20: Chapter 1 Preliminaries rial, in Le
Page 21 and 22: Chapter 1 Preliminaries are limited
Page 23 and 24: Chapter 1 Preliminaries predictor.
Page 25 and 26: Chapter 2 Issues in Relative Clause
Page 45 and 46: clear predictions for head-final RC
Page 51 and 52: Reading time [ms] 400 600 800 1000
Page 53: Chapter 2 Issues in Relative Clause
Page 57 and 58: Chapter 3 Connectionist Modelling o
Page 63 and 64: German Word Order Chapter 3 Connect
Page 65 and 66: Length-Adjusted Reading Time (ms) C
Page 69 and 70: Chapter 4 Two SRN Prediction Studie
Page 71 and 72: 4.1.3 Training and Testing Chapter
Page 73 and 74: GPE 0.0 0.2 0.4 0.6 0.8 1.0 English
Page 75 and 76: (23) [V1 [N1 V2 de ORC ] N2 de SRC
Page 85 and 86: (27) German with commas: a. SRC: S1
Page 87 and 88: 4.4.4 Discussion Chapter 4 Two SRN
Page 91 and 92: Bibliography M. H. Christiansen. Th
Page 93 and 94: Bibliography E. Gibson and J. Thoma
Page 95 and 96: Bibliography J. W. King and M. Kuta
Page 97 and 98: Bibliography F. Reali and M. H. Chr
Page 99 and 100: Appendix A Statistics SRC ORC regio
Page 101 and 102: Appendix B Grammars B.1 English (wr
Page 103 and 104: B.2 German {numREL, Nnom, RC RCpure
Page 105 and 106:
B.3 Mandarin Rel : SRC (0.85) | ORC
show all

Connectionist Modeling of Experience-based Effects in Sentence ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?