Unsupervised Recursive Sequence Processing - Institute of ...

More documents

Recommendations

Info

In a trained map, neurons spread in regions of the data space where a high sample density can be observed, resulting in large U-values at borders between clusters. Consequently, the U-Matrix forms a 3D landscape on the lattice of neurons with valleys corresponding to meaningful clusters and hills at the cluster borders. The U-Matrix of weight vectors can be constructed also for SOM-S. Based on this matrix, the sequence entries can be clustered into meaningful categories, based on which the extraction of Markov models as described above is possible. Note that the U-Matrix is built by using the weights assigned to the neurons only, while the context information of SOM-S is yet ignored. 6 However, since context information is used for training, clusters emerge which are meaningful with respect to the temporal structure, and this way they contribute implicitly to the topological ordering of the map and to the U-Matrix. Partially overlapping, noisy, and ambiguous input elements are separated during the training, because the different temporal contexts contain enough information to activate and produce characteristic clusters on the map. Thus, the temporal structure captured by the training allows a reliable reconstruction of the input sequences, which could not have been achieved by the standard SOM architecture. 5 Experiments 5.1 Mackey-Glass time series The first task is to learn the dynamic of the real-valued chaotic Mackey-Glass time series dx = bx(τ) + ax(τ−d) using a = 0.2, b = −0.1, d = 17. This is the same dτ 1+x(τ−d) 10 setup as given in [41] making a comparison of the results possible. 7 Three types of maps with 100 neurons have been trained: a 6-neighbor map without context giving standard SOM, a map with 6 neighbors and with context (SOM-S), and a 7-neighbor map providing a hyperbolic grid with context utilization (H-SOM- S). Each run has been computed with 1.5 · 10 5 presentations starting at random positions within the Mackey-Glass series using a sample period of ∆t = 3; the neuron weights have been initialized white within [0.6, 1.4]. The context has been considered by decreasing the parameter from η = 1 to η = 0.97. The learning rate is exponentially decreased from 0.1 to 0.005 for weight and context update. Initial neighborhood cooperativity is 10 which is annealed to 1 during training. Figure 2 shows the temporal quantization error for the above setups: the temporal quantization error is expressed by the average standard deviation of the given sequence and the mean unit receptive field for 29 time steps into the past. Similar 6 Preliminary experiments indicate that the context also orders topologically and yields meaningful clusters. The number of neurons in context clusters is thereby small compared to the number of neurons and statistically significant results could not be obtained. 7 We would like to thank T.Voegtlin for providing data for comparison. 18
to Voegtlin’s results, we observe large cyclic oscillations driven by the periodicity of the training series for standard SOM. Since SOM does not take contextual information into account, this quantization result can be seen as an upper bound for temporal models, at least for the indices > 0 reaching into the past (trivially, SOM is a very good quantizer of scalar elements without history); the oscillating shape of the curve is explained by the continuity of the series and its quasi-periodic dynamic, and extrema exist rather by the nature of the series than by special model properties. Obviously, the very restricted context of RSOM does not yield a long term improvement of the temporal quantization error. However, the displayed error periodicity is anti-cyclic compared to the original series. Interestingly, the data optimum topology of neural gas (NG), which also does not take contextual information into account, allows a reduction of the overall quantization error; however, the main characteristics, such as the periodicity, remain the same as for standard SOM. RecSOM leads to a much better quantization error than RSOM and also NG. Thereby, the error is minimum for the immediate past (left side of the diagram), and increases for going back in time, which is reasonable because of the weighting of context influence by (1 − η). The increase of the quantization error is smooth and the final values after 29 time steps is better than the default given by standard SOM. In addition, almost no periodicity can be observed for RecSOM. SOM-S and H-SOM-S further improve the results: only some periodicity can be observed, and the overall quantization error increases smoothly for the past values. Note that these models are superior to RecSOM in this task while requiring less computational power. H-SOM-S allows a slightly better representation of the immediate past compared to SOM-S due to the hyperbolic topology of the lattice structure that matches better the characteristics of the input data. 0.2 Quantization Error 0.15 0.1 0.05 * SOM * RSOM NG * RecSOM SOM-S H-SOM-S 0 0 5 10 15 20 25 30 Index of past inputs (index 0: present) Fig. 2. Temporal quantization errors of different model setups for the Mackey-Glass series. Results indicated by ∗ are taken from [41]. 19
Page 1 and 2: Unsupervised Recursive Sequence Pro
Page 3 and 4: This framework directly generalizes
Page 5 and 6: place by the update rule △w j =
Page 7 and 8: ecursive partitioning very much lik
Page 9 and 10: However, the dimensionality of the
Page 11 and 12: In the following, we focus on the c
Page 13 and 14: , ! ! , Fig. 1. Hyperbolic
Page 15 and 16: with reverse indexing notation, i.e
Page 17: The number of specialized neurons f
Page 21 and 22: * 6 2 6 5 : 8 : 2 8 5 - Type P (0)
Page 23 and 24: TVVEBTSSX SEBTSSX VVEBTXX EBTSSSX E
Page 25 and 26: follows: a stands for (0, 0) + µ,
Page 27 and 28: 6 Conclusions We have presented a s
Page 29 and 30: [18] S. Kaski, T. Honkela, K. Lagus

Unsupervised Recursive Sequence Processing - Institute of ...

Create successful ePaper yourself

Delete template?

Save as template?