Unsupervised Recursive Sequence Processing - Institute of ...

More documents

Recommendations

Info

too much contortion. A global isometric embedding, however, is not possible in general. Interestingly, for all such tessellations a data similarity measure is defined and possibly non-isometric visualization in the 2D plane can be achieved. While 6 neighbors per neuron lead to standard Euclidean triangular meshes, for a grid with 7 neighbors or more, the graph becomes part of the 2-dimensional hyperbolic plane. As already mentioned, exponential neighborhood growth is possible and hence an adequate data representation can be expected for the visualization of domains with a high connectivity of the involved objects. SOM with hyperbolic neighborhood (HSOM) has already proved well-suited for text representation as demonstrated for a non-recursive model in [29]. 3 SOM for sequences (SOM-S) In the following, we introduce the adaptation of SOMSD for sequences and the general triangular grid structure, SOM for sequences (SOM-S). Standard SOMs operate on a rectangular neuron grid embedded in a real-valued vector space. More flexibility for the topological setup can be obtained by describing the grid in terms of a graph: neural connections are realized by assigning each neuron a set of direct neighbors. The distance of two neurons is given by the length of a shortest path within the lattice of neurons. Each edge is assigned the unit length 1. The number of neighbors might vary (also within a single map). Less than 6 neighbors per neuron lead to a subsiding neighborhood, resulting in graphs with small numbers of nodes. Choosing more than 6 neighbors per neuron yields, as argued above, an exponential increase of the neighborhood size, which is convenient for representing sequences with potentially exponential context diversification. Unlike standard SOM or HSOM, we do not assume that a distance preserving embedding of the lattice into the two dimensional plane or another globally parameterized two-dimensional manifold with global metric structure, such as the hyperbolic plane, exists. Rather, we assume that the distance of neurons within the grid is computed directly on the neighborhood graph, which might be obtained by any non-overlapping triangulation of the topological two-dimensional plane. 4 For our experiments, we have implemented a grid generator for a circular triangle meshing around a center neuron, which requires the desired number of neurons and the neighborhood degree n as parameters. Neurons at the lattice edge possess less than n neighbors, and if the chosen total number of neurons does not lead to filling up the outer neuron circle, neurons there are connected to others in a maximum symmetric way. Figure 1 shows a small map with 7 neighbors for the inner neurons, and a total of 29 neurons perfectly filling up the outer edge. For ≥ 7 neighbors, the exponential neighborhood increase can be observed, for which an embedding into 4 Since the lattice is fixed during training, these values have to be computed only once. 12
, ! ! , Fig. 1. Hyperbolic self organizing map with context. Neuron n refers to the context given by the winner location in the map, indicated by the triangle of neurons N 1 , N 2 , and N 3 , and the precise coordinates ß 12 ,ß 13 . If the previous winner has been D 2 , adaptation of the context along the dotted line takes place. the Euclidean plane is not possible without contortions; however, local projections in terms of a fish eye magnification focus can be obtained (cf. [29]). SOMSD adapts the location of the expected previous winner during training. For this purpose, we have to embed the triangular mesh structure into a continuous space. We achieve this by computing lattice distances beforehand, and then we approximate the distance of points within a triangle shaped map patch by the standard Euclidean distance. Thus, positions in the lattice are represented by three neuron indices which represent the selected triangle of adjacent neurons, and two real numbers which represent the position within the triangle. The recursive nature of the shown map is illustrated exemplarily in figure 1 for neuron n. This neuron n is equipped with a weight w ∈ R n and a context c that is given by a location within the triangle of neurons N 1 , N 2 , and N 3 expressing corner affinities by means of the linear combination parameters ß 12 and ß 13 . The distance of a sequence s from neuron n is recursively computed by d SOM-S ((s 1 , . . . , s t ), n) = η ‖s 1 − w‖ 2 + (1 − η) g(C SOM-S (s 2 , . . . , s n ), c). C SOM-S (s) is the index of the neuron n j in the grid with smallest distance d SOM-S (s, n j ). g measures the grid distance of the triangular position c j = (N 1 , N 2 , N 3 , ß 12 , ß 13 ) to the winner as the shortest possible path in the mesh structure. Grid distances between neighboring neurons possess unit length, and the metric structure within the triangle N 1 , N 2 , N 3 is approximated by the Euclidean metric. The range of g is normalized by scaling with the inverse maximum grid distance. This mixture of hyperbolic grid distance and Euclidean distance is valid, because the hyperbolic space can locally be approximated by Euclidean space, which is applied for computational convenience to both distance calculation and update. 13
Page 1 and 2: Unsupervised Recursive Sequence Pro
Page 3 and 4: This framework directly generalizes
Page 5 and 6: place by the update rule △w j =
Page 7 and 8: ecursive partitioning very much lik
Page 9 and 10: However, the dimensionality of the
Page 11: In the following, we focus on the c
Page 15 and 16: with reverse indexing notation, i.e
Page 17 and 18: The number of specialized neurons f
Page 19 and 20: to Voegtlin’s results, we observe
Page 21 and 22: * 6 2 6 5 : 8 : 2 8 5 - Type P (0)
Page 23 and 24: TVVEBTSSX SEBTSSX VVEBTXX EBTSSSX E
Page 25 and 26: follows: a stands for (0, 0) + µ,
Page 27 and 28: 6 Conclusions We have presented a s
Page 29 and 30: [18] S. Kaski, T. Honkela, K. Lagus

Unsupervised Recursive Sequence Processing - Institute of ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?