25.10.2016 Views

SAP HANA Predictive Analysis Library (PAL)

sap_hana_predictive_analysis_library_pal_en

sap_hana_predictive_analysis_library_pal_en

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Expected Result<br />

DOCTOPICDIST_TBL:<br />

3.1.11 Self-Organizing Maps<br />

Self-organizing feature maps (SOMs) are one of the most popular neural network methods for cluster analysis.<br />

They are sometimes referred to as Kohonen self-organizing feature maps, after their creator, Teuvo Kohonen,<br />

or as topologically ordered maps. SOMs aim to represent all points in a high-dimensional source space by<br />

points in a low-dimensional (usually 2-D or 3-D) target space, such that the distance and proximity<br />

relationships are preserved as much as possible. This makes SOMs useful for visualizing low-dimensional<br />

views of high-dimensional data, akin to multidimensional scaling.<br />

SOMs can also be viewed as a constrained version of k-means clustering, in which the cluster centers tend to<br />

lie in low-dimensional manifold in the feature or attribute space. The learning process mainly includes three<br />

steps:<br />

1. Initialize the weighted vectors in each unit.<br />

2. Select the Best Matching Unit (BMU) for every point and update the weighted vectors of BMU and its<br />

neighbors.<br />

3. Repeat Step 2 until convergence or the maximum iterations are reached.<br />

An important variant is batch SOM, which updates the weighted vectors only at the end of every learning<br />

epoch. It requires that the whole set of training data is present, and is independent on the order of input<br />

vectors.<br />

The SOM approach has many applications such as virtualization, web document clustering, and speech<br />

recognition.<br />

Prerequisites<br />

●<br />

●<br />

The first column of the input data is an ID column and the other columns are of integer or double data<br />

type.<br />

The input data does not contain null value. The algorithm will issue errors when encountering null values.<br />

<strong>SAP</strong> <strong>HANA</strong> <strong>Predictive</strong> <strong>Analysis</strong> <strong>Library</strong> (<strong>PAL</strong>)<br />

<strong>PAL</strong> Functions P U B L I C 103

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!