SOM tutorial part 1

A SOM does not need a target output to be specified unlike many other types of network. Instead, where the node weights match the input vector, that area of the lattice is selectively optimized to more closely resemble the data for the class the input vector is a member of. From an initial distribution of random weights, and over many iterations, the SOM eventually settles into a map of stable zones. Each zone is effectively a feature classifier, so you can think of the graphical output as a type of feature map of the input space. If you take another look at the trained network shown in figure 1, the blocks of similar colour represent the individual zones. Any new, previously unseen input vectors presented to the network will stimulate nodes in the zone with similar weight vectors.

Now let's take a look at each step in detail. I'll supplement my descriptions with extracts from the source code where appropriate. Hopefully, for those of you who can program, the code will help reinforce your understanding of the principles involved.

Prior to training, each node's weights must be initialized. Typically these will be set to small standardized random values. The weights in the SOM from the accompanying code project are initialized so that 0 < w < 1. Nodes are defined in the source code by the class CNode. Here are the relevant parts of that class:

class CNode

{

private:

//this node's weights

vector<double> m_dWeights;

//its position within the lattice

double m_dX,

m_dY;

//the edges of this node's cell. Each node, when draw to the client

//area, is represented as a rectangular cell. The colour of the cell

//is set to the RGB value its weights represent.

int m_iLeft;

int m_iTop;

int m_iRight;

int m_iBottom;

public:

CNode(int lft, int rgt, int top, int bot, int NumWeights):m_iLeft(lft),

m_iRight(rgt),

m_iBottom(bot),

m_iTop(top)

{

//initialize the weights to small random variables

for (int w=0; w<NumWeights; ++w)

{

m_dWeights.push_back(RandFloat());

}

//calculate the node's center

m_dX = m_iLeft + (double)(m_iRight - m_iLeft)/2;

m_dY = m_iTop + (double)(m_iBottom - m_iTop)/2;

}

...

};

You can see that the weights are initialized automatically by the constructor when a CNode object is instantiated. The member variables m_iLeft, m_iRight, m_iTop and m_iBottom are only used to render the network to the display area - each node is represented by a rectangular cell described by these values, as shown previously in Figure 3.

To determine the best matching unit, one method is to iterate through all the nodes and calculate the Euclidean distance between each node's weight vector and the current input vector. The node with a weight vector closest to the input vector is tagged as the BMU.

where V is the current input vector and W is the node's weight vector. In code this equation translates to:

double CNode::GetDistance(const vector<double> &InputVector)

{

double distance = 0;

for (int i=0; i<m_dWeights.size(); ++i)

{

distance += (InputVector[i] - m_dWeights[i]) * (InputVector[i] - m_dWeights[i]);

}

return sqrt(distance);

}

As an example, to calculate the distance between the vector for the colour red (1, 0, 0) with an arbitrary weight vector (0.1, 0.4, 0.5)