Data-driven cluster reinforcement and visualization in sparsely-matched self-organizing maps

IEEE Trans Neural Netw Learn Syst. 2012 May;23(5):846-52. doi: 10.1109/TNNLS.2012.2190768.

Abstract

A self-organizing map (SOM) is a self-organized projection of high-dimensional data onto a typically 2-dimensional (2-D) feature map, wherein vector similarity is implicitly translated into topological closeness in the 2-D projection. However, when there are more neurons than input patterns, it can be challenging to interpret the results, due to diffuse cluster boundaries and limitations of current methods for displaying interneuron distances. In this brief, we introduce a new cluster reinforcement (CR) phase for sparsely-matched SOMs. The CR phase amplifies within-cluster similarity in an unsupervised, data-driven manner. Discontinuities in the resulting map correspond to between-cluster distances and are stored in a boundary (B) matrix. We describe a new hierarchical visualization of cluster boundaries displayed directly on feature maps, which requires no further clustering beyond what was implicitly accomplished during self-organization in SOM training. We use a synthetic benchmark problem and previously published microbial community profile data to demonstrate the benefits of the proposed methods.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Data Interpretation, Statistical
  • Environmental Monitoring / methods*
  • Information Storage and Retrieval / methods*
  • Neural Networks, Computer*
  • Pattern Recognition, Automated / methods*
  • Water Microbiology*