Laplacian mixture modeling for network analysis and unsupervised learning on graphs

PLoS One. 2018 Oct 1;13(10):e0204096. doi: 10.1371/journal.pone.0204096. eCollection 2018.

Abstract

Laplacian mixture models identify overlapping regions of influence in unlabeled graph and network data in a scalable and computationally efficient way, yielding useful low-dimensional representations. By combining Laplacian eigenspace and finite mixture modeling methods, they provide probabilistic or fuzzy dimensionality reductions or domain decompositions for a variety of input data types, including mixture distributions, feature vectors, and graphs or networks. Provable optimal recovery using the algorithm is analytically shown for a nontrivial class of cluster graphs. Heuristic approximations for scalable high-performance implementations are described and empirically tested. Connections to PageRank and community detection in network analysis demonstrate the wide applicability of this approach. The origins of fuzzy spectral methods, beginning with generalized heat or diffusion equations in physics, are reviewed and summarized. Comparisons to other dimensionality reduction and clustering methods for challenging unsupervised machine learning problems are also discussed.

MeSH terms

  • Algorithms
  • Cluster Analysis*
  • Finite Element Analysis
  • Fuzzy Logic
  • Models, Statistical
  • Unsupervised Machine Learning*

Grants and funding

The author(s) received no specific funding for this work.