Overlapping community structure detection in large-scale networks

Community-detection, Apache-spark, Graph-mining, Larg-sclale-networks, Bigclam, Machine-learning, Scala


This is a my undergraduate thesis. In project, I provide a general view of communities and its the real life applications. In recent years, with the rapid growth of network scale, it is a difficult task to detect overlapping communities in large-scale networks for state of the art methods. This method is implemented in the Apache Spark framework for its power in distributed parallel computation.

Github: BigCLAM-ApacheSpark

This project was referenced in awesome-community-detection regarding the method: Overlapping Community Detection at Scale: a Nonnegative Matrix Factorization Approach (WSDM 2013)

