This is a my undergraduate thesis. In project, I provide a general view of communities and its the real life applications. In recent years, with the rapid growth of network scale, it is a difficult task to detect overlapping communities in large-scale networks for state of the art methods. This method is implemented in the Apache Spark framework for its power in distributed parallel computation.
This project was referenced in awesome-community-detection regarding the method: Overlapping Community Detection at Scale: a Nonnegative Matrix Factorization Approach (WSDM 2013)
Every comment would be appreciated.
If you want to use parts of any code of mine:
let me know and use it!