Detecting Communities Flashcards
What is a graph partitioning problem?
Detecting two communities
Name the various methods used to solve for graph partitioning
Girvan-Newman algorithm - exploits betweenness centrality
Minimum cut - optimal division
Modularity Maximization - identify communities by varying the division
Hierarchical Clustering
Generative Model
What do we use? Laplacian
How do we find the partition to a Laplacian Network?
The partition is determined by the eigenvector associated with the first non-trivial eigenvalue
How do we find the partition to a Laplacian Network?
The partition is determined by the eigenvector associated with the first non-trivial eigenvalue
How do we find the optimal solution for cut size (R)?
- Create a vector, s, in which if an element is part of group 1, it equals +1, in group 2 it equals -1.
- Using calc identities, we can reduce it such that the goal is to find an s that minimizes r
What is the first constraint on the vector s when optimizing?
1.The possible vector s must lie on any of the vertices of the hypercube, abs(s) = sqrt(n) where n is the number of dimensions
What is the second constraint on the vector s when optimizing?
- The summation of the s elements must equal the difference between the number of nodes in each community (n1 - n2).
R (the cut size) is proportional to one of the eigenvalues in the matrix. How do we minimize R then?
Choose the smallest non-trivial eigenvalue which is lamba_2.