DBSCAN Flashcards
What the density based do ?
locates cluster even if they do not belong regions of high density, and in any
separates outliers
What is the two parameters in DBScan
MinPts
š(epsilon):
What is š(epsilon) ?
š(epsilon): Maximum radius of the neighborhood for some point pā¢ Neighborhood of p: points within a radius of from a point p
What is minpts?
Minimum number of points in an neighborhood of a point
Give me the Characterization of points
Core point
Border point
Noise point
What the diff between core and border ?
Core has more than specific number of minPts
Border has fewer than minPts but has core in region
Direct density reachable
The link between core and any point in region of core point
Density-reachable
The link between core to another point in diffrent cluster but most be connect to other core points to access to the point
Density-connected
The link between two point in different cluster but most be between them core point
Why the epsilon important?
If is very small their will not core point
If very large the results in all point forming single cluster
How determining EPS epsilon?
Support have minPts=4 then the EPS is the distance of 4 neighbour
The pors of DBScan
Does not require the number of clusters a priori.
āŖ it can capture arbitrary shape cluster
āŖ it can identify (noises)
Cons of DBScan
1-Fails in case of varying density clusters.
2- Does not work well of high dimensional data.
3-Sensitive to Parameters.