Neo4j data science algorithms3 Flashcards

1
Q

Graph Isomorphism

A

A concept in graph theory where two graphs are considered isomorphic if there is a one-to-one correspondence between their nodes and edges that preserves adjacency. For example, graph isomorphism can be used to detect structural similarities in chemical compounds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Graph Partitioning

A

The process of dividing a graph into smaller, disjoint subgraphs while minimizing the number of edges between them. For example, graph partitioning is used in parallel computing to distribute workloads efficiently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Walktrap Algorithm

A

A community detection algorithm based on random walks, which groups nodes into communities by considering the probability of random walks staying within the same community. For example, Walktrap can identify tightly connected groups in social networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Shortest Path Algorithms

A

Algorithms that find the shortest path between nodes in a graph, such as Dijkstra’s and Bellman-Ford. For example, these algorithms can be used for routing in transportation networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Graph Database Sharding

A

A technique used to distribute a graph database across multiple servers or clusters to improve scalability and performance. For example, sharding can be used to handle large-scale graph data across distributed systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Triadic Closure

A

The tendency for two nodes with a common neighbor to form a direct connection, a concept used in social network analysis. For example, triadic closure is used to predict the likelihood of friendships forming in social networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Motif Detection

A

The process of identifying recurring subgraph patterns in a graph, which can reveal underlying structural or functional properties. For example, motif detection is used in bioinformatics to identify common interaction patterns in protein networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

GraphQL Cypher

A

A Neo4j extension that combines GraphQL with the Cypher query language to allow flexible and powerful querying of graph data. For example, GraphQL Cypher enables complex queries on graph databases with a simple API.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Bipartite Graph

A

A type of graph where nodes can be divided into two disjoint sets such that no two nodes within the same set are adjacent. For example, a bipartite graph can model relationships between users and the items they purchase.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Neo4j APOC (Awesome Procedures on Cypher)

A

A library of utilities and functions that extend the capabilities of Cypher, Neo4j’s query language. For example, APOC provides functions for data integration, graph algorithms, and data transformation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Personalized PageRank

A

A variation of the PageRank algorithm that computes the importance of nodes with respect to a personalized set of starting nodes. For example, Personalized PageRank can be used in recommendation systems to prioritize items similar to those a user has interacted with.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Betweenness Centrality

A

A measure of a node’s importance based on the number of shortest paths that pass through it, used to identify critical nodes in a network. For example, nodes with high betweenness centrality are often key connectors in communication networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Markov Clustering Algorithm (MCL)

A

A scalable graph clustering algorithm based on flow simulations, which identifies clusters by simulating random walks. For example, MCL is used in bioinformatics for clustering proteins based on their interaction networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

K-Core Decomposition

A

A method for decomposing a graph into subgraphs where each node has at least k connections, used to identify dense regions. For example, k-core decomposition can be used to find influential groups in social networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Dynamic Graphs

A

Graphs that change over time, where nodes and edges can be added or removed, requiring specialized algorithms for analysis. For example, dynamic graph analysis is used to study evolving social networks or transaction networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

GraphBLAS

A

A specification for implementing graph algorithms using linear algebra operations, enabling efficient computations on large graphs. For example, GraphBLAS can be used to optimize graph analytics in high-performance computing environments.

17
Q

Graph Neural Network Explainers

A

Techniques used to interpret the predictions of Graph Neural Networks by identifying important nodes and edges. For example, explainers can help understand why a GNN classified a node in a certain way.

18
Q

Homophily

A

The tendency for nodes in a graph to connect with similar nodes, a common phenomenon in social networks. For example, homophily explains why people with similar interests are more likely to become friends.

19
Q

Graph Convolutional Autoencoders

A

Models that learn graph representations by encoding node features and reconstructing graph structures, used for tasks like anomaly detection. For example, these autoencoders can identify unusual patterns in network traffic data.

20
Q

Neo4j Fabric

A

A feature in Neo4j that enables the querying of multiple databases as if they were a single, federated database. For example, Neo4j Fabric allows seamless integration and querying across distributed graph datasets.

21
Q

Gibbs Sampling for Graphs

A

A Markov Chain Monte Carlo algorithm used to sample from the probability distribution of node labels, often used in Bayesian networks. For example, Gibbs sampling can be used to infer missing labels in a partially labeled graph.