neo4j glossary of terms Flashcards

1
Q

allocator (cluster)

A

A component in the cluster that allocates databases to servers according to the topology constraints specified and an allocation strategy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

asynchronous replication (cluster)

A

Asynchronous replication is used by secondary copies to poll for new transactions, which means they cannot be guaranteed to have received the most recent transactions. This enables efficient scale-out of read-performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Aura instance

A

A fully-managed database represented by a single DBID, that is running in the Neo4j Aura cloud.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

auto-commit transaction

A

An automatically committed transaction that contains a single query.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Bolt protocol

A

Bolt is a protocol used for interaction between Neo4j instances and drivers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

bookmark

A

A marker the client can request from the cluster to ensure that it is able to read its own writes so that the application’s state is consistent and only databases that have a copy of the bookmark are permitted to respond.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

category (Bloom)

A

A category is based on a node label and is defined in a Perspective as a way of visually distinguishing nodes with the same label(s).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

causal consistency

A

All servers in a cluster agree on the order in which transactions take place. The position of a server on the causal chain can be guaranteed using a bookmark.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

cluster

A

A Neo4j DBMS that spans multiple servers working together to increase fault tolerance and/or read scalability. Databases on a cluster may be configured to replicate across servers in the cluster thus achieving read scalability or high availability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

client application

A

Software that interacts with a Neo4j server.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

commit

A

A commit is the successful completion of a transaction, which ensures durability of any changes made. For more details, visit Operations Manual → Transaction management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Composite database

A

Composite databases are the means to access partitioned graph data with a single Cypher query.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

constraint

A

Constraints are sets of data modeling rules that ensure the data is consistent and reliable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Core server (Neo4j 4)

A

A server in a cluster operating in read/write mode. This is replaced in Neo4j 5 by the database-level configuration of primary and secondary databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Cypher®

A

Neo4j’s graph query language.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

data model

A

A data model defines how information is organized in a database. A good data model will make querying and understanding your data easier. In Neo4j, the data models have a graph structure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

database

A

A database is a container used by the DBMS to manage and store graph data. The physical structure of data is controlled by the database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

database vs graph

A

Databases are the physical containers of graph data. Graphs are the logical structure of data in Neo4j.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Database Management System

A

Database Management System, or DBMS, capable of managing multiple databases. A DBMS may run on a single server, or span several servers configured as a cluster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

database schema

A

The prescribed property existence and datatypes for nodes and relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

deallocate (cluster)

A

An act of removing a database from a server or a server from a cluster without loss of data or reduced fault tolerance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

degree (of a node)

A

The number of relationships of a specific node; loops are counted twice.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

disaster recovery (cluster)

A

A manual intervention to restore availability of a cluster, or databases within a cluster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

driver

A

A software library that provides access to Neo4j from a particular programming language.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

election (cluster)

A

In the event that the Raft leader becomes unresponsive, followers automatically trigger an election and vote for a new leader.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

entity

A

A node or a relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

expression (Cypher)

A

A component of a Cypher query which produces values. It may be used in projections, as a predicate, or when setting properties on graph elements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

fabric

A

Fabric is the architectural design of a unified system that provides a single access point to local or distributed graph data.

29
Q

fault tolerance (cluster)

A

A guarantee that a cluster can maintain a database’s persistence and availability in the event of one or more servers failing.

30
Q

follower (cluster)

A

A primary copy of a database acting as a follower, receives and acknowledges synchronous writes from the leader.

31
Q

Generative AI (GenAI)

A

A type of artificial intelligence (AI) system that generates text, images, or other media in response to prompts.

32
Q

Generative Pre-Trained Transformer (GPT)

A

A type of GenAI model that combines two forms of training to produce foundation models. Specifically: Pre-Training: Training general purpose capabilities on vast quantities of data. Fine-Tuning: Training a finite number of supervised ML tasks on a small amount of hand-picked data.

33
Q

graph

A

A logical representation of a set of nodes where some pairs are connected by relationships.

34
Q

index

A

Data structure that improves read performance of a database.

35
Q

knowledge graph

A

A specific type of graph that has an organizing principle so that a user (or a computer system) can reason about the underlying data. The organizing principle provides an additional layer of structure that adds context to support knowledge discovery.

36
Q

label

A

Marks a node as a member of a named and indexed subset. A node may be assigned zero or more labels.

37
Q

Language Model (LM)

A

An ML approach that models the probability distribution over a sequence of words. Predicts the probabilities of next word/character in a sequence. Applications in GenAI as well as embedding, classification, and other ML tasks.

38
Q

Large Language Model (LLM)

A

LMs consisting of large neural networks (billions of parameters) trained on large quantities of data often using self-supervised/semi-supervised approaches. Trained for general tasks and currently seen as the “GenAI for language/text”.

39
Q

LLM Hallucination

A

Language models generate text that is incorrect, nonsensical, or unreal. Appear to answer questions confidently even if they do not have facts. May provide contradicting or inconsistent responses to similar prompts.

40
Q

leader (cluster)

A

A single primary copy of a database is designated as the leader. It receives all write transactions from clients and replicates writes synchronously to followers and asynchronously to secondary copies of the database.

41
Q

motif

A

A description of a specific pattern within a graph.

42
Q

node

A

A node represents an entity or discrete object in your graph data model. Nodes can be connected by relationships, hold data in properties, and are classified by labels.

43
Q

operator

A

A symbol representing a mathematical or logical operation.

44
Q

parameter

A

Named value provided when running a Cypher statement.

45
Q

path

A

A sequence of nodes and the relationships connecting them, that does not contain duplicate relationships. Several paths can match a pattern.

46
Q

pattern

A

A specific arrangement of nodes and relationships that can be matched in a graph. A pattern follows a motif.

47
Q

perspective (Bloom)

A

A Perspective defines a certain business view or domain that can be found in the target Neo4j graph. A single Neo4j graph can be viewed through different Perspectives, each tailored for a different business purpose.

48
Q

primary (cluster)

A

A copy of the database that is able to process write transactions and is eligible to be elected as a leader. It participates in fault tolerant writes as it is part of the majority required to acknowledge and commit write transactions.

49
Q

primary vs secondary (cluster)

A

In a cluster, databases can operate in either primary or secondary mode. Primary databases are able to process write and read transactions, ensuring fault tolerance. Secondary databases are replicated asynchronously from primaries, and their main purpose is to provide read scaling within the cluster.

50
Q

property

A

Properties are key-value pairs that are used for storing data on nodes and relationships.

51
Q

query (Cypher)

A

A statement that retrieves or writes information to a database.

52
Q

Raft group

A

A group of servers that are participating in hosting a particular database in primary mode.

53
Q

Raft group member

A

A server that is participating in a Raft group. A server can be a member of one or more groups.

54
Q

Raft log

A

A shared log between all Raft group members that is guaranteed to be consistently updated and viewed by those members. The log contains both database data and operational state of the Raft group.

55
Q

Raft protocol

A

The networking mechanism that enables a database to replicate its data across multiple servers to give high availability for accessing the data and high durability to the data stored.

56
Q

Read Replica (Neo4j v4)

A

A server in a cluster operating in read-only mode. This is replaced in Neo4j 5 by the database-level configuration of primary and secondary databases.

57
Q

read scaling

A

Distributing query load by creating additional database copies hosted in secondary mode (read-only).

58
Q

relationship

A

A relationship represents a connection between nodes in your graph data model. Relationships connect a source node to a target node, hold data in properties, and are classified by type.

59
Q

secondary (cluster)

A

An asynchronously replicated copy of the database that provides read scaling within the cluster.

60
Q

seed (cluster)

A

A seed is a database dump or a full backup used to create a database on a cluster. This is sometimes called seeding.

61
Q

server

A

A physical machine, a virtual machine, or a container running an instance of Neo4j. Servers can be standalone or part of a cluster.

62
Q

session

A

A causally linked sequence of transactions.

63
Q

session consistency

A

An alternative name for Neo4j’s causal consistency.

64
Q

standalone

A

A single server running Neo4j and not part of a cluster.

65
Q

system database

A

A database used by Neo4j to store system information.

66
Q

synchronous replication (cluster)

A

Synchronous replication requires the leader primary to replicate a transaction and block the commit until a quorum of the follower primaries acknowledges that the transaction is successfully replicated. Once the transaction is replicated, the commit is allowed to proceed. This ensures data durability and consistency within the cluster.

67
Q

tenant (Aura)

A

An isolated environment that contains its own database instances, configurations, and resources.

68
Q

topology (cluster)

A

A configuration that describes how the copies of a database should be spread across the servers in a cluster, see primary mode and secondary mode.

69
Q

transaction

A

A transaction comprises a unit of work performed against a database. It is treated in a coherent and reliable way, independent of other transactions. Transactions comply with the AC