Big Data Lecture 13 Graph Databases Flashcards

Question 1

Q

What properties does a well-defined query language have? Why?

Answer

A

<ul><li>declarative (hidden what it does),</li><li>functional (composable).</li></ul>

So that we can parralelize the query, optimize behind the scenes. It makes it more concise and readable.<br></br>

Question 2

Q

Why do we use graph databases?

Answer

A

So that we can avoid expensive joins on multiple tables when we query!

Question 3

Q

How are data linked in a graph database?

Answer

A

Using pointers, that is fast.

Question 4

Q

What are the two families of graph databases?

Answer

A

Labeled property graphs, and triple stores.

Question 5

Q

How can we represent graph connectivity in memory?

Answer

A

Using adjancency matrix, or incidence graph (nodes x edges: 1 = in, -1 = out, 0 = nothing).

Question 6

Q

How is data stored in Labeled Property Graphs?

Answer

A

Both edges and nodes can have properties (flat table) and labels.

Question 7

Q

What extra datatypes are in Cypher?

Answer

A

Node, Relationship and Path.

Question 8

Q

Is there order between types in neo4j?

Question 9

Q

What does Null mean in Neo4j?

Answer

A

Absent data!

Question 10

Q

How do we query data on graphs?

Answer

A

We use pattern matching for values and edges!

Question 11

Q

How is data sharded in Neo4j?

Answer

A

In overlapping shards, so that we have time to load them when chasing the pointers. But for all we know its all difficult and complicated.

Question 12

Q

How are edges for one node stored in memory?

Answer

A

As a linked list, which allows for fast querying.

Question 13

Q

What are triple stores?

Answer

A

Data is stored in tripples (subject)-(property)-(object), where object can be left empty. And subject, or property cannot be literal.

Question 14

Q

What formats are there for triple store?

Answer

A

<ol><li>RDF/XML,</li><li>Turtle,</li><li>JSON-LD,</li><li>RDFa,</li><li>N-Triples.</li></ol>

Question 15

Q

What are query languages for graph databases?

Answer

A

Cypher and SPARQL.

Question 16

Q

What were graph databases used for in AI historically?

Answer

A

We can build ontologies on top of them, on which we can run logical entailments. For example OWL, or OWL2.