XQueries Flashcards

Question

Consistency

Answer 1

Every read receives the most recent write, or an error.

Answer 2

More capacity by adding new nodes.

Answer 3

Often achieved by very simple interface.

Answer 4

Even if nodes fail, the remaining subnetworks can continue their work.

Answer 5

States we cannot achieve consistency, availability and partition tolerance all at the same time.

Answer 6

Single node DBMS.

Answer 7

NoSQL databases.

Answer 8

NoSQL databases.

Answer 9

An alternative of ACID to compensate for the CAP theorem. Stands for Basically Available, Soft state, Eventually consistent.

Answer 10

Rather than enforcing consistency, it will ensure availability of data instead.

Answer 11

The database state might occasionally be inconsistent but will eventually be made consistent.

Answer 12

- key-value stores - document store - column stores - graph databases

Answer 13

Given a key and value, we can insert data into a database. Given a key, we can find a value in a database.

Answer 14

- Apache Cassandra - Amazon DynamoDB - Apache Voldemort - Memcached - Redis - Riak

Answer 15

Each key value pair (k, v) is stored at some node.

Answer 16

- Assign values v for key k to integer between 0 and (2^n)-1, in which (2^n)-1 gives us the amount of space to store places for nodes, as well as duplicates for these nodes. We do the hash function for these numbers. - Distribute nodes to some of the integers (typically random). - If (k,v) is assigned to integer i, then store at node following i.

Answer 17

Can be done easily with horizontal fragmentation, aka we split C horizontally and store it at different locations. Then, any key-value pairs that would have been in C that are stored in A are now transferred over to C.

Answer 18

Ensures availability, storing copies of the key-value pairs on multiple nodes. For example, if we have a key-value pair which is stored in the north-eastern A node, if A receives a duplicate then it will be stored in B. If we receive multiple duplicates, we store one at each consecutive node.

Answer 19

- Scalability -> simple adding via horizonal fragmentation - Availability and fault-tolerance -> via replication. - High performance -> apply a hash function to determine anode, then ask the node. Same with writing.

Answer 20

One of the problems with key-value stores is that we cannot ensure consistency due to CAP Theorem. Therefore, we provide eventual consistency, which allows multiple versions of data item to be present at the same time (versioning). If the newer version is not available, the older one is updated and used instead.

Answer 21

Database which stores a collection of documents. Document is essentially semi-structured data associated with an object id. These documents are typically represented in JSON.

Answer 22

For simplicity reasons, the difference is syntax alone. XML: Anna 57904 JSON: {"students":[ {"name " : "Anna", "number" : 57904}, ... ]}

Answer 23

- creating/managing collections - insert/update/delete documents - finding documents - indexing documents

Answer 24

db.createCollection("students")

Answer 25

db.students.insert({name: "Anna"})

Answer 26

db.studuents.find({name: "Anna"})

Answer 27

db.students.createIndex({name: 1})

Answer 28

- horizontal fragmentation -> collections are split into horizontal fragments based upon shard key, which is the indexed field in all documents. - replication -> horizontal fragments of collections are replicated.

Answer 29

Hard to explain, but examples include Google Bigtable and Apache HBase.

Answer 30

- creating tables - inserting rows - finding documents

Answer 31

create 'STUDENT', 'Name', 'ID'

Answer 32

put 'STUDENT', 'row1', 'Name:Fname','Anna'

Answer 33

get 'STUDENT', 'row1' scan 'STUDENT'

Answer 34

Fragmentation is split into two: - Top level -> rows are divided into regions. - Bottom level -> regions store different column families in different nodes.

Answer 35

Each item has a timestamp and one can access past versions of the database if setup.

Answer 36

Simply, stores data as a graph. Data is accessed using SQL-like path query language. Also implements indexes.