NoSQL Flashcards

1
Q

Which is a limit of relational databases?

A

Joins can be costly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which type of NoSQL database is based on sets of nodes and edges between nodes?

A

Graph database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which is not an advantage of NoSQL databases?

A

Support for joins

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

T/F. Expressiveness of query language is important to data science tasks.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

T/F. Data quality should be taken into account during data preparation.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What technique is used to visualize correlations between two variables?

A

Scatter plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which is not a type of data set used in data science model building?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What markup language can be used for sharing models between tools?

A

Predictive Modelling Markup Language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

CSV/TSV

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

JSON

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Python

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

MongoDB

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

cursor

A

a pointer to a record - helpful when an entire result set can’t fit into memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

outlier

A

find a dirty data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Documents consist of keys and values.

18
Q

In JSON structures, keys are usually what data type?

19
Q

csv.DictReader Python function can be used to read tabular data into a Python dictionary.

20
Q

Which driver is used in Python to work with MongoDB?

21
Q

When converting data to MongoDB format, we can use Python dictionary functions.

22
Q

Data type conversions rarely need to be performed during data loads.

23
Q

In MongoDB, what can we create to reduce the time needed to find a document based on the value of an attribute?

24
Q

Data frames provide what type of abstract structure?

25
Embedded documents are used to avoid joins.
TRUE
26
What is the top level data structure in Cassandra called?
Keyspace
27
Duplicating data in wide column databases is uncommon.
FALSE
28
Which command is used to specify the nodes in a cluster to connect to?
Cluster
29
One way to catch errors when preparing data is to use which Python statement?
Except
30
Spark and Cassandra can run on the same cluster nodes.
TRUE
31
A Cassandra data model should be based on how you will query the database.
TRUE
32
Graphs consists of rows and tables
FALSE
33
Directed edges are not used in hierarchical relations.
FALSE
34
Using separate files for nodes and edges can simplify data loading.
TRUE
35
What driver is used with Python to work with Neo4j databases?
Py2Neo
36
What data structure can be used to map from nodes and edges to a table data structure?
DataFrame
37
Graphs are especially useful for modeling networks like social networks and road systems.
True
38