NoSQL Flashcards

1
Q

Which is a limit of relational databases?

A

Joins can be costly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which type of NoSQL database is based on sets of nodes and edges between nodes?

A

Graph database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which is not an advantage of NoSQL databases?

A

Support for joins

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

T/F. Expressiveness of query language is important to data science tasks.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

T/F. Data quality should be taken into account during data preparation.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What technique is used to visualize correlations between two variables?

A

Scatter plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which is not a type of data set used in data science model building?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What markup language can be used for sharing models between tools?

A

Predictive Modelling Markup Language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

CSV/TSV

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

JSON

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Python

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

MongoDB

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

cursor

A

a pointer to a record - helpful when an entire result set can’t fit into memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

outlier

A

find a dirty data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q
A
17
Q

Documents consist of keys and values.

A

TRUE

18
Q

In JSON structures, keys are usually what data type?

A

String

19
Q

csv.DictReader Python function can be used to read tabular data into a Python dictionary.

A

TRUE

20
Q

Which driver is used in Python to work with MongoDB?

A

pymongo

21
Q

When converting data to MongoDB format, we can use Python dictionary functions.

A

TRUE

22
Q

Data type conversions rarely need to be performed during data loads.

A

FALSE

23
Q

In MongoDB, what can we create to reduce the time needed to find a document based on the value of an attribute?

A

Index

24
Q

Data frames provide what type of abstract structure?

A

Table

25
Q

Embedded documents are used to avoid joins.

A

TRUE

26
Q

What is the top level data structure in Cassandra called?

A

Keyspace

27
Q

Duplicating data in wide column databases is uncommon.

A

FALSE

28
Q

Which command is used to specify the nodes in a cluster to connect to?

A

Cluster

29
Q

One way to catch errors when preparing data is to use which Python statement?

A

Except

30
Q

Spark and Cassandra can run on the same cluster nodes.

A

TRUE

31
Q

A Cassandra data model should be based on how you will query the database.

A

TRUE

32
Q

Graphs consists of rows and tables

A

FALSE

33
Q

Directed edges are not used in hierarchical relations.

A

FALSE

34
Q

Using separate files for nodes and edges can simplify data loading.

A

TRUE

35
Q

What driver is used with Python to work with Neo4j databases?

A

Py2Neo

36
Q

What data structure can be used to map from nodes and edges to a table data structure?

A

DataFrame

37
Q

Graphs are especially useful for modeling networks like social networks and road systems.

A

True

38
Q
A