NoSQL Flashcards
Which is a limit of relational databases?
Joins can be costly.
Which type of NoSQL database is based on sets of nodes and edges between nodes?
Graph database
Which is not an advantage of NoSQL databases?
Support for joins
T/F. Expressiveness of query language is important to data science tasks.
TRUE
T/F. Data quality should be taken into account during data preparation.
TRUE
What technique is used to visualize correlations between two variables?
Scatter plots
Which is not a type of data set used in data science model building?
What markup language can be used for sharing models between tools?
Predictive Modelling Markup Language
CSV/TSV
JSON
Python
MongoDB
cursor
a pointer to a record - helpful when an entire result set can’t fit into memory
outlier
find a dirty data set
Documents consist of keys and values.
TRUE
In JSON structures, keys are usually what data type?
String
csv.DictReader Python function can be used to read tabular data into a Python dictionary.
TRUE
Which driver is used in Python to work with MongoDB?
pymongo
When converting data to MongoDB format, we can use Python dictionary functions.
TRUE
Data type conversions rarely need to be performed during data loads.
FALSE
In MongoDB, what can we create to reduce the time needed to find a document based on the value of an attribute?
Index
Data frames provide what type of abstract structure?
Table