Cosmos DB Flashcards
atom-record-sequence
Core type system for CDB engine. CDB translates your data models into ARS
- atom - primitive types
- record - structs
- sequences - arrays
What are the consistency levels of CosmosDB?
- Strong
- Bounded Staleness
- Session
- Consistent Prefix
- Graph-based data
What do the different CosmosDB APIs allow you to do?
Use your existing API calls with CosmosDB. For example, CQL code to access Apache Cassandra DB can be simply ported over for use against CosmosDB
Difference between Azure Table storage and CosmosDB?
Azure Table storage is the predecessor of CosmosDB
Use cases for CosmosDB
Web, mobile, gaming, IoT apps that handle massive amounts of reads and writes at a global scale
Graph database
- Persist relationships in the storage layer
- Both entities(Nodes, AKA vertices) and their relationships(Edges, AKA Labels) to other entities are stored together using Properties(Somewhat like tags)
When Product A. is purchased, products C and D are usually purchased with it. You want to use these to make recommendations to your customers. What is the CosmosDB database/API type that should be used?
Graph
What does “multi-model” database service mean?
There are more than one data access methods (APIs) to access your data. (E;G., CosmosDB is multi-model)
How does the charging structure work for a CosmosDB table and why?
Charging is done by current capacity specified, regardless of the amount of data stored. This is because the capacity is reserved.
What are the 2 methods of migrating data to CosmosDB?
- CosmosDB Data Migration Tool
* AzCopy
Graph Database
One of the models supported by Cosmos DB.
Non-tabular format
Relationships are treated as data rather than schema structure
What is a Graph?
The essential structure of a graph database.
Comprised of
- Vertices/Nodes - represent objects
- Edges/Relationships - Relationships between vertices
- Properties - descriptors of vertices and edges
- Vertice Properties - description of the object
- Edge Properties - description of the relationship
What is the general way to define your Nodes and Edges in a graph database?
Nodes = nouns
- Customer
- Employee
- Product
Relationships = verbs - Likes
- Places
- Belongs To
BCDR
Business Continuity and Disaster Recovery
Multi-master support
- AKA multiple write regions
- Every region defined as a write region is considered a master
- data changes are automatically propagated with single digit latency
Multi-master conflict resolution modes
- Last-Writer-Wins(LWW) - User defined integer value
- Custom (User Defined Function) - User defined stored proc
- Custom (Async) - Conflicts logged and handled at the app level
Multi-region failover
- Only applies to single write regions
- Read region outages take that region offline and redirect traffic
- Write region outages mark the master offline and promote another region to master
Types of data consistency offered
- Strong
- Bounded staleness (set interval lag)
- Session Scoped to client session most widely used
- Consistent Prefix - reads see writes in order
- Eventual Session to eventual 2x less expensive
Key metrics of Cosmos DB to take into consideration
- Total Requests
- Total Throughput
- Total Storage
R/U
- Request Unit.
* The amount of CPU, disk I/O and memory required to read 1KB of data in 1 second
An Cosmos DB ____ is a container for one or more databases. A Cosmos DB ____ is a container for one or more collections. A ____ contains documents. A ____ is an unstructured set of key/value pairs read and written in a JSON format
Account
database
collection
document
Partitioning
Distributing data across underlying resources based on a predefined partition key (E;G., combination of property keys in a document like person and category)
Given the following requirement, what is the R/U requirement
- 25 docs/second
- 20 RUs/doc
500 R/Us (25 x 20)
Once a partition is defined, how are documents placed into a particular partition?
Automatically by Cosmos DB
A partition key is a ____ ____. Documents with the same partition key are always…
- document property
2. stored on the same logical partition
What is a partition hot-spot and what is the risk of having one?
- A partition key that does not evenly divide data across partitions
- Hot partitions can overload the R/Us assigned to that partition and cause throttling
CosmosDB Indexing Modes
- Consistent - updated with every new doc
- Lazy - reads/writes take priority over updating the index
- None