Random Flashcards

1
Q

What does OLTP stand for?

A

Online Transactional Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does OLAP stand for?

A

Online Analytical Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a data warehouse primarily used for?

A
  • Business Intelligence
  • Analytics
  • Reporting
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a data lake primarily used for?

A
  • Big Data
  • Machine Learning
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What type of data is stored in a data warehouse?

A

Structured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What type of a data is stored in a data lake?

A

Structured, semi-structured, unstructured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a data lakehouse primarily used for?

A

Combines the querying capabilities of a data warehouse with the schema-less nature and high storage capacity of data lakes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does ACID stand for?

A

Atomicity, Consistency, Isolation, and Durability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does CRUD stand for?

A

Create, Read, Update, Delete

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does Atomicity in ACID mean?

A

Ensures that all operations in a transaction are fully completed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does Consistency in ACID mean?

A

Guarantees the database remains valid before and after a transaction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does Isolation in ACID mean?

A

Ensures transactions run independently without affecting each other’s outcome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does Durability in ACID mean?

A

Once a transaction is committed, it stays permanent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the 1NF (First Normal Form)?

A

All records have the same structure (same number of fields) and each field contains a single value, with no repeating groups of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the 2NF (Second Normal Form)?

A

A non-key field must provide a fact about the key, us the whole key, and nothing but the key.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the 3NF (Third Normal Form)?

A

Non-key fields only depend (describe) the primary key and not other non-key fields.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is data governance?

A

The overall management of data availability, usability, integrity, and security.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is sharding?

A

Distributing data across multiple database instances or servers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are the benefits of sharding?

A

Can handle high volumes of traffic and allows for parallel processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does the acronym CAP (CAP theorem) stand for?

A

Consistency, Availability, Partition Tolerance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What does the CAP theorem state?

A

In a distributed data store, you can only guarantee two of the three properties.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is data modeling?

A

The process of creating a conceptual representation of data and its relationships within a database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is data transformation?

A

The process of converting data from its original format into a format suitable for analysis or processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are common techniques used in data transformation?

A
  • Data cleansing
  • Data normalization
  • Aggregation
  • Data mapping
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is a Star Schema?

A

A central fact table connected to dimension tables in a simple, denormalized structure for fast querying.

26
Q

What is a Snowflake Schema?

A

A normalized version of the star schema with dimension tables further split into related tables to reduce redundancy.

27
Q

What is data lineage?

A

Tracking of the flow and transformation of data throughout its lifecycle.

28
Q

What is a data mart?

A

A subset of a data warehouse, focused on a specific business line or team.

29
Q

What are OLTP systems typically used for?

A

Real-time transactional processing, such as banking, ecommerce, or bookings.

30
Q

What are OLAP systems typically used for?

A

Processing and analyzing large amounts of data quickly for business intelligence and reporting.

31
Q

What does DSA stand for?

A

Data Structure Analysis

32
Q

How do you optimize queries?

A
  • Analyze execution plan with EXPLAIN
  • Build indexes
  • Minimize SELECT columns
  • Refactor complex queries
33
Q

How do you ensure data quality throughout a pipeline?

A
  • Unit testing code
  • Retries for transient errors
  • Data quality validation (great expectations)
  • Monitoring and alerting
34
Q

What does ETL stand for?

A

Extract, Transform, Load

35
Q

What does ELT stand for?

A

Extract, Load, Transform

36
Q

Are python “lists” mutable or immutable?

A

Mutable

37
Q

How do you add an item to a python “list”?

A
list.append(item)
38
Q

How do you insert an item into a python “list”?

A
list.insert(index, element)
39
Q

How do you remove an item from a python “list”?

A
list.remove(item)
40
Q

How do you slice a python “list”?

A
list.slice(start:stop:step)
41
Q

Provide an example of “list comprehension” in python.

A
[ x * 2 for x in list ]
42
Q

Is a python “tuple” mutable or immutable?

A

Immutable

43
Q

Provide an example of python “tuple” unpacking.

A
a, b, c = tuple
44
Q

Provide an example of setting a “tuple” in python.

A
tuple = (1, 2, 3)
45
Q

Are python “dictionaries” mutable or immutable?

A

Mutable

46
Q

Provide an example of setting a python “dictionary”.

A
dictionary = { 'key1': 'value1', 'key2': 'value2' }
47
Q

Provide an example of adding/updating a python “dictionary” value.

A
dictionary[key] = value
48
Q

Provide an example of removing a python “dictionary” value.

A
dictionary.pop(key)
49
Q

Provide an example of accessing a python “dictionary” keys.

A
dictionary.keys()
50
Q

Provide an example of accessing a python “dictionary” values.

A
dictionary.value()
51
Q

Provide an example of python “dictionary” comprehension.

A
{ k: v * 2 for k, v in dictionary.items() }
52
Q

Are python “sets” mutable or immutable?

A

Mutable, but unique

53
Q

Provide an example of setting a python “set”.

A
set = {1, 2, 3}
54
Q

Provide an example of adding an item to a python “set”.

A
set.add(item)
55
Q

Provide an example of removing an item from a python “set”.

A
set.remove(item)
56
Q

Provide an example of a python “set union” operation.

A
{ set_1 | set_2 }
57
Q

Provide an example of a python “set intersection” operation.

A
{ set_1 & set_2 }
58
Q

Provide an example of a python “set difference” operation.

A
{ set_1 - set_2 }
59
Q

Provide an example of python “set” comprehension.

A
{ x * 2 for x in set if x > 2 }
60
Q

Provide an example of setting a “name tuple” in python.

A
tuple = namedtuple('Point', ['x', 'y'])
61
Q

Provide an example of using a “named tuple” in python.

A
p = Point(1, 2)
print(p.x, p.y)