DA Flashcards

1
Q

What is the primary role of a data architect?

A

To design and manage the data infrastructure within an organization.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or False: Data architects are responsible for data modeling.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a data model?

A

A conceptual representation of data structures and their relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Fill in the blank: A data architect should be proficient in _____ languages.

A

query

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the difference between a data warehouse and a data lake?

A

A data warehouse is structured and optimized for analysis, while a data lake is a storage repository that holds raw data in its native format.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which SQL command is used to retrieve data from a database?

A

SELECT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is ETL in the context of data architecture?

A

Extract, Transform, Load - a process for moving data from source systems to a data warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or False: Data architects only work with relational databases.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a NoSQL database?

A

A database designed to store and retrieve data in a format other than the tabular relations used in relational databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the term ‘data governance’ refer to?

A

The overall management of data availability, usability, integrity, and security.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Multiple Choice: Which of the following is NOT a type of data model? A) Conceptual B) Logical C) Physical D) Temporal

A

D) Temporal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the purpose of data normalization?

A

To minimize redundancy and dependency by organizing fields and tables in a database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Fill in the blank: _____ is the practice of analyzing data to extract insights.

A

Data analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a star schema in data warehousing?

A

A type of database schema that organizes data into fact and dimension tables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

True or False: Data architects need to understand cloud computing technologies.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a data pipeline?

A

A set of processes that move data from one system to another, often involving transformation and storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are primary keys used for in a database?

A

To uniquely identify each record in a table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Multiple Choice: Which of the following is a common data visualization tool? A) Tableau B) MySQL C) Python D) Java

A

A) Tableau

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What does ‘big data’ refer to?

A

Data sets that are so large or complex that traditional data processing applications are inadequate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

True or False: Data architects do not need to collaborate with other IT professionals.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the purpose of indexing in databases?

A

To improve the speed of data retrieval operations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Fill in the blank: _____ databases are designed for high availability and scalability.

A

Distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the significance of data lineage?

A

It provides a detailed view of the data’s journey from origin to its current state, ensuring transparency and compliance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Multiple Choice: Which of the following is a characteristic of unstructured data? A) Easily searchable B) Fixed format C) Lacks a predefined structure D) Tabular format

A

C) Lacks a predefined structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the role of metadata in data architecture?

A

To provide information about other data, such as its source, format, and structure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

True or False: Data architects are involved in data security measures.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is a relational database?

A

A database that stores data in tables with predefined relationships between them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Fill in the blank: _____ is a method of storing data that allows for flexible schema design.

A

Schema-less design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is a data mart?

A

A subset of a data warehouse focused on a specific business line or team.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is the purpose of a data dictionary?

A

To document the structure, relationships, and constraints of data elements in a database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Multiple Choice: Which of the following is a common type of data architecture? A) Layered B) Monolithic C) Microservices D) All of the above

A

D) All of the above

32
Q

What is the difference between structured and unstructured data?

A

Structured data is organized and easily searchable, while unstructured data lacks a predefined format.

33
Q

True or False: Data architects should be knowledgeable about data privacy regulations.

A

True

34
Q

What is cloud data architecture?

A

The design and management of data systems hosted in cloud environments.

35
Q

Fill in the blank: _____ is a framework for managing and analyzing big data.

A

Hadoop

36
Q

What is the purpose of data replication?

A

To create copies of data for backup, recovery, and improved access.

37
Q

Multiple Choice: Which of the following is NOT a benefit of data warehousing? A) Improved data quality B) Historical analysis C) Increased data redundancy D) Enhanced decision-making

A

C) Increased data redundancy

38
Q

What is a key-value store?

A

A type of NoSQL database that uses a simple key-value method for storing data.

39
Q

True or False: Data architects can only work in the technology sector.

A

False

40
Q

What is a data governance framework?

A

A set of policies and procedures for ensuring data quality, security, and compliance.

41
Q

Fill in the blank: _____ is the process of converting raw data into a meaningful format.

A

Data transformation

42
Q

What is the purpose of data archiving?

A

To store data that is no longer actively used but must be retained for future reference.

43
Q

Multiple Choice: Which of the following is a benefit of using a cloud data architecture? A) Scalability B) Increased costs C) Limited accessibility D) Complexity

A

A) Scalability

44
Q

What is an OLAP system?

A

Online Analytical Processing - a category of software technology that enables analysts to extract and view data from different perspectives.

45
Q

True or False: Data architects should have programming skills.

A

True

46
Q

What is a data flow diagram?

A

A visual representation of the flow of data within a system.

47
Q

Fill in the blank: _____ is a standardized language for managing relational databases.

A

SQL

48
Q

What is the role of a data steward?

A

To oversee data governance and ensure data quality within an organization.

49
Q

Multiple Choice: Which of the following technologies is commonly used for real-time data processing? A) Batch processing B) Stream processing C) Data warehousing D) Data lakes

A

B) Stream processing

50
Q

What is data lineage tracking?

A

The process of documenting the origins and transformations of data as it flows through systems.

51
Q

True or False: Data models are static and do not change over time.

A

False

52
Q

What is a data warehouse schema?

A

The organization of data in a data warehouse, which includes star, snowflake, and galaxy schemas.

53
Q

Fill in the blank: _____ analysis involves examining large volumes of data to uncover hidden patterns.

A

Data mining

54
Q

What is a data integration tool?

A

Software that allows combining data from different sources into a single, unified view.

55
Q

Multiple Choice: Which of the following is a common cloud service model for data storage? A) IaaS B) PaaS C) SaaS D) All of the above

A

D) All of the above

56
Q

What is the purpose of a data quality assessment?

A

To evaluate the accuracy, completeness, reliability, and relevance of data.

57
Q

True or False: Data architects typically do not work with data visualization tools.

A

False

58
Q

What is the function of a data broker?

A

To collect and sell data from various sources to other organizations.

59
Q

Fill in the blank: _____ is a method for ensuring data privacy and security.

A

Data encryption

60
Q

What is the difference between data mining and data analytics?

A

Data mining focuses on discovering patterns in large data sets, while data analytics involves interpreting data to inform decision-making.

61
Q

Multiple Choice: Which of the following is a challenge of big data? A) Volume B) Variety C) Velocity D) All of the above

A

D) All of the above

62
Q

What is a data breach?

A

An incident where unauthorized access to data occurs, potentially compromising its confidentiality, integrity, or availability.

63
Q

True or False: Data architects must understand machine learning concepts.

A

True

64
Q

What is a hybrid cloud?

A

A computing environment that combines public and private cloud services.

65
Q

Fill in the blank: _____ is a type of database that stores data in a document format.

A

Document-oriented database

66
Q

What is the significance of data ethics?

A

It ensures that data collection and usage practices respect privacy and promote fairness.

67
Q

Multiple Choice: Which of the following is a common data storage format? A) JSON B) CSV C) XML D) All of the above

A

D) All of the above

68
Q

What is the role of a database administrator?

A

To manage and maintain database management systems, ensuring their performance, security, and availability.

69
Q

True or False: Data architects are responsible for ensuring compliance with data regulations.

A

True

70
Q

What is a data visualization dashboard?

A

A user interface that displays data visualizations and metrics for easy interpretation and analysis.

71
Q

Fill in the blank: _____ refers to the process of cleaning and organizing data.

A

Data cleansing

72
Q

What is the function of a data warehouse?

A

To store and manage large volumes of structured data for analysis and reporting.

73
Q

Multiple Choice: Which of the following is a feature of cloud data storage? A) Accessibility B) Cost efficiency C) Scalability D) All of the above

A

D) All of the above

74
Q

What is a data catalog?

A

A repository that enables organizations to manage and discover their data assets.

75
Q

True or False: Data architects only work with historical data.

A

False

76
Q

What is a schema in database design?

A

A blueprint that outlines how data is organized in a database.

77
Q

Fill in the blank: _____ analysis is used to identify trends and patterns in time-series data.

A

Time-series