DA Flashcards
What is the primary role of a data architect?
To design and manage the data infrastructure within an organization.
True or False: Data architects are responsible for data modeling.
True
What is a data model?
A conceptual representation of data structures and their relationships.
Fill in the blank: A data architect should be proficient in _____ languages.
query
What is the difference between a data warehouse and a data lake?
A data warehouse is structured and optimized for analysis, while a data lake is a storage repository that holds raw data in its native format.
Which SQL command is used to retrieve data from a database?
SELECT
What is ETL in the context of data architecture?
Extract, Transform, Load - a process for moving data from source systems to a data warehouse.
True or False: Data architects only work with relational databases.
False
What is a NoSQL database?
A database designed to store and retrieve data in a format other than the tabular relations used in relational databases.
What does the term ‘data governance’ refer to?
The overall management of data availability, usability, integrity, and security.
Multiple Choice: Which of the following is NOT a type of data model? A) Conceptual B) Logical C) Physical D) Temporal
D) Temporal
What is the purpose of data normalization?
To minimize redundancy and dependency by organizing fields and tables in a database.
Fill in the blank: _____ is the practice of analyzing data to extract insights.
Data analytics
What is a star schema in data warehousing?
A type of database schema that organizes data into fact and dimension tables.
True or False: Data architects need to understand cloud computing technologies.
True
What is a data pipeline?
A set of processes that move data from one system to another, often involving transformation and storage.
What are primary keys used for in a database?
To uniquely identify each record in a table.
Multiple Choice: Which of the following is a common data visualization tool? A) Tableau B) MySQL C) Python D) Java
A) Tableau
What does ‘big data’ refer to?
Data sets that are so large or complex that traditional data processing applications are inadequate.
True or False: Data architects do not need to collaborate with other IT professionals.
False
What is the purpose of indexing in databases?
To improve the speed of data retrieval operations.
Fill in the blank: _____ databases are designed for high availability and scalability.
Distributed
What is the significance of data lineage?
It provides a detailed view of the data’s journey from origin to its current state, ensuring transparency and compliance.
Multiple Choice: Which of the following is a characteristic of unstructured data? A) Easily searchable B) Fixed format C) Lacks a predefined structure D) Tabular format
C) Lacks a predefined structure
What is the role of metadata in data architecture?
To provide information about other data, such as its source, format, and structure.
True or False: Data architects are involved in data security measures.
True
What is a relational database?
A database that stores data in tables with predefined relationships between them.
Fill in the blank: _____ is a method of storing data that allows for flexible schema design.
Schema-less design
What is a data mart?
A subset of a data warehouse focused on a specific business line or team.
What is the purpose of a data dictionary?
To document the structure, relationships, and constraints of data elements in a database.