Chapter 6 Database Architecture Flashcards

1
Q

What are the components of MySQL architecture?

A

MySQL architecture consists of various components and relationships that facilitate database management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the four layers of MySQL architecture?

A

The four layers are Tools Layer, Query Processor, Connectors, and Storage Engine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the Tools Layer in MySQL architecture?

A

The Tools Layer includes user interfaces and applications for database interaction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the Query Processor do?

A

The Query Processor manages connections and compiles SQL queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are Connectors in MySQL?

A

Connectors are APIs that link applications to the query processor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the purpose of APIs in MySQL?

A

APIs are older programming interfaces for database interaction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is MySQL Workbench?

A

MySQL Workbench is a desktop application for managing MySQL databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are Utility Programs in MySQL?

A

Utility Programs are tools for database administration and management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the Command-Line Client?

A

The Command-Line Client is an interactive tool for processing SQL queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are Database Transactions?

A

Database Transactions are operations that ensure data integrity and consistency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is System Data in MySQL?

A

System Data in MySQL is the information stored in the mysql system schema. This schema contains tables with metadata about the server itself, including:

  • Data Dictionary Tables: Store information about database objects like tables and columns.
  • System Tables: Used for operational purposes like user accounts and server logs.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do users interact with User Data in MySQL?

A

User Data is created and managed by database users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are High-End Installations?

A

High-End Installations are advanced setups requiring additional Enterprise features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a Connection in MySQL?

A

A Connection specifies the database name and credentials.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the Connection Manager do?

A

The Connection Manager creates and manages connections to databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a Query Compilation?

A

Query Compilation generate a detailed execution plan for queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the CSV Storage Engine?

A

The CSV Storage Engine stores data in comma-separated values format.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the MEMORY Storage Engine?

A

The MEMORY Storage Engine stores data in RAM for fast access.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is a Storage Engine?

A

A Storage Engine manages how data is stored in tables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Who is a Database Administrator?

A

A Database Administrator manages database configurations and storage engines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is Block Replacement?

A

Block Replacement is the process of removing stale blocks from the buffer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a Data Update?

A

A Data Update is the modification of existing data in a block.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is Buffer Capacity?

A

Buffer Capacity is the maximum number of blocks the buffer can hold.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is Block Discarding?

A

Block Discarding is removing blocks from the buffer to free space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the File System Layer?

A

The File System Layer organizes data stored on physical storage media.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What are Database Queries?

A

Database Queries are requests for data retrieval or manipulation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Multi-Tier Architecture is used how in management?

A

Multi-Tier Architecture groups computers in layers for application management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is the Top Tier in architecture?

A

The Top Tier interacts directly with end-users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What does the Bottom Tier manage?

A

The Bottom Tier manages resources like databases and email.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What functions does the Middle Tier execute?

A

The Middle Tier executes functions like user authorization and business logic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is Web Architecture?

A

Web Architecture is a multi-tier architecture for web applications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What role do Web Browsers play in web architecture?

A

Web Browsers are the top tier managing user interaction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What do Web Servers do?

A

Web Servers are the middle tier generating web pages and handling requests.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is the function of Application Servers?

A

Application Servers process requests and communicate with databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What does the Services Tier provide?

A

The Services Tier provides database and authentication services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is Single-tier architecture?

A

Single-tier architecture contains one layer of software components.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What does Multi-tier architecture contain?

A

Multi-tier architecture contains two or more interconnected software tiers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is On-premise software?

A

On-premise software is installed and run on customer computers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What are Cloud services?

A

Cloud services are software provided over the internet for a fee.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is Infrastructure-as-a-Service (IaaS)?

A

IaaS involves renting virtual computing resources from cloud providers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What does Platform-as-a-Service (PaaS) provide?

A

PaaS provides a platform for developers to create and manage applications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What is Software-as-a-Service (SaaS) & Platform as a Service?

A
  1. SaaS is about delivering ready-to-use software applications (like renting a furnished apartment)
  2. PaaS provides a platform for developers to create and manage applications (like leasing land to build a house).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What is Elastic Compute Cloud (EC2)?

A

EC2 is Amazon’s IaaS offering for virtual computing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is Microsoft Azure?

A

Microsoft Azure is Microsoft’s PaaS offering with various cloud services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What is Salesforce?

A

Salesforce is a SaaS for sales management and customer relations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

What is a Cloud database?

A

A Cloud database is offered as a PaaS cloud service.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What is a Virtual machine (VM)?

A

A VM emulates a complete computing environment on a server.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Who is a Customer in cloud services?

A

A Customer is an end-user accessing cloud services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What is Storage media?

A

Storage media are physical devices for storing data in the cloud.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is a Network?

A

A Network connects multiple computers for data exchange.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What is an Operating system?

A

An Operating system is software managing computer hardware and resources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

What is a Database?

A

A Database is a structured collection of data for storage and retrieval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What is a Server in a computer network?

A

A Server is a computer providing data or services to other computers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What is a Shared memory computer?

A

A Shared memory computer has processors that share the same memory and storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

What is a Shared storage computer?

A

A Shared storage computer has processors that share storage but have private memory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

What is a Shared nothing computer?

A

A Shared nothing computer has processors that share neither memory nor storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

What are Parallel computers?

A

Parallel computers have multiple processors working under a single OS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

What is a Local area network?

A

A Local area network consists of cables connecting devices within a small area.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

What is the Ethernet protocol?

A

The Ethernet protocol is a common communication protocol for local area networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

What are Data transmission risks?

A

Data transmission risks are potential vulnerabilities when sending data online.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What is a corp Cloud service provider?

A

Cloud service providers are companies offering database management in the cloud.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

What is Scalable storage?

A

Scalable storage is adjustable storage capacity based on customer needs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

What are Peak processing loads?

A

Peak processing loads refer to high demand periods requiring additional resources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

What is a Wide Area Network?

A

A Wide Area Network spans multiple facilities across large distances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

What is a Node in networking?

A

A Node is a computer connected in a network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

What is Shared Memory?

A

Shared Memory is when multiple processors share both memory and storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

What is Shared Storage?

A

Shared Storage is when multiple processors share storage but not memory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

What is Shared Nothing?

A

Shared Nothing means each processor has private memory and storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

What is the Ethernet Protocol?

A

The Ethernet Protocol is a standard for connecting computers in a network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

What is a Parallel Database?

A

A Parallel Database runs on parallel computers or clusters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

What is a Distributed Database?

A

A Distributed Database operates on multiple computers via a wide area network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

What is Query Decomposition?

A

Query Decomposition is breaking queries into parts for concurrent execution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

What is a Local Area Network?

A

A Local Area Network covers a small geographic area.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

What are Technical Challenges in databases?

A

Technical Challenges arise from slow wide area networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

What is a Unified View of Data?

A

A Unified View of Data provides consistent data representation for users and programmers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

What is Data Location Impact?

A

Data Location Impact refers to the significance of data placement on query processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

What are Remote Queries?

A

Remote Queries access data from different locations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

What is Cluster Management Software?

A

This software coordinates nodes in a cluster.

Node: computer on network

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
79
Q

What are Memory Access Limitations?

A

Memory Access Limitations indicate local networks can’t support shared memory access.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
80
Q

What is Data Visibility?

A

Data Visibility means only administrators see physical data locations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
81
Q

What is Concurrency in databases?

A

Concurrency is the simultaneous execution of processes or queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
82
Q

What is Query Execution Speed?

A

Query Execution Speed is the time taken to process database queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
83
Q

What is Employee Data Storage?

A

Employee Data Storage refers to local nodes storing data for geographically distributed employees.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
84
Q

What are Communication Protocols?

A

Communication Protocols are rules for data exchange between networked devices.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
85
Q

What is a Processor?

A

A Processor is a component that executes instructions and processes data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
86
Q

What is the Two-Phase Commit protocol?

A

The Two-Phase Commit protocol ensures all nodes commit or rollback updates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
87
Q

What is a Transaction Coordinator?

A
  • A transaction coordinator is a software component that manages distributed transaction processes.
  • It ensures that all parts of a transaction are completed successfully or that the entire transaction is rolled back if there is an error.
88
Q

What is a Local Log?

A

A Local Log is temporary storage for pending updates on nodes.

89
Q

What is a Commit Message?

A

A Commit Message is an instruction to finalize updates in a database.

90
Q

What is Rollback?

A

Rollback is the reversal of changes made during a transaction.

91
Q

What is a Confirmation Message?

A

A Confirmation Message is an acknowledgment from nodes to the transaction coordinator.

92
Q

What is Data Assignment?

A

Data Assignment is the distribution of data across nodes in a database.

93
Q

What is Node Availability?

A

Node Availability refers to the status of nodes being operational during transactions.

94
Q

Who are Database Administrators?

A

Database Administrators are professionals managing and optimizing database performance.

Aka root account

95
Q

What are Distributed Database Characteristics?

A

Distributed Database Characteristics indicate nodes are not visible to users or programmers.

96
Q

What is Two-phase locking?

A

Two-phase locking controls lock acquisition in transactions.

97
Q

What is Two-phase locking Phase 1?

A

In Phase 1, the coordinator notifies the nodes of updates.

98
Q

What is Two-phase locking; Phase 2?

A

In Phase 2, the coordinator receives confirmations and instructs commits.

99
Q

What is a Commit message?

A
  • A Commit message instructs nodes to finalize updates.
  • The commit message flows from the coordinator to the nodes, confirming the update.
100
Q

What is a Distributed transaction?

A

A Distributed transaction involves multiple nodes for data updates.

101
Q

What is a Local transaction?

A

A Local transaction updates data on a single node only.

102
Q

What is Temporary inconsistency?

A

Temporary inconsistency occurs when nodes are updated at different times, causing discrepancies.

103
Q

What is a Debit Node?

A

A Debit Node is responsible for processing debit transactions.

104
Q

What is a Credit Node?

A

A Credit Node is responsible for processing credit transactions.

105
Q

What is the CAP Theorem?

A

The CAP Theorem limits simultaneous consistency, availability, and partition tolerance.

106
Q

What is a Network Partition?

A

A Network Partition is a failure; preventing nodes from communicating with each other.

107
Q

What is a Partition-Tolerant Database?

A

A Partition-Tolerant Database continues functioning despite network partitions.

108
Q

What is a Consistent Database?

A

A Consistent Database requires data across nodes to match and conform.

109
Q

What is an SQL Update Statement?

A

An SQL Update Statement is a command to modify existing data in a database.

110
Q

What is an SQL Select Statement?

A

An SQL Select Statement is 𝓪 𝓬𝓸𝓶𝓶𝓪𝓷𝓭 𝓽𝓸 retrieve data from a database.

111
Q

What is Percentage Availability?

A

Percentage Availability is the proportion of time a database is responsive to users.

112
Q

What is a Node in a distributed database?

A

A Node is an individual unit within a distributed database system.

113
Q

What is Table Data Partition?

A

Table Data Partition is a subset of data within a database table.

114
Q

What is a Live Node?

A

A Live Node is a node that actively responds to queries.

115
Q

What is a Replicated Database?

A

A Replicated Database maintains multiple copies on different devices.

116
Q

What is a Simple Backup?

A

A Simple Backup involves backing up one replica while others are active.

117
Q

What is Enhanced Security?

A

Enhanced Security refers to improved data protection through multiple replicas.

118
Q

What is Shared Nothing Architecture?

A

Shared Nothing Architecture is where nodes do not share storage.

119
Q

What is Shared Storage?

A

Shared Storage allows multiple nodes to access common storage.

120
Q

What is Shared Memory?

A

Shared Memory is an architecture where processors share a common memory.

121
Q

What is Complex Server Administration?

A

Complex Server Administration involves managing update propagation across replicas.

122
Q

What is Replication?

A

Replication is used in parallel and distributed databases.

123
Q

What are Frequent Reads?

A

Frequent Reads are a common scenario for using database replication.

124
Q

What are Infrequent Updates?

A
  • Infrequent Updates occur less often than reads.
  • The frequency of data reads is usually much higher than the frequency of data updates.
125
Q

What is Temporary Inconsistency in replication?

A

Temporary Inconsistency is an acceptable state during data replication.

126
Q

What is a Catalog in databases?

A

A Catalog is a directory of information describing database objects.

127
Q

What is a Central Catalog?

A

A Central Catalog is catalog information stored on a single node.

128
Q

What is Group Replication?

A

Group Replication is a technique allowing updates from any node in the group.

129
Q

What do Double-Headed Arrows indicate?

A

Double-Headed Arrows indicate bidirectional communication between nodes and the cloud.

130
Q

What are Animation Captions?

A

Animation Captions are visual aids explaining the database update process.

131
Q

What is Transaction Information?

A

Transaction Information is data sent to nodes before commit to prevent conflicts.

132
Q

What is a Database Management System?

A

A Database Management System is software for managing databases and processing queries.

133
Q

What are Replicated Databases?

A

Replicated Databases maintain copies of data across nodes.

134
Q

What is a Central Catalog?

A

A Central Catalog contains the entire catalog on a single node.

135
Q

What are Local Transactions?

A

Local Transactions are transactions affecting only one node’s data.

136
Q

What is an Out-of-Date Replica?

A

An Out-of-Date Replica is not updated with the latest changes.

137
Q

What is an Error Display in databases?

A

An Error Display advises resubmitting queries if outdated.

138
Q

What is Operational Data?

A

Operational Data is used for daily business functions.

139
Q

What is Analytic Data?

A

Analytic Data is used for business analysis and planning.

140
Q

What is Volatility in data?

A
  • Volatility refers to the rate of change in operational data.
  • Volatility refers to data whose value is undefined and changes quickly
141
Q

What is Extracted Data?

A

Extracted Data is pulled from operational databases for processing.

142
Q

What is Data Mart Derivation?

A

Data Mart Derivation is the creation of a data mart from operational or warehouse data.

143
Q

What are Performance Issues?

A

Performance Issues are problems affecting the speed of database queries.

144
Q

What is Data Refresh Frequency?

A

Data Refresh Frequency refers to regular intervals at which data is updated in the warehouse.

145
Q

What is Query Conflict?

A

Query Conflict refers to discrepancies in reports generated by simultaneous queries.

146
Q

What is Metric System Conversion?

A

Metric System Conversion is changing measurements to metric for consistency.

147
Q

What is a system Catalog?

A
  1. A collection of tables within the database that store metadata about the database itself.
  2. Includes information about tables, columns, data types, constraints, and other structural details.
148
Q

What is a Learning Management System?

A

A software application used to create, deliver, track, and manage online courses and training programs

149
Q

What is a Temporary Database?

A

A Temporary Database is short-term storage for data processing.

150
Q

What is Data Standardization?

A

Data Standardization applies consistent formats and definitions to data.

151
Q

What is a Fact Table?

A

A Fact Table stores quantitative data for analysis.

152
Q

What is a Dimension Table?

A

A Dimension Table contains descriptive attributes related to facts.

153
Q

What is Summary Data?

A

Summary Data is precomputed data stored for fast access.

154
Q

What is a Dimensional Model?

A

A Dimensional Model is a structure used to organize data in a data warehouse.

155
Q

What is Nonvolatile Data?

A
  1. Nonvolatile Data remains unchanged (via power outage) until updated or replaced.
  2. It’s stored in non-volatile storage mediums, such as hard drives. flash drives, and optical discs
156
Q

What is Query Submission?

A

Query Submission is the process of requesting information from a data warehouse.

157
Q

What are Ad Hoc Queries?

A

Ad Hoc Queries are spontaneous queries for specific data analysis needs.

158
Q

What are Operational Systems?

A

Operational Systems are designed for executing business transactions.

159
Q

What is Analysis Paralysis?

A

Analysis Paralysis is overthinking that hinders timely decision-making.

160
Q

What are Reporting and Analytics?

A

Reporting and Analytics are processes for interpreting data, to inform business decisions.

161
Q

Who are BI Consumers?

A

BI Consumers are individuals providing requirements for data analysis.

162
Q

What is Data Profiling?

A

Data Profiling is assessing data quality of candidate sources.

163
Q

What are Dimensional Models?

A

Dimensional Models are structures designed for analytical data representation.

164
Q

What is Data Extraction?

A

Data Extraction is the process of retrieving data from sources.

165
Q

What is Data Cleansing?

A

Data Cleansing involves removing inaccuracies from data sets.

166
Q

What is Data Replication?

A

Data Replication is copying data from one database to another.

167
Q

What is Change Data Capture?

A

Change Data Capture is tracking changes in data sources.

168
Q

What are Computational Requirements?

A
  • Computational Requirements are resources needed for processing data aggregations.
  • These include the hardware (CPU, GPU, memory, storage) and software (operating system, databases, algorithms) that enable data manipulation and analysis.
169
Q

What are Parallel Machine Architectures?

A

Parallel Machine Architectures are systems designed to handle large data processing.

170
Q

What is Storage Space Calculation?

A

Storage Space Calculation is determined by dimensions and their sizes.

171
Q

What is Data Migration?

A

Data Migration is moving data from one system to another.

172
Q

What is a Staging Area?

A

A Staging Area is infrastructure for aggregating raw data sets.

173
Q

What is Data Validation?

A

Data Validation ensures data accuracy and quality before processing.

174
Q

What is Data Cleaning?

A

Data Cleaning involves removing inaccuracies and inconsistencies from data.

175
Q

What is Data Transformation?

A

Data Transformation modifies data into a suitable format for analysis.

176
Q

What is Data Requirements Analysis?

A

Data Requirements Analysis identifies data needed for business intelligence applications.

177
Q

What is Automated Extraction?

A

Automated Extraction refers to tools that automate data retrieval processes.

178
Q

What are SQL Queries?

A

SQL Queries are structured queries for data extraction.

179
Q

What are Adapters?

A

Adapters are connectors for different data sources.

180
Q

What are Code Generation Techniques?

A

Code Generation Techniques are methods for creating executable data extraction programs.

181
Q

What is a Source System?

A

A Source System is the original system from which data is extracted.

182
Q

What is Warehouse Population?

A

Warehouse Population is filling the data warehouse with processed data.

183
Q

What is Metadata?

A

Metadata is data that describes other data.

184
Q

What is Renormalization?

A

Renormalization is reverting denormalized data back to normalized form.

185
Q

What is Aggregation?

A

Aggregation is summarizing data for reporting or analysis.

186
Q

What is Audit Information?

A

Audit Information is data used to verify integrity and accuracy.

187
Q

What are Row Counts?

A

Row Counts refer to the total number of rows in a dataset.

188
Q

What is Null Conversion?

A

Null Conversion transforms various null representations into a standard form.

189
Q

What is Data Augmentation?

A

Data Augmentation involves adding provenance information to data records.

190
Q

What does the architecture of a database system describe?

A

The internal components and the relationships between components

This includes understanding how different parts of the database system interact with each other.

191
Q

What is the role of the query processor in a database system?

A

Interprets queries, creates a plan to modify the database or retrieve data, and returns query results to the application

The query processor optimizes queries for efficiency.

192
Q

How does the storage manager function in a database system?

A

Translates the query processor instructions into low-level file system commands that modify or retrieve data

It also uses indexes to quickly locate data due to the varying sizes of databases.

193
Q

What is the function of the transaction manager?

A

Ensures transactions are properly executed and prevents conflicts between concurrent transactions

It also restores the database to a consistent state in case of failure.

194
Q

What is contained in the log file of a database?

A

A complete record of all inserts, updates, and deletes processed by the database

The transaction manager writes log records before applying changes for recovery purposes.

195
Q

What is the purpose of the catalog in a database system?

A

A directory of tables, columns, indexes, and other database objects

Other components use catalog information to process and execute queries.

196
Q

True or False: All database systems support transactions.

A

False

Some database systems do not support transactions and therefore lack a transaction manager.

197
Q

Fill in the blank: The database sizes range from _______ to many terabytes.

A

megabytes

198
Q

What does the transaction manager do in the event of a failure?

A

Uses log records to restore the database

This ensures that the database can return to a consistent state after a failure.

199
Q

What happens to a database transaction if a server failure occurs before completion?

A

The database must reverse any partial changes

This is necessary to maintain data integrity.

200
Q

What is query optimization in the context of a database?

A

The process of ensuring the most efficient instructions are executed on the data

It is performed by the query processor.

201
Q

What might vary significantly between different database systems?

A

Capabilities and component details

For example, the physical structure of data on storage media affects the storage manager’s implementation.

202
Q

What are the main components of a database system?

A

A query processor, storage manager, transaction manager, log, and catalog.

203
Q

What role does the query processor play in a database system?

A

It receives queries from applications and performs query optimization using information from the catalog.

204
Q

How does the storage manager interact with the query processor?

A

It translates the query processor’s instructions into file-system commands and uses an index to locate requested data quickly.

205
Q

What does the transaction manager do in a database system?

A

It logs insert, update, and delete queries, sending the result back to the application.

206
Q

Does the query processor has direct access to the database data on storage media.

A

No, all access to the database data must go through the storage manager

207
Q

Without query optimization, the storage manager cannot $ ?

A
  1. The storage manager can retrieve the data with efficient or inefficient instructions
  2. Query optimization ensures the most efficient instructions are sent to the storage manager.
208
Q

The catalog allows the storage manager to $ ?

A

Catalogs allow quick locations to the requested data.

209
Q

What must be done in order to correctly restore database changes? 2

A
  1. All inserts, updates, and deletes must be logged in order to correctly restore database changes.
  2. Reads are not needed to restore database changes, however, and thus are not logged.
210
Q

What is metadata in the context of a database?

A

Data about the database, such as column names and the number of rows in each table.

211
Q

Where is metadata stored in a database system?

A

In the catalog.

212
Q

The _______ helps locate data quickly by providing information about where on the storage media certain pieces of data can be found.

A

[index]

213
Q

What types of queries does the transaction manager log?

A

Insert, update, and delete queries.

214
Q

What is the purpose of query optimization?

A

To ensure the most efficient instructions are sent to the storage manager.

215
Q

What is a concurrent execution?

A
  1. Concurrent execution is when multiple processes are active at the same time, rather than one after the other.
  2. It can also refer to the execution of multiple transactions at the same time