Chapter 6 Database Architecture Flashcards

1
Q

What are the components of MySQL architecture?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the four layers of MySQL architecture?

A

The four layers are Tools Layer, Query Processor, Connectors, and Storage Engine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the Tools Layer in MySQL architecture?

A

The Tools Layer includes user interfaces and applications for database interaction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the Query Processor do?

A

The Query Processor manages connections and compiles SQL queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are Connectors in MySQL?

A

Connectors are APIs that link applications to the query processor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the purpose of APIs in MySQL?

A

APIs are older programming interfaces for database interaction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is MySQL Workbench?

A

MySQL Workbench is a desktop application for managing MySQL databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are Utility Programs in MySQL?

A

Utility Programs are tools for database administration and management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the Command-Line Client?

A

The Command-Line Client is an interactive tool for processing SQL queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are Database Transactions?

A

Database Transactions are operations that ensure data integrity and consistency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is System Data in MySQL?

A

System Data in MySQL is the information stored in the mysql system schema. This schema contains tables with metadata about the server itself, including:

  • Data Dictionary Tables: Store information about database objects like tables and columns.
  • System Tables: Used for operational purposes like user accounts and server logs.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do users interact with User Data in MySQL?

A

User Data is created and managed by database users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are High-End Installations?

A

High-End Installations are advanced setups requiring additional Enterprise features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a Connection in MySQL?

A

A Connection specifies the database name and credentials.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the Connection Manager do?

A

The Connection Manager creates and manages connections to databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a Query Compilation?

A

Query Compilation generate a detailed execution plan for queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the CSV Storage Engine?

A

The CSV Storage Engine stores data in comma-separated values format.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the MEMORY Storage Engine?

A

The MEMORY Storage Engine stores data in RAM for fast access.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is a Storage Engine?

A

A Storage Engine manages how data is stored in tables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Who is a Database Administrator?

A

A Database Administrator manages database configurations and storage engines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is Block Replacement?

A

Block Replacement is the process of removing stale blocks from the buffer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a Data Update?

A

A Data Update is the modification of existing data in a block.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is Buffer Capacity?

A

Buffer Capacity is the maximum number of blocks the buffer can hold.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is Block Discarding?

A

Block Discarding is removing blocks from the buffer to free space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the File System Layer?

A

The File System Layer organizes data stored on physical storage media.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What are Database Queries?

A

Database Queries are requests for data retrieval or manipulation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Multi-Tier Architecture is used how in management?

A

Multi-Tier Architecture groups computers in layers for application management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is the Top Tier in architecture?

A

The Top Tier interacts directly with end-users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What does the Bottom Tier manage?

A

The Bottom Tier manages resources like databases and email.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What functions does the Middle Tier execute?

A

The Middle Tier executes functions like user authorization and business logic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is Web Architecture?

A

Web Architecture is a multi-tier architecture for web applications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What role do Web Browsers play in web architecture?

A

Web Browsers are the top tier managing user interaction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What do Web Servers do?

A

Web Servers are the middle tier generating web pages and handling requests.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is the function of Application Servers?

A

Application Servers process requests and communicate with databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What does the Services Tier provide?

A

The Services Tier provides database and authentication services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is Single-tier architecture?

A

Single-tier architecture contains one layer of software components.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What does Multi-tier architecture contain?

A

Multi-tier architecture contains two or more interconnected software tiers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is On-premise software?

A

On-premise software is installed and run on customer computers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What are Cloud services?

A

Cloud services are software provided over the internet for a fee.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is Infrastructure-as-a-Service (IaaS)?

A

IaaS involves renting virtual computing resources from cloud providers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What does Platform-as-a-Service (PaaS) provide?

A

PaaS provides a platform for developers to create and manage applications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What is Software-as-a-Service (SaaS) & Platform as a Service?

A
  1. SaaS is about delivering ready-to-use software applications (like renting a furnished apartment)
  2. PaaS provides a platform for developers to create and manage applications (like leasing land to build a house).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What is Elastic Compute Cloud (EC2)?

A

EC2 is Amazon’s IaaS offering for virtual computing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is Microsoft Azure?

A

Microsoft Azure is Microsoft’s PaaS offering with various cloud services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What is Salesforce?

A

Salesforce is a SaaS for sales management and customer relations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

What is a Cloud database?

A

A Cloud database is offered as a PaaS cloud service.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What is a Virtual machine (VM)?

A

A VM emulates a complete computing environment on a server.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Who is a Customer in cloud services?

A

A Customer is an end-user accessing cloud services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What is Storage media?

A

Storage media are physical devices for storing data in the cloud.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is a Network?

A

A Network connects multiple computers for data exchange.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What is an Operating system?

A

An Operating system is software managing computer hardware and resources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

What is a Database?

A

A Database is a structured collection of data for storage and retrieval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What is a Server in a computer network?

A

A Server is a computer providing data or services to other computers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What is a Shared memory computer?

A

A Shared memory computer has processors that share the same memory and storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

What is a Shared storage computer?

A

A Shared storage computer has processors that share storage but have private memory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

What is a Shared nothing computer?

A

A Shared nothing computer has processors that share neither memory nor storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

What are Parallel computers?

A

Parallel computers have multiple processors working under a single OS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

What is a Local area network?

A

A Local area network consists of cables connecting devices within a small area.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

What is the Ethernet protocol?

A

The Ethernet protocol is a common communication protocol for local area networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

What are Data transmission risks?

A

Data transmission risks are potential vulnerabilities when sending data online.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What is a corp Cloud service provider?

A

Cloud service providers are companies offering database management in the cloud.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

What is Scalable storage?

A

Scalable storage is adjustable storage capacity based on customer needs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

What are Peak processing loads?

A

Peak processing loads refer to high demand periods requiring additional resources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

What is a Wide Area Network?

A

A Wide Area Network spans multiple facilities across large distances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

What is a Node in networking?

A

A Node is a computer connected in a network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

What is Shared Memory?

A

Shared Memory is when multiple processors share both memory and storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

What is Shared Storage?

A

Shared Storage is when multiple processors share storage but not memory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

What is Shared Nothing?

A

Shared Nothing means each processor has private memory and storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

What is the Ethernet Protocol?

A

The Ethernet Protocol is a standard for connecting computers in a network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

What is a Parallel Database?

A

A Parallel Database runs on parallel computers or clusters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

What is a Distributed Database?

A

A Distributed Database operates on multiple computers via a wide area network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

What is Query Decomposition?

A

Query Decomposition is breaking queries into parts for concurrent execution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

What is a Local Area Network?

A

A Local Area Network covers a small geographic area.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

What are Technical Challenges in databases?

A

Technical Challenges arise from slow wide area networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

What is a Unified View of Data?

A

A Unified View of Data provides consistent data representation for users and programmers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

What is Data Location Impact?

A

Data Location Impact refers to the significance of data placement on query processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

What are Remote Queries?

A

Remote Queries access data from different locations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

What is Cluster Management Software?

A

This software coordinates nodes in a cluster.

Node: computer on network

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
79
Q

What are Memory Access Limitations?

A

Memory Access Limitations indicate local networks can’t support shared memory access.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
80
Q

What is Data Visibility?

A

Data Visibility means only administrators see physical data locations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
81
Q

What is Concurrency in databases?

A

Concurrency is the simultaneous execution of processes or queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
82
Q

What is Query Execution Speed?

A

Query Execution Speed is the time taken to process database queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
83
Q

What is Employee Data Storage?

A

Employee Data Storage refers to local nodes storing data for geographically distributed employees.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
84
Q

What are Communication Protocols?

A

Communication Protocols are rules for data exchange between networked devices.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
85
Q

What is a Processor?

A

A Processor is a component that executes instructions and processes data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
86
Q

What is the Two-Phase Commit protocol?

A

The Two-Phase Commit protocol ensures all nodes commit or rollback updates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
87
Q

What is a Transaction Coordinator?

A
  • A transaction coordinator is a software component that manages distributed transaction processes.
  • It ensures that all parts of a transaction are completed successfully or that the entire transaction is rolled back if there is an error.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
88
Q

What is a Local Log?

A

A Local Log is temporary storage for pending updates on nodes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
89
Q

What is a Commit Message?

A

A Commit Message is an instruction to finalize updates in a database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
90
Q

What is Rollback?

A

Rollback is the reversal of changes made during a transaction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
91
Q

What is a Confirmation Message?

A

A Confirmation Message is an acknowledgment from nodes to the transaction coordinator.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
92
Q

What is Data Assignment?

A

Data Assignment is the distribution of data across nodes in a database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
93
Q

What is Node Availability?

A

Node Availability refers to the status of nodes being operational during transactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
94
Q

Who are Database Administrators?

A

Database Administrators are professionals managing and optimizing database performance.

Aka root account

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
95
Q

What are Distributed Database Characteristics?

A

Distributed Database Characteristics indicate nodes are not visible to users or programmers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
96
Q

What is Two-phase locking?

A

Two-phase locking controls lock acquisition in transactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
97
Q

What is Two-phase locking Phase 1?

A

In Phase 1, the coordinator notifies the nodes of updates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
98
Q

What is Two-phase locking; Phase 1 / 2?

A
  1. Phase 1 (Growing): Transactions acquire all necessary locks.
  2. Phase 2 (Shrinking): Transactions release all locks and commit.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
99
Q

What is a Commit message?

A
  • A Commit message instructs nodes to finalize updates.
  • The commit message flows from the coordinator to the nodes, confirming the update.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
100
Q

What is a Distributed transaction?

A

A Distributed transaction involves multiple nodes for data updates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
101
Q

What is a Local transaction?

A

A Local transaction updates data on a single node only.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
102
Q

What is Temporary inconsistency?

A

Temporary inconsistency occurs when nodes are updated at different times, causing discrepancies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
103
Q

What is a Debit Node?

A

A Debit Node is responsible for processing debit transactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
104
Q

What is a Credit Node?

A

A Credit Node is responsible for processing credit transactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
105
Q

What is the CAP Theorem?

A

The CAP Theorem limits simultaneous consistency, availability, and partition tolerances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
106
Q

What is a Network Partition?

A

A Network Partition is a failure; preventing nodes from communicating with each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
107
Q

What is a Partition-Tolerant Database?

A

A Partition-Tolerant Database continues functioning despite network partitions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
108
Q

What is a Consistent Database?

A

A Consistent Database requires data across nodes to match and conform.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
109
Q

What is an SQL Update Statement?

A

An SQL Update Statement is a command to modify existing data in a database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
110
Q

What is an SQL Select Statement?

A

An SQL Select Statement is π“ͺ 𝓬𝓸𝓢𝓢π“ͺ𝓷𝓭 𝓽𝓸 retrieve data from a database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
111
Q

What is Percentage Availability?

A

Percentage Availability is the proportion of time a database is responsive to users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
112
Q

What is a Node in a distributed database?

A

A Node is an individual unit within a distributed database system.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
113
Q

What is Table Data Partition?

A

Table Data Partition is a subset of data within a database table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
114
Q

What is a Live Node?

A

A Live Node is a node that actively responds to queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
115
Q

What is a Replicated Database?

A

A Replicated Database maintains multiple copies on different devices.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
116
Q

What is a Simple Backup?

A

A Simple Backup involves backing up one replica while others are active.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
117
Q

What is Enhanced Security?

A

Enhanced Security refers to improved data protection through multiple replicas.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
118
Q

What is Shared Nothing Architecture?

A

Shared Nothing Architecture is where nodes do not share storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
119
Q

What is Shared Storage?

A

Shared Storage allows multiple nodes to access common storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
120
Q

What is Shared Memory?

A

Shared Memory is an architecture where processors share a common memory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
121
Q

What is Complex Server Administration?

A

Complex Server Administration involves managing update propagation across replicas.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
122
Q

What is Replication?

A

Replication is used in parallel and distributed databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
123
Q

What are Frequent Reads?

A

Frequent Reads are a common scenario for using database replication.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
124
Q

What are Infrequent Updates?

A
  • Infrequent Updates occur less often than reads.
  • The frequency of data reads is usually much higher than the frequency of data updates.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
125
Q

What is Temporary Inconsistency in replication?

A

Temporary Inconsistency is an acceptable state during data replication.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
126
Q

What is a Catalog in databases?

A

A Catalog is a directory of information describing database objects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
127
Q

What is a Central Catalog?

A

A Central Catalog is catalog information stored on a single node.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
128
Q

What is Group Replication?

A

Group Replication is a technique allowing updates from any node in the group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
129
Q

What do Double-Headed Arrows indicate?

A

Double-Headed Arrows indicate bidirectional communication between nodes and the cloud.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
130
Q

What are Animation Captions?

A

Animation Captions are visual aids explaining the database update process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
131
Q

What is Transaction Information?

A

Transaction Information is data sent to nodes before commit to prevent conflicts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
132
Q

What is a Database Management System?

A

A Database Management System is software for managing databases and processing queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
133
Q

What are Replicated Databases?

A

Replicated Databases maintain copies of data across nodes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
134
Q

What is a Central Catalog?

A

A Central Catalog contains the entire catalog on a single node.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
135
Q

What are Local Transactions?

A

Local Transactions are transactions affecting only one node’s data.

136
Q

What is an Out-of-Date Replica?

A

An Out-of-Date Replica is not updated with the latest changes.

137
Q

What is an Error Display in databases?

A

An Error Display advises resubmitting queries if outdated.

138
Q

What is Operational Data?

A

Operational Data is used for daily business functions.

139
Q

What is Analytic Data?

A

Analytic Data is used for business analysis and planning.

140
Q

What is Volatility in data?

A
  • Volatility refers to the rate of change in operational data.
  • Data volatility is about the information. It’s about how often the meaning of the data changes.
141
Q

What is Extracted Data?

A

Extracted Data is pulled from operational databases for processing.

142
Q

What is Data Mart Derivation?

A

Data Mart Derivation is the creation of a data mart from operational or warehouse data.

143
Q

What are Performance Issues?

A

Performance Issues are problems affecting the speed of database queries.

144
Q

What is Data Refresh Frequency?

A

Data Refresh Frequency refers to regular intervals at which data is updated in the warehouse.

145
Q

What is Query Conflict?

A

Query Conflict refers to discrepancies in reports generated by simultaneous queries.

146
Q

What is Metric System Conversion?

A

Metric System Conversion is changing measurements to metric for consistency.

147
Q

What is a system Catalog?

A
  1. A collection of tables within the database that store metadata about the database itself.
  2. Includes information about tables, columns, data types, constraints, and other structural details.
148
Q

What is a Learning Management System?

A

A software application used to create, deliver, track, and manage online courses and training programs

149
Q

What is a Temporary Database?

A

A Temporary Database is short-term storage for data processing.

150
Q

What is Data Standardization?

A

Data Standardization applies consistent formats and definitions to data.

151
Q

What is a Fact Table?

A

A Fact Table stores quantitative data for analysis.

152
Q

What is a Dimension Table?

A

A Dimension Table contains descriptive attributes related to facts.

153
Q

What is Summary Data?

A

Summary Data is precomputed data stored for fast access.

154
Q

What is a Dimensional Model?

A

A Dimensional Model is a structure used to organize data in a data warehouse.

155
Q

What is Nonvolatile Data?

A
  1. Nonvolatile Data remains unchanged (via power outage) until updated or replaced.
  2. It’s stored in non-volatile storage mediums, such as hard drives. flash drives, and optical discs
156
Q

What is Query Submission?

A

Query Submission is the process of requesting information from a data warehouse.

157
Q

What are Ad Hoc Queries?

A

Ad Hoc Queries are spontaneous queries for specific data analysis needs.

158
Q

What are Operational Systems?

A

Operational Systems are designed for executing business transactions.

159
Q

What is Analysis Paralysis?

A

Analysis Paralysis is overthinking that hinders timely decision-making.

160
Q

What are Reporting and Analytics?

A

Reporting and Analytics are processes for interpreting data, to inform business decisions.

161
Q

Who are BI Consumers?

A

BI Consumers are individuals providing requirements for data analysis.

162
Q

What is Data Profiling?

A

Data Profiling is assessing data quality of candidate sources.

163
Q

What are Dimensional Models?

A

Dimensional Models are structures designed for analytical data representation.

164
Q

What is Data Extraction?

A

Data Extraction is the process of retrieving data from sources.

165
Q

What is Data Cleansing?

A

Data Cleansing involves removing inaccuracies from data sets.

166
Q

What is Data Replication?

A

Data Replication is copying data from one database to another.

167
Q

What is Change Data Capture?

A

Change Data Capture is tracking changes in data sources.

168
Q

What are Computational Requirements?

A

Computational Requirements are resources needed for processing data aggregations. Which includes:

  1. The hardware (CPU, GPU, memory, storage)
  2. software (operating system, databases, algorithms) that enable data manipulation and analysis
169
Q

What are Parallel Machine Architectures?

A

Parallel Machine Architectures are systems designed to handle large data processing.

170
Q

What is Storage Space Calculation?

A

Storage Space Calculation is determined by dimensions and their sizes.

171
Q

What is Data Migration?

A

Data Migration is moving data from one system to another.

172
Q

What is a Staging Area?

A

A Staging Area is infrastructure for aggregating raw data sets.

173
Q

What is Data Validation?

A

Data Validation ensures data accuracy and quality before processing.

174
Q

What is Data Cleaning?

A

Data Cleaning involves removing inaccuracies and inconsistencies from data.

175
Q

What is Data Transformation?

A

Data Transformation modifies data into a suitable format for analysis.

176
Q

What is Data Requirements Analysis?

A

Data Requirements Analysis identifies data needed for business intelligence applications.

177
Q

What is Automated Extraction?

A

Automated Extraction refers to tools that automate data retrieval processes.

178
Q

What are SQL Queries?

A

SQL Queries are structured queries for data extraction.

179
Q

What are Adapters?

A

Adapters are connectors for different data sources.

180
Q

What are Code Generation Techniques?

A

Code Generation Techniques are methods for creating executable data extraction programs.

181
Q

What is a Source System?

A

A Source System is the original system from which data is extracted.

182
Q

What is Warehouse Population?

A

Warehouse Population is filling the data warehouse with processed data.

183
Q

What is Metadata?

A

Metadata is data that describes other data.

184
Q

What is Renormalization?

A

Renormalization is reverting denormalized data back to normalized form.

185
Q

What is Aggregation?

A

Aggregation is summarizing data for reporting or analysis.

186
Q

What is Audit Information?

A

Audit Information is data used to verify integrity and accuracy.

187
Q

What are Row Counts?

A

Row Counts refer to the total number of rows in a dataset.

188
Q

What is Null Conversion?

A

Null Conversion transforms various null representations into a standard form.

189
Q

What is Data Augmentation?

A

Data augmentation is a statistical technique that creates new data from existing data to add to data records.

190
Q

What does the architecture of a database system describe?

A

The internal components and the relationships between components

This includes understanding how different parts of the database system interact with each other.

191
Q

What is the role of the query processor in a database system?

A
  1. Interprets queries
  2. creates a plan to modify the database or retrieve data
  3. Returns query results to the application

The query processor optimizes queries for efficiency.

192
Q

How does the storage manager function in a database system?

A

Translates the query processor instructions into low-level file system commands that modify or retrieve data

It also uses indexes to quickly locate data due to the varying sizes of databases.

193
Q

What is the function of the transaction manager?

A

Ensures transactions are properly executed and prevents conflicts between concurrent transactions

It also restores the database to a consistent state in case of failure.

194
Q

What is contained in the log file of a database?

A

A complete record of all inserts, updates, and deletes processed by the database

The transaction manager writes log records before applying changes for recovery purposes.

195
Q

What is the purpose of the catalog in a database system?

A

A directory of tables, columns, indexes, and other database objects

Other components use catalog information to process and execute queries.

196
Q

True or False: All database systems support transactions.

A

False

Some database systems do not support transactions and therefore lack a transaction manager.

197
Q

Fill in the blank: The database sizes range from _______ to many terabytes.

198
Q

What does the transaction manager do in the event of a failure?

A

Uses log records to restore the database

This ensures that the database can return to a consistent state after a failure.

199
Q

What happens to a database transaction if a server failure occurs before completion?

A

The database must reverse any partial changes

This is necessary to maintain data integrity.

200
Q

What is query optimization in the context of a database?

A

The process of ensuring the most efficient instructions are executed on the data

It is performed by the query processor.

201
Q

What might vary significantly between different database systems?

A

Capabilities and component details

For example, the physical structure of data on storage media affects the storage manager’s implementation.

202
Q

What are the main components of a database system?

A

A query processor, storage manager, transaction manager, log, and catalog.

203
Q

What role does the query processor play in a database system?

A

It receives queries from applications and performs query optimization using information from the catalog.

204
Q

How does the storage manager interact with the query processor?

A

It translates the query processor’s instructions into file-system commands and uses an index to locate requested data quickly.

205
Q

What does the transaction manager do in a database system?

A

It logs insert, update, and delete queries, sending the result back to the application.

206
Q

Does the query processor has direct access to the database data on storage media.

A

No, all access to the database data must go through the storage manager

207
Q

Without query optimization, the storage manager cannot $ ?

A
  1. The storage manager can retrieve the data with efficient or inefficient instructions
  2. Query optimization ensures the most efficient instructions are sent to the storage manager.
208
Q

The catalog allows the storage manager to $ ?

A

Catalogs allow quick locations to the requested data.

209
Q

What must be done in order to correctly restore database changes? 2

A
  1. All inserts, updates, and deletes must be logged in order to correctly restore database changes.
  2. Reads are not needed to restore database changes, however, and thus are not logged.
210
Q

What is metadata in the context of a database?

A

Data about the database, such as column names and the number of rows in each table.

211
Q

Where is metadata stored in a database system?

A

In the catalog.

212
Q

The _______ helps locate data quickly by providing information about where on the storage media certain pieces of data can be found.

213
Q

What types of queries does the transaction manager log?

A

Insert, update, and delete queries.

214
Q

What is the purpose of query optimization?

A

To ensure the most efficient instructions are sent to the storage manager.

215
Q

What is a concurrent execution?

A
  1. Concurrent execution is when multiple processes are active at the same time, rather than one after the other.
  2. It can also refer to the execution of multiple transactions at the same time
216
Q

What is query optimization?

A

A crucial aspect of database management focused on enhancing the efficiency and speed of data retrieval operations.

It involves techniques and strategies aimed at minimizing resources for executing queries while maximizing result quality.

217
Q

What is the first step in the query optimization process?

A

The analysis of queries to understand their execution plans.

Execution plans are the specific steps the database engine takes to retrieve requested data.

218
Q

How does indexing contribute to query optimization?

A

Allows for quicker data access and reduces the amount of data the database must scan.

Proper indexing speeds up response times significantly.

219
Q

What are optimal join methods in query optimization?

A

Techniques for combining data from multiple tables, including:
* Nested loops
* Hash joins
* Merge joins

Selecting the appropriate join method based on data characteristics is vital for performance.

220
Q

Why are statistics about data distribution and cardinality important?

A

They help the query optimizer make better decisions regarding execution plans.

Keeping these statistics updated is essential for maintaining optimal performance as data evolves.

221
Q

What is query rewriting?

A

A strategy where queries are reformulated to improve performance without altering the returned results.

This can include simplifying expressions and consolidating multiple queries.

222
Q

How do caching mechanisms enhance query performance?

A

By storing previously retrieved results to quickly return cached results instead of recalculating them.

This saves time and resources during query execution.

223
Q

What is the overall goal of query optimization?

A

To ensure that queries run as efficiently as possible, improving performance and user satisfaction in data-driven applications.

224
Q

True or False: Query optimization involves only the analysis of queries.

A

False.

Query optimization also involves techniques like indexing, join methods, query rewriting, and caching.

225
Q

What is the memory hierarchy?

A

A structured arrangement of different types of memory in a computer system, organized by speed and cost.

226
Q

What are registers in the memory hierarchy?

A

Small, fast storage locations within the CPU that hold the most frequently used data and instructions for quick access.

227
Q

What is the function of the L1 cache?

A

Stores copies of frequently accessed data from main memory, allowing for quick retrieval.

228
Q

What is the difference between L1 and L2 cache?

A

L1 is on-chip and smaller, while L2 is off-chip and larger, serving as an intermediary between L1 cache and main memory.

229
Q

What is the primary storage area in a computer called?

A

Main Memory (DRAM).

230
Q

What types of devices are considered local secondary storage?

A

Hard Drives (HDDs) and Solid State Drives (SSDs).

231
Q

What is remote secondary storage?

A

Storage located on networks, often accessed via the internet, providing access to large amounts of data stored off-site.

232
Q

Fill in the blank: Each level in the memory hierarchy provides a trade-off between _______ and cost.

233
Q

What is the function of local secondary storage?

A

Holds data and programs that are not currently in use; slower than main memory.

234
Q

What is the role of access patterns in memory hierarchy?

A

Programs are designed to access data in a way that takes advantage of the hierarchy, using faster memory for frequently accessed data.

235
Q

What manages database connections and compiles SQL queries?

A

The query processor

The query processor is responsible for managing connections and translating SQL into low-level storage engine instructions.

236
Q

What authorizes a user to access a database?

A

A connection

Connections are essential for user access to databases.

237
Q

What is MySQL Workbench?

A

A desktop application that enables database administrators and users to interact with the database

MySQL Workbench does not directly interact with the query processor.

238
Q

How does the file system interact with the query processor?

A

Indirectly, via the storage engine

The file system does not interact directly with the query processor.

239
Q

Is the query processor aware of data addresses and structures on storage media?

A

False

The query processor executes instructions compiled from SQL queries without knowledge of data addresses.

240
Q

What does the storage engine execute?

A

Low-level instructions generated by the query processor from SQL queries

This execution is crucial for the actual data operations within the database.

241
Q

What do storage engines that support transactions manage?

A

Concurrency and recovery

This management is essential for maintaining data integrity in transactional systems.

242
Q

What are the five categories of utility programs?

A

Installation, client, administrative, developer, miscellaneous tools

243
Q

How many tools do utility programs include approximately?

A

Approximately 30 tools

244
Q

Who are the primary users of utility programs?

A

Database administrators or programmers

245
Q

What is one example function of utility programs?

A

Upgrade existing databases to a new MySQL release

Other functions include backing up databases, importing data, inspecting log files, and administering database servers.

246
Q

Fill in the blank: Utility programs are intended primarily for _______.

A

[database administrators or programmers]

247
Q

What are application programs built by?

A

Programmers using MySQL

248
Q

Are application programs part of MySQL?

249
Q

What is Workbench?

A

A desktop application with broad capabilities

250
Q

What can Workbench do?

A

Assign data access privileges and monitor database performance

Many Workbench capabilities are administrative functions.

251
Q

What is mysql_config_editor?

A

A tool in the administrative group of utility programs

252
Q

What does mysql_config_editor help database administrators do?

A

Configure MySQL

253
Q

Are MySQL APIs older or newer than Connectors?

A connection is a link between tools and the query processor.

254
Q

What are many Connectors built on top of?

255
Q

What does the query parser do?

A

Checks each SQL statement for correct syntax and converts the statement to an internal representation

The query parser is essential for ensuring that SQL commands are valid before execution.

256
Q

What is the role of the connection manager?

A

Processes connection requests from the tools layer

The connection manager is responsible for establishing links between the user applications and the database.

257
Q

What information does a database connection include?

A
  • Database name
  • Server address
  • Logon credentials

These elements are necessary for successful communication with the database.

258
Q

What is the cache in the context of database management?

A

An area of main memory that contains recent queries, execution plans, and results for possible reuse

Caching improves performance by reducing the need to reprocess frequent queries.

259
Q

What is the function of the cache manager?

A

Stores reusable information in main memory

The cache manager optimizes access to frequently used data, enhancing overall system efficiency.

260
Q

Fill in the blank: The cache is an area of main memory that contains recent queries, execution plans, and _______.

261
Q

True or False: The connection manager handles the actual execution of SQL statements.

A

False

The connection manager is involved in managing connections, not executing queries.

262
Q

What does the file system layer consist of?

A

Data stored on storage media and organized in files

The file system layer is integral to managing how data is stored and organized in a database.

263
Q

What are the three types of data contained in a database file system?

A
  • User data
  • Log files
  • Data dictionary

These three types of data serve specific functions within the database management system.

264
Q

What does user data include in a database?

A
  • Tables
  • Indexes

User data is crucial for storing actual information and facilitating efficient data retrieval.

265
Q

What is the purpose of log files in a database?

A

To contain a detailed, sequential record of each change applied to a database

Log files are essential for the recovery system to restore data after failures.

266
Q

How does the recovery system utilize log files?

A

To restore data in the event of a transaction, system, or storage media failure

Log files help ensure data integrity and reliability.

267
Q

What is a catalog in the context of a relational database?

A

A directory of tables, columns, keys, indexes, and other objects

The catalog, or data dictionary, is critical for managing database structure.

268
Q

What term does MySQL use for its data dictionary?

A

β€˜Data dictionary’

MySQL’s implementation includes a structured approach to managing metadata.

269
Q

How many tables does the MySQL data dictionary contain?

A

Roughly 30 tables

These tables provide comprehensive information about the database schema.

270
Q

What does the β€˜tables’ data dictionary table describe?

A

All tables

This table provides metadata about the structure of each table in the database.

271
Q

What information does the β€˜table stats’ data dictionary table contain?

A

Table statistics, such as the number of rows in each table

This information is useful for optimization and performance tuning.

272
Q

What does the β€˜columns’ data dictionary table describe?

A

All columns

This table details the structure and attributes of each column in the database.

273
Q

What information does the β€˜foreign keys’ data dictionary table provide?

A

All foreign keys

This table is important for understanding relationships between tables.

274
Q

What does the β€˜indexes’ data dictionary table describe?

A

All indexes

Indexes are crucial for improving query performance.

275
Q

What does the β€˜routines’ data dictionary table contain?

A

All stored procedures and stored functions

This table helps manage executable database logic.

276
Q

What information does the β€˜triggers’ data dictionary table describe?

A

All triggers

Triggers are automated actions that occur in response to certain events in the database.

277
Q

Can data dictionary tables be accessed directly with SELECT, INSERT, UPDATE, and DELETE queries?

A

No

Direct access is restricted to maintain the integrity of the data dictionary.

278
Q

How can the contents of data dictionary tables be accessed?

A

Indirectly

This is done through specific commands like SHOW, which compile as SELECT queries.

279
Q

What does the SHOW COLUMNS command generate?

A

A SELECT query against the columns table

This command is used to retrieve column information for a specific table.

280
Q

What type of query does CREATE generate against dictionary tables?

A

INSERT

This reflects the addition of new objects to the database schema.

281
Q

What type of query does ALTER generate against dictionary tables?

A

UPDATE

This reflects modifications to existing database objects.

282
Q

What type of query does DROP generate against dictionary tables?

A

DELETE

This reflects the removal of objects from the database schema.

283
Q

How can multiple nodes can be updated independently with local transactions?

A
  1. The transaction coordinator notifies participating nodes of required updates.
  2. Nodes commit immediately and confirm with the transaction coordinator.
  3. If a node is unavailable, the transaction coordinator repeats the update message until confirmation is received.
284
Q

What are updates done in a distributed transactions and in separate local transactions called?

A
  1. Updates in separate local transactions are asynchronous.
  2. Updates in a distributed transaction are synchronous.
285
Q

What must most distributed databases always function as?

A

Partition-tolerant

This means they can continue to operate even when there are network issues.

286
Q

In a distributed database, what do consistency and availability guarantee?

A

Either consistency or availability, but not both

This is a key principle in distributed database design.

287
Q

What does a consistent database conform to?

A

All rules at all times

Consistency ensures that data integrity is maintained across all nodes.

288
Q

Give an example of a rule that may govern data in a distributed database.

A

Foreign key values on one node must match primary key values on another node

This ensures relational integrity across nodes.

289
Q

What must be true about copies of data on multiple nodes in a consistent database?

A

They must be identical

This is crucial for maintaining data consistency.

290
Q

What must live nodes do in an available database?

A

Respond to queries at all times

This means that even if some nodes are down, the database remains operational.

291
Q

What occurs when a network error prevents nodes from communicating?

A

A network partition forms

This can disrupt the functioning of distributed databases.

292
Q

Why do distributed databases occasionally experience network partitions?

A

Because nodes are connected by wide area networks that occasionally fail

This vulnerability is inherent in distributed systems.

293
Q

What characterizes a partition-tolerant database?

A

It continues to function when a network partition occurs

This is essential for maintaining operational integrity in distributed systems.

294
Q

What is the process for updating replicas in a database on a single node?

A

It is a simple process where either storage solutions handle replication internally or the database updates all replicas in a single local transaction.

295
Q

What is required for synchronizing replicas in a database?

A

Synchronizing replicas does not necessitate any special database features.

296
Q

What complicates the process of updating replicated data in a distributed database?

A

It is more complicated because updating all replicas in a distributed transaction is generally slower and fails if any replica is down.

297
Q

What is the primary/secondary method in managing replicated data?

A

It assigns one node as the primary where all updates are made initially, and secondary nodes are updated afterward.

298
Q

What happens if the primary node fails in the primary/secondary method?

A

The system automatically identifies a new primary node to maintain availability.

299
Q

How does the group replication method work?

A

Updates can be applied to any node within a group, which shares transaction details with other nodes before committing.

300
Q

What occurs if a conflict arises during the group replication method?

A

An algorithm decides which transaction will proceed and which will be rolled back.

301
Q

What can influence the complexity of the algorithm used in group replication?

A

It can range from simple, allowing the first transaction to commit, to more complex solutions.

302
Q

What happens during a network partition in group replication?

A

Processing will be paused temporarily.

303
Q

How does participation activity differ among relational databases?

A

Support for primary/secondary and group replication methods can differ.

304
Q

Which database supports both primary/secondary and group replication methods?

A

MySQL with the InnoDB storage engine.

305
Q

What advantage does primary/secondary replication provide for read queries?

A

Each node can handle read queries independently without needing to connect with other nodes across the wide area network.

306
Q

What is the process of reading data from operational systems and writing it to a temporary database called?

A

β€˜extraction’

This is the initial step in the data processing pipeline.

307
Q

What is β€˜cleansing’ in the context of data management?

A

Correcting errors in operational data and converting to a standard format

Cleansing ensures data quality and consistency.

308
Q

What issue arises from data in different operational systems?

A

Incompatible or missing keys

This can complicate data integration efforts.

309
Q

What is one example of data integration?

A

Creating uniform primary and foreign keys

This facilitates consistent data relationships across systems.

310
Q

What is the term for converting data from a design optimized for operations to a design optimized for analytics?

A

β€˜restructuring’

Restructuring is crucial for effective data analysis.

311
Q

What do dimensional designs in data warehouses usually include?

A

Date and time dimension tables

These tables are essential for tracking historical data.

312
Q

How many rows would the date dimension table contain if an organization tracks data for 100 years?

A

36,500 rows

This is calculated as 100 years x 365 days per year.

313
Q

How many rows does the time dimension table contain?

A

1,440 rows

This is calculated as 24 hours x 60 minutes per hour.

314
Q

What do fact tables contain in relation to date and time dimensions?

A

Foreign keys referencing date, time, or both dimensions

This establishes the time of a fact.

315
Q

What is a dimension hierarchy?

A

A sequence of columns where each column has a one-many relationship to the next column

This structure helps in organizing data in a meaningful way.

316
Q

What is a data lake?

A

An analytic database of raw, unprocessed data copied from multiple data sources.

Data lakes share characteristics of both data warehouses and federated databases.

317
Q

What role does the global query processor play in a federated database?

A

It decomposes federated queries and merges the results into a federated result.

Individual databases process decomposed queries independently.

318
Q

When is referential integrity enforced in InfoSphere?

A

When primary and foreign keys are in the same database.

Referential integrity is not enforced when they are in different databases.

319
Q

What is a federated database?

A

A collection of two or more participating databases under a coordinating software layer.

The participating databases are autonomous and heterogeneous.

320
Q

Define an autonomous database.

A

A database that operates independently of other participating databases.

It can be administered and queried as if it were not part of a federated database.

321
Q

What does heterogeneous mean in the context of databases?

A

Databases that run under different database systems or have incompatible schema.

This may include inconsistent primary and foreign keys or similar tables with different designs.

322
Q

What is the function of middleware in a federated database?

A

Middleware serves as the coordinating software layer between application programs and database software.

Many federated database middleware products are available.

323
Q

Name two examples of federated database middleware products.

A
  • InfoSphere Federated Server from IBM
  • WebLogic Server from Oracle
324
Q

What is a global catalog in a federated database?

A

A directory of participating database objects, such as tables, columns, and indexes.

It is one of the components found in most federated database middleware products.

325
Q

What does a database wrapper do?

A

Converts decomposed queries to the appropriate syntax for each participating database.

326
Q

True or False: A federated database can include databases with compatible schemas only.

A

False

Federated databases may include heterogeneous databases with incompatible schemas.

327
Q

What is the typical nature of web click data?

A

Rapidly collected and high volume

328
Q

How is social media data typically characterized?

A

Unstructured

329
Q

What type of data analysis is often appropriate for high-volume, unstructured data?

A

Data lake supported by data scientists

330
Q

What solution can make three operational databases appear as one?

A

Federated database

331
Q

What types of data are usually highly structured and suitable for fact tables?

A

Sales revenue and manufacturing costs

332
Q

What do analysis of revenue and cost by product line and region suggest?

A

Dimension tables

333
Q

What is required for quarterly reports?

A

Periodic extracts at a fixed period

334
Q

What do the requirements for periodic extracts suggest?

A

Data warehouse or data mart

335
Q

What can easily handle data that is local to one software layer and not persisted on storage media?

A

Embedded database

336
Q

When is an embedded database not appropriate?

A

If the data must be shared or persisted