RDBMS Flashcards by Karthik H

“What is database replication?”

“Replication is the process of copying and maintaining database instances in multiple locations to ensure high availability and fault tolerance.”

How well did you know this?

Not at all

Perfectly

“What is the difference between master-slave and master-master replication?”

“In master-slave replication, one master handles writes and multiple slaves handle reads. In master-master replication, multiple masters can handle both reads and writes.”

How well did you know this?

Not at all

Perfectly

“Which type of replication introduces potential data conflicts?”

“Master-master replication, since multiple nodes can perform write operations, leading to synchronization challenges.”

How well did you know this?

Not at all

Perfectly

“What is the main drawback of synchronous replication?”

“It ensures strong consistency but can introduce higher latency due to the need for immediate replication.”

How well did you know this?

Not at all

Perfectly

“Which replication method prioritizes performance over consistency?”

“Asynchronous replication, as it allows data to be written to replicas with some delay.”

How well did you know this?

Not at all

Perfectly

“How does replication improve read-heavy workloads?”

“By directing read queries to slave databases, reducing load on the master.”

How well did you know this?

Not at all

Perfectly

“What is database sharding?”

“Sharding is the process of dividing a large database into smaller, more manageable pieces called shards, each containing a subset of the data.”

How well did you know this?

Not at all

Perfectly

“How is data distributed in sharding?”

“Data is distributed based on a sharding key, such as user ID or geographic region.”

How well did you know this?

Not at all

Perfectly

“What are the primary benefits of sharding?”

“It enables horizontal scaling, reduces query response times, and allows parallel processing.”

How well did you know this?

Not at all

Perfectly

“What challenges can arise from sharding?”

“Uneven data distribution (hotspots), complex query routing, and expensive re-sharding operations.”

How well did you know this?

Not at all

Perfectly

“What is the difference between sharding and replication?”

“Sharding partitions data across multiple nodes, while replication duplicates the same data across multiple nodes.”

How well did you know this?

Not at all

Perfectly

“What is a common strategy for choosing a sharding key?”

“A key with even distribution across data, such as user IDs hashed to different shards.”

How well did you know this?

Not at all

Perfectly

“What is database federation?”

“Federation involves breaking a database into multiple independent databases that operate separately but can be queried together if needed.”

How well did you know this?

Not at all

Perfectly

“How does federation differ from sharding?”

“In federation, each database operates independently, while in sharding, shards are part of the same logical database.”

How well did you know this?

Not at all

Perfectly

“What are the advantages of database federation?”

“Flexibility to use different database technologies, independent scaling, and better isolation between different datasets.”

How well did you know this?

Not at all

Perfectly

“What are the drawbacks of database federation?”

Study These Flashcards

“More complex query logic and increased operational overhead for managing multiple databases.”

“Give an example of a real-world use case of federation.”

Study These Flashcards

“An e-commerce platform where products, users, and orders are stored in separate databases.”

“What is denormalization?”

Study These Flashcards

“Denormalization is the process of adding redundant data to reduce the number of joins and improve read performance.”

“What is the main advantage of denormalization?”

Study These Flashcards

“Faster read queries by reducing the number of required joins.”

“What is the main disadvantage of denormalization?”

Study These Flashcards

“Increased storage requirements and potential data inconsistency.”

“When should denormalization be used?”

Study These Flashcards

“In read-heavy applications where performance is critical.”

“How does denormalization impact data integrity?”

Study These Flashcards

“It increases the risk of data inconsistency due to duplicated information.”

“What is SQL tuning?”

Study These Flashcards

“SQL tuning is the process of optimizing SQL queries to improve performance and reduce resource usage.”

“What is an index in SQL?”

Study These Flashcards

“An index is a database structure that improves the speed of data retrieval.”

"Why should you avoid over-indexing?"

"Over-indexing increases storage overhead and slows down write operations."

"What does the `EXPLAIN` statement do?"

"It shows the execution plan of a query, helping identify performance bottlenecks."

"What is query caching?"

"Query caching stores frequently accessed query results to reduce database load."

"What is connection pooling?"

"Connection pooling reuses database connections to reduce overhead from creating new connections."

"How does replication improve fault tolerance?"

"It ensures data is available on multiple servers, reducing the risk of data loss if one server fails."

"What are primary keys and foreign keys?"

"A primary key uniquely identifies a record, while a foreign key links to a primary key in another table."

"What is the difference between vertical and horizontal scaling?"

"Vertical scaling increases hardware resources on a single server, while horizontal scaling adds more servers."

"What is the CAP theorem?"

"The CAP theorem states that a distributed system can only provide two out of three: Consistency, Availability, and Partition tolerance."

"What are some techniques to improve query performance?"

"Using indexing, partitioning, caching, and optimizing SQL queries."

"What is the difference between OLAP and OLTP databases?"

"OLTP is optimized for transactional queries, while OLAP is optimized for analytical queries."

"What is the difference between a clustered and non-clustered index?"

"A clustered index determines the physical order of data, while a non-clustered index does not."

"What is the role of a database optimizer?"

"The database optimizer chooses the most efficient way to execute a query."

"What is a database deadlock?"

"A deadlock occurs when two transactions hold locks that the other needs, causing a cycle of waiting."

"What is ACID compliance in databases?"

"ACID stands for Atomicity, Consistency, Isolation, and Durability, ensuring reliable transactions."

RDBMS Flashcards

(38 cards)