Ststems Design: Basics, Load Balancing, Caching Flashcards

Question

What is a cache?

Answer 1

A temporary storage location for data or computation results, typically designed for fast access and retrieval.

Answer 2

When a requested data item or computation result is found in the cache.

Answer 3

When a requested data item or computation result is not found in the cache and needs to be fetched from the original data source or recalculated.

Answer 4

The process of removing data from the cache, typically to make room for new data or based on a predefined cache eviction policy as opposed toto a specific trigger like changes or updates in the underlying data source.

Answer 5

When the data in the cache is outdated compared to the original data source.

Answer 6

Caching API responses, session data, and web page fragments using a cache library like Memcached or Redis, or implementing custom caching logic within the application code ## Footnote https://www.designgurus.io/course-play/grokking-the-system-design-interview/doc/caching

Answer 7

A web server primarily handles HTTP requests to deliver static content like HTML, CSS, and images, while an application server processes dynamic content by executing business logic and interacting with databases or backend systems. Web servers, such as Nginx or Apache, focus on efficiently serving static assets, whereas application servers, like Tomcat or WebLogic, support application logic and protocols beyond HTTP, such as SOAP or RMI. Typically, web servers act as a front layer, routing requests to application servers that handle dynamic processing, enabling seamless integration in multi-tier architectures.

Answer 8

Disk caching is useful for data that is too large to fit in memory or for data that needs to persist between application restarts. This type of caching is commonly used for caching database queries and file system data.

Answer 9

Server side caching occurs on a server and can include full-page caching, fragment caching, query result caching, precomputed results, and object caching. Client side caching occurrs in the client, like a phone application or browser. Client-side caching stores frequently accessed data, such as images, CSS, or JavaScript files, to reduce the need for repeated requests to the server. Examples of client-side caching include browser caching and local storage.

Answer 10

CDN caching stores data on a distributed network of servers, reducing the latency of accessing data from remote locations. This type of caching is useful for data that is accessed from multiple locations around the world, such as images, videos, and other static assets. CDN caching is commonly used for content delivery networks and large-scale web applications.

Answer 11

DNS cache is a type of cache used in the Domain Name System (DNS) to store the results of DNS queries for a period of time. It's used to return IPs quickly.

Answer 12

1. Write-through cache: Under this scheme, data is written into the cache and the corresponding database simultaneously. 2. Write-around cache: This technique is similar to write-through cache, but data is written directly to permanent storage, bypassing the cache. 3. Write-back cache: Under this scheme, data is written to cache alone, and completion is immediately confirmed to the client. The write to the permanent storage is done after specified intervals or under certain conditions.

Answer 13

Complete data consistency between the cache and the storage. Also, this scheme ensures that nothing will get lost in case of a crash, power failure, or other system disruptions. Extraneous writes Since every write operation must be done twice before returning success to the client, this scheme has the disadvantage of higher latency for write operations.

Answer 14

Cache is only updated when a read request is made. It reduces the cache being flooded with write operations that will not subsequently be re-read, but has the disadvantage that a read request for recently written data will create a “cache miss” and must be read from slower back-end storage and experience higher latency. Use case: apps with high write throughput but only a subset of data is frequently read.

Answer 15

It results in low-latency and high-throughput for write-intensive applications; however, this speed comes with the risk of data loss in case of a crash or other adverse event because the only copy of the written data is in the cache until the db is synced.

Answer 16

1. Purge 2. Refresh 3. Ban 4. TTL 5. Stale-while-revalidate

Answer 17

The purge method removes cached content for a specific object, URL, or a set of URLs. It's typically used when there is an update or change to the content and the cached version is no longer valid. When a purge request is received, the cached content is immediately removed, and the next request for the content will be served directly from the origin server. Appropriate for the times when Cash items that are invalid need to be removed immediately. For example, if a product price is updated.

Answer 18

Fetches requested content from the origin server, even if cached content is available. When a refresh request is received, the cached content is updated with the latest version from the origin server, ensuring that the content is up-to-date. Unlike a purge, a refresh request doesn't remove the existing cached content; instead, it updates it with the latest version.

Answer 19

The ban method invalidates cached content based on specific criteria, such as a URL pattern or header. When a ban request is received, any cached content that matches the specified criteria is marked as invalid and will be refreshed in the background while still remaining available to users for quick responses This is usually initiated by external events or triggers.

Answer 20

This method involves setting a time-to-live value for cached content, after which the content is considered stale and must be refreshed. When a request is received for the content, the cache checks the time-to-live value and serves the cached content only if the value hasn't expired. If the value has expired, the cache fetches the latest version of the content from the origin server and caches it.

Answer 21

This method is used in web browsers and CDNs to serve stale content from the cache while the content is being updated in the background. When a request is received for a piece of content, the cached version is immediately served to the user, and an asynchronous request is made to the origin server to fetch the latest version of the content. Once the latest version is available, the cached version is updated. This method ensures that the user is always served content quickly, even if the cached version is slightly outdated.

Answer 22

Read aside and read through cache.

Answer 23

AKA - Lazy loading. The application (client) directly interacts with the cache and the underlying data source. On a cache miss: The application retrieves the data from the underlying source (e.g., a database). Then the application then stores the fetched data in the cache for future use. On a cache hit: The application retrieves the data directly from the cache. **The cache is explicitly managed by the application.**

Answer 24

Advantages: Flexibility: The application has full control over caching logic. Efficient Use of Cache: Only frequently accessed data is loaded into the cache. Disadvantages: Increased Complexity: The application must handle cache misses and updates explicitly. Risk of Stale Data: Requires additional logic to handle cache invalidation or updates. Use case: Read heavy systems where cache hits are common and granular control over the caching and invalidation logic is needed.

Answer 25

The application interacts only with the cache. On a cache miss the cache automatically retrieves the data from the underlying source. The retrieved data is then stored in the cache for future use. On a cache hit the data is served directly from the cache.

Answer 26

Advantages: Simplified Code: The application doesn't handle cache misses or source retrieval. All that logic is handled by the cache instead of the application. Consistent Access: The cache handles all data interactions, ensuring centralized management. Disadvantages: Limited Control: The application has less flexibility in how the cache operates. Higher Dependency: Relies heavily on the cache implementation for performance and correctness. Use Cases: Systems where simplicity and centralized cache management are preferred. Frequently accessed data that benefits from automated retrieval and storage in the cache.

Answer 27

First In First Out (FIFO): The cache evicts the first block accessed first without any regard to how often or how many times it was accessed before. Last In First Out (LIFO): The cache evicts the block accessed most recently first without any regard to how often or how many times it was accessed before. Least Recently Used (LRU): Discards the least recently used items first. Most Recently Used (MRU): Discards, in contrast to LRU, the most recently used items first. Least Frequently Used (LFU): Counts how often an item is needed. Those that are used least often are discarded first. Random Replacement (RR): Randomly selects a candidate item and discards it to make space when necessary.

Answer 28

Invalidation typical requires explicit management - it’s typically manual and triggered by specific events. Eviction is policy based and automatic.

Answer 29

Cache eviction occurs only when the cache needs to free up space and expiration is independent of resource availability or cache size.

Answer 30

CDN caching delivers static content (e.g., images, videos, CSS) to end users from distributed edge servers, reducing latency and improving global performance. Use case: speeding up website load times for users worldwide. Even for users that are far flung but aren't numerous enough to justify deployment of server clusters. In-memory caching stores dynamic or intermediate data (e.g., session data, database query results) close to application servers, enhancing backend speed. Use case: reducing database or API load in real-time applications. CDNs optimize content delivery to users, while in-memory caches focus on internal system efficiency.

Answer 31

Use purge for immediate and permanent removal of unwanted content. Use ban for conditional or broader invalidation where revalidation is preferred over outright deletion.

Answer 32

It’s a matter of use. Server side cashing is intended to make data freely available to anyone who queries it. Session data is specific to a user. Both may use the same tools and the same mechanisms but server-side cashing is used to describe specific use cases.

Answer 33

Server-side cache stores frequently accessed data (e.g., database queries, API responses) closer to the application server, reducing load and response time for subsequent requests. Proxy cache (e.g., CDN, reverse proxy) sits between the client and the server, caching responses to reduce traffic and serve cached content without hitting the origin server. Server-side cache improves backend efficiency, while proxy cache reduces latency and offloads requests from the origin, enhancing overall scalability.

Ststems Design: Basics, Load Balancing, Caching Flashcards

(59 cards)