Caching Flashcards

Question

What is CPU cache and what type of memory does it use?

Answer 1

CPU cache is the CPU's internal memory designed to store copies of data and instructions from RAM that the CPU is likely to use frequently. It uses SRAM (Static RAM), which is faster than DRAM (Dynamic RAM) used in RAM modules because it doesn't need to be constantly refreshed.

Answer 2

SRAM, used in CPU cache, doesn't require constant refreshing, making it faster than DRAM. However, SRAM is more expensive to produce compared to DRAM.

Answer 3

CPU cache is crucial because it allows the CPU to access frequently used data quickly. If the needed data is in the cache, the CPU doesn't have to wait for slower RAM, thus avoiding bottlenecks and enhancing overall computer performance.

Answer 4

Without CPU cache, a computer would be slower because the CPU would frequently have to wait for data from the slower RAM, creating a performance bottleneck.

Answer 5

L1 cache, the fastest and smallest, is located on the processor and runs at the processor's speed. L2 cache is larger but slower than L1 and is used when data isn't found in L1. L3 cache, larger than L2 but slower, is used when data isn't in L2. L3 is shared across CPU cores, while L1 and L2 are dedicated to individual cores.

Answer 6

In earlier computers, the L2 cache was located on a separate chip on the motherboard. In modern CPUs, it is integrated into the processor, which improves its speed and efficiency.

Answer 7

The L3 cache is called 'shared cache' because it is shared between all the cores on a CPU. This contrasts with L1 and L2 caches, which are dedicated to individual CPU cores.

Answer 8

The L1 cache is the smallest and fastest, L2 is larger but slower than L1, and L3 is the largest but the slowest among the three. Each level of cache is designed to balance speed and size to optimize CPU performance.

Answer 9

The CPU cache reduces processing time by storing frequently accessed data and instructions close to the CPU. This proximity allows for quicker access compared to fetching data from slower main memory (RAM), thereby speeding up processing.

Answer 10

In multi-core processors, the shared L3 cache allows cores to efficiently access and share common data, reducing the need for each core to fetch the same data from RAM. This shared access improves data coherence and overall efficiency in multi-core systems.

Answer 11

SRAM is more expensive due to its complex internal structure which requires more transistors per bit of storage compared to DRAM. This complexity provides faster access but increases production costs.

Answer 12

The CPU uses algorithms to predict which data and instructions it will need next, based on recent accesses and patterns of usage. This predictive approach helps in storing relevant data in the cache for faster access.

Answer 13

This situation is known as a 'cache miss.' When the required data is not found in the cache, the CPU must fetch the data from the slower main memory (RAM), resulting in increased access time and potential processing delays.

Answer 14

Yes, the efficiency of the CPU cache significantly affects overall computer performance. A well-optimized cache reduces the frequency of accessing slower RAM, thereby minimizing bottlenecks and enhancing system speed.

Answer 15

The size of the CPU cache impacts how much data can be quickly accessed by the CPU. Larger caches can store more data, potentially reducing the need to access slower main memory and improving performance, especially in data-intensive tasks.

Answer 16

No, the CPU cache is not directly visible or accessible to the operating system or software applications. It is managed internally by the CPU's hardware and caching algorithms.

Answer 17

A CDN is a network of servers strategically distributed globally, designed to deliver web content quickly to users by hosting content closer to where the users are. It speeds up the delivery of static and dynamic content over the internet.

Answer 18

A CDN improves web service performance by reducing latency. It hosts content on edge servers located close to users, ensuring faster content delivery and improved user experience.

Answer 19

Key components of a CDN include Points of Presence (PoPs), which are server locations around the world, and edge servers within these PoPs that cache and deliver content to users.

Answer 20

CDNs use technologies like DNS-based routing, where each PoP has its own IP address, and Anycast, where all PoPs share the same IP address, to direct user requests to the closest PoP efficiently.

Answer 21

Edge servers in a CDN act as reverse proxies with large content caches. They store static content, delivering it quickly to users, and reduce the load on origin servers by requesting content only when it's not in the cache.

Answer 22

Modern CDNs can transform static content into more optimized formats, such as minifying JavaScript bundles or converting images into modern formats like WebP or AVIF, improving load times and efficiency.

Answer 23

Terminating TLS connections at the edge server reduces latency in establishing encrypted TCP connections, as TLS handshakes are resource-intensive and can involve several network round trips.

Answer 24

Modern CDNs provide enhanced security, including effective DDoS protection, by utilizing their vast network capacity to absorb and diffuse attack traffic across numerous servers, especially in Anycast-based networks.

Answer 25

A CDN's highly distributed nature, with content replicated in multiple PoPs, enhances system availability and reliability. It can withstand more hardware failures than origin servers, ensuring content is always accessible.

Answer 26

Developers should use a CDN for serving HTTP traffic because it significantly enhances performance, security, and availability of web services, leading to better user engagement and retention.

Answer 27

Load balancing in a CDN distributes incoming traffic among various servers within the network, preventing any single server from becoming overwhelmed and ensuring consistent, efficient content delivery.

Answer 28

CDNs are vital for mobile content delivery because they optimize content for faster loading on mobile devices, which is crucial given the typically slower mobile network speeds and higher sensitivity to latency.

Answer 29

CDNs can improve website speed and user experience, both of which are factors in SEO rankings. Faster websites are more likely to be ranked higher in search engine results, increasing visibility.

Answer 30

Yes, CDNs can help mitigate downtimes by distributing content across multiple servers. If one server goes down, others can step in to serve the content, ensuring continuous availability.

Answer 31

Vertical scaling involves adding more resources like RAM or upgrading the CPU to an existing server. It's an easy method to increase capacity but has limitations in scalability and resilience.

Answer 32

Horizontal scaling involves adding more servers to handle the workload. Each server can handle a subset of requests, enhancing performance, scalability, fault tolerance, and eliminating single points of failure.

Answer 33

A load balancer, acting as a reverse proxy, distributes incoming requests evenly across multiple servers. It prevents any single server from being overloaded, ensuring efficient resource utilization.

Answer 34

CDNs, with servers worldwide, are essential for serving static files (like images, videos, HTML, CSS, and JavaScript) as they reduce latency by serving content from the nearest server to the user.

Answer 35

Caching creates copies of data for faster retrieval, reducing the need for repeated data fetching. It enhances performance at various levels, from browser disk caching to memory and CPU cache.

Answer 36

TCP/IP (Transmission Control Protocol/Internet Protocol) is a set of networking protocols governing how data is transmitted over the internet. TCP ensures reliable data transmission, forming the basis for many internet protocols.

Answer 37

DNS translates human-readable domain names (like neetcode.io) into IP addresses, allowing browsers to locate and access websites. It's essential for navigating the internet efficiently.

Answer 38

HTTP (Hypertext Transfer Protocol) is an application-level protocol facilitating the transfer of web content. It simplifies data communication between clients and servers, structuring requests and responses.

Answer 39

REST is a standardized approach making HTTP APIs stateless and uniform. GraphQL allows fetching multiple resources in a single request, reducing over-fetching and under-fetching issues common in REST APIs.

Answer 40

SQL databases are relational, structured, and ACID-compliant, ideal for complex queries and relationships. NoSQL databases are non-relational, offering more flexibility and scalability, suitable for varied data types and large-scale applications.

Answer 41

Sharding involves distributing data across multiple servers to enhance performance and scalability. The advantage is improved load distribution and scalability. However, it can be complex to implement and manage, especially in maintaining data consistency and balancing the load across shards.

Answer 42

In leader-follower replication, all writes are directed to the leader, which then replicates the data to followers. In leader-leader replication, each replica can handle reads and writes, offering higher availability but increasing the risk of data inconsistency.

Answer 43

The CAP theorem states that a distributed database system can only simultaneously provide two of the three: Consistency, Availability, and Partition tolerance. It's crucial for understanding the trade-offs in distributed system design.

Answer 44

Message queues decouple different parts of an application, allowing for asynchronous data processing and communication. They help manage load, ensure data integrity during high traffic, and improve system resilience.

Answer 45

Horizontal scaling, or adding more servers, is favored for its almost infinite scalability and enhanced fault tolerance. Unlike vertical scaling, it doesn't have the physical limitations of a single machine and allows for redundancy.

Answer 46

Protocol buffers are a method of serializing structured data. They are more efficient than JSON, as they serialize data into a binary format, which is less storage-intensive and faster to transmit over networks.

Answer 47

WebSockets allow for bi-directional, real-time communication between clients and servers. Unlike traditional HTTP, they enable a continuous connection, ideal for applications requiring instant data updates like chat apps.

Answer 48

HTTP/2 is an evolution of HTTP that enables more efficient use of network resources and reduced latency. It allows multiplexing, where multiple requests and responses can be in flight at the same time over a single TCP connection.

Answer 49

GRPC, a high-performance RPC framework, uses protocol buffers and is ideal for efficient server-to-server communication. REST, based on standard HTTP, is more flexible and easier to use for a wide range of web services.

Answer 50

DNS A records (Address Records) map a domain name to its corresponding IP address. They are essential for directing user traffic to the correct server when a domain name is entered into a browser.

Answer 51

A reverse proxy serves as an intermediary for requests from clients seeking resources from servers. It provides functions like load balancing, authentication, SSL termination, and caching to enhance security, performance, and scalability.

Answer 52

Fault tolerance refers to a system's ability to continue operating without interruption when one or more of its components fail. Designing for fault tolerance involves redundancy, failover mechanisms, and robust error handling to ensure system reliability and availability.

Answer 53

Stateless servers do not retain user session information, which simplifies scaling and load balancing. They enhance system resilience and reliability, as any server can handle any request, making it easier to add or remove servers without impacting user experience.

Answer 54

A B-tree is a balanced tree data structure commonly used in databases and file systems for storing and managing large amounts of sorted data. It allows efficient insertion, deletion, and retrieval operations, making it crucial for database indexing and performance.

Answer 55

ACID properties (Atomicity, Consistency, Isolation, Durability) ensure reliable transactions in a database. Atomicity guarantees all-or-nothing execution, Consistency ensures data validity, Isolation prevents transaction interference, and Durability means completed transactions are permanently recorded.

Answer 56

Load balancing distributes workloads across multiple computing resources, preventing any single resource from becoming overwhelmed. It ensures optimal resource utilization, maximizes throughput, minimizes response time, and prevents overload.

Answer 57

Microservices architecture involves developing an application as a collection of small, independent services. It benefits large-scale applications by enabling better scalability, faster development cycles, and easier maintenance and deployment.

Answer 58

Edge servers in a CDN can transform and optimize content (like compressing images, minifying JS/CSS) closer to the user. This reduces payload sizes and loading times, improving overall web performance and user experience.

Answer 59

Synchronous communication requires the sender to wait for a response, potentially causing delays. Asynchronous communication allows the sender to continue processing other tasks, improving system responsiveness and efficiency.

Answer 60

Distributed systems are preferred due to their ability to handle high volumes of traffic and data. They offer scalability, reliability, and availability by distributing load and services across multiple interconnected nodes.

Caching Flashcards

(84 cards)