Neetcode System Design Concepts Flashcards

Question 1

Q

How can we vertically scale a single server?

Answer

A

Give it more RAM or CPU (memory or raw computing power)

Question 2

Q

Why is vertically scaling a server limited?

Answer

A

You’ll run out of computing power fast. Won’t be able to keep up with compute demands

Question 3

Q

Why is vertically scaling a single server risky?

Answer

A

It creates a single point of failure if anything happens to that server. All our eggs are in one basket

Question 4

Q

What is the advantage of horizontal scaling vs. vertical scaling?

Answer

A

Horizontal scaling allows us to create replicas of a single server which eliminates our single point of failure problem and allows us to get a lot of computing done with average servers. Don’t need really high power servers to make this effective

Question 5

Q

True or False: Horizontal scaling can scale infinitely in theory

Question 6

Q

Does a horizontally scaled system have better availability than a vertically scaled one? Why or why not?

Answer

A

Yes - a horizontally scaled system allows us to eliminate the single point of failure issue and allows us to route traffic to different servers depending on the health of a server

Question 7

Q

What is a disadvantage to horizontally scaling servers?

Answer

A

It’s much more complicated than vertically scaling. You have to make sure one server doesn’t get overloaded or many servers don’t sit idle. You also have to use load balancers to balance out the load, etc.

Question 8

Q

What is another name for a load balancer?

Answer

A

Reverse-proxy

Question 9

Q

How does a load balancer work from a high-level?

Answer

A

It takes in incoming requests and redirects them to the correct server

Question 10

Q

What are two methods for redirecting traffic in a load balancer?

Answer

A

Round robin & hashing

Question 11

Q

Let’s say we have a global infrastructure, how can we use load balancers to our advantage from a geographic standpoint?

Answer

A

We can use load balancers to route traffic to the nearest location or region

Question 12

Q

If we have a global system, how can we deliver static content like images, HTML, CSS?

Answer

A

We can use content delivery networks (CDN). This allows us to configure how we deliver static content across our network

Question 13

Q

What is a content delivery network?

Answer

A

It’s a network of servers that are located all around the world. These servers can delivery static content like images, videos, HTML/CSS/JS

Question 14

Q

How do CDN’s take files from our server and put them onto the CDN?

Answer

A

It takes the file from our origin server and copies them onto CDN servers

Question 15

Q

True or False: CDN’s can copy data on both a push or pull basis

Answer

A

True - this can be done push or pull

Question 16

Q

What is a general definition of caching?

Answer

A

Creating copies of data so that it can be refetched faster in the future

Question 17

Q

What is the advantage of caching on a machine?

Answer

A

Things like network requests can be expensive time-wise (take a long time) so we can cache this data to disk to reduce that time burden

Question 18

Q

Is reading disk expensive?

Answer

A

Yes, we can copy to memory but that can be expensive too, so usually our operating systems will copy this into a subset of our CPU (L1, L2, L3 CPU cache)

Question 19

Q

What is an IP address?

Answer

A

Every computer is assigned an IP address, which is a unique identifier for any machine. You can think of this as the telephone number for any machine

Question 20

Q

What does the Internet Protocol Suite include?

Answer

A

IP/TCP & UDP

Question 21

Q

What is the purpose of TCP?

Answer

A

Sending data over a network has to have a set of rules, TCP enforces these rules

Question 22

Q

True or False: TCP can fail because it doesn’t resend a request that fails

Answer

A

False - TCP will automatically re-send any requests that fails

Question 23

Q

When you type in a URL, how does the computer know which IP address it belongs to?

Answer

A

DNS (domain name system) solves this. You create an “a” record that points a URL to an IP address of the server

Question 24

Q

Does the operating system have to make a request for the IP address of the server using DNS every time?

Answer

A

No this request is usually cached by the operating system so that it can be referenced later without making the request every time

Question 25

Q

What is HTTP in relation to TCP?

Answer

A

It’s built on top of TCP - TCP is too low-level to be useful for network requests so HTTP was built to handle these

Question 26

Q

What is a general definition of HTTP?

Answer

A

HTTP provides an application-layer protocol which follows the client-server model instead of just using packets like TCP

Question 27

Q

What is the general structure of an HTTP request from the client?

Answer

A

Client will initialize the request with a request header (where it’s going, who it’s from) and a request body (payload or the actual content)

Question 28

Q

What is a REST API?

Answer

A

It’s a standardized HTTP API that makes these requests stateless and consistent

Question 29

Q

What is a 200, 400, and 500 error code in REST API?

Answer

A

200 = successful, 400 = unsucessful request, 500 = internal server error

Question 30

Q

What advantages does GraphQL have over REST API?

Answer

A

You can make a request for specific fields so that you don’t overfetch data or make duplicate requests

Question 31

Q

What is gRPC?

Answer

A

Released by Google in 2016, gRPC is a client to server interaction that creates a performance boost from protocol buffers

Question 32

Q

What is a protocol buffer in a gRPC system?

Answer

A

Improved version of JSON using serialized binaries to send data which is much faster than JSON. Down-side is that it’s not as human readable as JSON

Question 33

Q

True or False: WebSockets are built on top of TCP

Question 34

Q

What is the main advantage of WebSockets?

Answer

A

You can get the change instantly from device to device. Once a message is received, it is immediately sent to the next device

Question 35

Q

If we tried to replicate WebSockets using HTTP, how could be do it?

Answer

A

We’d have to use polling to do this using HTTP which works but is sub-optimal because you have to continually check for changes using polling

Question 36

Q

What is SQL mostly used for on a high-level?

Answer

A

Storing data

Question 37

Q

What are the most common SQL tools?

Answer

A

MySQL, Postgres

Question 38

Q

If we can just store data in a text file on disk, why do we need to use a database?

Answer

A

Databases can more efficiently store data using data structures like B-trees

Databases also have fast retrieval of data using SQL queries

SQL queries allow you to access data that are stored as rows in tables

Question 39

Q

True or False: One issue with SQL is that it is not ACID compliant

Answer

A

False - SQL is ACID compliant

Question 40

Q

What does the acronym ACID stand for?

Answer

A

Atomicity, Consistency, Isolation, Durability

Question 41

Q

What does atomicity mean in the ACID acronym?

Answer

A

Every transaction is all or nothing

Question 42

Q

What does consistency mean in the ACID acronym?

Answer

A

Foreign keys and other constraints will always be enforced

Question 43

Q

What does isolation mean in the ACID acryonym?

Answer

A

Different concurrent transactions won’t interfere with each other (think a queue)

Question 44

Q

What does durability mean in the ACID acronym?

Answer

A

Data is stored on disk, so if a machine is restarted data will still be there

Question 45

Q

What is the main advantage of using NoSQL databases?

Answer

A

Consistency (using foreign keys) makes databases harder to scale, so NoSQL databases remove this relation constraint

Question 46

Q

What are 3 different types of NoSQL databases?

Answer

A

Key-value stores (DynamoDB), Document stores (MongoDB), Graph DB (neo4j)

Question 47

Q

If we’re separating the database using sharding, how do we decide which portion of the data to put on which machine?

Answer

A

We can use a shard key

Question 48

Q

What is the definition of sharding in relation to databases?

Answer

A

Since we no longer have foreign key constraints, we can break up our database and scale horizontally, this is called sharding

Question 49

Q

What is ranged-based sharding?

Answer

A

Ranged-based sharding is using the id of a person in a table as the key

Question 50

Q

What is the advantage to replication over sharding?

Answer

A

Replication is a simpler approach to sharding

Question 51

Q

What is leader-follower replication?

Answer

A

If we want to scale our DB reads, we can make read only copies of our DB - this is leader-follower replication

Question 52

Q

What is the process for leader-follower replication?

Answer

A

Every write gets sent to the leader, who then sends to the followers
Every read could go to a leader or a follower

Question 53

Q

True or False: Leader-leader replication is possible

Answer

A

True - every replica can be read or write but this can lead to inconsistent data

Question 54

Q

What does the acryonym CAP mean in CAP theorem?

Answer

A

Consistency, Availability, Partition (Network)

Question 55

Q

What is the advantage of CAP theorem? What problem does it solve?

Answer

A

It can be complex to keep replicas in sync, so CAP theorem was created to weigh trade-offs with replicated design

Question 56

Q

What is the technical definition of CAP theorem?

Answer

A

Given a network partition in a database, you have to choose between data consistency or data availability

Question 57

Q

How are message queues similar to databases?

Answer

A

They are similar to databases in that they have durable storage, can be replicated for redundancy, or sharded for scalability

Question 58

Q

If we’re overwhelmed with more data than we can process, how can message queues help us solve this problem?

Answer

A

Message queues are perfect for this because we can handle these requests one at a time in a consistent manner. Data can be persisted (held in queue) before it can be processed

Question 59

Q

True or False: Different parts of our app can be decoupled by using message queues