Web & HTTP Flashcards

Question

Replicated Web service

Answer 1

- Device that multiplexes requests across a collection of servers ¤ All servers share one public IP ¤ Balancer transparently directs requests to different servers ¨ How should the balancer assign clients to servers? ¤ Random / round-robin n When is this a good idea? ¤ Load-based n When might this fail? ¨ Challenges ¤ Scalability (must support traffic for n hosts) ¤ State (must keep track of previous decisions) n RESTful APIs reduce this limitation

Answer 2

¨ Advantages ¤ Allows scaling of hardware independent of IPs ¤ Relatively easy to maintain ¨ Disadvantages ¤ Expensive ¤ Still a single point of failure ¤ Location!

Answer 3

¨ For Web pages ¤ RTT matters most ¤ Where should the server go? ¨ For video ¤ Available bandwidth matters most ¤ Where should the server go? ¨ Is there one location that is best for everyone? ¨ Impact on user experience ¤ Users navigating away from pages ¤ Video startup delay ¨ Impact on revenue ¤ Amazon: increased revenue 1% for every 100ms reduction in page load time (PLT) ¤ Shopzilla:12% increase in revenue by reducing PLT from 6 seconds to 1.2 seconds ¨ Ping from LON to NYC: ~100ms

Answer 4

- Goal: satisfy client request without involving origin server - User sets browser: Web accesses via cache - Browser sends all HTTP requests to cache ¤ Object in cache: cache returns object ¤ Else cache requests object from origin server, then returns object to client

Answer 5

- Cache acts as both client and server ¤ Server for original requesting client ¤ Client to origin server - Typically cache is installed by ISP (university, company, residential ISP) - Why Web caching? ¤ Reduce response time for client request ¤ Reduce traffic on an institution’s access link ¤ Internet dense with caches: enables “ poor ” content providers to effectively deliver content (so too does P2P file sharing)

Answer 6

Scenario: - access link rate: 1.54 Mbps - RTT from institutional router to server: 2 sec - web object size: 100K bits - average request rate from browsers to origin servers: 15/sec - avg data rate to browsers: 1.50 Mbps Performance: - access link utilization = .97 - LAN utilization: .0015 - end-end delay = Internet delay + access link delay + LAN delay = 2 sec + minutes + usecs

Answer 7

- Goal: don’t send object if cache has up-to-date cached version ¤ No object transmission delay ¤ Lower link utilization - Cache: specify date of cached copy in HTTP request ¤ If-modified-since: - Server: response contains no object if cached copy is upto-date: ¤ HTTP/1.0 304 Not Modified

Answer 8

Key goal: decreased delay in multi-object HTTP requests - HTTP1.1: introduced multiple, pipelined GETs over single TCP connection - server responds in-order (FCFS: first-come-first-served scheduling) to GET requests - with FCFS, small object may have to wait for transmission (head-ofline (HOL) blocking) behind large object(s) - loss recovery (retransmitting lost TCP segments) stalls object transmission - HTTP/2: [RFC 7540, 2015] increased flexibility at server in sending objects to client: - methods, status codes, most header fields unchanged from HTTP1.1 - transmission order of requested objects based on client-specified object priority (not necessarily FCFS) - push unrequested objects to client - divide objects into frames, schedule frames to mitigate HOL blocking

Answer 9

- HTTP/2 over single TCP connection means: - recovery from packet loss still stalls all object transmissions * as in HTTP 1.1, browsers have incentive to open multiple parallel TCP connections to reduce stalling, increase overall throughput - no security over vanilla TCP connection - HTTP/3: adds security, per object error- and congestion-control (more pipelining) over UDP * more on HTTP/3 in transport layer

Web & HTTP Flashcards

(37 cards)