Final - Provided Review Questions Flashcards

Question

What’s specifically done in Sun RPC for these design points – you should easily understand this from your project?

Answer 1

Binding - per-machine registry daemon. talks to registry on target machine. IDL - XDR (language agnostic specification and encoding) Pointers - allowed and serialized Partial Failures - retries; return as much info as possible

Answer 2

Serialization/DeSerialization - converting from objects to a byte stream and byte stream to objects Variable size structures are represented by a pointer to the start of the memory location and then a length of the object. Sun Solution: • Sun XDR : • All data types are encoded in multiples of 4 bytes; • Big-endian is the transmission standard; • Two’s complement is used for integers; • IEEE format is used for floating point;

Answer 3

caching: + performance, less network interaction - complexity, coherence mechanisms and management file sharing semantics: • unix semantics - every write visible immediately (not realistic) • session semantics (write back on close, update on open) - not a great solution still • periodic updates - client writes back periodically and server invalidates periodically. Augment w/ flush/sync API. • immutable files - never modify, only create new • transactions - all changes are atomic replication - each server has all files + load balancing, availability, fault tolerance - writes are more complex (sync to all, write to one, prop to others) - replicas must be reconciled (voting) partitioning - each server contains some of the files + availability vs single server DFS, scalability (w/ file sys size), single file writes are simpler - on failure, lose portion of data, load balancing harder, if not balanced could get hot spots • both - files partitioned, partitions replicated

Answer 4

Concurrent writing was rare so optimized for common case and turn off caching in the rare instance of concurrent writers. Files were often opened very briefly, not justifying high overhead of session semantics Good portion of new data deleted within 30s - 5min, so write-back on close not necessary - collect dirty blocks from last writer when another client tries to open

Answer 5

For small granularity, you are very specific about what is being shared (for instance, variables), however, that can generate too much traffic and DSM overheads. With larger granularity, you cut down on DSM overheads, but now you need to worry about False sharing eg, with page granularity, to systems sharing a page that contains variables X and Y. One CPU cares only about X and the other only about Y. But from a DSM perspective, these are concurrent accesses so triggers all the DSM overheads/coherence/invalidation/etc. These two aren't _actually_ sharing the same memory.

Answer 6

* home node drives coherence (like what server maintains in DFS) * keeps state: pages access, modifications, caching enabled/disabled, locked. Keeps track of who the current owner is. * important b/c it's how you find a page. Manager node know's who the owner is. • Owner: currently 'owns' the page, eg exclusive writer. Controls all state updates and drives consistency operations. Owner may change. * Each page object has: * address = node id + page frame number * node id = home node * Global Map (replicated): * object (page) id => manager node id * manager map available on each node * Can also use a global mapping table where object id is an index into table, which then holds the manager node. This means you can change the manager node where as in the other case, the manager is Fixed as part of the object id. * Metadata for local pages (partitioned) * per page metadata is distributed across managers

Answer 7

* DSM needs to intercept memory accesses to see if it needs to do anything related to the DSM * can do this via the trap - when the VA is accessed it will trap into the OS if it doesn't exist, this can then be passed to the DSM layer where it can then handle the access * if content is cached on a node, DSM layer will ensure content is write protected and cause a trap if anyone tries to modify that content. Will then perform necessary coherence operations. * Other MMU information is useful (dirty page)

Answer 8

strict - Updates are instantaneous and visible everywhere immediately in the order they occurred. Not really possible in DSM (or even on SMPs w/o extra lock/synch privileges) Only a theoretical model in DSM due to latency and overheads associated with implementing it. sequential - memory updates from different processors may be arbitrarily interleaved, but all processes should see the same values in the same order . Each process will see the same interleaving. Updates from the same process will not be arbitrarily interleaved. They must be observed by other processes in the actual order they occurred. causal - if updates are causally related, then updates that other processors see must occur in the same order as the causal relationship. Eg, P1 W_m1(x), P2 R_m1(x), P2 W_m2(y) => causal b/c after p1 wrote x to m1, P2 read that value and then wrote y to m2. Thus it's possible the reading of m1 influenced the value y. If the the Writes are unrelated (no read before writes on different variables) then there are no guarantees (called 'concurrent' writes). Writes occurring on same processor must be observed in that same order in other processors. weak - Uses 'sync' points. A sync point makes sure all updates prior to this point will become visible to all other processors and all other updates on other processors so far will become visible here. Must be called by process performing the update AND process that wishes to know what updates have been made. Can also use an entry/acquire and exit/release. entry/acquire means to receive (pull) all updates made by others while exit/release means push all updates to others.

Answer 9

Homogeneous: + Simple design and easy to work with + Easy to scale - just add more workers - lose locality b/c each worker does the entire process Heterogeneous: + Gain locality since each worker is doing a specific task - More complicated to scale - which specific tasks do you scale, how do you identify which ones are the bottleneck?

Answer 10

``` Technologies: • High speed interconnects • Distributed Memory and File Systems • Virtualization (Server/Network) • Distributed storage • SDN ```

Answer 11

Practical Scalability: • economies of scale - you buy a single set of hardware, then you can virtualize the servers and the network. • You can host multiple tenants in a single location, on a single machine even. • You can easily provision new systems as needed, or bring them down Unavoidable Failures: • With this many devices, each with a fixed level of reliability, something is going to fail eventually.

Answer 12

Overheads: | • Context Switching + scheduler computation time

Answer 13

MLFQ - ability to dynamically move a task into different different queues (I/O Bound, Mixed, CPU Bound). Allows tasks to live in a queue appropriate for their workload type.

Answer 14

Because different workloads perform better with different time slices. CPU bound tasks favor long time slices and I/O bound tasks favor short time slices.

Answer 15

1. Task starts in I/O bound queue. 2. If voluntarily pre-empted (I/O op issued), keep in this queue. If it runs to expiration, move into Mixed queue. 3. Repeat these steps, moving task up and down in queues as required.

Answer 16

The goal of her scheduler is to balance schedule CPU Bound and Memory bound tasks on the same CPU's, thus maximizing the utilization (b/c hypterthreading allows for a cheap context switch, quickly pop over and start you long running memory op, then back to the CPU bound workload) - allows you to hide memory access latency with cheap context switches.

Answer 17

* Page Based - Virtual Memory Pages are mapped onto Physical Page Frames * Virtual Memory page address -> page table -> DRAM address * Only map the _pages_, from there you can use offsets from those pages to get to specific pieces of information. * Virtual address: Virtual Pagetable Number (VPN): Offset * VPN is an offset into the page table * When you index into the page table you get the Physical Frame Number (PFN), and again combined with the offset gives you the physical location in DRAM (offset is the same from the virtual address) * Physical memory is only allocated on first touch (only when actually needed)

Answer 18

* If it just hasn't been allocated yet, then it allocates it (allocate on first touch) * If it has been allocated but isn't in DRAM, then it raises a fault and traps to the OS * If bringing it back into memory, it may not put it into the same place in physical memory so the page table gets updated for that VA

Answer 19

Tries to access page not allocated to it: • page fault generated on kernel stack • trap to OS • raise SIGSEGV

Answer 20

No - the memory is shared, so they only need to copy the memory into the shared channel.

Answer 21

• Copying incurs the cost of crossing in/out of the kernel. Mapping a file maps the VA to the PA of the shared memory region and can be done once per process until the process no longer needs to access the region anymore. Because if this, the overall cost associated with mapping/unmapping is significantly less than copying data.

Answer 22

• test_and_set - performs an atomic operation that tests a variable and then sets its value to 1, returning the original value. If value is already set, setting it again has no effect. ` while (test_and_set(variable) == busy);` -- most basic spinlock + low latency (just atomic operation) + low delay (spinning continuously on atomic) - bad contention - processor goes to memory on each test_and_set (b/c it's atomic) • test_and_test_and_set - test cached value first, if it's free (0) then perform test_and_set. + OK latency + OK delay - better contention, but ... • if non-cc, no difference. • if WU - OK • if WI - horrible! b/c each test_and_set operation invalidates every other persons cache, defeating the purpose of checking the cache b/c now they all go to memory to get the value. • test_and_test_and_set - delay after lock becomes free. do the normal test_and_test_and_set, but if lock is busy go into another normal while(lock == busy) loop, then after it becomes available create a delay before performing the outer loop check. + Contention - improved + Latency - OK - Delay - much worse b/c of inherent delay. Could be delaying even with no contention. • test_and_test_and_set - delay after each spin (instead of when lock becomes free) + Works on NCC architectures - Delay = much worse • queueing lock - array with length == # CPUs. One element == 1 (has lock) while all others are 0 (must wait). Each cpu wanting to wait for lock calls read_and_increment(queuelast) to get their position into the array. Each cpu spins on its flag until it's no longer must-wait. Then it does it's critical section, sets it's array index value to must-wait, and sets the next value in lie = has-lock.

Answer 23

Sending a command to a device: • Sending a packet to a device: send data (user process) → form packer (kernel) → write Tx request record (driver) → perform transmission (device) So for 1500B packet, with 8byte registers you would need: • 1 CPU access to request packet transmission • 188 for the data (~1500 / 8) • total of 189 CPU instructions Receiving: • Receiving something back works in the reverse order (device retrieves and passes to driver, driver passes to kernel, kernel passes to user process)

Answer 24

Virtual File System file - elements on which VFS operates (identified by an inode) file descriptor - OS representation of a file (open, read, write, etc) inode - persistent representation of file "index" • list of all data blocks • device, permissions, size, etc dentry - directory entry, corresponds to a single path component that is being traversed as we try to reach a file • /users/ada => /, /users, /users/ada • dentry cache (soft state - dentry's don't exist on disk) superblock - filesystem specific information regarding the FS layout

Answer 25

To calculate possible file size per inode: • get block size (eg, 1KB = 1024B) • get size per points (eg, 4B) • calculate # pointers / block • block size / pointer size = 1024 / 4 = 256 pointers • this is # of pointers in any indirect block • calculate total number of pointers: • # direct blocks + 256 (single indirect) + 256^2 (double indirect) and so on... • Calculate total size: • total # of block pointers * block size • eg (12 + 256 + 256^2 +... ) * 1KB = Total KB possible / inode

Answer 26

Multiple OS's running on a single host, each one believing it is the only OS running and owns all resources available to it. History: • started in the 60's because computers were huge and run a VMM so multiple OS's could be run on the single piece of hardware.

Answer 27

x86 HW Improvements: • closed holes in x86 ISA • added root/non-root modes • VM Control structure • extended page tables and tagged TLB with vm ids • multiqueue devices and interrupt routing • security and management support • additional instructions to exercise above features

Answer 28

Device virtualization uses 3 key models and standardizes the interface for which the VM can interact with the devices.

Answer 29

• Repeatedly implementing the same infrastructure over and over to do things like client-server communication. Lots of common steps related to remote IPC. + higher level interface for data movement and communication + error handling + hiding complexities of cross machine interactions

Answer 30

Design Options: • Upload / Download - file moved to client, access done on client, client done => reupload to server + local read/writes on client - entire file download/upload even for small access - server gives up control - no idea what client is doing • True Remote - every access is done to the server + file access is centralized, server has full control and knows state of access and file - every file operation pays network costs - limits server scalability • Practical Solution - combination of the two • allow client to store parts of files locally (blocks) + low latency on file operations + server load reduced, more scalable • force clients to interact w/ server (frequently) + server knows what clients are doing, controls access permissions, easier to maintain consistency - server more complex, requires different file sharing semantics

Answer 31

Stateless vs Stateful: • stateless - keeps no state • only OK for 'extreme' models, not practical model - cannot support caching and consistency mgmnt - more bits transferred b/c every request self contained + no resources used on the server side (CPU/mem) + on failure, just restart - very resilient • stateful - keeps client state • needed for 'practical' model to track what's cached/accessed + can support locking, caching, incremental operations - on failure, all state could be lost, so need checkpointing and recover mechanisms - overheads to maintain state and consistency

Answer 32

• where to cache, • coherence mechanisms to user • when to write back / update (session semantics - write back on close, update on open - vs periodic updates - clients lease cached data (can be concurrent), server invalidates periodically to provide bounds on inconsistency),

Answer 33

Client data structures: * cachable, cached blocks, dirt block timer (per block), version Server data structures: • readers, writer, version, cachable All open() go thru server, decides if needs to pull dirty blocks, if caching is allowed, etc

Answer 34

What is it? • agreement b/t memory (state) and upper software layers. Memory behaves correctly if and only if software follows specific rules. • access ordering • propagation / visibility of updates

Answer 35

Capacity scales elastically with demand, law of large numbers (on average, a fixed amt of resources needed while individual demands vary) -> shared resources a good thing, allowing us to capitalize on economies of scale

Answer 36

Models: • IAS - You manage everything including the virtualization • PAS - We virtualize servers for you, and you get to manage the OS and everything above it • SAAS - You just manager your software and disk usage - the OS, virtualization, etc is done by the provider

Answer 37

``` Basic (deployment) models: • public (3rd party customers) • private (leverage tech internally) • hybrid (failover, deal with spikes, testing) • community (certain types of users) ```

Answer 38

* whether page has been accessed * whether it's been modified * caching enabled/disabled * locked or not

Answer 39

* session based when non-concurrent * additionally uses periodic updates * 3 seconds for files * 30 seconds for directories * Uses lease-based locking - client must renew or release the lease w/in time period or else

Final - Provided Review Questions Flashcards

(63 cards)