7 - Paging Flashcards

Question

What are the benefits of virtual memory?

Answer 1

Virtual Memory (VM) has several benefits: 1. Portability: => programs work regardless of how much actual memory present; => programs can be larger than physical memory 2. Convenience: => programmer can use e.g. large sparse data structures with impunity; => less of the program needs to be in memory at once, thus potentially more efficient multi-programming, less IO loading/swapping program into memory 3. Efficiency: => no need to waste (real) memory on code or data which isn't used (e.g., error handling)

Answer 2

1. Programs (executables) reside on disk 2. To execute a process, we load pages in on demand IE as and when they are referenced. 3. Demand segmentation is also possible, but rare (e.g. Burroughs, OS/2) as it's more difficult (segment replacement is much harder due to segments having variable size - why??????).

Answer 3

When loading a new process for execution 1. Create its address space (page tables, etc) 2. Mark PTEs as either invalid (???), or non-resident. 3. Add the PCB to scheduler Then, whenever the OS receives a page fault, check PTE: => If due to invalid reference, kill process => Else, due to non-resident page, so "page in" the desired page (separate question as to how).

Answer 4

- OS receives page fault due to non-resident page. Pages in desired page: 1. Find a free frame in memory 2. Initiate disk IO to read in desired page 3. When IO finished, modify PTE for this page to show that it is now valid. 4. restart process at the faulting instruction

Answer 5

1. Handling makes fault invisible to process .: logical transparency BUT: 1. Requires care to save process state correctly on fault (why???) 2. Can be particularly awkward on a CPU with pipelined decode as we need to wind back (e.g.,MIPS,Alpha) (WHAT????) 3. CISC CPU - single instruction can move lots of data possibly across pages - cannot restart instruction, rely on help from microcode (e.g. to test address before writing). Can possibly use temporary registers to store moved data (how ???) 4. Instructions / data could span multiple pages .: multiple page faults per instruction, but infrequent due to locality of reference. 5. Pure demand paging (described previously) - lots of page faults when process begins .: many real systems explicitly load core parts of process first.

Answer 6

1. To page in from disk, need a free frame of physical memory to hold data we're reading in - but size of physical memory limited. Two options: 1. Discard unused pages if total demand for pages exceeds physical memory size 2. Or swap out an entire process to free some frames

Answer 7

Page fault occurs due to non-resident page 1. Locate the desired replacement page on disk 2. Select a free frame for the incoming page: 2a . If there is a free frame use it, otherwise select a victim page to free 2b Then write the victim page back to disk [unless possibly if unmodified]. 2c Finally mark it as invalid (?? or non-memory resident ??) in its process page tables 3. Read desired page into the now free frame 4. Restart the faulting process from where it left off. ... thus, having no free frames effectively doubles the page fault service time.

Answer 8

Add dirty bit to a page's PTE 1. If page not modified (dirty bit not set) no need to write out when its frame is evicted. 2. If page is marked read only e.g. contains binary executable code, no need to write out either (unmodifiable).

Answer 9

Importance of choosing a good victim frame: => A key factor in an efficient VM system: evicting a page that we'll need in a few instructions time can get us into a really bad condition Aim => We want to ensure that we get few page faults overall, and that any we do get are relatively quick to satisfy In general, page replacement algorithms: => All aim to minimise page fault rate => Candidate algorithms are evaluated by (trace driven) simulation using reference strings

Answer 10

1. Keep FIFO queue of pages | 2. Discard from head

Answer 11

1. Performance hard to predict BECAUSE no idea whether replaced page will be used again or not. BECAUSE eviction is independent of page use frequency Generally, simple but inefficient e.g. : 3. Can discard a page currently in use, causing immediate fault, and next in queue to be replaced => system slowdown. 4. Possible to have faults with increasing availability of frames : Belady's Anomaly (Need some questions on this .... !!!!)

Answer 12

1. Replace the page which will not be used again for the longest period of time. How: => Uncertain how long a page will be unused for ... ! Why: => Provides good baseline for other algorithms - how close they can get to theoretical best performance.

Answer 13

1. Replace page which hasn't been used for longest amount of time. 2. Equivalent to OPT with time running backwards (???)

Answer 14

1. Assumes past is a good predictor of future 2. Can still end up replacing pages that are about to be used. 3. Generally considered quite good, but may require substantial hardware assistance (why ??) 4. BUT, how do we determine the LRU ordering ? (separate details on implementing least recently used).

Answer 15

1. Give each PTE a time-of-use field and give the CPU a logical clock (counter) 2. Whenever a page is referenced, its PTE is updated to clock value 3. Replace page with smallest time value

Answer 16

1. Requires a search to find minimum counter value 2. Adds a write to memory (PTE) on every memory reference 3. Must handle clock overflow 4. Impractical on a standard processor to add extra memory writes to every single memory reference made.

Answer 17

1. Maintain a stack of pages (doubly linked list) with MRU (most recently used) page on top 2. Discard from bottom of stack

Answer 18

1. Requires changing (up to) 6 pointers per [new] reference 2. This is very slow without extensive hardware support 3. Also impractical on a standard processor

Answer 19

1. Have a reference bit in the PTE 2. Reference bit initially zeroed by OS when first paged in. 3. R set by hardware whenever the page is referenced. 4. After time has passed, consider those pages with the bit set to 1 to be ACTIVE and implement NRU (not recently used) replacement : => Periodically (e.g. 20 ms) clear all reference bits (??? scan through PTable every 20 ms ???) -> When choosing a victim to evict, prefer pages with clear reference bits => If also have a modified or dirty bit in the PTE, can use that too. Priorities: (Referenced and dirty) => bad choice (Referenced but not dirty) = probably code in use (Not referenced but dirty) => next best, requires write back (not referenced, not dirty) => best choice to replace.

Answer 20

Instead of just a single bit, the OS: 1. Maintains an 8-bit value per page, initialised to zero. 2. Periodically (e.g. 20 ms) shift reference bit onto higher order bit of the byte AND clear the REFERENCE bit. replacement : select lowest value page (or one of ...) to replace - Keeps history for last 8 clock sweeps - Interpreting bytes as unsigned integers, then LRU page is min (additional_bits) (what????) - May not be unique, but gives a candidate set (so ????)

Answer 21

How: 1. Store pages in queue as per FIFO 2. Before discarding the head, check reference bit. 3. If reference bit = 0, discard ELSE reset reference bit, give page a second chance. (add to tail of queue)

Answer 22

1. Guaranteed to terminate after at most one cycle => worst case having second chance devolve into FIFO if all pages are referenced 2. A page given a second chance is the last to be replaced (so ???)

Answer 23

When memory is referenced we present the TLB with a logical memory address 1. If the PTE is present we get an immediate result (parallelised search / readoff) 2. Otherwise we make a memory reference to PTs and update the TLB... 3. Latter case is definitely slower than the direct memory reference. Nice diagram p9 of §7 (another question, sketch / explain)

Answer 24

1. CPU generates logical address (page number, offset) 2. Logical address (p, o) sent in parallel to the TLB which contains a cache of recently used page numbers and their corresponding frame numbers in physical memory => content lookup (???) fast 3. If a match is found in the TLB then we can use the frame number and offset to access memory only once to get the value we want. 4. If a match is not found, we access the page table (itself in memory) to get the frame number and make a further access to get the value stored at the offset within the frame in physical memory. 5. The page - frame mapping is inserted into the TLB so that (cf spatial locality of reference) if the same page is accessed again we can get its frame number very quickly.

Answer 25

As with any cache, 1. what to do when it's full, 1a - Discard entries typically LRU (least recently used) policy. 1b - Context switches require TLB flush to prevent next process using wrong PTEs 1c - May mitigate cost through process tags (???? how ??????) 2. how are entries shared???

Answer 26

TLB performance is measured in terms of a hit-ratio => Hit ratio = proportion of time a PTE is found in TLB. => Effective memory access time = hr * TLBtime + (1 - hr) * (TLB + 2 * MAT) Case 1 = 140 ns Case 2 = 122 ns only 13 % improvement .... may not be useful if already sufficient spatial locality of reference as to make the hit ratio already rather high (80-20 principle).

7 - Paging Flashcards

(50 cards)