memory Flashcards

Question

An already selected heap page should be swapped out from the address space of process A and made available to process B as a free page. Explain which basic steps are necessary for this operation. Assume that the kernel has its own mapping of the physical page.

Answer 1

(1) Invalidate user mapping in process A (unset P-bit in corresponding PTE) (2) Flush TLB entry (3) Write page to swap file via kernel mapping. Note offset somewhere. (4) Clear page via kernel mapping (5) Create new user mapping in process B (configure PTE with correct frame number and set P-bit)

Answer 2

The system is under heavy memory pressure and busy swapping pages in and out. CPU utilization is low as processes wait for pages to be fetched from secondary storage.

Answer 3

The working set comprises all pages that are currently needed by a process to make progress. In situations of high memory pressure, this information can drive page replacement decisions so that less recently accessed pages are swapped out first. In critical situations, the working set can be used to select processes for suspension.

Answer 4

Memory consumption: The bitmap is very compact (i.e., 1 bit per page), but the list nodes for free memory can usually be placed in the free memory itself. So in this case the list can be more efficient. Algorithmic complexity: Finding free memory in the bitmap is an O(n) operation with n being the number of pages in the system. For a singly-linked list it is O(1).

Answer 5

The page size could be reduced while increasing the number of levels.

Answer 6

With demand paging pages in the virtual address space are actually allocated in physical memory and filled with contents only on their first access. A disadvantage is that programs may experience many page faults at startup, which might be less efficient from a performance perspective than doing pre-paging.

Answer 7

The cost can be mitigated by using tagged TLBs. With tagged TLBs, every address space is assigned a tag and only TLB entries with the current tag are used for translation, thus eliminating the need to flush the TLB on kernel entries and exits.

Answer 8

by the width of the page frame number field

Answer 9

* Different allocation lifetimes * Different allocation sizes * No relocation of previous allocations

Answer 10

* Heap * Stack * Dynamically created virtual memory areas

Answer 11

Dirty Bit On a write access, the MMU sets the dirty bit in the page table entry (PTE) of the target page. By periodically inspecting and resetting dirty bits, the OS can detect that at least one write access to a page (i.e., page granularity) has been taken place since the last reset. However, it is not possible to derive the exact offset within the page, the number of accesses, or the exact point in time

Answer 12

Pro: We only have to adjust two PTEs to exchange two elements in the data structure. This is especially advantageous if the elements are large (i.e., occupy a whole page) and we save copy time. Con: Since the respective TLB entries have to be invalidated, this method leads to a large number of TLB misses and consecutive page table walks. At the same time, page table structure caches may be less effective as the page tables have been modified.

Answer 13

* Number of levels * Size of page * Size of PTE (i.e., number of PTEs per page)

Answer 14

* Number of levels * Size of page * Size of PTE (i.e., number of PTEs per page)

Answer 15

The memory is automatically freed by the operating system in the course of releasing the underlying memory pages.

Answer 16

Internal fragmentation: If a memory manager only offers memory blocks of fixed sizes, the difference between the requested size and block size cannot be used and is called internal fragmentation. External fragmentation: The free space in physical memory may be too scattered to allow the allocation of a segment of a certain length, even if enough total free space is available. The sum of the currently not usable memory is called external fragmentation.

Answer 17

Internal fragmentation can be reduced by choosing a smaller page size. * A system with a smaller page size uses more pages so the size of the page table is increased * The TLB reach (amount of memory that the TLB can keep track of) is reduced * More virtual-to-physical translations are required, which leads to additional overhead * Spatial locality might be decreased. This might decrease the performance of caches and result in higher access times when using a HDD

Answer 18

The TLB entry format is specified in the instruction set architecture (ISA) of the respective CPU. Even with a software-managed TLB, the TLB is a hardware component, which is specified by the cpu manufacturer. The format of the page table entries, however, can be chosen by the operating system.

Answer 19

+ No ambiguity (homonym problem): A virtual address might point to different physical addresses at different points in time. When using physical tags, homonyms can be detected which means that the cache does not have to be flushed at each context switch. + Synonyms (multiple virtual addresses point to the same physical address) can be detected by checking all entries of the cache + Easy write-back. If the cache entry needs to be written back to main memory, no additional TLB lookup is required. - Modern systems use virtual memory abstraction. A TLB look-up is required to translate the virtual address to the physical address which is used for tagging. The virtual to physical translation takes some time and might increase the latency of the cache.

Answer 20

Aliasing (coherence problem with synonyms). Virtual addresses that point to the same physical address might be mapped to different cache sets.

Answer 21

+ Isolation (Process ↔ Process, Process ↔ Kernel) + Sharing (e.g., libraries) + Memory overcommitment can be implemented + Easier allocation of physical memory (i.e., fragmentation allowed) -More complex hardware necessary (MMU + TLB) -Higher access latency (TLB lookup, page table traverse on TLB miss) - Context switching costs increase due to necessary TLB flushes (strictly speaking, this is only a disadvantage of operating systems with multiple virtual address spaces) -Kernel entries become necessary as only the OS is allowed to manipulate the address space -More complex to implement - Memory overhead for page tables

Answer 22

* Allocation / management of page tables * Implement a page fault handler * Implement page allocation, loading, and replacement * Implement address space switching

Answer 23

The working set is the set of pages accessed in the last delta time frame. If the hardware supplies a reference bit, the OS can periodically read and reset the bit to determine which pages have been accessed in the given scan interval.

Answer 24

In a system with demand paging, pages are allocated and filled only on the first access. To detect the first access, the page is marked invalid (no access) in the page table.

Answer 25

* Find victim page * Invalidate user mode references to the page frame (in all page tables) * Write back contents, if necessary * Flush/invalidate entries in TLB * Load new contents into page frame * Update mapping in current page table * Restart instruction

Answer 26

The allocator forms a so called slab cache from one or multiple slabs, themselves being pages of contiguous physical memory. The cache is split into chunks of (fixed) equal size. Allocations can only be made with this chunk size

Answer 27

Slab allocators are commonly used for allocating (kernel) objects of the same size. This is because, (1) the allocator effectively minimizes fragmentation, which is usually a problem with many but small allocations. And (2) by reusing freed objects some initialization overhead may be saved

Answer 28

On many processors, 64 bytes is the cache line size (or a multiple thereof)

Answer 29

Fully associative caches are not susceptible to conflict misses, because in a conflict miss the requested entry has previously been removed from the selected set due to capacity reasons although there was room left in other sets. As fully ssociative caches posses a single set only, conflict misses cannot occur.

Answer 30

A segmentation fault or access violation is an exception that is raised by the operating system if a process attempts to access a virtual address that is either not assigned or for which the process does not posses the required access permissions

Answer 31

Copying the process’s address space

Answer 32

Copy-on-write creates page tables that share the page frames between the parent and child processes, without having to copy the data itself. To maintain isolation the pages are marked read-only in both processes. When either process attempts to modify a page, the sharing is broken and a private writable copy of the page frame is created.

Answer 33

Least Recently Used (LRU) (0.5 P). LRU is usually approximated, because the page tables are missing timestamps that are updated on each access

Answer 34

* A cache usually consists of cache lines that are larger than a single CPU word (e.g., 64 bytes). Thus when reading the buffer, RAM accesses occur with larger width, which increases the memory access bandwidth. * Having a cache allows the CPU to prefetch (read-ahead) data from the source. * When writing to the destination the cache buffers the write accesses, thus reducing the latency for each write operation. Just like when reading, the memory access bandwidth increases.

Answer 35

PFN is the abbreviation for Page Frame Number and denotes the index of a physical page frame

Answer 36

Heap (0.5 P), Stack (0.5 P), and BSS (0.5 P).

Answer 37

When a process accesses anonymous memory for the first time, the operating system must (1) find a free physical page (this may include evicting other pages), and (2) clear the page by filling it with zeros (0.5 P). The operating system can keep the latency low, by maintaining a pool of zero pages from which the allocation can directly be satisfied

Answer 38

Thrashing occurs when the system does not have enough physical memory to satisfy the working sets of the running processes. This causes constant swapping of pages and severely inhibits progress. The working set of a process is the set of pages accessed during the last measurement interval and can be used as indicator for the true memory demand of a process for the next measurement interval. If the working sets of all running processes do not fit into memory, the operating system may select a process with a large working set and temporally suspend it, so other processes can make progress

Answer 39

Advantage (0.5 P): Small latency and less memory accesses during translation. Disadvantage (0.5 P): Not suited for sparse address spaces of large size due to high memory consumption

Answer 40

The physically addressable memory is only determined by the width of the PFN field in a page table entry (

Answer 41

The TLB reach describes the amount of virtual memory for which a TLB can store translations at any point in time (1.0 P). The TLB reach can be increased by using a larger TLB (0.5 P) or increasing the page size (

Answer 42

Yes (0.5 P). The page table is walked in software, and that software can convert arbitrary page table entries into the format expected by the TLB

Answer 43

Whether fragmentation occurs depends on the memory allocation policy. A buddy allocator for example produces internal fragmentation for allocations of size other than 2 n , irrespective of the allocation order or deallocation of memory (