Memory Flashcards

Question

What are the goals for memory design

Answer 1

* To make the user believe processes are executing simultaneously * To make each process appear infinite memory is available and can be accessed infinitely quickly.

Answer 2

The memory hierarchy refers to the order of primacy for memory, i.e. which memory is used first.. Disk→Memory→Cache→Register

Answer 3

• The OS stores a queue of processes waiting to execute. • As space becomes available, processes are loaded. When a process finishes, it is removed. • A process may be swapped out if stalled (e.g. waiting for I/O).

Answer 4

Optimum swapping refers to maintaining a balance between too little and too much swapping. Too much and the processor is spending too much time with memory transfers. Too little, stalled processes aren't removed at the correct time.

Answer 5

Partitioning is simply a method of splitting memory into pieces. There are two types, fixed size and variable size.

Answer 6

Fixed-size memory partitions: Easy to administer (logarithmic distribution is usually best)

Answer 7

Variable-size memory partitions: Memory allocated as required. Fragmentation makes it harder to load incoming processes This leads to memory wastage..

Answer 8

Refers to what processes are running at what time.

Answer 9

Physical addresses refer to absolute locations in memory.

Answer 10

Logical addresses are relative to the address of the beginning of the program.

Answer 11

Paging is the process of dividing the memory of a process into fixed-size pages. Whilst physical memory is in turn divided into frames.

Answer 12

Pages can be mapped non-contiguously to frames, and so does not result in memory gaps, unlike with partitioning. Noncontiguous meaning, not placed in a consecutive manner.

Answer 13

Logical Addressing is more complicated, as • For a process, a single offset is not enough anymore. • Each process has a page table. • Logical addresses now refer to a page, and the page table translates the base address (logical page) to the physical address (frame)

Answer 14

They are vital for logical addressing. As processes are divided into multiple pages. Pages that require a corresponding logical address for safe storage.

Answer 15

Pages, not processes, are swapped in and out. Thus, we've created the illusion of "infinite" memory, from the perspective of the process. However, a paging supervisor is required for controlling swap rate.

Answer 16

Logical addresses, which are used by software to reference sections of memory used by a process, function. differently in a paging system. Logical addresses refer to a page number, which is used as a key in the page table. The page table identifies the frame, from which a base address can be derived. The base address is combined with the offset to obtain the full physical address.

Answer 17

Page thrashing is when page faults happen too often. This refers to when a process wants a page that has been swapped out. The OS needs to go fetch it. This is a problem as the processor spends a lot of time re/mapping pages to frames, as opposed to computing.

Answer 18

One method is, each logical memory access can require two physical accesses: 1st search through memory if page table is swapped out Then, look up data using page table, however, this method is slow so, Translation Lookaside Buffer(TLB) is used. The TLB is a cache that stores page table entries.

Answer 19

Cache which stores recent page table entries used as pages can be swapped out.

Answer 20

To pre-emptively get the required data into the most accessible place, without the user having to care.

Answer 21

* The TLB * Main memory * Disk

Answer 22

* Memory cache * Main memory * Disk

Answer 23

Segmentation is another memory management mechanism The application splits memory into segments of variable size at compile time (unlike paging, where the user only sees one contiguous block). • The compiler produces addresses of the form {segment,oset} • The linker reconciles this with the memory subsystem. • Each segment has different access writes and privileges: - Processes can be run in isolation, safely. (Segmentation Fault) - Self-modification can be prevented (Harvard). - Data/cache can be shared between processes.

Answer 24

Most x86 machines use segmentation with paging.

Answer 25

Segmentation can be combined with paging. The compiler divides the program into segments at compile time, and the paging supervisor divides each segment into pages at run time. A page table exists for each segment, and entries are stored in the translation lookaside buer as before

Answer 26

• Provide access speed close to that of the fastest memory. • Provide capacity close to that of the most capacious memory. • The cache contains a local copy of parts of the main memory. • If the CPU wants some memory in the cache, it is returned quickly. • If the CPU wants some memory, not in the cache, the cache fetches it.

Answer 27

The cache contains a local copy of parts of the main memory.

Answer 28

Yes, the levels of caches depend on access speed and memory size.

Answer 29

Memory is divided into blocks Caches are divided into lines Blocks and lines both store words. We assume lines and blocks are the same sizes. Each line has a tag corresponding to a block. Memory has more blocks than the caches have lines

Answer 30

Yes, caches can be filled and fetched form in parallel as they typically involve different processes.

Answer 31

No, at times they must happen such as the first time a program or process is run. As if not run before then the process will not be stored in the cache.

Answer 32

No, caches are often split into levels based on the speed of access and size of memory. In addition, these levels may be divided into instruction caches and data caches.

Answer 33

Block: A set of words in main memory.

Answer 34

Line: A tagged entry in the cache, which holds a set of words.

Answer 35

Tag: An ld associated with the cache line and its contents.

Answer 36

**Cache hit**: Process wants to read from memory, and the cache has it, resulting in a faster lookup compared to a cache miss.

Answer 37

**Cache miss**: Process wants to read from memory, but the cache does not have it, so the cache has to fetch it, resulting in a slower lookup than a cache hit.

Answer 38

In part because different processes require different access patterns. As such more efficient for different caches to correspond to different access patterns. Caches can be designed based on the type of instruction used for the cache.

Answer 39

Should the cache behave more like a register or memory, i.e. access speed vs storage size

Answer 40

Caches are smaller than memory, so blocks need to be mapped onto caches. A mapping function tells the cache line to which memory block is related. There are three types: - Direct - Associative - Set Associative

Answer 41

• Each block of main memory maps onto only one line. A set of blocks are designated to the same line.

Answer 42

• Quick to calculate which line a given block should be placed in Cache line ID = Block number (in main memory) % Number of lines in the cache • Cache Thrashing: High cache miss rate - different blocks are addressed by the process, which are alternately mapped and evicted. With direct mapping this can easily happen

Answer 43

To cause thrashing can make alternate references to blocks in the same set. As the blocks are never stored in the cache at the same time since each line can only hold one memory block.

Answer 44

• Any block can go into any cache line. • The tag is the address of the block that's currently loaded. • To check if the cache holds a block, the cache needs to (simultaneously) compare all cache tags with the desired block address. • Expensive lookup, but flexible for a greater variety of access patterns.

Answer 45

Break by a loop. The number of blocks in the loop should be greater than the number of lines in the cache.

Answer 46

* Compromise between direct and associative. * The cache is divided into a set of subcaches. * Each memory block can go into only one subcache (ala direct), but it can into any line of that subcache (ala associative). * As subcache size → cache size (i.e. only one subcache), the cache becomes **fully associative.** * As subcache size → 1 (i.e. each subcache holds only one line), the cache becomes **fully direct.**

Answer 47

Replacement Functions are used to determine which line is overwritten when a new block is loaded. Algorithms include: - Least recently used - First-in, first-out - Least frequently used - Random

Answer 48

The problem of keeping main memory and the cache synchronised.

Answer 49

The problem of keeping main memory and the cache synchronised.

Answer 50

**Read access:** No problem with a single access pathway, but DMA exists. **Write access:** If a cache line has been written to (dirty), it must be pushed to the main memory before it's replaced.

Answer 51

**Write Through**: Update main memory whenever a cache line is written. **Write Back**: Update main memory whenever a cache line is evicted.

Answer 52

* All write operations are performed in parallel. * All modules with cache access must monitor all reads to maintain coherency. This leads to substantial memory traffic!

Answer 53

* When a line is modified, a "dirty" bit is set, and the main memory is only updated when a "dirty" line is replaced. * All access through one cache.

Answer 54

Bus watching, Hardware transparency: and Non-cacheable segments:

Answer 55

A solution to cache coherency in multiprocessors In a write-through system, any write to the main memory causes the immediate removal of cache lines containing the clobbered data.

Answer 56

A solution to cache coherency in multiprocessors Used for write-through systems, each write to main memory causes an immediate update of all cache lines. Doesn't delete and re-load entire cache.

Answer 57

A solution to cache coherency in multiprocessors Prevent caching of certain regions of (shared) memory.

Answer 58

Smaller blocks/lines better exploit locality, result in more fetch/replace operations, and have less data transfer for those operations. Larger Lines mean less fetch operations are required however, means less lines can be used in a cache. In general line size is between 8-32 bytes.

Answer 59

How to and if to use cache levels, i.e. how to stagger cache levels.

Answer 60

Different Caches are suitable for different processes.

Answer 61

* Unified cache has a better overall hit rate because the code/data imbalances even out. * For fast (pipeline) architectures, a Harvard cache eliminates data/code bus contention (in a pipeline, data and code can be requested out of order).

Answer 62

• Cache systems and virtual memory systems operate largely orthogonally to each other - the virtual memory system exists \after" the caching system. Addresses may be taken from the Virtual memory, however, caches are not aware of pages, they simply see memory blocks.

Answer 63

• The interactions between cache systems and the translation lookaside buer (TLB) are more complicated; either the tags in the cache hold - physical addresses (in which case they need to be translated by the TLB prior to lookup), or - logical addresses (in which case cache tags need to be kept in coherence with the state of the TLB)

Answer 64

* They typically have their own processor and memory. * They also have a layered cache system - certain "bits of magnetic space" are easier to get at than others. * Spatial locality really matters. * They typically have their own RAM caches.

Answer 65

The response is generated as a sequence of events appearing on the output of each gate

Answer 66

Can't as easily assume voltage signals reach gates at the same time. A difference in path length causes a significant time difference.

Answer 67

V_r =( V_wR_r)/(R_w +R_r)

Answer 68

V_r = V_w R_r \>\> R_w

Answer 69

V_r →0, with R_w\>\> R_r

Answer 70

VOltage van be pulled down to 0 if R_r→0.

Answer 71

Transistor–transistor logic (TTL) is a logic family built from bipolar junction transistors. Its name signifies that transistors perform both the logic function (the first "transistor") and the amplifying function (the second "transistor"), as opposed to resistor–transistor logic (RTL) or diode–transistor logic (DTL).

Answer 72

No, when the nodes are driven in different directions, simple ones and zeros don't describe the outputs of the bus. SO we also observe the impedances.

Answer 73

For multiple sources sat all other drives to weak values states i.e. R_weak is high impedance, so source with low forcing impedance dominates for the bus.

Answer 74

Basically, a resistor is a device that without extra Work, will pull up or pull down the voltage of the net. Designed to have said voltage easily overwritten by active inputs to the net.

Answer 75

Boost efficiency of up-down behaviour. Allows the driver to be smaller, and more efficient as only either removes charge or only adds charge.

Answer 76

The effect is created simply by connecting a resistor between the net and either ground (for passive pull-down) or the supply rail (for passive pull up). Often rather than a resistor a MOS transistor with the gate and source shorted, which requires no extra processing steps which makes better use of the space..

Answer 77

1: Two inputs the same, output:=input 2: Forcing 0 and forcing 1 is a conflict 3: Forcing 0 and X might be a conflict 4: Conflicts propagate: anything and a conflict is a conflict 5: Z is always overridden ?: 1 for pull up, 0 for pull-down

Answer 78

* RTL - resistor transistor logic (now obsolete) * DTL - diode transistor logic (now obsolete) * TTL - transistor transistor logic (extremely common, becoming less so) * ECL - emitter coupled logic (fast, heavy on power) * MOS - metal oxide semiconductor (low power) * CMOS - complementary metal oxide semiconductor (even less power) * BiCMOS - Bipolar CMOS (fast, low power, good driving capability) * GaAs - Gallium Arsenide (very fast) * SiGe - Silicon Germanium heterojunction (currently fastest)

Answer 79

* Fastest logic made to date: * SiGe bipolar heterojunction stacked current logic * Clocks at 80GHz * fT 450GHz * Gate delay 6 ps

Answer 80

•The logic thresholds are different for each input to each gate

Answer 81

● Large (poor packing density) ● Fast ● Power hungry ● Persistent (the data is stable as long as the power is maintained) Bistable memory.

Answer 82

Holds Bit Value for roughly 1 millisecond ## Footnote * Highest packing density of all memory topologies * Three components form the cell * Two are parasitic

Answer 83

Ideal, linear increase, however, Output saturates after a certain value as output voltage reaches supply voltage..

Answer 84

Negative Linear. Saturates at high voltage, near supply, and low voltage when close to 0.

Answer 85

connect them back-to-back so that the input of each is driven by the output of the other, we can superpose the transfer characteristics, and if the stage gains are high enough, we get the situation shown in the figure.

Answer 86

If the gains are insufficient for the transfer characteristics to intersect like this, we get a situation where there exists just one stable equilibrium point, at the single crossing point. (The transition between these two states occurs when the slope of the linear region = 1, i.e. Rb = βRc)

Answer 87

Paging is a mechanism to separate data and can be used to implement a virtual memory system. It is **architecture-sympathetic** ie. architecture side memory management.

Answer 88

Segmentation is a memory management mechanism by which data/instructions can be split into segments to divide programs in a user-sympathetic manner.

Answer 89

Page directory is implemented, stores page tables and is never swapped out. It is stored in the OS block.

Memory Flashcards

(114 cards)