Lecture 11 - Vector Processors Flashcards

1
Q

Vector Registers

A

Each holds a fixed number of elements and has lots of ports so vector operations can be performed in parallel.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Vector functional units

A

Fully pipelined so we can start processing a new vector element on each clock cycle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Vector load-store Unit

A

Data move between registers and memory at rate of one word per cycle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Scalar Registers

A

Provide data as input to vector functional units.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Advantages of vector architectures

A

Exploit data-level parallelism to improve performance.

Potentially reduce complexity and reduce energy per operation because:

  • Vector operations specify lots of independent operations which are simple to execute in parallel.
  • Less switching activity in datapath.
  • Drastic reduction in number of instructions.
  • Regular patterns of access to register file and memory.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Vector Datapath

A

Partition register file and functional units into multiple lanes as always operating on same element from different vectors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Initiation Rate

A

How many elemental operations are completed per cycle for each vector operation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Start-up time.

A

Time before first result. As functional units are pipelined, the first result will take pipeline depth clock cycles to be produced.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Vector Length Register

A

Controls length of any vector operation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Strip Mining

A

Operate on odd-size piece of the vector size then complete MVL sized pieces until complete.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Vector Stride

A

Need ability to read elements from memory separated by a fixed distance (stride). This only requires load/store unit to be modified.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Chaining

A

Start next dependent instruction as soon as individual elements of source vectors become available. (Read-after-write hazards)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Tailgating

A

Overwrite elements of one vector register as soon as they have been used by prior register. (Write-after-read hazards)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Vector-mask Control

A

Use a boolean vector of length MVL to control execution of a vector instruction. The vector operation turns into no-op at elements where mask bit is clear.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Scatter and Gather

A

Store and load vector indexed instructions to support loops which make indirect accesses to arrays.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

SIMD ISA Extensions

A

Add SIMD instructions to standard ISA by introducing new SIMD registers and functional units. This is done by supporting data parallel operations on wide registers e.g. treating 128-bit register as 8 16-bit values.

17
Q

Stream Processors

A

Organise application into streams and kernels. This a natural fit for many modern media-processing applications.