Lecture 11 Flashcards by Gauvain Robert

cudaDeviceSynchronize()

is used from the host.

Wait until all current kernels finish

How well did you know this?

Not at all

Perfectly

cudaStreamSynchronize()

waits until all kernels

in a stream finish.

How well did you know this?

Not at all

Perfectly

__syncthreads()

is used inside a kernel.

Stop thread until all threads reach the location!

How well did you know this?

Not at all

Perfectly

cudaEventCreate

initialize an event variable

How well did you know this?

Not at all

Perfectly

cudaEventRecord

place a marker in the queue

How well did you know this?

Not at all

Perfectly

cudaEventSynchronize

wait until all markers

have received values

How well did you know this?

Not at all

Perfectly

cudaEventElapsedTime

get the time difference

between two events

How well did you know this?

Not at all

Perfectly

Coalescing memory

Always access global memory ”in order”
If threads access global memory in order of thread
numbers, performance will be improved!

How well did you know this?

Not at all

Perfectly

CUDA can be coupled closer to ________

OpenGL

How well did you know this?

Not at all

Perfectly

Lecture 11 Flashcards

(9 cards)