General Knowledge Flashcards

1
Q

Explain what a tightly coupled workload is

A

If you are spanning multiple instances, you are tightly coupled if those instances have to talk to one another in order to get the job done. The failure of one node usually leads to the failure of the entire calculation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Explain what a loosely coupled workload is

A

Each instance can process independently, and maybe once in a while they write to disk to check point data with one another. But they don’t actually have to actively communicate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Explain what a FLOP is

A

FLoating-point Operations Per Second. They’re used as a measure of how fast a processor is; how many operations they can do on floating point numbers every second. Floating point numbers are basically numbers that have a decimal place in them somewhere, as opposed to an integer which is just a round number on its own. A more powerful processor can do more operations more quickly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is FPGA

A

Field Programmable Gate Array.
A regular processor is hard-wired; it’s versatile and there’s lots you can do with it, but on the hardware level, you can’t change anything. When you program for a standard CPU, you’re limited by the way the hardware was laid out, because different arrangements are better at different things (think of how a GPU is better than a CPU at certain tasks).
A “pure” FPGA is a chip that can be changed on the hardware level. You use software to change the way the chip operates, so, in essence, you can build your own chip for whatever application you’re doing. When you’re writing some types of code, there are optimizations that you simply can’t make on a regular CPU because the manufacturer designed it to be versatile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a scheduler

A

Managed which jobs run where and when. Used to share the HPC system amongst many users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an MPI

A

Message Passing Interface. Can be operated only on homogeneous system. Used to exchange data between workers. MPI handles process-to-process communications, while OpenMP manages threads in a given process. In a distributed system environment consisting of processes that have their own memory locally, a programming model that implements communication between processes only by sending and receiving messages. Processes do not share memory space and do not allow one process to directly access another process’s memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the purpose of checkpointing

A
  • HPC has limited hardware redundancy;
    • If nodes supporting a HPC application fail
      ○ the application eventually fails
      ○ Therefore, applications need to periodically save their application state (e.g. check-pointing) to guard against losing a lot of work
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain the difference between a Management Node and a Master Node

A

Usually the master or head node is where your users will login and run jobs. So it’s sort of like a bastion to the compute cluster. This is generally where you would run the scheduler as well, but sometimes you want to put the scheduler on a different node because you don’t want users on that head node to be mucking about and saturating the CPU/memory and breaking your services so you put it on the management node. The management node is also where you can tap in and operate on the storage without root squashing, you can also provide directory services to tap in here.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a workflow

A

It’s rare that an application is run by itself. Usually, a set of applications are run in a series of steps to form a complete workflow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly