Modellering Tentaplugg Flashcards

1
Q

A performance metric is used for what?

A

It is used to describe, measure and extrapolate information from data. Measure effectiveness and performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a performance metric contain?

A

Count of how many times events occur, duration of time intervals, size of parameters. It also derives a value from these parts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A good performance metric has what characteristics?

A

Allows accurate and detailed comparisons, leads to correct conclusions, is well understood by everyone and has a quantitative basis. Also, a good metric helps avoid erroneous conclusions.

ADC | Accurate, Detailed, Correct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a congestion collapse?

A

Offered load increases, whilst the work done decreases. A network becomes HIGHLY congested, leading to a significant drop in performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Congestion collapses occur frequently in complex systems, but why does it happen?

A

Might happen because:
- The cost per job increases with the load
- Impatience, where jobs leave before completing
- Rejection, where jobs get rejected before completing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we avoid congestion collapses?

A

With TCP congestion control and/or admission control in web servers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a latent congestion collapse?

A

Think of it as a hidden congestion collapse, that has not yet collapsed. It is a fuse, ready to be set off. Some sort of bottleneck is preventing the collapse, but when resources gets added more and more, it might tip over and reveal the congestion collapse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is response time?

A

The elapsed time from request, to response

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is throughput?

A

Jobs / Operations completed per time unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is bandwidth?

A

Bits per second

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Ad hoc metrics?

A

Metrics that will be defined for specific purposes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is jitter?

A

Variation in time delay

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is delay?

A

Time taken to complete an action

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is resolution?

A

Smallest increment between two possible measurements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is accuracy?

A

How close the mean of the distribution of measurements is to the true value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is precision?

A

The level of repeatability and consistency in measurements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does variability tell us? And what is variability also known as?

A

It tells us how far apart all the measured values are from eachother.

Variability is known as spread, or scatter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How can we quantify variability?

A

Histograms: Equally sized buckets of observed data, with 6-7 buckets minimum
Box plots: A type of graph where each box has a line, showing max val, min val, quartiles and mean
Sample variance:
Standard deviation:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is variance?

A

A measurement that represents the deviation of observed values, from expected values. A high variance indicates greater dispersion between values, whilst a low variance suggests that the datapoints are closer to the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is standard deviation?

A

A statistic that measures the dispersion of a dataset relative to its mean, and is calculated as the square root of the variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is a QQplot?

A

A graphical tool used to compare the distribution of a dataset, to a theoretical distribution, or another dataset. It helps assess whether or not the data come from a population with a known or assumed distribution, such as a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Describe Full System Virtualization

A

FSV involves creating a complete and independent virtual instance of an entire computer system (this includes operating system, hardware components, software, etc.). This environment mimics the original system closely.

23
Q

What does FSV allow the user to do? One point, important

A

It allows the user to run multiple operating systems on a single machine.

24
Q

Describe Lightweight System Virtualization

A

LSV involves creating isolated environments (called containers) that share the same operating system kernel as the host system. These containers are lightweight, portable and efficient in resource utilization.

25
Q

What are the LSV isolated environments known as?

A

Containers.

26
Q

Does mininet use LSV or FSV?

A

LSV

27
Q

Simulation is the process of what?

A
  1. Designing a model of a real system
  2. Conducting experiments with this model
28
Q

Why do we simulate?

A

To gain an understanding of the behavior of the system.

29
Q

What is a continuous-time model?

A

A model where state changes occur continuously across time

30
Q

What is a discrete-time model?

A

A model where state changes occur only at specific times

31
Q

Describe the Monte Carlo Simulation

A

It involves generating numerous random inputs within specified ranges and using them to simulate the system’s outcomes. When this has ran several times, it provides a probalistic view of possible outcomes and helps in making informed decisions.

Note that the name comes from Monte Carlo Casino, making it easy to remember its’ relevance in chance and randomness.

32
Q

What’s the best and worst with conducting Live Experiments?

A

They provide high accuracy but are expensive, and less flexible.

33
Q

In what does emulation strike a balance?

A

It strikes a balance in cost, accuracy and flexibility.

34
Q

What’s the best and worst with conducting simulations?

A

They are really cost effective, and highly flexible, but may have accuracy limitations based on the model

35
Q

What is analytical modelling?

A

A strictly theoretical analysis based on a mathematical and/or statistical approach.

36
Q

What are all the 4 types of means?

A

Arithmetic mean, harmonic mean, geometric mean and weighted mean

37
Q

When should you use arithmetic mean?

A

Arithmetic mean is used when you want to find a typical or average value of a set of numbers, where all values have equal importance.
Ex: Used when calculating average exam scores in class

38
Q

When should you use harmonic mean?

A

Used when you want to calculate an average that considers rates, ratios, or speeds, especially when dealing with reciprocals or inversely proportional quantities. Commonly used in physics, economics and engineering for averaging rates or speeds.
Ex: Calculating average speed in a round-trip journey.

39
Q

When should you use geometric mean?

A

Used when you want to calculate a central value for a set of numbers that are subject to exponential growth or decay. Could be used in finance to calculate average growth rates, such as investment returns over multiple periods.
Ex: Calculating average annual growth rates of investments.

40
Q

When should you use weighted mean?

A

Used when you want to calculate a central value for a set of numbers, but each value has a different level of importance or weight. It’s used in various scenarios where certain values carry more significance.
Ex: Calculating the GPA of a student with different credit weights for courses.

41
Q

Är du självmordsbenägen?

A

Ja

42
Q

Why do queues exist?

A

Since jobs share many resources and only one can access at a time, then the other must wait. While waiting, they queue up.

43
Q

What is the Kendall Notation, and what does it describe?

A

The Kendall Notation is the standard system used to describe and classify a queueing node.

44
Q

What can be said about the Kendall Notation: M/M/3/20/1500/FCFS?

A
  • Exponentially distributed arrivals
  • Exponentially distributed service times
  • Three servers
  • Capacity 20 (queue size is 20 - 3 = 17)
  • Population is 1500 total
    -Service discipline is FCFS
45
Q

What is the formula of Little’s Law?

A

Mean jobs in system = arrival rate * mean response time

46
Q

In the Kendall Notation, what does “M” and “D” stand for?

A

M = Exponential / Memoryless / Poisson distribution
D = Deterministic distribution

47
Q

What is the Simpson’s Paradox?

A

Simpson’s Paradox, also known as confounding, is a phenomenon in statistics where a trend appears in different groups of data, but it reverses or disappears when the groups are combined.
In laymans terms, the overall pattern can be misleading or contradictory when looking at the data as a whole versus when it’s broken down into subgroups.
This can happen when the subgroups have different sizes or weights that influence the combined results.

48
Q

What is verification and validation in systemmodelling?

A

To validate a model is to analyse if it is a correct representation of the real system.

To verify a model is to see if the model behaves as expected.

49
Q

How can validation be performed?

A

Validation can be performed by comparing the model to other performance evaluation approaches. For example, a few measurements could be done on the real system and compared to the model to check its validity.

50
Q

How can verification be performed?

A

Verification can be performed by doing some different tests on the simulation. Slightly changed inputs should yield a slight change of output.

51
Q

What is ANOVA?

A

It’s a statistical technique used to analyze the differences among various groups or treatments in an experiment. In simple terms, ANOVA helps you determine whether there are significant differences between the groups you’re studying and if those differences are due to the factors you’re testing.

It assesses whether the differences among groups are greater than what you would expect from random chance. In other words, it tells you if the variations you see are likely due to the factors you’re investigating and not just random fluctuations.

52
Q

How is ANOVA performed?

A

First do an equal amount of repeated measurements on a number of different systems. Store the measurements in a table

The table has columns representing a system each, and rows representing a repetition each.

1.Calculate group means: Find the average (mean) of the data for each group. This tells you the typical performance for each system.

  1. Calculate overall mean: Find the average of all the data points combined, regardless of their group. This is the overall average performance.
  2. Calculate variability: Determine how much each data point varies from the group mean, and then square these differences.
  3. Perform the ANOVA test: This involves comparing the variability within each group (due to random chance) to the variability between groups (due to the different treatments). You can use statistical software or tables to do this.
  4. Determine statistical significance: ANOVA will provide a p-value. If this value is less than a predetermined significance level (usually 0.05), it suggests that there are significant differences between the groups.
53
Q

How are ANOVA contrasts useful?

A

The use of contrast gives the ability to compare a subset of the systems to see if there is a significant difference within the subset.

54
Q

What is a Discrete-Event Simulation?

A

A DES is time based, where the incrementation of time will cause events to execute. An example of this is a simulation of processes arriving at a server. The processes are arriving with a specified rate, which is dependent on the time.