Benchmarking Flashcards

Question 1

Q

What are the differences between real and synthetic workload?

Answer

A

Real workload:
List of requests observed during normal operation of a real system.
This is uncontrollable and harder to reproduce.

Synthetic workload:
List of requested used for controlled performance testing.
This is repeatable and controllable, it should represent the real workload.

Question 2

Q

What should be taken in mind, when selecting a workload?

Answer

A

Services Exercised
Level of detail
Representativeness
Timeliness
Loading level
Repeatability
...

Question 3

Q

What is the definition of a unidirectional effect?

Answer

A

Effects that only increase as the level of a factor increases, or vice-versa. Or else, it is impossible to retrieve consistent information about that factor.

Question 4

Q

What is the main difference between a simple design, varying one factor at a time, and a full factorial design?

Answer

A

When using a simple design we can’t capture interactions between factors, which is unrealistic. A full factorial design, on the other hand, computes every possible combination of levels in each factor.

Question 5

Q

What is the main goal of performing a 2k-p Fractional Factorial Design?

Answer

A

The main goal is to reduce the number of experiments done using a Full Factorial Design. This way, it is possible to run the experiments with several factors confounded. This can be used to test if the interactions of some factors is indeed negligible without having to compute all the combinations. The main side effect is not being able to determine the effect of all factors individually.

Question 6

Q

How should confoundings be chosen?

Answer

A

Ideally, we should choose significant effects with insignificant ones.

Question 7

Q

What are the most common mistakes when designing plots?

Answer

A

▪ Excess information
▪ Multiple scales
▪ Using symbols in place of text
▪ Poor scales
▪ Using lines incorrectly
▪ Non-zero origins
▪ Three quarters rule: Highest point should be ¾ of scale (or more)
▪ Two related measures on the same graph
▪ Omitting confidence intervals
▪ Histogram cell size
▪ CDF vs histogram to compare several data sets

Question 8

Q

How can we measure the impact of a factor in a given system after computing the sign table?

Answer

A

Now it is important to quantify the impact of each factor and their interaction. This is measured by the proportion of the variance.

Question 9

Q

What is a test workload?

Answer

A

List of requests used to analyse the performance of a SUT. Can be either a Real Workload or a Synthetic Workload.

Real workload typically cannot be repeated, and therefore, is generally not suitable for use as a test workload.

Question 10

Q

What are application benchmarks used for?

Answer

A

Application benchmarks or macro-benchmarks are used to evaluate the performance of a System Under Test (SUT) as a whole

Question 11

Q

What are exercisers used for?

Answer

A

Exercisers or micro benchmarks are used to evaluate a specific Component Under Test (CUT)

Question 12

Q

What is a benchmark model composed of?

Answer

A

Metrics -> Criteria used to evaluate the performance of the system
Factors -> Parameters that are varied in the performance study
Levels -> Values taken by each factor

Question 13

Q

What is the goal of a designing a proper experiment?

Answer

A

Run the least number of experiments that allow for strong conclusions

Question 14

Q

What is a systematic service characterization?

Answer

A

Identify service provided by major subsystem
List factors affecting performance
List metrics that quantify demands and performance
Identify workload provided to that service

Question 15

Q

Describe the 3 averaging techniques to characterize workload parameters

Answer

A

Mean -> More affected by outliers than median or mode
Median -> Sort the observations in increasing order, take the observation in the middle of the series. More resistant to outliers
Mode -> Plot histogram of observations. Choose the midpoint of the bucket where the histogram peaks. For categorical variables, the most frequently occurring

Question 16

Q

What are the main techniques to characterize workload parameters?

Answer

Study These Flashcards

A

Averaging -> present a single number that summarizes the parameter values observed.
Dispersion -> The average alone is not sufficient if there is a large variability in the data.
Percentiles -> Specify how observations fall into buckets.

Benchmarking Flashcards

(16 cards)