01. Flow Basic Flow Metrics Basics Flashcards

1
Q

What are the different types of Averages?

A
  1. Mean
  2. Median
  3. Mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is an average

A
  1. It is the values that are most representative/typical of the dataset.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you calculate the mean

A

Add all the numbers together, and then divide by how many numbers there are.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are outliers

A

An extremely high or low values that stands out from the rest of the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do outliers change the representation of the dataset

A
  1. They skew the representation depending on the outliers.
  2. Outliers “pull” the data to the left or right
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do outliers affect the mean?

A

Outliers pull the mean higher or lower, skewing the data to the left or right.

Mean won’t give you the best representation of what a typical value is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does it mean if the data is skewed to the right

A

Data that is skewed to the right has a “tail” of high outliers that trail off to the right.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does it mean if the data is skewed to the left?

A

Data that is skewed to the left has a “tail” of high outliers that trail off to the left.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does it mean if the data is Symmetric

A

If the data is symmetric, the Mean, Median and Mode are in the middle.
No outliers pull the mean in either direction, and the data has the same shape on either side of the centre.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which averages are not affected as much by outliers?

A
  1. Median
  2. Mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is the Median found?

A
  1. Line up all the values in ascending order.
  2. If there are an odd number of values, the
    median is the one in the middle.
  3. If there are an even number of values,
    add the two middle ones together and
    divide by two.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Where is the Mean and Median in a right-skewed dataset

A

If the data is skewed to the right, the mean is to the right of the median (higher).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the Mode?

A
  1. The mode of a set of data is the most popular value, the value with the highest frequency
  2. The mode has to be in the data set.
  3. It’s the only average that works with categorical data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Where is the Mean and Median in a Left-skewed dataset

A

If the data is skewed to the left, the mean is to the left of the median (lower).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Can a dataset have more than one Mode?

A
  1. If there is more than one value with the highest frequency, then each one of these values is a mode.
  2. If the data appears to represent more than one trend or set of data, we can provide a mode for each set.
  3. This would be a bimodal dataset.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the Three steps for finding the mode?

A
  1. Find all the distinct categories or values in your data set.
  2. Write down the frequency of each value or category.
  3. Pick the one(s) with the highest frequency to get the mode.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Flow metrics

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What metrics can be used when analysing flow

A
  1. Range
  2. Quartiles
  3. Percentiles
  4. MAD - Median of the absolute
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Define Range.

A
  1. We can use the range to measure Variability / Spread?
  2. The range lets us know how the data varies
  3. The more variability, the less predictable the source of the data is
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the Use of range in Flow Analysis?

A

Shows full Spread / Width of flow times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How do we measure the range?

A
  1. The range measures the spread/width of a dataset.
  2. It’s given by Upper bound - Lower bound.
  3. Where the upper bound is the highest value, and the lower bound is the lowest value.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What are the Advantages of Range

A
  • Simple to calculate
  • Highlights extreme variation
23
Q

What are the disadvantages of Range

A
  1. Sensitive to outliers
  2. It doesn’t tell whether the values are clustered or spread out. It ignores whether values are bunched near the median or scattered all over.
  3. It can be misleading if you assume it reflects “overall variation.”
  4. It only uses two data points—the smallest and largest—and doesn’t care what happens in between.
  5. It can give a sense of less predictability because of the wide range
24
Q

When is the Range used?

A

When you want a quick view of a variation.

25
Q

How can we negate the impact of outliers when analysing dispersion?

A

Use the range within the dataset that does not include the outliers

26
Q

Quartiles

27
Q

Define Quartiles

A

Quartiles specifically refer to the values that split the data into quarters.

The lowest quartile is the lower or first quartile (Q1).

The highest quartile is known as the upper quartile or third quartile (Q3).

The quartile in the middle (Q2) is the median, as it splits the data in half.

28
Q

How are Quartiles created?

A
  1. First, line up the values in ascending order and then split the data into four equally sized chunks, each containing one-quarter of the data.
29
Q

How is the Interquartile calculated?

30
Q

What is the Interquartile range?

A

Interquartile range = Upper quartile – Lower quartile

The interquartile range is much less sensitive to outliers.

It’s another way in which we can compare different sets of data.

31
Q

Can Quartiles show variability in your data?

A

Quartiles also show you how spread out your data is:

The Interquartile Range (IQR) = Q3 - Q1 tells you the range of the middle 50% of your data.

This is a very robust measure of variation — it ignores outliers.

32
Q

Quartiles can help you see what in Flow Analysis

A

Quartiles include the median (Q2)

  1. What’s in the middle of your data
  2. A more robust centre than the mean if your data is skewed

Quartiles help you see the centre — especially in skewed distributions where the mean is misleading.

33
Q

When using Quartiles, what are the signs of variability

A
  1. If Q1 and Q3 (Interquartile Range (IQR)) are close together → low spread (consistent process)
  2. If they’re far apart → high spread (variable process)
34
Q

How are the quartiles used in Flow Analysis?

A

Quartiles help understand central tendency and spread.

  1. Q2 (median) gives a strong view of the middle/typical value.
  2. Q1 and Q3 show how tightly or loosely the values are packed around the centre.

Together, they give a powerful, outlier-resistant summary of your data.

35
Q

What are the Advantages of quartiles

A
  1. Resistant to outliers
  2. Good for skewed data
36
Q

What are the disadvantages of quartiles?

A

IQR (from quartiles) gives you a summary of the middle, but it:

Doesn’t measure how far values deviate from the median (like MAD or SD)

Ignores the edges of your data, where rare but critical events happen

37
Q

What question do quartiles not address

A
  1. How far do values deviate from the centre - MAD or standard deviation
  2. How bad the worst-case flow delays can get - Percentiles (90th, 95th, max)
  3. Are there rare but extreme cases - Look at the tails explicitly or use box plots with outliers
39
Q

What are Percentiles

A

A percentile indicates that:

“X% of the data points are less than or equal to this value.” Thus:

  1. The 95th percentile represents the threshold at which 95% of the dataset is below.
  2. The top five percent surpass that value, typically representing delays, exceptions, or outliers.
  3. Percentiles give you a target you can manage and evidence to show stakeholders.
40
Q

What are Percentiles used for?

A
  1. Setting realistic service level targets (SLA).
  2. Setting expectations you can communicate to customers or stakeholders.
  3. Setting performance goals your team can aim for.
  4. Giving visibility into the worst-case (or best-case) scenarios — especially the ends or “tails” of the distribution
41
Q

What are the different methods for calculating percentiles

A
  1. Linear Interpolation Between Ranks
  2. Nearest Rank Method
42
Q

What is the difference between Linear Interpolation & Nearest Rank percentiles?

A
  1. Interpolation is more statistically smooth and useful in continuous data analysis.
  2. Nearest-rank is simpler and easy to explain manually,
43
Q

How would you calculate the Nearest Rank Method for Percentiles?

44
Q

If you have 125 numbers and want to find the 10th
percentile.

A
  1. start by calculating 10 × 125 ÷ 100. This gives you a value of 12.5.
  2. Rounding this number up gives you 13, which means that the 10th percentile is the number at position 13.
45
Q

Flow example is 95% of items done in under 10 days

46
Q

What are the advantages of percentiles?

A
  1. Great for setting expectations
  2. Can capture tail behaviour
47
Q

Why are percentiles great for setting expectations?

A

Percentiles are great tools for clearly stating what results to expect, even when the data is noisy, messy, or not normally distributed.

48
Q

What does it mean when we say that a Service Level Agreement (SLA) specifies a 90th percentile cycle time of 10 days?

A

90% of requests are resolved in 10 days or less.

49
Q

Why do percentiles matter

A
  1. Percentiles reflect actual performance for most people
  2. You can base contracts, dashboards, or SLOs on them
  3. You don’t get tricked by a few extreme cases (like averages do)
50
Q

What is MAD?

A
  1. MAD = Median of the Absolute Deviations from the Median
  2. Indicates How tightly the values cluster around the median.
  3. Good for: Processes where you monitor consistency or “typical” deviations from the norm.
  4. One number gives you a sense of core variation, regardless of shape.

MAD = median(lead_time_i - median(lead_times))

51
Q

How to Use MAD in Practice

A
  1. Calculate the Median of your flow metric (lead time, cycle time, etc.)
  2. Compute the absolute deviations from the median
  3. Find the median of those absolute deviations → That’s your MAD
52
Q

Why Use MAD in Flow Analysis?

A
  1. Robust: Not affected by outliers (unlike standard deviation)
  2. Great for skewed or non-normal data (typical in lead times, cycle times)
  3. Highlights consistency in process performance
  4. One number gives you a sense of core variation, regardless of shape.
53
Q

Find the MAD for the following cycle time.

Cycle times (days): [5, 6, 7, 8, 9, 12, 100]

A

Cycle times (days): [5, 6, 7, 8, 9, 12, 100]

  1. Median = 8
  2. Absolute deviations = [3, 2, 1, 0, 1, 4, 92]
  3. MAD = Median of deviations = 2

MAD = 2 tells us that most values deviate from the median by just 2 days,

54
Q

Can Mad be used with control charts