01. Flow Basic Flow Metrics Basics Flashcards

Question

How can we negate the impact of outliers when analysing dispersion?

Answer 1

Use the range within the dataset that does not include the outliers

Answer 2

Quartiles specifically refer to the values that split the data into quarters. The lowest quartile is the lower or first quartile (Q1). The highest quartile is known as the upper quartile or third quartile (Q3). The quartile in the middle (Q2) is the median, as it splits the data in half.

Answer 3

1. First, line up the values in ascending order and then split the data into four equally sized chunks, each containing one-quarter of the data.

Answer 4

Interquartile range = Upper quartile – Lower quartile The interquartile range is much less sensitive to outliers. It’s another way in which we can compare different sets of data.

Answer 5

Quartiles also show you how spread out your data is: The Interquartile Range (IQR) = Q3 - Q1 tells you the range of the middle 50% of your data. This is a very robust measure of variation — it ignores outliers.

Answer 6

Quartiles include the median (Q2) 1. What's in the middle of your data 2. A more robust centre than the mean if your data is skewed Quartiles help you see the centre — especially in skewed distributions where the mean is misleading.

Answer 7

1. If Q1 and Q3 (Interquartile Range (IQR)) are close together → low spread (consistent process) 2. If they’re far apart → high spread (variable process)

Answer 8

Quartiles help understand central tendency and spread. 1. Q2 (median) gives a strong view of the middle/typical value. 2. Q1 and Q3 show how tightly or loosely the values are packed around the centre. Together, they give a powerful, outlier-resistant summary of your data.

Answer 9

1. Resistant to outliers 2. Good for skewed data

Answer 10

IQR (from quartiles) gives you a summary of the middle, but it: Doesn’t measure how far values deviate from the median (like MAD or SD) Ignores the edges of your data, where rare but critical events happen

Answer 11

1. How far do values deviate from the centre - MAD or standard deviation 2. How bad the worst-case flow delays can get - Percentiles (90th, 95th, max) 3. Are there rare but extreme cases - Look at the tails explicitly or use box plots with outliers

Answer 12

A percentile indicates that: "X% of the data points are less than or equal to this value." Thus: 1. The 95th percentile represents the threshold at which 95% of the dataset is below. 2. The top five percent surpass that value, typically representing delays, exceptions, or outliers. 3. Percentiles give you a target you can manage and evidence to show stakeholders.

Answer 13

1. Setting realistic service level targets (SLA). 2. Setting expectations you can communicate to customers or stakeholders. 3. Setting performance goals your team can aim for. 4. Giving visibility into the worst-case (or best-case) scenarios — especially the ends or “tails” of the distribution

Answer 14

1. Linear Interpolation Between Ranks 2. Nearest Rank Method

Answer 15

1. Interpolation is more statistically smooth and useful in continuous data analysis. 2. Nearest-rank is simpler and easy to explain manually,

Answer 16

1. start by calculating 10 × 125 ÷ 100. This gives you a value of 12.5. 2. Rounding this number up gives you 13, which means that the 10th percentile is the number at position 13.

Answer 17

1. Great for setting expectations 2. Can capture tail behaviour

Answer 18

Percentiles are great tools for clearly stating what results to expect, even when the data is noisy, messy, or not normally distributed.

Answer 19

90% of requests are resolved in 10 days or less.

Answer 20

1. Percentiles reflect actual performance for most people 2. You can base contracts, dashboards, or SLOs on them 3. You don't get tricked by a few extreme cases (like averages do)

Answer 21

1. MAD = Median of the Absolute Deviations from the Median 2. Indicates How tightly the values cluster around the median. 2. Good for: Processes where you monitor consistency or "typical" deviations from the norm. 3. One number gives you a sense of core variation, regardless of shape. MAD = median(lead_time_i - median(lead_times))

Answer 22

1. Calculate the Median of your flow metric (lead time, cycle time, etc.) 2. Compute the absolute deviations from the median 3. Find the median of those absolute deviations → That’s your MAD

Answer 23

1. Robust: Not affected by outliers (unlike standard deviation) 2. Great for skewed or non-normal data (typical in lead times, cycle times) 3. Highlights consistency in process performance 4. One number gives you a sense of core variation, regardless of shape.

Answer 24

Cycle times (days): [5, 6, 7, 8, 9, 12, 100] 1. Median = 8 2. Absolute deviations = [3, 2, 1, 0, 1, 4, 92] 3. MAD = Median of deviations = 2 MAD = 2 tells us that most values deviate from the median by just 2 days,

01. Flow Basic Flow Metrics Basics Flashcards

(54 cards)