Block 2 Flashcards

1
Q

attribute data?

A

discrete integers, a quantity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

types of data collection?

A
  • direct observations (needed for quality calcs)

- questions (market research)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Types of data?

A

-attribute data

variable data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Variable data?

A

continuous values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

precision?

A

how reproducible a value is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

accuracy?

A

how close to the true value it is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

ways to describe data?

A
  • frequency distribution
  • measures of central tendency
  • measures of dispersion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Ways to sort data?

A
  • categorically (eg.blood type)

- grouped (eg. 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

relative frequency?

A

Out of 1, the proportion of the data in the specific group/value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

types of histogram?

A
  • frequency histogram
  • relative frequency histogram
  • cumulative frequency histogram
  • relative cumulative frequency histogram
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

determining class width?

A

range/number of classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

determining the number of cells in a histogram?

A

<100 →5-9 cells
100-500 → 8-17 cells
>500 → 15-20 cells

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Graphs for distribution?

A
  • histogram
  • bar graph
  • polygon of data
  • cumulative frequency distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

distribution graph with mean closer to the left?

A

skewed right / positively skewed (more values right of the mean)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

two peaks?

A

bimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

high peak/kurtosis?

A

leptokurtic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

low peak/kurtosis?

A

Platykurtic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

negative kurtosis?

A

flatter than a normal distribution with same mean and standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

aspects of a distribution?

A
  • location (mean)
  • spread
  • shape (skew)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

3 measures of central tendency?

A
  • Average
  • Median
  • Mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

average of ungrouped data?

22
Q

average of grouped data?

A

sum of (frequency*midpoint) / total frequency

23
Q

average with different sized groups?

A
weighted average
sum of (frequency*average of each group) / total frequency
24
Q

Finding the median value for grouped data

A
  • half the range
  • count up to find the group
  • find how many values intro the group it is
  • divide by the group frequency and multiply by the group interval
25
Mode?
peak | can have many or none
26
Measures of dispersion?
- range - standard variation - variance
27
Range?
difference between max and min in the dataset
28
Standard deviation?
sum of differences between value (Xi) and average (Xbar) squared, divided by number of values (n) -1, square rooted
29
Problems with range?
less accurate with more observed values, more likely to get an outlier
30
When to use standard deviation?
went n>10 | otherwise, use range
31
Measures of distribution shape?
- skewness - Kurtosis - Coefficient of Variation
32
Skewness?
a3=0 →symmetrical a3>0 skewed to the right a3<0 skewed to the left +1 or -1 is big
33
How to determine if a skewness value is reliable?
n>100 | needs to be unimodal
34
What does Kurtosis value (a4) mean?
a4=3 → normal distribution a4>3 → more peaked a4<3 → less peaked
35
Correlation variation / Coefficient of Variation
``` standard deviation (s) *100%/average (Xbar) units for s and the mean cancel, thus the units are % ```
36
Coefficient of Variation vs standard variation?
relative to the mean - more relevant
37
population?
set of items of interest
38
Sample?
subset of a population
39
sample statistics?
- Average (Xbar) | - Sample standard deviation (s)
40
population parameter?
- Mean (Xbar0 or lowercase mu) | - Standard deviation (So or lowercase sigma
41
statistic vs parameter?
statistic for a sample | parameter for a population
42
average and standard distribution of standard normal curve?
``` average (mu) = 0 standard distribution (sigma) = 1 ```
43
finding the percentage above/below a value on a normal curve?
transfer to standard normal distribution then use the tables
44
How check for a normal distribution?
- visual inspection of a histogram (unimodal, symmetrical, tapering tails) - skewness = 0 - Kurtosis = 3
45
Probability plots?
- order values small to large (and rank from 1 for smallest) - use plotting position equation to find percentile - plot measured value over percentile and draw line of best fit
46
Chi-square goodness of fit?
-compares observed value to expected value
47
Tests for normality?
- probability plots | - chi-square goodness of fit
48
simplest way to find a cause and effect relationship?
scatter diagram
49
drawing a "straight line fit" mathmatically?
use equations for find gradient (m) and y intercept (a)
50
coefficient of correlation?
``` goodness of fit to a line of best bit. between -1&1 1=all points on +gradient line -1=all points on -gradient line 0=no correlation ```