Basic Statistical Concepts Flashcards

1
Q

Stages in transforming data into information and evidence

A

1) Data
2) Information
3) Evidence
4) Knowledge
5) Decisions
5) Impact

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

To change data into information you must

A

compile, manage and analyze

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

to turn information into evidence you must

A

integrate, interpret and evaluate the information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

making evidence into knowledge

A

format for presentation to planners and stakeholders

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Knowledge to Decisions

A

Influence plans and decisions (planners and policy makers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Making decisions have impact

A

implement decisions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Turn impact to data

A

monitor indicators for change

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Arithmetic average of a distribution

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Equation for mean

A

Mean = (Total score/ sample size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Only measure that can be manipulated alebraically

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Most sensitive to skew and outliers

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

the value in the distribution that occurs most frequently

A

mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Always located at the peak of the distribution when in graphs

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Insensitive to extreme values or outliers

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Middle value in an ordered array of data

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Divides the upper half of the distribution from the lower half

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Hardly affected by outliers

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Measurement of symmetry and the extent to which a distribution curve leans

A

Skewness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Arrangement of mean median and mode when a graph is positively skewed

A

Mode < Median < Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

In a graph, when the tail extends to the right

A

Positively skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Skewness when higher values have lower frequencies

A

Positively skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

In graphs, skewed to the left

A

Negatively skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

In graphs, tail extends to the left

A

Negatively skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Arrangement of mean, median and mode when graph is positively skewed

A

Mode > Median > Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Higher values have higher frequencies

A

negatively skewed

26
Q

no skew, balanced tails

A

Bell curve

27
Q

Two parameters that determine the normal curve distribution

A

mean

standard deviation

28
Q

Measure of variability for the distance away from the mean

A

Standard deviation

29
Q

Formula for SD from the mean

A

SD from the mean [(X-Mean)/SD)

30
Q

Percentage of area from -1SD to +1SD

A

68%

31
Q

Percentage of area outside of -1SD to +1SD

A

32%

32
Q

Percentage of the area between -2SD and +2SD

A

95%

33
Q

What percentage of the observations would be more than +-2SD away from the mean?

A

5%

34
Q

What percentage of observations would lie within 3SD of the mean?

A

99.7%

35
Q

1 SD above and below the mean

A

68%

36
Q

2 SD above and below the mean

A

95%

37
Q

3 SD above and below the mean

A

99.7

38
Q

The SD of a sampling distribution that determines whether a sample mean will be higher or lower than the population mean

A

Standard error

39
Q

Equation for SE

A
SE = (s/ sqrt n)
s= sample SD
n= sample size
40
Q

Range within which the true magnitude of effects lies with a certain degree of assurance or confidence

A

Confidence interval

41
Q

Disprove prevailing hypothesis, always set up as hypothesis of no effect (null)

A

Refutationalism/ Falsification

42
Q

Determine sample size from confidence interval

A

large sample size –> more stable estimate –> narrow CI

43
Q

Three threats to validity

A

Sampling variability
Confounding variables
Bias

44
Q

Methods of evaluating sample variability

A

Hypothesis testing

Interval estimation

45
Q

Samples drawn at random from a population may give varying measurements of statistical data

A

Sampling variability

46
Q

Used to test for validity and address sampling variability

A

hypothesis testing

47
Q

Steps of hypothesis testing

A

1) Set up two hypothesis
2) Prove null hypothesis wrong
3)

48
Q

Hypothesis wherein there is no relationship between exposure and outcome

A

Null hypothesis

49
Q

Hypothesis wherein there is a relationship between exposure and outcome

A

Alternative hypothesis

50
Q

Alternative hypothesis wherein direction is determined

A

One tailed

51
Q

Alternative hypothesis wherein direction is not determined

A

Two tailed

52
Q

Used to determine whether the hypotheses may be accepted or regected

A

Test statistic

53
Q

A statistical test assumed to have normal distribution, used to determine whether two population means are different given a large sample size

A

Ztest

54
Q

Equation for Ztest

A

z=

55
Q

Numerator of Z

A

magnitude of the difference between groups

56
Q

Denominator of Z

A

the variability of the estimate

57
Q

You want a large test statistic

A

big difference between groups, large sample size or

smaller variability of the estimate

58
Q

The probability of obtaining the observed result due to chance

A

P value

59
Q

You want a low p value

A

Larger test statistic

60
Q

Study results may say treatments differ but the truth is that hey do not

A

Type 1 error false positive

61
Q

Study results say treatments do not differ but the truth is they do

A

Type 2 false negative