Math IB Stats Flashcards

1
Q

Discrete data

A

Fixed to certain values; no gaps between data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Continuous data

A

Not fixed to certain values; can occupy a continuous range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Reliable data

A

If you can repeat the data and obtain similar results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sufficient data

A

When there is enough data to support your conclusions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Population

A

The entire group that you want to draw conclusions about

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sample

A

Subset of population; group of individuals from the population that will give info about the population as a whole

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Sampling technique: convenience

A

Most easily accessible members of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Sampling technique: simple random

A

Randomly choose members - equal chance for everybody

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Sampling technique: systematic

A

Pick at a fixed interval — eg every 6th person

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Sampling technique: stratified

A

Divide group into groups (stratas) based on shared characteristics, then sample from the groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Sampling technique: quota

A

Stratified sampling, but sample from each stratum is proportional and to the size of each stratum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Bar chart

A

For discrete data; has gaps in between the bars

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Histogram

A

No gaps in between bars; for continuous data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Skew

A

Where the majority of the data is located (shape)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Histogram skew

A

Left (-): most data is on right side - left tail
Normal: equally distributed
Right (+): most data is on left side - left tail

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Mode

A

Value that occurs the most

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Modal

A

For grouped data — can’t find mode so we would say the modal range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Bimodal; no mode

A

Bimodal - 2 modes in set of data
No mode - all numbers appear only once

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Mean

A

The average

20
Q

Median

A

The middle data value when data set is arranged in order of size (if even data set — median is avg of two middle numbers)

21
Q

Range

A

Max - min

22
Q

Quartiles

A

Divides data into quarters
- 1st: 25% of data below it
- 2nd: the median and has 50% of data below and above
-3rd: 75% below

23
Q

Interquartile range (IQR)

A

Difference between Q3 and Q1

24
Q

Lower quartile

A

Q1

25
Q

Middle quartile

A

Q2

26
Q

Upper quartile

A

Q3

27
Q

Outlier for boxplot

A

Outliers are 1.5xIQR above Q3 or below Q1

28
Q

Cumulative frequency

A

The sum of all previous frequencies up to the current point

29
Q

Percentile

A

A value below which a certain percentage of observations lie

30
Q

Percentile rank

A

Calculate by dividing #of values below ___ by total # of values

31
Q

Variance

A

How far a data point is spread from the mean (sigma squared)

32
Q

Standard deviation

A

Square root of variance

33
Q

Bivariate data

A

Study of relationships between to sets of data

34
Q

Correlation

A

When change in x corresponds to change in y

35
Q

Causation

A

When one event is the result of the a occurrence of another event

36
Q

Pearson product moment correlation coefficient (r)

A

Measure of the correlation strength between two variables.
Between -1 and 1 ( can equal)

37
Q

R value

A

0 is weakest. 1 is strongest.
(-) values mean there is a negative correlation.

38
Q

Line of best fit

A

Straight line drawn though the center of a group of points plotted on a scatter diagram

39
Q

Interpolation

A

Predictions inside the domain your data points are in

40
Q

Extrapolation

A

Predictions outside the domain of your data

41
Q

Draw line of best fit

A

Find mean point which line will go through. Equal number of points above and below line.

42
Q

Residual

A

The vertical distance between data pints and a graph of a regression line

43
Q

Least square regression line

A

Has the smallest possible value for the sum of squares of the residual

44
Q

Regression line y on x

A

Y=ax+b
A = change on y for each change in x
B = y int

45
Q

Binomial distribution elements:

A
  • fixed number of trials
  • only two outcomes, success or failure
  • constant probability each trial
  • trials are independent
46
Q

Probability - with replacement

A

Elements in sample space remains unchanged (e.g if you pull a card out of a deck, you put it back)

47
Q

Probability - without replacement

A

Items are not returned to the sample space (e.g if you pull a card out from a deck, you leave it out, changing probabilities for next time)