Statistics 2 Flashcards

1
Q

What is a measure of location?

A

Single value that describes a position within a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

If this value is describing the centre of the data?

A

This is a measure of central tendency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Mode/modal class?

A

Value/class that occurs most often.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Median?

A

Middle value when all values ordered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Mean calculated by?

A

Sum of data values (∑x)/ number of data values (n)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

When is the mode an appropriate measure?

A

-Qualitative/quantitative data
-Single mode/bimodal data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Inappropriate to measure mode?

A

Each value only occurs once.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Median usage?

A

Quantitative data only.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Advantage of median vs mean?

A

Not affected by extreme values, so can be used in data with such values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Mean usage?

A

-Utilises all pieces of data, giving a true measure of the data.
-Used for quantitative data only.
-Is affected by extreme values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

For data values of a frequency table, mean calculated by?

A

Frequency Density (Midpoint x Frequency) ∑xf/
Frequencies (∑f

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Median?

A

-Describes the middle of the data set, splitting the data into 2 50% halves.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

For effects on measure by a new data value, how is this evaluated?

A

Compare the previous value with the new one, if its larger, it increases etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Lower quartile?

A

1/4 of the way through the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Upper quartile?

A

3/4s of the way through the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Percentiles?

A

-Split data into 100 parts.
(e.g. 10th percentile is 10/100 (1/10) of the way through the data).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Calculate lower quartile for discrete data?

A

-n/4
-If integer, lower quartile halfway between this data point + next above.
-If not integer, round up and utilise this data point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Upper quartile of discrete data?

A

-3/4 of n
-If integer, upper quartile halfway between this data point + one above.
-If not integer, round up and utilise this data point.

19
Q

If data is presented in a group frequency table, how can medians, quartiles and percentiles be estimated?

A

Using process of linear interpolation.

20
Q

Why is there assumption involved in process of linear interpolation?

A

Assumed that data values are evenly distributed within each class/range.

21
Q

Lower + upper quartile + median in grouped continuous/cumulative frequency data calculation?

A

Q1: n/4th value

Q2: n/2th value

Q3: 3n/4th value

22
Q

Measure of spread?

A

Measure of how spread out data is.

23
Q

Range?

A

Difference between largest and smallest values of data set.

24
Q

IQR?

A

Difference between the upper quartile and lower quartile, Q3-Q1.

25
Q

Range qualities?

A

-Takes into account all data values.
-Can be affected by extreme values.

26
Q

IQR qualities?

A

-Not affected by extreme values.
-Only considers spread of the middle 50% of the data.

27
Q

If one data set has a higher IQR than another, what is the difference in data?

A

Higher IQR= increased variability in data (its more variable).

28
Q

Inter-percentile range?

A

Difference between values of two given percentiles.

29
Q

Frequently used IPR?

A

10-90th, as not affected by extreme values whilst still interpreting spread of 80% of data in the calculation.

30
Q

If asked to interpret the meaning of the value (like the IPR etc).

A

Just utilise the meaning of the measure (e.g. for mean, 50% of data larger, 50% smaller.)

31
Q

Variance?

A

-Another measure of spread of data.
-Utilises the fact that each data point deviates from the mean by the amount of (the value-the mean) x-x-bar.

32
Q

Variance known as.

A

Mean of sqaures minus squares of mean.

33
Q

Variance eqns.

A

Refer to book.

34
Q

Standard deviation?

A

Square root of variance.

35
Q

Variance eqns for grouped frequency table.

A

Refer to book.

36
Q

If data in a grouped frequency table?

A

Using linear interpolation, you can calculate estimates for the variance and standard deviation of the data, with the midpoint of each class interval used in calculations.

37
Q

Coding?

A

Way of simplifying statistical calculations.

38
Q

Coding formula?

A

y=x-a
b

39
Q

Why are values coded?

A

When coded, they create a new data value which is easier to work with.

40
Q

For the mean of coded data:

A

y-bar (coded mean)=x-bar (mean) - a/b (coding)

41
Q

For the mean of original data?

A

X-bar=by-bar + a

42
Q

For the standard deviation of the coded data?

A

sigma-y (coded standard deviation) =sigma-x (new standard deviation) /b

43
Q

Standard deviation of original data?

A

sigma-x=b x sigma-y.

44
Q

If asked to formulate code, and values decreased by 20% e.g, how is this interpreted.

A

Like normal
0.8(… etc.)