Intro Flashcards

1
Q

What is a measure of central tendency?

A

A measure that gives a general idea of the centre of the distribution of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

If you have nominal data, what measure would you use to find the central tendency?

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Advantage and 2 disadvantages of mode?

A

Not affected by extremely high or low values (outliers)

Dis:

  • some distributions have no mode (uniform dist.)
  • ignores distribution of observations - non modal values therefore have no weight
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Problems with the mode? (2)

A

Some distributions have more than one mode (bimodal)

With OIR data the modal score may not be central to the distribution as a whole

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Four types of measurement of data?

A

Nominal
Ordinal
Interval
Ratio

NOIR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is nominal data (3) and example?

A

Observations of a qualitative variable are measured and recorded as labels or names.
Data is classified into categories and can’t be sorted into an order.
Only mathematical operation permitted is classifying and counting.

Example: gender

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

2 characteristics of nominal data?

A

Mutually exclusive

Exhaustive (each object must appear in one of the categories)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is ordinal data (2) and example?

A

Data oranges in an order, but differences between values are meaningless.
Most advanced mathematical operation on this data is ranking of categories.

Eg. Level of education:
GCSE
A level
Degree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is interval data (2) and example?

A

Meaningful amounts of differences between data values can be determined.
No absolute zero score

Eg. Temperature in Celsius or shoe size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why can’t you say ‘100 degrees Celsius is twice as hot as 50 degrees Celsius’?

A

Because 100 is not twice as hot as 50, since 0 doesn’t represent absolute zero (doesn’t represent the absence of heat, only the freezing point of water)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is ratio data and example?

A

Extension of the interval data to include an inherent zero starting point.

Eg. Weight, age, temperature in kelvin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Advantages of median? (3)

A

Not affected by outliers
Unique median for each data set
Can be computed for OI and R data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a parameter?

A

A measurable characteristic of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a statistic?

A

A measurable characteristic of a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Characteristics of mean? (4+ formula)

A

Unique (only one per dataset)
Requires interval or ratio data
Every single score affects it
Sum of deviations from mean is always zero

Σ(xi - xbar) = 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the least squares principle? (1+formula)

A

If the difference between the mean value and the scores are squared and then added, the resultant sum is the minimum possible.

Σ(xi - xbar)^2 = min

17
Q

Disadvantage of mean?

A

Affected by extreme values (outliers)

18
Q

Three common data shapes?

A

Mean = median - symmetrical dist.

Mean > median - positive skew

Mean < median - negative skew

19
Q

Coefficient of skewness equation?

A

sk = (3(xbar - median))/s

s is standard deviation

20
Q

2 equations for frequency distribution?

A

2^k > N (k is no. of classes)
Class interval determined by:

i = (H-L)/k THEN ROUND UP

21
Q

Upper and lower limit of frequency distribution classes?

A

Must include all values in data

22
Q

Histogram axis labels?

A
X - class intervals
Y - class frequencies

BARS TOUCH

23
Q

What is a frequency polygon?

A

A histogram with straight lines joining the midpoint of adjacent frequency bars

24
Q

If there is a weird column on the left hand side of a stem and leaf diagram what is it?

A

Cumulative frequency

25
Q

What goes on the axis of a bar chart?

A
X - class intervals
Y - class frequencies
26
Q

What is a line chart good for showing?

A

Change over time