Statistics 1 Flashcards

1
Q

Mathematical Model

A

A simplification of a real world situation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Uses of a mathematical model (2)

A

1) Make predictions about a real world problem

2) To obtain an improved understanding of the situation through analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why mathematical models are useful (2)

A

1) They enable predictions to be made

2) They are quicker, easier and cheaper to use than the real situation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Disadvantages of a mathematical model (3)

A

1) They only work for a restricted range of values.
2) They only give a partial description of the real situation (does not include all aspects)
3) The model may only work in certain situations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Stages of designing a mathematical model (7)

A

1) A real world problem is observed
2) A mathematical model is devised
3) The mathematical model is used to make predictions about the expected behaviour of the real world problem
4) Experimental data is colelcted from teh real world
5) A comparison is made between the predicted and observed outcomes
6) Statistical tests are used to assess how well the model describes the real world
7) The mathematical model is refined, if necessary, to improve the match of predicted outcomes with (experimental) data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are statistical tests (2)

A

1) Used to assess how well a mathematical model matches a real world situation
2) An objective measure of deciding if the differences between the model’s predictions and the experimental data are within acceptable limits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Quantitative variables

A

Variables associated with numerical observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Qualitative variables

A

Variables associated with non-numerical observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Continuous variable

A

A variable that can take any value in a given range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Discrete variable

A

A variable that can take only specific values in a given range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Median/IQR (what it shows, Adv, Disadv)

A

Use: To show how spread out the middle 50% of observations are
Adv: Not affected by extreme values; used when the data is skewed
Disadv: Does not use all of the data and therefore not a ‘true’ measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why use mean/s.d.

A

These measures are a true measure of the data (use all the pieces of data) but are affected by extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Advantages of using a stem and leaf diagram (4) (Kangaroos eat quince tomorrow)

A

1) Keeps the shape of the data
2) Enables the shape of the distribution of the data to be revealed
3) Quartiles can easily be found from the diagram
4) Two sets of data can be compared by using back-to-back diagrams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Disadvantage of using a stem and leaf diagram (1)

A

Can be time-consuming to do

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is another name for the group encompassing mean and median

A

Measures of location

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is another name for the group encompassing IQR and standard deviation

A

Measures of spread

17
Q

S.d./Mean (Use, Adv, Disadv)

A

When used: Used when the data are symmetrical/when the data size is not small
Adv: Uses all the pieces of data
Disadv: Affected by extreme values

18
Q

Give an example of all three of the measures of skewness (measures of location, quartiles, equation, boxplot)

A

Boxplot (self-evident)
Measures of location
- mode = median = mean –> symmetrical
- mode < median < mean –> positive skew
- mode > median > mean –> negative skew

Quartiles
Q2 - Q1 = Q3 - Q2 –> symmetrical
Q2 - Q1 < Q3 - Q2 –> positive skew
Q2 - Q1 > Q3 - Q2 –> negative skew

Equation
3(mean - median)/s.d. --> if value is large, skew = positive
19
Q

EQ: ‘Why should a histogram be used?’

A

A: The data are continuous

20
Q

Advantages of a histogram (2)

A

1) Gives a good pictures of how data are distributed
2) Enables you to see:
i) Rough location
ii) General shape of the data
iii) How spread out the data are

21
Q

What is the difference between a histogram and a bar chart? (2)

A

1) H: No gaps between the bars

2) H: Area of the bar is proportional to the frequency

22
Q

What are the three possible statuses for distribution

A

Symmetrical, positive skew, negative skew

23
Q

Define experiment

A

A repeatable process that gives rise to a number of outcomes

24
Q

Define event

A

A collection (or set) of one or more outcomes

25
Q

Define sample space

A

The set of all possible outcomes of an experiment

26
Q

Define discrete random variable

A

A value obtained by taking a measurement from an experiment in the real world

27
Q

Define random variable

A

A variable whose value is the outcome of an experimetn

28
Q

Equation for F(x)

A

P(X ≤ x)

29
Q

Equation for E(X)

A

Σ(xp(x))

30
Q

E(Xn)

A

ΣxnP(X = x)

31
Q

Var(X)

A

E(X2) – (E(X))^2

32
Q

E(aX + b)

A

aE(X) + b

33
Q

Var(aX + b)

A

a^2Var(X)

34
Q

s.d.(aX + b)

A

a(s.d.(X))