Topic 1 - Data, Samples and Descriptive Statistics Flashcards

1
Q

Define population

A

The entire/complete group we are interested in

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define sample

A

Smaller group on which data is collected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define parameter

A

Numerical measures which describe specific characteristics of a POPULATION (remember p parameter for p population)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define statistics

A

Numerical measures computed from a sample (remember s statistics for s sample)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How many areas can statistical analysis be divided into?

A

2 areas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the areas that statistical analysis can be divided into?

A

1) Descriptive statistics
2) Inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are descriptive statistics?

A

Numerical and graphical methods used to summarise and present info in a meaningful way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are inferential statistics?

A

When you make inferences or predictions about the population based on the findings from your sample
e.g. things like hypothesis testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Describe the general process in statistics

A

1) From the population you collect data to form your sample
2) Using descriptive statistics you can describe the sample (statistics)
3) Using the statistics found you can use inferential statistics to make estimates about the parameters (features) of the population
4) This can then allow you to draw conclusions about the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is simple random sampling?

A

Each unit of population has an equal probability of being chosen to be part of sample- each individual chosen randomly (entirely by chance)
- good, unbiased representation of population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the variations of simple random sampling?

A

1) With replacement- sample unit drawn and then returned to population after characteristics recorded
2) Without replacement- sample drawn is not returned to population after characteristics recorded- … each unit can be only be drawn once (NOTE here sample values not independent as sample drawn can affect next sample- as you cannot draw the unit you drew previously)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How could you describe a distribution of a histogram or potentially even other graphical representations of data?

A

1) Symmetric distribution- if there is symmetry or balance when data is presented
2) Positively skewed- skewed to the right- where distribution is higher to the left and lower on the right
3) Negatively skewed- skewed to the left- where distribution is higher to the right and lower on the left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How many ways can you summarise the data of one variable?

A

2 ways

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the ways in which you can summarise the data of one variable and briefly describe or give examples of each way?

A

1) Frequency distribution e.g. ordered list or table
2) Histograms- graph summarising the data in a frequency distribution- vertical axis either frequency, relative frequency or percentage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In how many ways can you display the relationship between two variables?

A

2 general ways

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How can you display the relationship between two variables?

A

1) Categorical (qualitative) variables e.g. cross tables
2) Numerical (quantitative) variables e.g. scatter plots/diagrams

17
Q

What is the Simpson’s Paradox?

A

For example say you have 2 people who sell 2 products A and B
The aggregate data (sales of both products looked at together where you see the total sales and sales effectiveness as a percentage)- the aggregate data shows the person 2 is more affective in sales

BUT when you look at the sales for the 2 products separately, you see that person 1 is more effective in selling each individual product than person 1

… the 2 contradicting pictures are known as the Simpson’s paradox

See last slide of topic 01 pre recorded lecture if need be for visual

18
Q

Give examples of descriptive statistics

A

4 main types:
1) Measures of Frequency- count, percent, frequency
2) Measures of Central Tendency- mean, median or mode
3) Measures of Variation or Dispersion- range, variance, standard deviation
4) Measures of Position- percentile ranks, quartile ranks

19
Q

Give examples of inferential statistics

A

1) Hypothesis Testing
2) Point Estimate

20
Q

In a positively skewed distribution how does the mean, median and mode compare?

A

In a positively skewed distribution (skewed to the right- where distribution is higher on the left and lower on the right side)
REMEMBER the order is always mode, median mean OR mean, median, mode (mode is always the highest peak BUT we are interested in the x-axis position of that measure of central tendency) … as it is mode, median, mean from left to right, the mode is the smallest, then the median is slightly larger and the mean is the largest out of the 3

21
Q

In a negatively skewed distribution how does the mean, median and mode compare?

A

In a negatively skewed distribution (skewed to the left- where distribution is higher on the right and lower on the left side)
REMEMBER the order is always mode, median, mean OR mean, median, mode (mode is always the highest peak BUT we are interested in the x-axis position of that measure of central tendency) … as it is mean, median, mode from left to right, the mode is the smallest, then median is slightly larger and the mean is the largest out of the 3

22
Q

What is an independent event?

A

When the first draw doesn’t effect the second
I.e. draw is done with replacement so that all units can be drawn including the one/ones drawn previously

23
Q

What is a dependent event?

A

When the first draw does effect the second … the second does depend on the first
I.e. draw is done without replacement so that a unit drawn previously cannot be drawn again- also the total number of units that can be drawn falls