Lecture 8 Flashcards
What is a variable
Any quantity that can be measured
Finish the sentence “In a dataset there will be ___ of a variable for each individual in the sample”
Observations
What’s a central tendency
The typical value of a variable
What is dispersion
How far from the typical value the individual observations of a variable are
What is an association?
How a variable relates to another variable
What are inferential statistics?
Stats used to make predictions about parameters of the population based on two factors
What are parameters?
Characteristics
What estimates the parameters?
Statistics computed from a sample
What is probability?
The chance that a particular event will occur
What is sampling distribution
The probability that we obtain the parameters observed in our sample
What is hypothesis testing?
The data supporting our beliefs about the population
When do we use descriptive statistics?
To summarise sample data
What do we use statistical inference?
To generalise about population parameters
What determines or influences what statistical methods we can apply?
The level of measurement of the data
What’s descriptive statistics for?
To summarise the key features of data.
- To make it understandable for human readers
- To identify characteristics
- To identify patterns
- To provide basis for further analysis
What are measure of Central tendency?
Mean (x̄), median (M), mode (Z)
What are measure of central tendency
Single number that represents the ‘typical’ value of a variable (an average: mean, median, mode)
How would you visualise data?
In frequency tables i.e. Bar charts and Histograms
What is skewness?
Distributions that have a relatively higher proportion of values at the low (left) or high (right) end of the range (on the graph)
Where can you visualise skewness best?
Comparing values of means, median and mode in histograms
What does a normal distribution look like?
Evenly spread above and below the mean (bell shape)
Which side does a positive skew lean towards?
Right
Which way does a negative skew lean towards?
Left
What is the mean the best representation of?
The average in most cases of continuous data