Hypotheses, Descriptive Statistics and Presenting Data Flashcards
What should we consider when choosing a plot type for presenting data?
The data type
What plot type is used for a numerical (continuous or discrete) variable to see how the data is distributed?
Histogram
What types of numerical variables can we use a histogram for?
We can use a histogram for both continuous and discrete numerical variables
What type of plot should we use for presenting two numerical variables?
We should use a scatterplot
Can we use a scatterplot for both continuous and discrete numerical variables?
Yes, we can use a scatterplot for both continuous and discrete numerical variables
How should we arrange the variables on a scatterplot?
- independent variable = x axis
- dependent variable = y axis
What type of plot should we use for presenting one categorical and one numerical variable?
A boxplot
What types of categorical variables can we use a boxplot for?
- regular
- ordinal
- binary
What types of numerical variables can we use a boxplot for?
- continuous
- discrete
How should we arrange the variables on a boxplot?
- categorical variable = x axis
- numerical variable = y axis
What type of plot should we use for presenting two categorical variables?
If only a small number of levels (groups), a table is appropriate otherwise use a grouped bar chart
How many levels (groups) can a table accommodate for presenting two categorical variables?
A table can accommodate a small number of levels (groups)
What type of plot should we use for presenting two categorical variables when there are many levels (groups)?
A grouped bar chart
Can we use a grouped bar chart for other types of variables, such as numerical variables?
No, a grouped barchart is specifically for presenting two categorical variables with many levels (groups). For numerical variables, we would use other types of plots such as histograms, boxplots, or scatterplots
What is the difference between descriptive and inferential statistics?
- Descriptive statistics = summarising a collection of data, graphically or numerically, and performing exploratory analyses (e.g. plots)
- Inferential statistics = estimating parameters and their confidence limits, as well as hypothesis testing
What is the purpose of descriptive statistics?
To summarise and describe the important features of a dataset (e.g. central tendency and spread of the data)
What is the purpose of inferential statistics?
To make inferences about a larger population based on a sample of data
What are some examples of exploratory analyses in descriptive statistics?
Creating:
- histograms
- scatterplots
- boxplots
to visualize the distribution and relationships between variables.
What is hypothesis testing in inferential statistics?
Testing a hypothesis about a population parameter based on a sample of data, and determining the likelihood of obtaining the observed results if the null hypothesis (i.e., no difference or no relationship) were true.
What does central tendency refer to in descriptive statistics?
Central tendency in descriptive statistics refers to the measure of the “middle” or “typical” value of a dataset.
- e.g. mean, median, and mode
What does spread refer to in descriptive statistics?
How spread out or dispersed the data is
-e.g. range, variance, and standard deviation