Statistics (2) Flashcards
What is the purpose of descriptive statistics?
To explore and compare data meaningfully, assess major differences, determine data distribution shape, check for missing or unusual data, see data noise, and verify data fit for further testing
Descriptive statistics provides a summary of the data but does not allow for objective decisions regarding hypotheses.
What are some key functions of descriptive statistics?
- Explore and compare data meaningfully
- Assess major differences between conditions/variables
- Determine the shape of data distributions
- Check for missing data or outliers
- See the amount of noise in the data
- Verify data fit for further statistical testing
These functions help in understanding the basic characteristics of the data before applying inferential statistics.
True or False: Descriptive statistics can help us make objective decisions about our alternative hypothesis.
False
For objective decisions regarding the alternative hypothesis, inferential statistics is required.
Fill in the blank: Descriptive statistics allows us to check for _______ or unusual data.
missing data
Identifying missing data and outliers is critical for ensuring the integrity of data analysis.
What is needed to arrive at an objective decision about the alternative hypothesis?
Inferential statistics
Inferential statistics allows researchers to make predictions or inferences about a larger population based on sample data.
What are descriptive statistics used for?
Descriptive statistics allow us to:
* Look at measures of central tendency, dispersion, and variation
* Organise and aggregate or disaggregate data in a meaningful way
* Get a ‘feel’ for any relevant patterns
* Present data graphically or in a tabular format
Descriptive statistics summarize data without making inferences about a larger population.
What do inferential statistics allow us to do?
Inferential statistics allow us to:
* Test hypotheses about distributions
* Determine whether differences or relationships are statistically meaningful
* Express whether we can retain or reject the null hypothesis
Inferential statistics make predictions or generalizations about a population based on a sample.
Fill in the blank: Descriptive statistics focus on analyzing _______ data.
[observed]
Fill in the blank: Inferential statistics are used to determine if differences or relationships are statistically _______.
[meaningful]
True or False: Descriptive statistics can present data graphically.
True
True or False: Inferential statistics provide a summary of data without making predictions.
False
What is the normal curve also known as?
Standard normal distribution
What are the important measures that tend to be in the center of the distribution?
Mean, median, and mode
What do the mean, median, and mode represent in a distribution?
Numbers that are representative of the distribution as a whole
Fill in the blank: The mean, median, and mode are measures of _______.
Central tendency
True or False: The mode is the measure that represents the least common value in a distribution.
False
What is the significance of the center of the distribution?
It is where important measures tend to be found
What is the mean in statistics?
The mean is the average of a set of numbers, calculated by adding all items in a set and dividing by the number of items.
The mean is commonly used to represent general performance in statistics.
What types of data is the mean primarily used with?
The mean is used mostly with interval and ratio data.
Interval data is numerical data where the difference between values is meaningful, while ratio data has a true zero point.
How is the mean mathematically represented?
X = (Σxi) / N
Where X is the mean, Σxi is the sum of all items in the set, and N is the number of items.
True or False: The mean can only be calculated for integer values.
False
The mean can be calculated for both integers and decimals.
What is the median?
The middle of a set of values if arranged from smallest to largest.
The median is particularly useful in non-normal distributions or with extreme scores.
When is the median most useful?
When you have a non-normal distribution, extreme scores, or ordinal data.
Ordinal data refers to data that can be ranked but not measured.
What is the mode?
The most commonly occurring number in a set of data.
The mode is most frequently used with nominal data.
When is the mode most frequently used?
With nominal data.
Nominal data is categorical data without a specific order.
What do measures of variability or dispersion indicate?
They indicate how the data varies and the spread of data.
What additional information can measures of variability provide?
They can provide insight into the amount of ‘noise’ in the data set.
List common measures of dispersion.
- Range
- Interquartile range
- Mean absolute deviation
- Variance
- Standard deviation
True or False: The interquartile range (IQR) is a common measure of dispersion.
True
Fill in the blank: Common measures of dispersion include range, interquartile range, mean absolute deviation, ______, and standard deviation.
[variance]
What is the range in descriptive statistics?
The difference between the smallest and largest value in a distribution