W1 Flashcards
What are histograms?
Histograms visualize the distribution of a dataset, providing a graphical representation of the frequency of values within certain ranges.
Increasing the number of bins in the histogram enhances resolution, allowing for a more detailed view of the data distribution.
What does kurtosis mean?
Kurtosis refers to the peakedness or flatness of the distribution curve in a histogram.
High kurtosis indicates a high peak with more extreme values, while low kurtosis indicates a wider distribution.
Define dataset?
A dataset is a collection of data acquired for a specific purpose, which may relate to multiple experiments or hypotheses.
Each dataset typically consists of multiple variables, each representing different attributes being measured.
Define variable?
A variable is a number that can vary depending on an attribute being measured.
Multiple variables are typically measured from each participant or observation, often represented as columns in a data file.
What’s nominal data?
Nominal data, also known as categorical data, has no inherent order or magnitude between different categories.
Each category is distinct and cannot be ranked or ordered.
What’s ordinal data?
Ordinal data has a natural order between categories, but the magnitude of differences between categories is not interpretable.
Common examples include Likert scales where responses are ordered but the difference between each response option may not be uniform.
What’s interval data?
Interval data has ordered categories with interpretable magnitudes, but zero does not have a meaningful interpretation.
Examples include temperature measurements where the difference between 20°C and 30°C is the same as the difference between 30°C and 40°C.
What’s ratio data?
Ratio data, like interval data, has ordered categories with interpretable magnitudes, but zero is directly interpretable and ratios between values can be meaningful.
Examples include measurements of money or reaction times.
What is discrete data?
Discrete data consists of numerical values that belong to a fixed set of distinct values, often representing counts or whole numbers.
What’s continuous data?
Continuous data represents variables that can take any value within a certain range, allowing for infinite possibilities.
Interval and ratio data are examples of continuous data.
What’s a null hypothesis?
The null hypothesis states that there is no effect or relationship between variables in a statistical analysis.
Statistical tests are conducted to either reject or fail to reject the null hypothesis based on the evidence provided by the data.
What’s the p-value?
The p-value is the probability that a particular test statistic could occur if the null hypothesis is true.
A lower p-value suggests stronger evidence against the null hypothesis, leading to its rejection in favor of an alternative hypothesis.