Definitions Flashcards
Histogram
A histogram is a graph that presents continuous numerical data in bars. The height of a rectangle (the vertical axis) represents the distribution frequency of a variable (the amount, or how often that variable appears).
T-test
T-tests are to test whether the mean differences between groups are different. An independent samples t-test is used to assess the mean differences between two independent tests.
Degrees of freedom
Degrees of freedom show us how much the data can vary before it effects our results.
sampling distribution of mean differences
The sampling distribution of mean differences refers to the distribution of differences in means obtained from multiple samples taken from the same population
p-value
the probability of observing a difference in sample means as extreme as (or more extreme than) the one obtained in the current study, assuming the null hypothesis is true.
relative frequency
Relative frequency is the proportion or percentage of occurrences of a specific category or data point relative to the total number of occurrences in the dataset.
It provides a way to understand the distribution of data in terms of proportions rather than absolute counts.
cumulative frequency
Cumulative frequency is the frequency of all of the occurrences up till the one you have chosen. This is calculated as the sum of all the frequencies up until the chosen point.
Standard Deviations
Standard deviation is a statistical measure of the dispersion or spread of a set of data points. It quantifies how much individual data points differ from the mean (average) of the data set. In other words, it indicates the average deviation of each data point from the mean.
Statistical inference
Statistical inference is the process of making predictions, decisions, or generalizations about a population based on sample data from that population. It involves using statistical techniques to draw conclusions about parameters or characteristics of a population by analyzing sample data.
standard error of the mean
The standard error of the mean is a measure of the variability or dispersion of sample means around the population mean.
It indicates how much the sample mean is expected to vary from one sample to another.
z-scores
They represent the number of standard deviations a data point is from the mean of the distribution. This standardization process allows for the comparison of data points from different distributions or with different units of measurement.
Z-scores are used to standardise data points so that the mean is 0 and the standard deviation is 1.
Pearson Bivariate Correlation Coefficient
and range
used to assess the strength and the direction of the relationship between two variables, IV and DV.
The range is between -1 and 1.
R-squared
indicates the proportion of the variance in the dependent variable (outcome) that is explained by the independent variable(s) (predictor(s)).
It measures the goodness of fit of the regression model to the observed data.
R-squared ranges from 0 to 1.
R-squared value indicate in a regression model
When there is only one independent variable in a simple linear regression model, the correlation coefficient between the independent variable and the dependent variable is equivalent to the square root of the R-squared value.
confidence interval for a correlation coefficient
confidence interval for a correlation coefficient involves assessing the range within which we can reasonably estimate the true value of the correlation in the population.