Statistics Flashcards
measures of central tendency
a single number that describes the general location of a set of scores
mean
a measure of central tendency; average; add up all scores and divide by number of scores; “weighted” mean: weighted by frequency; the sum of all the distances (or deviations) of all scores from the mean is zero
median
a measure of central tendency; score corresponding to 50th percentile; “middle number”; when the distance of scores is symmetric, the mean and median will be equal; in a positively skewed distribution, the mean is larger (and vice versa)
mode
simplest measure of central tendency; score that occurs most often
frequency distribution
think how “frequent” data is “distributed”, a form of organizing data to make it make better sense; the number of observations within a given interval; shows the “frequency” of occurrence of each possible outcome of a repeatable event observed many times; for ex. - election results, test score listed by percentile; can be in table form, graphed as a histogram or pie chart
scatter plot
use when you have 2 variables that pair well together to view their relationship and see if there is a positive or negative correlation; x, y data is required for this type of graph (2 variables - independent and dependent); related term - correlation
summation notation
convenient/simple form of shorthand used to give concise expression for values of a variable; 8 rules
order of operations
PEMDAS (parentheses, exponents, multiplication/division, addition/subtraction)
percentile rank
single number that gives the percent of cases in the specific reference group scoring at or below that score; for ex. - the 5th percentile is the score in a distribution whose percentile rank is 5; state wide test - categorize test scores to put into; for ex. - can be pictured in a histogram
z-scores/transformation
numerical measurement that describes a value’s relationship to the mean of a group of values; measured in terms of standard deviations from the mean; if a z score is 0, it indicates that the data point’s score is identical to the mean; purpose - allows us to to calculate the probability of a score occurring within our normal distribution and enables us to compare two scores that are from different normal distributions
one-sample t-test
used to test the statistical difference between a sample mean and a known or hypothesized value of the mean in the population; use your findings to find z-score and compare to population mean; used when the population mean is unknown
dependent sample t-test
aka paired sample t-test; used to determine whether the mean difference between two sets of observations is zero; used when you want to find the statistical difference between two points, times or conditions; each subject is measured twice, resulting in pairs of observations; involves repeated measures (think “within” subjects); for ex. - pre-post observations where “d” is the time in between
independent sample t-test
aka two sample t-test; compares the means of two independent groups in order to determine whether there is statistical evidence that the associated population means are significantly different; assume that the dependent variable should be normally distributed within each population; variances of the two populations should be approximately equal (homogeneity of variance); the null hypothesis would be that there is not a difference between the 2 groups; think “between” subjects
correlation
a measure of how 2 or more variables are related to one another; important to always distinguish between correlation (A and B) - have relationship - and causation (A led to B); correlations range from -1 and +1 (otherwise there’s an error in your math); positive values reflect a positive correlation (as one score increases, scores on the other should also increase); negative values reflect a negative correlation (as one score increases, scores on the other should decrease); for ex., linear correlation, Pearson’s correlation coefficient (r), correlation using raw scores, spearman rank-order correlation; P (rho) = population correlation coefficient
Pearson’s r (correlation) - calculated thru SPSS
measure of the strength of association between 2 variables; the nearer the scatter of points is to a straight line, the higher the strength of association; for ex. - knowing the relationship between age and blood pressure; does not matter what measurement units are being used; r = 1 is a perfect positive correlation, r = -1 is a perfect negative correlation