research: statistics Flashcards
frequency distribution
Tabulation of the number of observations (or number of participants) per distinct response for a particular variable. It is presented in table format, rows indicating each distinct response and columns presenting the frequency for which that response occurred
frequency polygon
A line graph of the frequency distribution that is used to visually display data that are ordinal, interval, or ratio. The X-axis typically indicates the possible values, and the Y-axis typically represents the frequency count for each of those values.
histogram
A graph of connecting bars that shows the frequency of scores for a variable. Taller bars indicate greater frequency or number of responses. Histograms are used with quantitative and continuous variables (ordinal, interval, or ratio).
bar graph
A graph that displays nominal data. Each bar represents a distinct (noncontinuous) response, and the height of the bar indicates the frequency of that response.
central tendency
Measures of the typical or middle value of the data set. Measures of central tendency include the mean, median, and mode
Variability
A measure of the spread in a distribution of scores or data points. The more dispersed the data points, the more variability a distribution has. The three main indicators of variability are range, standard deviation, and variance.
interquartile range
The distance between the 75th percentile and the 25th percentile (i.e., the range of the middle 50% of the data). The interquartile range may be a more accurate estimate of variability when dealing with outliers or extreme values, as it eliminates the top and bottom quartiles
standard deviation
The most frequently reported indicator of variability for interval or ratio data.
sum of squares (SS)
The sum of the squared deviation scores, computed by subtracting the mean from each data point (deviation scores), squaring each deviation score, and adding them together.
variance
A type of variability equal to the standard deviation squared
Skewness
An asymmetrical distribution in which the data points do not cluster systematically around a mean. Distributions can be positively skewed, with a greater number of data points clustering around the lower end, or negatively skewed, with a greater number of data points clustering around the higher end of the distribution.
Kurtosis
The degree of peakedness of a distribution. Distributions can be mesokurtic (normal curve), leptokurtic (tall and thin), and platykurtic (flat and wide).
Inferential Statistics
Statistical procedures that are used to draw inferences about a population from a sample.
degrees of freedom
An important concept used in inferential statistics, that refers to the number of independent variables (IVs) “free to vary.” Computing df depends on the statistical test used.
Correlation coefficient
A numerical index that represents the relationship between two variables. Index values range from .00 to 1.00 in both the positive and negative directions, with +1.00 indicating a perfect positive relationship and a -1.00 indicating a perfect negative relationship. Four commonly used types of correlation coefficients include (a) Pearson product moment correlation coefficient (commonly referred to as Pearson r), (b) Spearman r (for comparing rank-order variables), (c) biserial correlation coefficients (comparing one continuous and one dichotomous or dummy coded variable), and (d) point biserial correlation coefficients (relating one continuous and one true dichotomous variable). A correlation provides information about the relationship between two variables, including whether there is a relationship at all, the direction of that relationship, and the strength of the relationship.