Statistics and Research Design Flashcards

Question

Parametric and Non-Parametric Tests

Answer 1

Parametric tests are inferential statistical tests that are used when the data to be analyzed represent an interval or ratio scale and when certain assumptions about the population distribution(s) have been met - i.e., when scores on the variable of interest are normally distributed and when there is homoscedasticity (population variances are equal). An advantage of the parametric tests is that they are more "powerful" than the nonparametric tests. They include the Student's t-test and the analysis of variance. Nonparametric tests are inferential statistical tests used to analyze nominal or ordinal data (or interval or ratio data when the assumptions for a parametric test have not been met). They include the chi-square test, the Mann-Whitney U test, and the Wilcoxon matched-pairs test.

Answer 2

Path analysis is a structural equation (causal) modeling technique that is used to verify a pre-defined causal model or theory. It involves translating the theory into a path diagram, collecting data on the variables of interest (the observed variables), and calculating and interpreting path coefficients.

Answer 3

When using probability sampling, each element in the target population has a known chance of being selected for inclusion in the sample. Methods of probability sampling include simple random sampling, stratified random sampling, and cluster sampling. In contrast to simple random sampling and stratified random sampling (which involve selecting individuals from the population), cluster sampling involves selecting units or groups of individuals from the population (e.g., schools, hospitals, clinics.)

Answer 4

Random assignment involves randomly assigning subjects to treatment groups and is sometimes referred to as "randomization." It is considered the "hallmark" of true experimental research because it enables an investigator to conclude that any observed effect of an IV on the DV is due to the IV rather than to error. (Random assignment must not be confused with random selection, which refers to randomly selecting subjects from the population.)

Answer 5

Random error is error that is unpredictable (random). Sampling error and measurement error are types of random error.

Answer 6

The randomized block ANOVA is the appropriate statistical test when blocking has been used as a method for controlling an extraneous variable (i.e., when the extraneous variable is treated as an independent variable). It allows an investigator to statistically analyze the main and interaction effects of the extraneous variable.

Answer 7

Regression analysis is used to predict a score on one criterion based on the person's obtained score on one predictor. It involves identifying the location of the regression line ("line of best fit") and using the equation for that line, the regression equation, to make predictions. The least squares criterion is used to locate the regression line so that the amount of error in prediction is minimized.

Answer 8

The rejection region of a sampling distribution contains the sample values (e.g., means) that are unlikely to be obtained simply as the result of sampling error. When an inferential statistical test indicates that the obtained sample value falls in the rejection region, the null hypothesis is rejected and the alternative hypothesis is retained. The size of the rejection region is defined by alpha. The retention region is the region of a sampling distribution that contains the values that are likely to be obtained simply as the result of sampling error. When an inferential statistical test indicates that an obtained sample value is in the retention region, the null hypothesis is retained and the alternative hypothesis is rejected. The retention region is equal to one minus alpha.

Answer 9

The sampling distribution of the mean is the distribution of sample means that would be obtained if an infinite number of equal-size samples were randomly selected from the population and the mean for each sample was calculated. The sampling distribution is normally-shaped, its mean is equal to the population mean, and its standard deviation (the standard error of the mean) is equal to the population standard deviation divided by the square root of the sample size. The sampling distribution is used in inferential statistics to determine how likely it is to obtain a particular sample mean given the population mean, the population standard deviation, the sample size, and the level of significance.

Answer 10

The four scales of measurement are one way to categorize the various ways of measuring variables. From least to most "mathematically sophisticated," the scales are nominal, ordinal, interval, and ratio. A nominal scale yields "frequency data" (the frequency of observations in each nominal category). Ordinal, interval, and ratio scales provide scale values or scores.

Answer 11

A correlation coefficient for two or more variables can be squared to obtain a measure of shared variability. For example, if the correlation between X and Y is .50, this means that 25% of variability in Y is shared with (or is accounted for by) variability in X.

Answer 12

Single-subject designs include at least one A (baseline) and one B (treatment) phase and include multiple measurements of the DV at regular intervals during each phase. The AB design includes a single baseline phase and a single treatment phase. The reversal designs include, at a minimum, two baseline phases and one treatment phase (e.g., an ABA or ABAB design), with the treatment being withdrawn ("reversed") during the second and subsequent baseline phases. Use of the multiple-baseline design involves sequentially applying a treatment to different "baselines" (e.g., to different behaviors, settings, tasks, or subjects).

Answer 13

Skewed distributions are asymmetrical distributions in which the majority of scores are located on one side of the distribution. In a positively skewed distribution, most scores are in the low side of the distribution but a few scores are in the high (positive) side and the mean is greater than the median which, in turn, is greater than the mode. In a negatively skewed distribution, the majority of scores are in the high side of the distribution, but a few are in the low (negative) side and the mode is greater than the median, which is greater than the mean.

Answer 14

The standard deviation is a measure of dispersion (variability) of scores around the mean of the distribution. It is the square root of the variance and is calculated by dividing the sum of the squared deviation scores by N (or N - 1) and taking the square root of the result.

Answer 15

Statistical power refers to the probability of rejecting a false null hypothesis. Power cannot be directly controlled but is increased by having a large sample, maximizing the effects of the IV, increasing the size of alpha, and reducing error.

Answer 16

Systematic error is predictable error. Extraneous (confounding) variables are a source of systematic error that affects the relationship between independent and dependent variables.

Answer 17

Trend analysis is a type of analysis of variance that is used to assess linear and nonlinear trends when the independent variable is quantitative.

Answer 18

A Type I error occurs when a true null hypothesis is rejected. The probability of making a Type I error is equal to alpha, which is set by the investigator prior to collecting or analyzing the data. A Type II error occurs when a false null hypothesis is retained. The probability of making a Type II error is equal to beta (which is usually unknown).

Answer 19

Within-subjects designs are experimental research designs in which each subject receives, at different times, each level of the IV (or combinations of the IVs) so that comparisons on the DV are made within subjects rather than between groups. The single-group time-series design is a type of within-subjects design.

Statistics and Research Design Flashcards

(43 cards)