Research Flashcards

1
Q

What are the benefits of using a one-way ANOVA?

A

An advantage of the one-way ANOVA (or any ANOVA) is that it helps control the “experimentwise error rate” (i.e., the probability of making a Type I error). If alpha is set at .05 for this study, for instance, the probability of making a Type I error would be help at 5%. In contrast, if the individual t-tests were conducted, each at the .05 level, the probability of making a Type I error would be much higher.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What will allow you to control the effects of an extraneous variable?

A

The analysis of covariance is useful for this purpose.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Will adding 12 points increase the mean and other measures of central tendency by 12 points? Will this have any effect on the standard deviation or other measures of variability.

A

Yes

No effect on SD or any other measure of variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q
  1. The _____ product moment correlation coefficient is the appropriate correlation coefficient when both variables are measured on a continuous scale.
  2. The __________ is used to correlate two variables that are measured in terms of ranks.

The _______ is used to determine the correlation between two dichotomous variables.

The ___________ is appropriate when one variable is continuous and the other is a true dichotomy.

A

Pearson r

Spearman rho (rho = Ranks) Correlation between height and shoe size

phi coefficient (living or dead)

point biserial correlation coefficient (time and dead/alive)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Decreasing the level of significance from .05 to .01 makes it more difficult to reject the null hypothesis and, therefore, also _______ power.

Decreasing the susceptibility of the dependent variable measure to measurement (random) error would ________power by ensuring that the measure is able to accurately detect the effects of the independent variable.

When appropriately used, a one-tailed test is ______powerful than a two-tailed test since it places the entire rejection region in only one tail of the sampling distribution rather than splitting it up between the two tails.

A

Decreases

Increase

More

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are some bivariate techniques? And why are they used?

What bivariate summarizes the degree of association between variables with a single number?

How do you determine the amount of shared variability or coefficient of determination?

When should a bivariate coefficient be squared?

When is the correlation NOT squared? and should be interpreted as a ?

What are some correlation Coefficients? and what are their variables?

Which is used when a relationship between variables is non-linear?

A

Bivariate techniques describe or summarize the degree of association between two variables and include:
Scattergram
Correlational Coefficients:
-Correlation Coefficients

The correlation coefficient is squared to determine the amount of shared variability/ coefficient of determination: in other words- the Squared correlation coefficient indicates the proportion of variability in Y that is explained or accounted for by the variability in X
A Bivariate Correlation Coefficient should be squared to obtain a measure of shared variability Only when it indicates the degree of freedom between two variables.

A correlation coefficient is not squared when it is a reliability coefficient (reality) which is a correlation of a measure with it’s self.
-direct measure of “true score variability”

-Pearson r (interval or ratio: interval or ratio)
-Spearman rho (Rank-order; Rank ordered)
-Contingency (nominal, Nominal)
Point Biserial (True Dichotomy; interval or ratio)
Biserial ( Artificial Dichotomy; interval or ratio)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A factorial research design is any design that includes ________ “factors” (independent variables)?

A

two or more

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Pearson r and other correlation coefficient range in value from ____. The magnitude of the coefficient indicates the relationship’s strength the sign (+ or -) indicates the relationships __________. Where a _____ means the value of Y increases as the value of X increases. A _______ correlation, the value of Y decreases as the value of X __________.

A

-1.0 to + 1.0

Direction

Positive
Negative
Increases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the three assumptions that must be met in order to produce an accurate correlation?

A

-Linearity
_Unrestricted range: where data is collected from people who are heterogeneous with regard to the characteristics measured by X and Y
- Homoscedasticity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

________ is used to predict Y from a single X, and an assumption is that the relationship between X and Y is ______ (i.e., the relationship can be described by a straight line (regression line “line of best fit”). What technique is used to locate a regression line in a scatterplot?

A

Regression analysis

linear

Least squares criterion-which locates the line so that the amount of error in prediction is minimized.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Statistical power refers to the ability to reject a false hypothesis and is effected by the size of the _____-.
Therefore when a statistical test enables an investigator to reject a false null hypothesis it is said to have statistical power:
What are the methods to increase statistical power?
_____

A

Alpha

  • Decreasing the susceptibility of the dependent variable measure to measurement (random) error would increase power by ensuring that the measure is able to accurately detect the effects of the independent variable. (minimize error)
  • Increase Alpha to .05 over .01.
  • When appropriately used, a one-tailed test is more powerful than a two-tailed test since it places the entire rejection region in only one tail of the sampling distribution rather than splitting it up between the two tails.
  • Maximizing differences between treatment groups does help increase power.
  • use a parametric test
  • increase sample size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The value of the Pearson-product moment correlation coefficient ranges from -1.0 to +1.0, and the corresponding proportion of variance (which is referred to as the coefficient of determination) is computed by ______ the correlation coefficient.

In addition to the Pearson r, what are some other coefficients?

What determines statistical significance?

A

squaring

Spearman rho (rank order/..)
Phi (Tru Dic/ Tru dic.)
Tetrachoric (Art. Dic/Art. Dic)
Contingency (Nom./Nom)
Point Biserial (True Dic,/interval or ratio)
Biserial (art. Dic/ interval or ratio)
Eta (interval or ratio/,,,)

-compare coeff. to appropriate critical value which is determined by Alpa and sample size.
Small Sample; need a large coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do you interpret a Correlation Coefficient?

A
  • Directly in terms of degree of association (-1 or +1). Correlations are not causal but
  • When a correlation represents the degree of association between two different variables, it can be squared to represent a coefficient of determination which provides a measure of shared variability.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does a regression (Regression Analysis, Multiple Regression) do?
Name some types of multiple regressions.

When is a analysis of variance used? What is the advantage of using a ANOVA?

When is an ANOVA better than a t-test?\

Name some other ANOVAs

What analysis of variance is used for a within subjects design when different levels of the IV or combination of the levels of two or more IV’s are subsequently administered (time) to each subject.

How is an F ratio caluculated using a one way ANOVA?

A

It predicts
Multiple regression: is a multivariate technique that is used when two or more continuous or discrete predictors will be used to predict status on a single continuous criterion.

ANOVA: to compared two or more means:
Advantage: it compares two group means while holding the probability of making a type 1 error at the level of significance set by the investigator. – helps to control experimentwise error rate>

t test and ANOVA are comparable when comparing two means, but an ANOVA is the stat. of choice when 2 or more means are compared. Further, an ANOVA is more complex and analyzes the variability around the mean.

Factorial ANOVA- 3 IV
Randomized Block ANOVA: treats extraneous variable as a IV
-Analysis of Covariance: ANCOVA: combines analysis of -variance and regression analysis: removes extraneous variables from the DV
-Repeated measure ANOVA: within subject design Levels ect. of IV is administered to each subject
-Mixed (Split-Plot) ANOVA: mixed design - at least one IV is between-groups and other within-subjects
Trend Analysis: evaluate the shape or form of the relationship –statistically significant linear or non-linear
-Multivariate Analysis of Variance (MANOVA) : 1+ IV and 1+DV

-Repeated measures ANOVA

An F-ratio is calculated.. When using the one-way ANOVA to determine if an independent variable has had a significant effect on a dependent variable, an F-ratio is calculated by dividing the mean square between (MSB) by the mean square within (MSW).
Divide MSB by MSW (manage social behavior by a Masters of Social work)
–MSB provides an estimate of variability due to (treatment plus error), while MSW provides an estimate of variability due to (error only.) When the independent variable has had an effect, MSB will be larger than MSW and the F-ratio will be larger than 1.0, and the larger the F-ratio, the more likely that the effect of the independent variable is statistically significant.
( the bigger the behavior MSB the larger the F-ratio and the more statistically significant the “issue”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Cohen’s d and eta squared are commonly used for?

Which is used to measure the difference between two groups( experimental and control) in terms of SD.

And which indicates the percent of variance in the outcome variable that is accounted for by variance in the treatment?

A

Effect size.

Cohen d
Eta

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How should significant main and interaction effects be interpreted when using a factorial ANOVA to assess the effects of two independent variables on a dependent variable?

What type of design is used when the effects of different levels of an IV are assessed by administering each level to a different group of subjects and comparing the status or performance of the groups on the DV.

What design : all levels of the IV are administered sequentially to all subjects?

Which design combines between groups and within subjects by including at least one between-groups IV and one within-subjects IV and involve measuring the DV across trial/time where trail/time is an additional IV and is within-subjects?

What is main and interaction effect?

A

Main effects refer to the effects of one independent variable on the dependent variable when considered alone, while interaction effects refer to the effects of one independent variable at different levels of another independent variable.

When the interaction is significant, this means that the effects of one independent variable differ for different levels of another independent variable. Thus, it is not possible to conclude that the independent variable has consistent main effect. Therefore you need to interpret the main effects with caution since the interaction is significant

  • Between-Groups Design
  • Within-Subjects Design
  • Mixed Design

Main IV on DV and disregards the effects of all other IV
Interaction: refers to the effects of two or more IV considered together and occurs when the effects of an IV differ at different levels of another IV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Measures of Central Tendency in Skewed Distribution
Positive=
Negative=

A

Positive (Mode, Md. Mean -Mean is the highest)

Negative (Mean, Md. Mo. ) Mode is greater than the md. which is greater then the M)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Name the measures of variability or spread?

How is the variance calculated?
How is the Standard Deviation Calculated?

A

Range
Variance (M squared)
Standard Deviation

Standard deviation the is square root of the variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

a study in which each participant received only one level of Variable A but all levels of Variable B

A

Split plot or Mixed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What do multivariate techniques do?

These techniques are categorized as Dependence Method and Interdependence methods. What is the difference?

What techniques predicts status on a variable?

What multivariate techniques teat a casual Model or theory?

What multivariate techniques are used for the purpose of data reduction and why is this important

A

they investigate the relationships among three or more variables. -Multivariate data analysis is a set of statistical models that examine patterns in multidimensional data by considering, at once, several data variables. It is an expansion of bivariate data analysis, which considers only two variables in its models. As multivariate models consider more variables, they can examine more complex phenomena and find data patterns that more accurately represent the real world.

-Dependence has a distinct independent and dependent variables (predictor and criterion)
Interdependence does not and includes several data reduction techniques. dependence and interdependence. Dependence relates to cause-effect situations and tries to see if one set of variables can describe or predict the values of other ones. Interdependence refers to structural intercorrelation and aims to understand the underlying patterns of the data.

  • Predictors: Multiple regression (2+ discrete or continuous IV (predictor) and 1 DV (criterion)
  • Canonical Correlation ( 2+ IV and 2 DV (criterion)
  • Discriminant Function Analysis (2+ continuous predictors and 1 Nominal criterion) A discriminant analysis (also known as discriminant function analysis) involves using scores on two or more predictors to predict an individual’s membership in a criterion group - i.e., it is used when the criterion is measured on a nominal scale.
  • Logistic Regression: extension of discriminant but assumes relationships are curvilinear

Test a theory: Causal modeling

  • Path analysis
  • LISREL

Data reduction
Factor Anaylsis
Cluster analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Consider as an example the regression model — a method to analyze correlations in data. The non-multivariate case of regression is the analysis between two variables, and it is called a __________ regression. It could be used, for instance, to see how the height of a swimmer correlates to its speed. By doing this type of regression, the analyst could find that taller swimmers tend to swim faster. Although it is right, we know that the height is not the only thing influencing speed, so the bivariate model hardly explains the complete phenomena of swimming.
In contrast, a _________ regression — also called multiple regression — could take into account way more variables: weight, age, carbohydrate intake, protein intake, amount of training hours, amount of resting hours, and many others. In theory, the higher the number of variables, the more accurate the regression can represent the phenomena of swimming, to a point where it could pinpoint the speed of a new swimmer with little error.

A

bivariate regression

Multivariate regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

________ data variables: are always of the numeric type and represent information that can be measured by some scale. Examples include age (20 years), temperature (25 ºC), and profit (US$ 2000). The number specifies the magnitude of the value on a given scale.
_______ variables: categorizes the data, but do not specify its magnitude. Examples include an operational system (Windows, Linux, macOS) and house size (small, medium, large). The list of options that a non-metric variable can assume is called levels or categories. Even when the levels have an inherent order (e.g., a large house is bigger than a small house), it is still a non-metric variable because there is not any magnitude associated (the variable doesn’t say how bigger is the house). Note that non-metric variables can also be numeric when it is not attached to any scale, such as a variable that dictates the id number of objects

A

Metric

Non metric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Most multivariate techniques perform computations that need numbers as inputs, so how can a technique work with non-metric data?

A

The answer is that a non-metric variable can become a dichotomic metric variable. In this conversion, each level becomes a new metric variable that can only have the values 0 (as false) or 1 (as true). For instance, consider a non-metric variable that classifies the Color of a product with the levels: black, white, and gray. The variable can be replaced with two new ones: isColorBlack and isColorWhite. If a product is black, then it assumes the values 1 and 0 respectively, and if it is white, the values 0 and 1. There is no need for a variable for gray products because they can assume the values 0 and 0: if they aren’t white nor black, they can only be gray

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

In dependence techniques of a multivarate tech,, the analyst feeds a model with input data, specifying which variables are independent and which are dependent. The ___________ variables are the ones the model will try to predict or explain (e.g., swimmer speed). The _______ variables (e.g., swimmer height) are the ones the analyst wants to study how much it affects the independent ones.
The goal of all dependence techniques is to establish a cause-effect relationship. The most notable differences between them are the number of independent variables they support and the nature of the variables involved.

A

independent predictor

dependent (criterion)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What multivariate technique is used to predict sales performance of different stores based on its attributes (e.g., number of vendors, number of hours open). Such analysis would lead to a deeper understanding of what makes each store sell more, which could drive administrative changes in the most important attributes towards values that give higher profit.

A

Multiple regression is an option when the analyst stipulates only one dependent variable, which is metric. The result of applying a multiple regression is the degree of impact that each independent variable has on the dependent one. That result also leads to an estimation function, where it accepts values for the independent variables and returns the expected value for the dependent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

The classic example is classification. After processing the data, the model can classify future entries that don’t have labels. For instance, a model could analyze characteristics of music fragments (dependent variables), whereas each piece is assigned to a musical genre (independent variable). If the analyst builds a successful model, it can classify the genre of fragments it never saw before. What type of multivariate technique is being used?

A

(Dependent Variable(s): one non-metric variable
Independent Variables’ Nature: metric)

  • Multiple discriminant analysis is very similar to machine learning classifiers. It is an option when there is only one dependent variable, which is non-metric — also called “class” or “label”. The goal is to understand the characteristic of the data that pertain to each class.
    • A discriminant analysis (also known as discriminant function analysis) involves using scores on two or more predictors to predict an individual’s membership in a criterion group - i.e., it is used when the criterion is measured on a nominal scale.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

A team of aerodynamics engineers who are designing a new aircraft and want to measure if several combinations of engines and wings affect the magnitude of the forces in airplanes (e.g., thrust, drag, lift, weight).
In a simulation environment, the engineers choose three types of engines (E1, E2, E3) and three types of wings (W1, W2, W3) — both the engine type and the wing type are independent variables. They develop several airplanes for all of the engine-wing combinations and launch them in many virtual spaces to collect as much force data as possible (the dependent variables). What type of Multivariate technique would work best in this study? How can the researcher fine tune these results?

A

MANOVA: require many independent variables and many measures

The application of MANOVA in the collected data could reveal that the combination of E1-W2 is significantly worse, while E3-W1 is significantly better. The engineers can see how each engine, each wing, and each combination, impacts on each of the forces. It is not an easy technique to conduct or to interpret but is a rewarding and powerful one.
-The multiple covariance analysis (MANCOVA) can fine-tune the results and reinforce the study’s validity by removing the effects of possible unobserved variables (for example, whether it was raining or not in the simulations). Thus, even if these factors affect the dependent variables, the MANCOVA reduces its impacts to isolate the effect of the treatments as much as possible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is the difference between internal validity, Face validity, Construct validity, external validity?

A

Internal validity focuses on the causal relationship between independent and dependent variables.

Face validity focuses on whether a test looks like it measures what it is intended to measure.

Construct validity is established when a test measures the intended hypothetical trait.

External validity focuses on the generalizability of one study to other conditions, individuals, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

An investigator uses a factorial ANOVA to assess the effects of two independent variables on a dependent variable and obtains significant main and interaction effects. When interpreting the results of her study, the investigator should interpret the main effect with caution or interpret the interaction since the main effect is significant?

A

interpret the main effects with caution since the interaction is significant.-
When the interaction is significant, this means that the effects of one independent variable differ for different levels of another independent variable. Thus, it is not possible to conclude that the independent variable has consistent main effects. For example, a study might find that, overall, Teaching Method #1 is superior to Teaching Method #2 (i.e., there is a main effect of teaching method). However, there might also be an interaction between teaching method and level of self-esteem - for example, Teaching Method #1 might be more effective for students with high and moderate self-esteem, while Teaching Method #2 is more effective for students with low self-esteem. In this situation, the main effect of teaching method would have to be interpreted with caution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Name some single subject designs?

What is the difference between AB design and Multiple baseline?

What type of single subject design is this:
A psychologist decides to use a single-subject design to assess the impact of an educational intervention designed to increase on-task behavior. When conducting her study, the psychologist will measure the participant’s on-task behavior:

A

AB design
Reversal ABA or ABAB
Multiple Baseline

Multiple baseline: does not require withdrawing a treatment, but involves sequentially applying treatment to different behaviors of the same subject (multiple baseline across behaviors) to the same subject in different setting (multiple baseline across settings) or same behavior across different subjects ( multiple baseline across subjects). In other words a treatment is applied to a baseline which can be a behavior, setting, task or subject)

Multiple baseline: at regular intervals during the baseline and the treatment phases of the study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is an analogue study?

A

when the researcher is studying a phenomenon (therapist-client interactions) under conditions that only resemble or approximate actual clinical conditions.

In clinical research, an analogue study is a study in which the conditions are in some way an analogue (approximation) of actual clinical practice.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

In inferential statistics, there are two types of decision errors - rejecting a true null hypothesis and retaining a false null hypothesis.
A researcher makes a Type II error when he or she? and a Type I error?

A

retains the null hypothesis when it is false: Type II error. (False negative) Telling a pregnant woman that she is not pregnant)

Rejecting a true null hypothesis is referred to as a Type I error. Nothing works hypothesis: say it works but it doesn’t False positive (telling a man he is pregnant)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is a Cohen D and what does a Cohen d of .50 mean?

A

Cohen’s d is a measure of effect size. It indicates the difference between the means of two groups in terms of standard deviations.
A Cohen’s d of .50 indicates that one group obtained a mean that is one-half standard deviation higher than the mean obtained by the other group.

34
Q

The use of an ABAB design helps determine whether or not an observed change in the dependent variable is due to the independent variable or to an external event. Specifically, if the participant exhibits the same change after the independent variable is administered the second time that he/she exhibited after it was administered the first time, that change can be assumed to have been caused by the independent variable. A researcher would use an ABAB design rather than an AB design in order to control which of the following threats to his study’s validity?

What type is this design?

A

History, maturation

An ABAB design is a single-subject design that involves collecting baseline data, administering the independent variable, removing the independent variable, and then readministering the independent variable.

35
Q

What are these threats?

Diffusion is a threat to?

Attrition is a threat to?

Instrumentation is a threat to? And type of designs control for instrumentation?

A

Diffusion is a threat to a study’s validity when participants in one group (often a no-treatment control group) benefit from the intervention administered to another group - i.e., when participants in the control group inadvertently learn about the treatment or are accidentally exposed to it.

Attrition is a threat to a study’s internal validity. It occurs when participants who drop out of one group differ in a systematic way from those who drop out of another group and this difference affects the study’s results.

Instrumentation is a threat to internal validity and refers to any change in tests or other measuring devices administered to participants during the course of the study that confounds the study’s results. An ABAB design would not help control instrumentation effects.

36
Q

A researcher would use “counterbalancing” to:

A

Counterbalancing involves administering the treatments (different levels of the IV) in different orders to different groups of participants. It helps control multiple treatment interference (also known as carry-over or order effects) that may result when multiple levels of the independent variable(s) are administered to the same participants.

37
Q

A researcher would use what technique when her goal is to evaluate the cause-and-effect or predictive relationships between measured variables and latent factors?

A

Structural equation model is used to explore or confirm hypothesized relationships between both measured and latent variables.

38
Q

When would you use a?

Meta-analysis

Multitrait-multimethod matrix

Discriminant function analysis

Structural equation

A
  • Meta-analysis is used to evaluate an intervention by combining the results of a number of research studies.
  • The multitrait-multimethod matrix is used to evaluate convergent and divergent validity.

Discriminant function analysis is used to classify people into criterion groups based on their scores or status on two or more predictors.

Structural equation model is used to explore or confirm hypothesized relationships between both measured and latent variables.

39
Q

A research participant’s score on the dependent variable is the amount of time (minutes) it took him or her to complete a task. When assigning scores to participants, the researcher discovers that three of the 60 participants did not complete the task, and he assigns them the maximum amount of time given to participants to work on the task. The best measure of central tendency for the data collected in this study would be which of the following?

A

Median

Ordinarily, the best measure of central tendency for interval or ratio data is the arithmetic mean. However, in the situation described in this question, the mean would not be the most accurate measure because the distribution of data includes three scores that are estimates (rather than actual measures) of the value of interest.

The mean is affected by the magnitude of every score in a distribution, but the median is not. Consequently, in this situation, the median would not be affected by the magnitude of the three missing scores and, therefore, would be a more accurate measure of central tendency.

40
Q

As a behavioral observation technique, event sampling (recording) is useful for behaviors that:

A

Methods of behavioral sampling include event sampling, interval recording, and situational sampling. These are described in the Statistics and Research Design chapter of the written study materials.

Event sampling involves observing a behavior each time it occurs and is useful for behaviors that occur infrequently, have a long duration, or leave a permanent record.

41
Q

Name some inferential Statistical tests and explain?

A

The t test tells you how significant the differences between groups are; In other words it lets you know if those differences (measured in means) could have happened by chance.

  • –There are three main types of t-test:
  • -An Independent Samples t-test compares the means for two groups.
  • –A Paired sample t-test compares means from the same group at different times (say, one year apart).
  • — A One sample t-test tests the mean of a single group against a known mean.— The more t-tests ran the bigger the experiment wise error and the more likely a type I error– thus analysis of variance is best

ANOVA: Basically, you’re testing groups to see if there’s a difference between them. Examples of when you might want to test different groups:

A group of psychiatric patients are trying three different therapies: counseling, medication and biofeedback. You want to see if one therapy is better than the others.
A manufacturer has two different processes to make light bulbs. They want to know if one process is better than the other.
Students from different colleges take the same exam. You want to see if one college outperforms the other.

One-way has one independent variable (with 2 levels). For example: brand of cereal, (1IV, 2+ groups, 1 DV)
Two-way has two independent variables (it can have multiple levels). For example: brand of cereal, calories
Factor Analysis

  • Nominal: Chi-square
  • Ordinal: Mann Whitney, Wilcoxon, Kruskal-Willis (2+ groups) .
42
Q

What are groups or levels?

A

What are “Groups” or “Levels”?
Groups or levels are different groups within the same independent variable. For example, your levels for “brand of cereal” might be Lucky Charms, Raisin Bran, Cornflakes — a total of three levels. Your levels for “Calories” might be: sweetened, unsweetened — a total of two levels.

Let’s say you are studying if an alcoholic support group and individual counseling combined is the most effective treatment for lowering alcohol consumption. You might split the study participants into three groups or levels:

Medication only,
Medication and counseling,
Counseling only.
Your dependent variable would be the number of alcoholic beverages consumed per day.

If your groups or levels have a hierarchical structure (each level has unique subgroups), then use a nested ANOVA for the analysis.

43
Q

What are these examples of? You have a group of individuals randomly split into smaller groups and completing different tasks. For example, you might be studying the effects of tea on weight loss and form three groups: green tea, black tea, and no tea.
Situation 2: Similar to situation 1, but in this case the individuals are split into groups based on an attribute they possess. For example, you might be studying leg strength of people according to weight. You could split participants into weight categories (obese, overweight and normal) and measure their leg strength on a weight machine.

A

one way ANOVA : One-way ANOVA between groups: used when you want to test two groups to see if there’s a difference between them.

A one way ANOVA is used to compare two means from two independent (unrelated) groups using the F-distribution. The null hypothesis for the test is that the two means are equal. Therefore, a significant result means that the two means are unequal.

44
Q

Percent correct scores represents what type of scale?

A

Percent correct scores represent a ratio scale of measurement - i.e., they have the properties of order, equal intervals, and an absolute 0 point. Consequently, it is possible to conclude that someone who obtains a score of 50% got twice as many items correct as someone whose score is 25%.

45
Q

What factor threatens external validity only.

A

Anything that threatens a study’s internal validity will also threaten its external validity.
–multiple treatment interference is listed by Campbell and Stanley as a direct threat to external validity.

46
Q

What is a time series design?

A

The time-series design is a type of within-subjects design that involves evaluating the effects of an intervention by comparing multiple quantitative observations of participants before and after they are exposed to the intervention.
EX: For her dissertation research project, a graduate student administered a measure of state anxiety to a group of college students on five consecutive days before and after the students participated in a stress reduction workshop.

47
Q

What is Homoscedasticity?

A

Homoscedasticity is one of the assumptions for use of the Pearson r and most other correlation coefficients. This assumption is met when the range of Y scores is about the same at every value of X.

48
Q

What is the difference between LISREL and Path analysis?

A

In contrast to path analysis models, which predict the causal relationships among measured attributes only, LISREL models incorporate both measured attributes and latent traits.

LISREL, a structural equation (causal) modeling technique, is used to test causal hypotheses about relationships among measured variables and the latent traits those variables are believed to measure.

49
Q

Scales of measure
Nominal:

Ordinal:

Interval:

A

Nominal Scale: unorder categories:

Ordinal: divides into categories and provides information on the order of those categories (Likert scale. Rank)

Interval: property of order as well as property of equal variables and No absolute zero (IQ)

Ratio: properties of order and equal intervals as well as an Absolute Zero

50
Q

A moderator variable affects the what?

A

A moderator variable affects the direction or strength of the relationship between independent and dependent variables. For example, if the results of a research study indicate that EMG biofeedback is more effective for tension headaches than for migraine or cluster headaches, type of headache is a moderator variable - i.e., type of headache moderates the effects of EMG feedback on headache symptoms.

51
Q

The standard error of the mean does what?

A

The standard error of the mean is a value that represents the precision of how well the sample mean estimates the population mean.

52
Q

Least squares principle

A

The linear regression formula was mathematically derived in such a way that it minimizes the sum of the squared differences between predicted Y and actual Y. This aspect of linear regression is called the

53
Q

Sample error refers to?

Is it systematic error or random error?

A

Sampling error refers to the discrepancies between sample values and corresponding population values (parameters) that are due to chance factors in the selection process. Note that, by definition, sampling error is random, not systematic.

54
Q

On the basis of the results of the t-test a psychologist uses to analyze the data she collects, the psychologist concludes that her results are “significant at the .01 level.” This means that:

A

The level of significance (alpha) determines the location of the boundary between the regions of likely and unlikely values in the sampling distribution. When results are significant at the chosen level of significance, this means that the results are in the region of unlikely values and that the null hypothesis should be rejected.

Answer A is correct: Significance at the .01 level means that there is a 1% chance that the obtained value (e.g., the mean or the difference between means) could have occurred by chance alone given the value specified in the null hypothesis. In other words, there is a 1% probability that the null hypothesis will be incorrectly rejected (that a Type I error will be made). ( in rejection region)

55
Q

The multiple correlation coefficient can be squared to obtain a measure of what?

What is shared variability?
When is a correlation NEVER squared?

A

The multiple correlation coefficient indicates the degree of association between three or more variables. Like any other correlation coefficient, it can be squared in order to obtain a measure of shared variability.

Correlation Coefficient that is (SQUARED)

Never square a reliability coefficient- you will only be squaring it with it’s self and is interpreted as a dIRECT Measure of “true score variability”

56
Q

What is the F-Ratio?

The denominator term in the F-ratio is reduced in magnitude by:

A

F-ratio:

It is the ratio of the two variances i.e. variance between the groups to the variance within the groups. F-ratio is calculated when there are more than two means to compare.

The formula for the F-ratio:

F =MSB/ MSW

where:

MSB: Mean square between the groups (measure of variability between treatment groups and serves as an estimate of variability that is due to both error and ther effects of the IV

MSW: Mean square within the groups: pooled variability within each treatment group. (Error)

The mean square within (MSW) is the denominator of the F-ratio and, as its name implies, is a measure of within-group variability
Within-group variability is a measure of error; and decreasing within-group variability decreases error and the magnitude of the denominator of the F-ratio.

57
Q

The Central Limit Theorem predicts that the sampling distribution of means increasingly approaches:

A

The Central Limit Theorem makes three predictions about the sampling distribution of means: (a) if repeated random samples of size N are drawn from the population, as N increases in size, the sampling distribution of means increasingly approaches normal regardless of the shape of the population distribution; (b) the mean of the sampling distribution of means equals the population mean; and (c) the standard deviation of the sampling distribution of means equals the population standard deviation divided by the square root of N.
a normal shape as the sample size increases regardless of the shape of the distribution of scores in the population.

58
Q

The correlation between two variables is equal or close to zero when, in a scatterplot

A

— the range of Y scores at each value of X is equal to the total range of Y scores.

59
Q

The least squares is what?

A

The least squares criterion is used to locate the regression line in a scatterplot so that the amount of error in prediction is minimized when using the regression line or its equation to predict criterion scores.
-at least squares gets to the line__

60
Q

+++ What are the assumptions of the Person r? (correlation)

What is the assumption for both a parametric and non parametric tests

How do they differ?

A

Linear
Unrestricted range
Homoscedasticity- range for Y scores is about the same for all values of X

  • -sample is randomly selected from the population
  • -observations are independent

Differ:

    • Parametric- evaluate hypothesis about population means, variances, or other parameters
  • appropriate forms of measures when the variable of interest is measured on an interval or ratio scale and when these assumptions are met: value of interest is normally distributed in the population (2) is when a study includes more than two groups there is a homoscedasticity ( variances of the populations that the different group represents are equal
  • -if violate –increase the probability of making a tye 1 or type 2 error.
61
Q

A teacher administers a math achievement test to the 25 students in her 6th grade class on the first day of the semester and then again on the last day of the semester to see how much they gained in math ability. To analyze the data she obtains, the teacher should use which of the following statistical tests?

A

The t-test for correlated (dependent) samples is used to compare two related means – e.g., means obtained from the same sample at two different times.

62
Q

When a newly developed test consists of 100 true/false items, the ______________ can be used an alternative to coefficient alpha for evaluating its internal consistency reliability.

A

Kuder-Richardson Formula 20 (KR-20) is an alternative to coefficient alpha when items are scored dichomotously (right or wrong), which would be the case for true/false items.

63
Q

In research, probands are?

A

In research, probands (also known as index cases) are the first individuals who are brought to the attention of the investigator -

64
Q

In a scatterplot, the regression lines for a test for two different groups of examinees differ substantially in terms of slope. This suggests that the test has:

A

Differential validity
The slope of a regression line for a test is directly related to the test’s criterion-related validity: The steeper the slope, the greater the validity.
A test has differential validity when it has different validity coefficients for different groups, which is what is suggested by different regression line slopes in a scatterplot.

65
Q

What are the steps to minimize error?

A

Steps to minimize error
 Increase the sample size.
 Use a one-tailed test when appropriate (versus two tailed test)
 Increase the length or intensity of the treatment
 Use a parametric test (e.g., t-test, ANOVA) instead of
non-parametric test

66
Q

How do we increase power?

A

Increasing alpha: This has two effects
 1. We find a real difference with less evidence (smaller
differences between groups required to reject Ho)
 2. A larger alpha increases the probability of Type I
Error; in other words, we are lowering our standards i.e.
lowering the bar to claim that a difference (treatment
effect) is there, and increasing the odds of falsely
claiming a treatment effect.

67
Q

To determine if there is a statistically significant pattern in the effect of time on memory for a list of nonsense syllables, you have subjects memorize a list of 15 syllables and then test their memory at 15-minute intervals for the next two hours. The best technique for analyzing the data you have collected is:

A

In this situation, you want to determine if there is a pattern (or “trend”) in forgetting the list of nonsense syllables. Of the techniques listed, only trend analysis (a type of analysis of variance that is used when the IV is quantitative) would be useful for determining if there is a pattern or trend in subjects’ memory for the list of nonsense syllables over time.

The correct answer is: trend analysis.

68
Q

During an experiment, the investigator asks participants about their beliefs regarding the study’s purpose and how they were expected to perform. When analyzing the data, she finds that participants’ actual performance is consistent with their beliefs and expectations. This suggests that the study’s results may be confounded by:

A

In this situation, the participants may have acted in ways consistent with their expectations rather than simply in response to the experimental manipulation.

Demand characteristics are unintentional cues in the experimental environment or manipulation that affect participants’ beliefs or expectations and thereby may account for the results of the study.

69
Q

___________ occur in repeated measures designs when the effects of one treatment have an impact on the effects of subsequent treatments.

_____________ is a type of rating error and occurs when a rater’s rating of a person on one dimension of performance affects how the rater rates that person on unrelated dimensions of performance.

___________ occurs when research participants act differently because of the novelty of the situation and the special attention they receive as research participants.

A

Carryover effects occur in repeated measures designs when the effects of one treatment have an impact on the effects of subsequent treatments.

-The halo bias is a type of rating error and occurs when a rater’s rating of a person on one dimension of performance affects how the rater rates that person on unrelated dimensions of performance.

The Hawthorne effect occurs when research participants act differently because of the novelty of the situation and the special attention they receive as research participants.

70
Q

Question ID:27331 To evaluate the effects of a behavioral intervention for reducing the self-injurious behaviors of a 5-year-old boy with Autism Spectrum Disorder in different settings, you would use which of the following single-subject designs

A

n this situation, you want to evaluate the effectiveness of the intervention in multiple settings.

A multiple baseline across settings design would be appropriate for this study. It would allow you to sequentially assess the effects of the intervention in different settings.

71
Q

To determine the degree of association between two variables that are reported in terms of ranks, you would use which of the following correlation coefficients?

A

The Spearman rank-order correlation coefficient is also known as Spearman rho and is used when both variables are ranks.

72
Q

___________ is used when both variables are measured on a nominal scale.

__________ is the appropriate correlation coefficient when one variable is continuous and the other is an artificial dichotomy.

___________ is used when both variables are true dichotomies.

A

The contingency coefficient is used when both variables are measured on a nominal scale.

The biserial coefficient is the appropriate correlation coefficient when one variable is continuous and the other is an artificial dichotomy.

The phi coefficient is used when both variables are true dichotomies.

73
Q

The effect of the interaction of independent variables A and B on the dependent variable

A

As one goes up the other goes up

74
Q

A school psychologist wants to determine if there is a significant difference in reading readiness scores between male and female students in the school’s preschool program. She obtains scores on a standardized reading readiness test for 17 girls and 13 boys. Which statistical test will be most appropriate for determining if there is a significant difference between the scores obtained by boys and girls?

A

The researcher in this study will be comparing the scores (most likely the mean scores) achieved by two independent groups of subjects.

A t-test (a.k.a. Student’s t-test) is used to compare the mean scores obtained by two groups.

75
Q

he best procedure to use if you have identified a potential confound that you want to be evenly distributed across groups is ___________.

A

Matched group assignment

76
Q

When conducting a one-way ANOVA to compare the effects of four different diets on weight loss, an F-ratio of _____ suggests that there may be a significant difference between the types of diets?

A

The F-ratio is calculated by dividing the mean square between by the mean square within. Mean square between is a measure of treatment effects plus error, while mean square within in a measure of error only.

Answer A is correct: A treatment effect is suggested when the numerator of the F ratio (mean square between) is larger than the denominator (mean square within) - i.e., when the F value is greater than +1.0.

The correct answer is: 15.5

77
Q

You want to use several predictors to classify people into one of two groups. The relationships between the predictors and the criterion violate the assumption of linearity. Consequently, the best technique is:

A

In this situation, there are two or more predictors and one dichotomous criterion. In addition, the assumption of linearity has been violated.
Logistic regression is similar to discriminant analysis (response c) but is less restrictive in terms of assumptions.

78
Q

When an assumption for a one-way ANOVA is violated, the best course of action would be to use which of the following to analyze the data

A

The one-way analysis of variance is used to compare three or more independent groups. (Although the one-way ANOVA can be used to compare two groups, the t-test is ordinarily used in this case.)

The Kruskal-Wallis is the nonparametric alternative to the one-way ANOVA. It can be used to compare two or more independent groups and is useful when one or more of the assumptions for the one-way ANOVA have been violated.

79
Q

Your research study involves assessing the effects of two independent variables on three dependent variables. In this situation, you would choose to conduct a MANOVA to analyze the data you collect rather than separate factorial ANOVAs in order to:

A

he multivariate analysis of variance (MANOVA) is used to simultaneously assess the effects of one or more independent variables on two or more dependent variables.

The more statistical comparisons made within a research study, the greater the likelihood of making a Type I error. By using the MANOVA to simultaneously assess the effects of the independent variable(s) on the dependent variables, the fewer the total number of statistical comparisons, and the lower the probability of making a Type I error. (The word “error” in the term experimentwise error rate refers to a Type I error.)

80
Q

Coefficient alpha is associated with

A

Coefficient alpha is a type of reliability coefficient. It yields a coefficient of internal consistency.

coefficient alpha is a measure of internal consistency.

81
Q

In factor analysis, “rotating” the factors has which of the following effects?

A

The purpose of rotation in factor analysis is to facilitate interpretation of the factors.
a. CORRECT Rotation alters the factor loadings for each variable and the eigenvalue for each factor (although the total of the eigenvalues remains the same). Knowing that an eigenvalue indicates the amount of variability accounted for by each factor may have helped you identify the correct answer to this question – i.e., when the factor loadings change, the eigenvalues will also change.

-It changes the factor loadings for the variables and the eigenvalue for each factor.

82
Q

What would most likely maximize the magnitude of a test’s reliability coefficient?

A

increase the length of the test and increase the heterogeneity of the examinees with regard to the attribute measured by the test

Longer tests tend to be more reliable (assuming that the added items are similar in terms of quality and content to the original items). In addition, the reliability coefficient (like all correlation coefficients) is larger when the range of scores is unrestricted, which occurs when examinees are heterogeneous with regard to the attribute(s) being measured by the test.