Stats Flashcards
What happens to statistical significance when there is a large sample size?
Large sample size makes it more likely to find statistical significance. As size grows, significance can be found for even very small differences.
How are the mean and standard deviation affected if a constant is subtracted from every score?
All operations (add, subtract, x, divide) affect mean but only multiplication and division affect standard deviation.
Criterion contamination. Refers to:
- Hi validity coef bc ratings contaminated by knoweledge of predictor.
- Underestimate of validity coef bc criterion rating contaminated by knowledge of predictor
- Carryover effects
- Due to low reliability
- It occurs when the rating on criterion is affected by knowledge of score on predictor.
Ie may inflate grades of hi IQ kids causing hi correlation between iq and grades.
Using an ANOVA a pooled error term is justified when:
- Sample size is unequal
- Variance is equal
- All cells have the same number of subjects
- Homiscedasticity is violated.
- Pooled error term is used when there is homogeneity of variance (equal). Homoscedacity is also equal variance.
When things Re equal they can be pooled. When unequal, treat them separately.
Match definitions below with std deviation, std error of measurement, std error of estimate, std error of the mean
- Ave amt of error in predicting each persons score
- Ave amt of error in calculating each score
- Ave amt of error in grp means iq score
- Ave amt of spread in grp scores.
- Predicting is std error of estimate
- Std error of measurement…ave amt of error in each persons score.
- Std error of the mean…in relation to the population mean
- Std deviation.
What is the most significant problem in using a series of t tests to analyze a data set?
- Experimenter wise error Type I
- Difficulty distinguishing intx and main effects
- Low power beta - 1
- Violating parametric assumptions
1
ANOVA reduces type I error, which is additive.
More tests ya do more chance of error.
How are the mean and std deviation effected if a constant is subtracted from every score?
- Both decrease
- Both increase
- Mean decreases, std dev same
- Mean same, std dev decreases
3
Single subject research design, which is the most significant problem?
- Autocorrelation
- Multicollinearity
- Regression to the mean
- Practice effects
1
The association between two variables, when ea variables association w another variable has been removed is known as: A.analysis of covariance B. partial correlation C. Semi-partial correlation D. Coefficient of determination
B. this is correlation between 2 variables when the association between a third variable and ea of the two original variables has been partialed out.
Cluster sampling involved what kind of clusters?
Naturally occuring groups and then randomly selecting from the clusters. Typically all the subjects within the clusters are sampled.
Standard errors of mean, measurement, and estimate express error in terms of:
- Standard deviation
- Sampling error
- Systematic error
- Testing situation
- All std error express in terms of std deviation.
Sampling error applies to std error of mean only
Systematic…applies to none
Testing situation…source of error for std error of measurement only.
Std error of measurement is significantly influenced by reliability coefficient, the range of std error of measurement is: A. -1 to 1 B. 0 to 1 C. 0 to sdx D. 0 to sdy
C
Range of validity coefficient is -1 to 1
Reliability coefficient is 0 to 1.
D is range of std error of estimate.
Taylor Russell tables evaluate incremental validity: base rate, selection ratio, criterion related validity
A. Construct validity
B. criterion related validity
C. Test retest
D. Internal consistency reliability
Selection ratio…ratio of number of openings to number of apps
Low optimizes incremental validity
Higher criterion validity, better incremental.
- Criterion related validity
For each a score, variability of b scores is equal to total variability of b scores. Conclude: A. Error; unlikely B. moderate positive correlation C. Mod negative correlation D. No correlation
D…for any a, end up w all possible b. scatter plot
Knowing a tells u nothing of b
Which will increase std error of mean?
Increase sd of population and decrease sample size
Decrease sd of population and increase sample siZe
Increase both
Decrease both
- Std error of mean has direct relationship w sd of population Nd indirect relationship w sample size
Std error of mean increases when sd of pop is increased and n is reduced.
Increasing test length: A. Affects reliability only B. reliability more than validity C. Equal effect D. Neither
B. effect on both but validity has a ceiling due to the reliability.
Two way ANOVA find differences. U conclude:
A. Main effects and may:not have intx effects
B. intx effects and may/not have main effects
C. Can be any combo of main effects and interactions
D. Neither main effects or interaction effects because may be due to chance.
C. 3 F ratios so 4 possibilities of significance. Two possible main effects and a possible intx effect. Can be any combo. Can have intx but no main effects etc..if one way ANOVA can’t detect intx effect.
Abab design the concern is: A. History and maturation B. regression and diffusion C. Failure of IV to return to baseline D. Failure of dv to return to baseline
4
Which circumstance would it be problematic to use chi square?
A. When looking for differences between groups
B. ordinal data
C. Repeater observations made
D. More than one iv
C. One of the main assumptions is independence of observations. Can’t use when repeated observations are made, like a pre and post test
Chi square is non parametric test of differences used for nominal or categorical data. Can use with ordinal. Use multiple chi when more than one IV
Shape of a z score distribution is: Normal Skewed Flat Can't be determined
Shape follows the raw score distribution which is not given.
Flat is for percentile ranks.
Single subjects design involve an approach: Ipsitive Idiographic Normative Nomothetic
Idiographic describes single subject approaches
Nomothetic group approach
Normative…data compared with in and between subjects
Ipsitive forced choice format. Only gives strengths and interests within a subject and can’t be used for comparisons.
3 levels of an IV and a continuous dv should be analyzed using what stats? One way ANOVA Factorial ANOVA Chi square Manova
1. One IV w 3 levels; 1 DV continuous or scored numerically One way ANOVA used w 1IV and 1DV Chi is nominal data or categorical Manova had more than one dv Two way ANOVA is 2 IV and one dv Factorial ANOVA more 1 IV and 1 dv
Relationship between education Nd income for clinical psychologists is?
Correlation between education and income in general?
- Zero. Restricted range
2. Broader .3 to .5
Changes in the Variable causes changes in the
Variable.
Independent
Dependent
IV is input and causes changes in
Dv is output
IV is manipulated
Dv is measured
Correlational research variables are not manipulated. Input variable is IV . Called predictor variables.
Outcome variables are dv or criterion variables.
Regardless…what effect does (IV) have on (Dv).
What is a factorial experimental design?
Adv of factorial?
More than one IV where every level of one IV is combo w every other level of IV .
What are the adv ? Statistical in nature.
What is internal validity?
What are the threats to internal validity?
How do you control for threats to internal validity?
Allows the conclusion that there is a causal relationship between the IV and dv.
Or if can conclude that no effect.
Threats: (hims teds) Factors other than IV are responsible for changes in dv: History (external event) Maturation (internal event..fatigue, bored, hunger) Testing (experience w pretest) Instrumentation (change nature of it) Statistical regression (less extreme scores when retested) Selection (preexisting subject characteristics) Differential mortality (diff of drop outs and non drop outs) Experimenter bias (expectation or other bias)
Control?
- Random assignment (equivalent on extraneous factors)
- Matching or grp similar subjects on extraneous and randomly assign
- Blocking or study as if extraneous is another IV
- Hold extraneous variable constant or use only homogeneous subjects
- Ancova…like post hoc matching
What does a confound mean?
Experiment contaminated by an extraneous variable is confounded.
What is a threat to internal validity for a pretest/post test design? This is a one group before and after design.
Testing…when pre and post are similar may show improvement due to experience w the test. Test wise
Instrumentation…raters may have improved by post test
Stat regression
Pygmalion in the classroom:
- Experimental expectancy
- Impact of maturation
- Confounding variable
- Unequal selection of students
Another name for it?
- Correct! Teachers preconceived ideas of a students abilities resulted in the graded and even iq scores moving in the expected direction even though the students hadnt changed.
- No
- It is a confounding variable! Yes
- No!
Also called rosenthal effect. Behavior of subjects changes due to expectancies.
Overcome w double blind study
What is the difference between random assignment and random selection?
Random selection or random sampling is selecting subjects into a study. All members of the population under study have equal chance to be selected to participate. (External validity)
Random assignment is after they have been selected. For subjects already selected the probability of being assigned to ea grp is the same. Great equalizer!
When is matching used?
Controls for effects of a specific extraneous variable. Identify subjects thru a pretest who are similar on an extraneous variable, group and randomly assign.
Good when sample size is small and random assignment cannot be counted on to ensure equivalency.
How is blocking done to control for confounding variables?
Involves studying the effects of an extraneous variable, usually a preexisting characteristic (gender, iq) to determine if and degree acct for scores on dv.
Make extraneous a IV
Ie. divide into blocks…hi and lo iq then randomly assign to IV. Now have iq and tx as 2 IV.
Different from matching bc
Matching ensures equivalence. Number of Ivs stay the same.
Blocking determines the effects of the extraneous variable. Also adding a IV.
Discuss holding the extraneous variable constant as a way to manage internal validity and Ancova.
Holding the extraneous variable constant eliminates the effects of an extraneous variable.
Include only homogeneous subjects
Ie only the high iq peeps
Disadvantage..can’t generalize
Ancova is a post hoc matching after data are obtained. Dv scores adjusted so subjects are equalized
Disadv..like matching..can’t control for what has not been identified or measured.
What is external validity?
What are some threats to external validity?
Some may overlap w internal validity. Understand concept not need to know which classification they go to.
Generalizability of the results to other settings, times, people…
Interaction (some variables have one effect under one set of conditions but a different effect under another set).
Intx between selection and tx:
Given tx not generalize to other members of population (ie use college kids may not go to rest of population).
Intx between hx and tx:
Effects of tx don’t generalize beyond setting or time pd expt done.
Intx between testing and tx:
Pretest sensitization..can’t generalize to sit where pretests not used. Pretest may sensitize to purpose or increase susceptibility to respond to the tx.
Demand characteristics
Cues in research setting that allow to guess research hypothesis. Due to cues, subjects act different than real world (try disprove..)
Hawthorne effect:
Respond different just due to mere fact being studied. Study..workers increased output following any change in the environment.
Order effects or carry over or multiple treatment interference
Problem in repeated measures design or same subjects exposed to more than one tx. Last tx may have greater effect bc it followed previous interventions.
How do you increase external validity?
- Random selection or random sampling. Often use experimentally accessible population and assumption is made that subjects similar in relevant ways to rest of target population.
Stratified random sampling.. sample from several subgroups of total population. To ensure proportionate rep of defined pop
Cluster sampling. Natural occurring group of individuals vs individual. - Naturalistic research
Controls for hawthorn and demand characteristics but will lack internal validity (always a trade off). - Single and double blind
Reduce demand and hawthorn effects - Counterbalancing
Controls for order effects.
Diff subjects or grps receive tx in diff order. Type is Latin square design…order administration of tx so ea appears once and only once in every position.
Experiemental vs quasi experimental vs correlational research.
Exptal..random assignment; manipulate variables
Quasi ..NO random assignment
Pre existing grps. Naturally occurring. Manipulable variable studied (decide which grp gets which tx) so experimenter control. Use w preexisting intact grps like classroom, ward..
Correlational..grps measured only
Not manipulated. No internal validity. Only associations.
Used for prediction.
Variables like age, gender, ses, eye color…
Discuss the differences between the developmental designs. Longitudinal, cross sectional, and cross sequential. Review drawbacks of each.
Longitudinal..study one grp of subjects over a long pd of time.
Disadv Time, money, dropout rate.
Cross sectional..study two or more grps of ppl at one time
Disadv…cohort effects..experience effect vs their age
Cross sequential..combo of the above..look at ppl of diff ages at 2 plus times
Controls for cohort. Cheaper, less expensive, decreases drop out.
What is a time series design?
Interrupted time series design?
Advantage of multiple measurements?
Multiple measurements over time to assess IV
Usually multiple pre and post test measures
Interrupted time series design is series of measurements on dv that is interrupted by administration of tx
One grp interrupted time series design. Threat is an event that occurs at same time as tx. Ie. price of cigs went up when administered. Control history with two grp time series design w a control grp that also look at over time.
Adv? Rule out threats to internal validity like maturation, regression, testing.
What is a single subject design?
Number of subjects is one or if two or more subjects treated as one group.
Dependent variable measured several times during both phases
Lots of variability poses threat to design.
Good for behavior modification study.
Usually baseline (no tx) and then tx phase.
Types: AB, reversal, multiple baseline
Describe the the single subject designs.
AB single baseline, single tx phase
Disadv..history, other confound
Reversal or withdrawal design
Single subject design
Treatment is withdrawn and data collected to det if behavior goes back to original level upon withdrawal. If returns to original level during wdrawal then more sure due to tx vs extraneous factors
ABA
ABAB. Tx reapplied at end. Adv over ABA is additional confirmation tx causes changes. Also don’t leave them wo tx.
Multiple baseline used when reversal can’t be. So can’t reverse or withdraw tx due to ethics. May not be possible to demo tx effect in a reversal. So instead of a withdrawal, this applies the tx sequentially or across different baselines. In other words…single subject design in which IV is sequentially administered across 2 or more subjects, behaviors, or settings (baselines)
What is qualitative research?
Talk about the types, especially surveys and case studies.
Descriptive research
Theory is developed from the data vs being derived beforehand.
Often pilot to help define ho
Types: Participant observation Nonparticipating observation Interviews Surveys..personal, phone, mail Biased selection or sampling is problem Case studies ...case is example of more general class. Can't draw conclusions between variables and may not be generalize able. Protocol analysis ...verbatim reports No traditional quantitative techniques but based on interpretation.