Stats Flashcards

Question

What is a factorial experimental design? Adv of factorial?

Answer 1

More than one IV where every level of one IV is combo w every other level of IV . What are the adv ? Statistical in nature.

Answer 2

Allows the conclusion that there is a causal relationship between the IV and dv. Or if can conclude that no effect. ``` Threats: (hims teds) Factors other than IV are responsible for changes in dv: History (external event) Maturation (internal event..fatigue, bored, hunger) Testing (experience w pretest) Instrumentation (change nature of it) Statistical regression (less extreme scores when retested) Selection (preexisting subject characteristics) Differential mortality (diff of drop outs and non drop outs) Experimenter bias (expectation or other bias) ``` Control? 1. Random assignment (equivalent on extraneous factors) 2. Matching or grp similar subjects on extraneous and randomly assign 3. Blocking or study as if extraneous is another IV 4. Hold extraneous variable constant or use only homogeneous subjects 5. Ancova...like post hoc matching

Answer 3

Experiment contaminated by an extraneous variable is confounded.

Answer 4

Testing...when pre and post are similar may show improvement due to experience w the test. Test wise Instrumentation...raters may have improved by post test Stat regression

Answer 5

1. Correct! Teachers preconceived ideas of a students abilities resulted in the graded and even iq scores moving in the expected direction even though the students hadnt changed. 2. No 3. It is a confounding variable! Yes 4. No! Also called rosenthal effect. Behavior of subjects changes due to expectancies. Overcome w double blind study

Answer 6

Random selection or random sampling is selecting subjects into a study. All members of the population under study have equal chance to be selected to participate. (External validity) Random assignment is after they have been selected. For subjects already selected the probability of being assigned to ea grp is the same. Great equalizer!

Answer 7

Controls for effects of a specific extraneous variable. Identify subjects thru a pretest who are similar on an extraneous variable, group and randomly assign. Good when sample size is small and random assignment cannot be counted on to ensure equivalency.

Answer 8

Involves studying the effects of an extraneous variable, usually a preexisting characteristic (gender, iq) to determine if and degree acct for scores on dv. Make extraneous a IV Ie. divide into blocks...hi and lo iq then randomly assign to IV. Now have iq and tx as 2 IV. Different from matching bc Matching ensures equivalence. Number of Ivs stay the same. Blocking determines the effects of the extraneous variable. Also adding a IV.

Answer 9

Holding the extraneous variable constant eliminates the effects of an extraneous variable. Include only homogeneous subjects Ie only the high iq peeps Disadvantage..can't generalize Ancova is a post hoc matching after data are obtained. Dv scores adjusted so subjects are equalized Disadv..like matching..can't control for what has not been identified or measured.

Answer 10

Generalizability of the results to other settings, times, people... Interaction (some variables have one effect under one set of conditions but a different effect under another set). Intx between selection and tx: Given tx not generalize to other members of population (ie use college kids may not go to rest of population). Intx between hx and tx: Effects of tx don't generalize beyond setting or time pd expt done. Intx between testing and tx: Pretest sensitization..can't generalize to sit where pretests not used. Pretest may sensitize to purpose or increase susceptibility to respond to the tx. Demand characteristics Cues in research setting that allow to guess research hypothesis. Due to cues, subjects act different than real world (try disprove..) Hawthorne effect: Respond different just due to mere fact being studied. Study..workers increased output following any change in the environment. Order effects or carry over or multiple treatment interference Problem in repeated measures design or same subjects exposed to more than one tx. Last tx may have greater effect bc it followed previous interventions.

Answer 11

1. Random selection or random sampling. Often use experimentally accessible population and assumption is made that subjects similar in relevant ways to rest of target population. Stratified random sampling.. sample from several subgroups of total population. To ensure proportionate rep of defined pop Cluster sampling. Natural occurring group of individuals vs individual. 2. Naturalistic research Controls for hawthorn and demand characteristics but will lack internal validity (always a trade off). 3. Single and double blind Reduce demand and hawthorn effects 4. Counterbalancing Controls for order effects. Diff subjects or grps receive tx in diff order. Type is Latin square design...order administration of tx so ea appears once and only once in every position.

Answer 12

Exptal..random assignment; manipulate variables Quasi ..NO random assignment Pre existing grps. Naturally occurring. Manipulable variable studied (decide which grp gets which tx) so experimenter control. Use w preexisting intact grps like classroom, ward.. Correlational..grps measured only Not manipulated. No internal validity. Only associations. Used for prediction. Variables like age, gender, ses, eye color...

Answer 13

Longitudinal..study one grp of subjects over a long pd of time. Disadv Time, money, dropout rate. Cross sectional..study two or more grps of ppl at one time Disadv...cohort effects..experience effect vs their age Cross sequential..combo of the above..look at ppl of diff ages at 2 plus times Controls for cohort. Cheaper, less expensive, decreases drop out.

Answer 14

Multiple measurements over time to assess IV Usually multiple pre and post test measures Interrupted time series design is series of measurements on dv that is interrupted by administration of tx One grp interrupted time series design. Threat is an event that occurs at same time as tx. Ie. price of cigs went up when administered. Control history with two grp time series design w a control grp that also look at over time. Adv? Rule out threats to internal validity like maturation, regression, testing.

Answer 15

Number of subjects is one or if two or more subjects treated as one group. Dependent variable measured several times during both phases Lots of variability poses threat to design. Good for behavior modification study. Usually baseline (no tx) and then tx phase. Types: AB, reversal, multiple baseline

Answer 16

AB single baseline, single tx phase Disadv..history, other confound Reversal or withdrawal design Single subject design Treatment is withdrawn and data collected to det if behavior goes back to original level upon withdrawal. If returns to original level during wdrawal then more sure due to tx vs extraneous factors ABA ABAB. Tx reapplied at end. Adv over ABA is additional confirmation tx causes changes. Also don't leave them wo tx. Multiple baseline used when reversal can't be. So can't reverse or withdraw tx due to ethics. May not be possible to demo tx effect in a reversal. So instead of a withdrawal, this applies the tx sequentially or across different baselines. In other words...single subject design in which IV is sequentially administered across 2 or more subjects, behaviors, or settings (baselines)

Answer 17

Descriptive research Theory is developed from the data vs being derived beforehand. Often pilot to help define ho ``` Types: Participant observation Nonparticipating observation Interviews Surveys..personal, phone, mail Biased selection or sampling is problem Case studies ...case is example of more general class. Can't draw conclusions between variables and may not be generalize able. Protocol analysis ...verbatim reports No traditional quantitative techniques but based on interpretation. ```

Answer 18

Stratified random sampling is a population divided into sub populations and all members of ea sub population have an equal probability of being chosen. Clustering involved grouping subjects who are similar in terms of status on an extraneous variable and then assign to each grp. Naturally occuring grps (vs individual)

Answer 19

B | Not c because other designs use manipulated variables

Answer 20

D. No withdraw of tx

Answer 21

C. Administer multiple pretests and post tests to one grp of subjects before and after tx. Design controls for many threats such as maturation, testing, stat regression . An external threat is the threat.

Answer 22

Adv. A Disadv. C

Answer 23

Nominal...unordered categories Dx, color, sex. Can be labeled w numbers but not ordered. Ordinal....ordered categories Ranks. Attitude scales. Don't know how much more or less Tell how ordered but not amount between the categories Interval....numbers arranged in order and intervals in between are equal No absolute zero pt Can't multiply or divide; can add and subtract (say 50 points higher but can't say twice as smart) Iq, standardized test scores, temps Ratio....numbers arranged in order and intervals between are equal. Absolute zero pt Can add, subtract, multiply, divide Dollar amounts, time, distance, height, wt...

Answer 24

Tale tells the tale...location of the tail determines labeling ``` Negatively skewed Easy test Most scores at high end Tail at lo end (few lo scores) so neg Mode greater than median greater than mean. ``` Positively skewed Hard test Most scores low end (left side) Tail at hi end bc very few hi scores... .so positive Mean is greater than median greater than mode Mean pulled to the tail.

Answer 25

Mean is average. Add all and divide by N. Sensitive to extreme values and misleading when highly skewed. Median..middle value When ordered lowest to highest. Less sensitive to extreme scores Mode...most frequent value Can be bimodal or multimodal.

Answer 26

``` Range Difference between the highest and lowest score Affected by extreme scores Tells nothing of distribution Only a general descriptor. ``` Variance Average of the squared differences from the mean of each score Measure of variability of scores Basically how the scores disperse around the mean. Sample variance is sum (x-M) squared divided by N-1. Probably don't need to know formula. Just that it is variance is the square of the std deviation. Standard deviation Square root of variance Thought of as expected deviation from the mean if a score chosen at random. Higher the std deviation the more that scores in a distribution are likely to deviate from the mean.

Answer 27

A score is a measure of how many std deviations a given raw score is from the mean. When all scores in the distribution are converted to a scores there will be a mean of zero and std deviation of 1. All scores below the mean will have negative scores, all scores above the mean are positive, all scores at mean are 0. Allows comparisons When raw scores are transformed to a scores the shape of the distribution does not change. This is called a linear transformation. Z score is x minus M divided by standard deviation.

Answer 28

T score is a standard score w mean of 50 and std deviation of 10. Stanine divide distribution into 9 intervals. One is lowest ninth... Mean of 5, std dev of about 2. Percentile ranks Percent scoring below attained raw score. Flat or rectangular distribution. So within a given range always same number of scores Changes shape of distribution. Nonlinear transformation. 70 percentile is 70 percent scored below you (Percentage is items answered correctly on a test; percentile is referenced to other scores in the distribution).

Answer 29

True In middle if your score is increased you jump over a lot more people. At hi end of distribution there are only a few scores and will jump over many fewer people.

Answer 30

B Percentile is nonlinear

Answer 31

B. around 16 percent which is z score of 1.

Answer 32

B. more scores in middle of distribution and she will jump over more people.

Answer 33

Inferential stats allow us to make inferences about what is happening in an entire population on the basis of what we observe in the sample. Sample stats only provide estimates of corresponding populations (sample value is called statistic and population value is called parameter). A. Sampling error is the inaccuracy of a sample value (stat) and the population value (parameter). When use a stat some error is inevitable. One type of sampling error is standard error of the mean.

Answer 34

Type of sampling error Difference between a sample mean and a population mean or extent to which a sample mean can be expected to deviate from its corresponding population mean. Also called error of the mean Must know formula!!! SE mean equals std dev divided by square root of N (sample size). N is 25 and sd is 10. Error is 2. So sample obtained can be expected to deviate 2 points either way higher or lower from the population mean. Error is smaller as sample size larger bc get closer to size of the population. INVERSE relationship...as sample size increase, std error of mean decreases!

Answer 35

Null hypothesis..no difference in tx conditions. So in population the IV has no effect on dv. Alternative hypothesis...states the opposite of the null. Usually predicts a relationship between variables One tailed test..stat test used when the alternative ho is directional (one mean is greater than another). Greater than or less than another mean. Two tailed...means are different but we don't know the direction.

Answer 36

1. Type II 2. Power. Probability of not making a type II error. 3. Two tailed test 4. Type I error

Answer 37

Retention region is the white area of the graph. If the value falls there the null ho is retained. Meaning it is kept. If the value falls in the rejection region (defined by the alpha level), the null hypothesis is rejected. This is because the stat test has indicated that the null has only 5 % or less chance of being true or 95% chance that it is not true so we have rejected it. When reject it say reached at the significance level. Say reject the null at the .05 level The significance level is the probability at which we reject the null as being true.

Answer 38

A. True. Higher the alpha easier to reject null B. wrong C. False. One tailed tests more powerful. D. True E. false. Greater diff between pop means more power. Can make the difference between the IV bigger. Drink glass of wine vs bottle of whiskey..

Answer 39

Probability of type I goes up and type II goes down. When set alpha consider real world circumstances. If I is more serious than II error then set alpha low. Do if research counterintuitive and contradict previous research.

Answer 40

``` Parametric. Test interval or ratio data. 3 assumptions: Normal distribution of dv Homogeneity of variance Independence of observations Ie. t test, ANOVA Most tests robust re the first 2 Last is most important ``` ``` Nonparametric Nominal or ordinal data Not based on the assumptions Distribution free tests Generally less powerful that parametric tests Chi squared, Mann Whitney u ``` Both assume samples are representative of population under study. Random selection of subjects

Answer 41

C. Lacks power this means the probability of type II error is hi Or that a false null will be kept Test won't detect true effect Won't yield stat significance

Answer 42

A. Alpha is the probability of making a type I error

Answer 43

A. Power is low unlikely to detect an effect of IV when one is present Likely keep null When keep null doesn't mean u did correctly. Just means test lacked power to correctly reject

Answer 44

Compare stay value to critical value table. Critical value depends on alpha and degrees of freedom. If obtained value exceeds critical value, null rejected. Obtained value is lower than critical, keep null or retain null.

Answer 45

False parametric test Correct. T ratio 3 types: One sample t test...compares one sample mean w known population mean. Is. Sample 35 women lawyers and compare to national lawyer ave. df=N-1. T test for independent samples Compare two means from unrelated samples. Ie random assign subjects into drug and placebo grp and then compare means Df = N-2 T test for correlated samples Related in some way (matched, pre/post). Compare pre and post means. Df = N-1. N equals number of pairs of scores. D. No!! Only use t test w 2 means!

Answer 46

A. True. One way ANOVA one way ANOVA. Usually use t test bc easier. Yields F stat. Tells you there is a difference but not which direction. B. ANOVA C. Factorial ANOVA Main and intx effects D. ANOVA just tells difference in means. Use post hoc tests to identify exactly where the significant difference is

Answer 47

``` 1. K-1 (k is number if groups) W in N -k 2. T test One sample, correlated sample N-1 Independent sample N-2 ```

Answer 48

Index of variability used to derived f ratio. Equal to sum of squares divided by degrees of freedom. F = MSB/MSW MSB= sum of squares between/ k-1 MSW=sum of sq w in/N-k Then compare to critical value. Higher than stat significance. Doesn't tell which means differ significantly just that they do. Post hoc done if significant

Answer 49

Pairwise comparison Between 2 means Complex comparisons Between combined means Protection? Scheffe is most conservative. Best protection against type I error when multiple comparisons are made. However this may increase type II error (and miss it if there is an effect) If doing pair wise comparisons use what? Tukey.

Answer 50

One way...1 IV and more than 2 independent grps Factorial ANOVA 2 plus IV For repeated measures all levels of all IV applied to single grp (or matched grps) Mixed or split plot ANOVA... 2 plus IV; mixed...at least one between subject IV and at least one repeated measures or w in subj variable (Not variance!) Manova. 2 or more Dv;1 or more IV Adv (vs many one way ANOVAs) is decreasing Expter wise error rate or type I error ANOVA repeated measures All subjects get all levels of IV Ancova Stat control over one or more dv to control for effects of extraneous variables. Two way ANOVA is 2 IV factorial

Answer 51

B Main effect...effect of one IV by itself. Find these differences, if any, in the marginal means column (pg 91). Intx effect. Effects of IV at different levels of other Ivs. Look inside boxes or cell means to find . Numbers move in same direction for both then no intx when reading across or down. Go in opposite directions. Also can draw a graph. If intx then lines cross. When interaction effect must interpret main effects w caution. Can't interpret them wout looking at intx

Answer 52

A. Used when given frequency or number of subjects w in ea category (not mean scores). Compares observed frequencies of observations within nominal categories to frequencies expected under null. B. compare two independent grps on dv measured w rank ordered data. Alternative to t test for independent samples. C. Compare two correlated grps on dv w rank ordered data. Alt to t test for correlated samples D. Compare 2 plus independent grps on dv w rank ordered data. Alt to one way ANOVA.

Answer 53

Nominal data. Frequencies of observations w in a category. Test ho that observed frequencies equal those expected of null true. Single sample...from one grp of ppl Df = C - 1. (Categories) Multiple sample...adding another variable in addition to one that gives rise to classification categories Df = (C-1) (R-1). R is rows Caution: 1. All observations must be independent of ea other 2. Ea observation only in one category or cell 3. Percentages of observations w in categories can not be compared.

Answer 54

Single sample Total number of subjects/ number of cells. Coke study had 100 subj and 2 cells 100/2= 50 Multiple sample Fe = (column total)(row total)/N See page 97. Most do for ea cell. Need look example.

Answer 55

C. See ? Pg 100

Answer 56

Scheffe is most conservative Greatest protection vs type I but then increases chance of type II. Tukey. Use for pair wise comparisons. Gives enough protection against type I

Answer 57

All true but g Could mean causally correlated If two variables are causally related, there will be a correlation between them. Correlation is a necessary but sufficient condition of causality. So correlation does not guarantee causality but if there is a causal relationship they also must be correlated.

Answer 58

All except d, e Homoscedasticity ...dispersion of scores is equal thruout scattergram and is an assumption of Pearson r. (Heteroscedacity is more dispersion at some parts of scattergram or not uniform). This lowers the coefficient. Wider range of scores the more accurate the correlation. Increase a correlation by increasing the range of scores.

Answer 59

All correct. Regression line is pic of overall relationship between 2 variables. Higher correlation, closer dots and more accurate at predicting y. Error is diff between predicted and actual criterion scores. Error scores assumed normally distributed w a mean of zero. Assume homoscrdastic. Regression line is least amount of error in predicting y scores from x. Regression can be used as sub for ANOVA.

Answer 60

A. True B. multiple regression C. True D. True. If predictors overlap/hi correlations w ea other combining them yields no significantly new info Significant predictor overlap is multicollinearity. One of predictors past 3 or 4 bound to have hi correlation w one of others. E true F true G true coefficient of multiple determination. Gives proportion of variance in criterion variable accounted for by combo of predictor variables.

Answer 61

Large number of potential predictors but use smaller set. Goal to get smallest set that maximizes predictive power. Those w hi multiple correlation w criterion. Adv Time, money Multicollinearity..adding gives no more power Forward...start w one predictor and add predictors one at a time. W ea you do analysis to determine predictive power of multiple regression is substantially increased. First kept has highest correlation w criterion. Add til no more predictive increase Backward. Start w all potential predictors and remove one at a time. When starts to significantly decrease R stop removing.

Answer 62

``` A. Discriminate function analysis B. trend analysis C. Partial correlation D. Canonical correlation E. multiple cutoff F. Structural equation modeling G. Logistical regression ``` (Vs multiple correlation coefficient which is relationship between two or more predictors and one criterion)

Answer 63

A. so does discriminate analysis however...logistical also used nominal or continuous B. correct. Logistical doesn't have these assumptions C.nope none D. True E. true

Answer 64

General term for variety of techniques that are based on correlations between multiple variables. Assumption...linear relationship Application...testing causal models Steps: Specify model w different variables usually w path diagram Stat analysis between all pairs Interpret results..degree data consistent w model Types: Path analysis. Simpler models w one way causal flows. Observed variables only LiSReL. One or two way Latent and observed variables

Answer 65

Cannonical correlation coefficient is used with multiple criterion and multiple predictors. Multiple correlation is 2 or more predictors and one criterion.

Answer 66

Four time. Square correlation coefficient So .6 is .36 and .3 is .09.

Answer 67

C. Says...range of y at every individual x will be equal to the entire range of y. In other words, any one score on x doesn't provide any info about y. Means correlation is 0. B is heteroscedascity. D is homoscedasticity

Answer 68

A. Population and can be represented in frequency distribution. Every single score in population. Population can be anything..height or cornstalk etc B. set of scores obtained from a sample is sample distribution. Most cases it will have less score variability than pop distribution. Make a frequency distribution. Doesn't include the full range of scores. C. Sampling distribution. Can be done with the mean, t, F. D. Sampling distribution of the means. Ea sample drawn will have a mean close to but not exactly at population mean. Plotting a large series of the means of these samples will yield a distribution of sample means that will approach a normal shape, be very tightly clustered around population mean, and have a mean that is equal to population mean. Use sampling with replacement and all sample sizes are the same.

Answer 69

All true! Basis of statistical inference! A test is robust or the rate of false rejections of the null/type I error rate is not substantially increased when normal distribution and homogeneity of variance are violated. Most parametric tests are robust w. as long as the sample size is adequate, still working w a normal distribution. As N decreases, parametric tests are less robust..and important that the normality of assumption is met. Re: homogeneity of variance...ok if equal number of subjects. Unequal then inflated type I error.

Answer 70

C. Means across measurements will be related and there will be a different magnitude of relationship among the means, depending on lag or how close together in time they were obtained. Correlation a between observations at given lags is autocorrelation.

Answer 71

A, b correct! Formula for conditional probabilities. Revise probabilities based on additional info. C, d is meta analysis. analysis uses the results of each study as if they were separate scores, sums, averages. Ea study used is one subject in meta analysis Stat yielded is effect size..gives magnitude if independent variables effect. Computed for ea dv. Then get average effect size. Adv..allows consider size of effect Disadv..subject to bias by analyzer, such as which studies to include See of 137 for ?, of 140 ?

Answer 72

B. how person does in reference to external criterion. Percentage tells is how did on a test or how much was mastered. Others are norm referenced scores on how did compared to others. Don't say anything about how much of the criterion is mastered.

Answer 73

B | Also c but that applies to others as well...like quasi experiments

Answer 74

D. Both inferential stats do use to draw conclusions about a population on basis of info from the sample. Must be representative from population it is drawn. Best way is to randomly select.