Final Flashcards

Question

___ is best for numeric symmetrically distributed data

Answer 1

*Mean* is best for numeric symmetrically distributed data

Answer 2

*Median* is best for numeric non-symmetrically distributed data

Answer 3

Gender is a *nominal* level of measurement

Answer 4

Time is a *ratio* level of measurement

Answer 5

Age is a *ratio* level of measurement

Answer 6

A range of values that we are confident contains the population parameter

Answer 7

A single value that represents the best estimate of the population value

Answer 8

In a confidence interval, the width concerns the *precision* of the estimate

Answer 9

The point estimate is always in the *middle* of the confidence interval

Answer 10

If we repeated sampling an infinite number of times, 95% of the intervals would overlap the true mean

Answer 11

Not every value in a CI, is equally as *probable*

Answer 12

A more narrow confidence interval means that it is *more* precise

Answer 13

1. Larger sample size 2. Less variance 3. Lower selected level of confidence (90% vs. 95%)

Answer 14

The null hypothesis is *a sampling error*. And it states that *the population means(not sample means) are equal so the difference seen is not real*

Answer 15

The alternative hypothesis states that the difference seen, represents *a real difference*.

Answer 16

When the null hypothesis is true, and we choose to reject it. Symbol: "Alpha"

Answer 17

When the null hypothesis is false, and we do not reject it. (accept it) Symbol: Beta

Answer 18

*Alpha* is the maximum probability of type 1 error that a researcher is willing to accept

Answer 19

Set before running statistics

Answer 20

0.05. (5%)

Answer 21

The probability of type 1 error if the null hypothesis is true

Answer 22

False You can NOT have a probability of type 1 error what the null hypothesis is false

Answer 23

After research

Answer 24

Probability of observing a value more extreme than actual value observed, if the null hypothesis is true

Answer 25

If the p-value is less than or equal to alpha, we *REJECT* the null hypothesis

Answer 26

If the p-value is greater than or equal to alpha, we *ACCEPT* the null hypothesis

Answer 27

If we “fail to reject” (accept) Ho, we attribute any | observed difference to *sampling error* only

Answer 28

• We don’t interpret non-significant differences as *“real”* (maybe not even as “trends”)

Answer 29

We understand that a non-significant difference is | attributable only to *chance.*

Answer 30

Look at the 95% CI of the mean difference, and evaluate whether or not it includes zero

Answer 31

If the confidence interval includes 0, it is *nonsignificant* in hypothesis testing

Answer 32

If the confidence interval excludes 0, it is *significant* in hypothesis testing

Answer 33

CIs give an estimate of effect size

Answer 34

P-values and CIs tells us about *statistical significance not clinical significance*

Answer 35

The probability of finding a statistically significant difference if such a difference exists in the real world

Answer 36

- Alpha - Effect size - Variance - Sample size

Answer 37

Increasing alpha will *increase* power

Answer 38

An effect size is known as the *mean difference*

Answer 39

The mean difference divided by the variance

Answer 40

*Variance* is the spread of scores

Answer 41

Increasing the effect size will *increase* the power

Answer 42

Increasing the sample size will *increase* the power

Answer 43

*Sample size* is the best way to increase statistical power

Answer 44

Increasing variance will *decrease* power

Answer 45

- Decreased alpha - Decreased effect size - Increased variance - Decreased sample size

Answer 46

- Power a priori | - Power post-hoc

Answer 47

A power analysis done before we collect data, to determine if the design is powerful enough

Answer 48

Power analysis done after the research is complete by the consumers to find if there was enough power/ if they failed to reject the null hypothesis

Answer 49

If a difference is found post-hoc/the null hypothesis was accepted/fail to reject, then the power issue is *moot/not a problem*

Answer 50

If a difference not is found post-hoc/the null hypothesis was accepted/fail to reject, then the power issue is *huge* and you have to do a *post-hoc analysis*

Answer 51

A priori is used to figure out how many subjects to use *before a study is started*

Answer 52

1. Compute with traditional cohen approach | 2. Determine with confidence interval analysis of effect size

Answer 53

``` • Continuous scale result: 0.0 – 1.0 ( > 0.8 is default) • Based on: • Sample size • Alpha • Variance (observed) • Effect size (use MCID, not observed) ```

Answer 54

*Determine with confidence interval analysis of effect size* is the better way to determine the post hoc analysis, while with *compute with traditional cohen approach*, the answer will probably be the same as a priori

Answer 55

If the MCID is excluded from the CI, then it is definitively negative and *adequately* powered

Answer 56

If the MCID is included from the CI, then it is not definitive and *inadequately* powered/ underpowered

Answer 57

A two tailed testis testing to see *if your calculated value is either above or below where it is expected to be*

Answer 58

A one tailed test is testing to see if *your calculated value is above where it's expected to be or below where it is expected to be*

Answer 59

*Null hypothesis(H0)* is the assumption you're beginning with and is opposite of what you're testing

Answer 60

*Alternating hypothesis* is the claim you're testing

Answer 61

Statistical method to decide whether an observed difference in sample scores represents a “real” difference in the population…. vs. just sampling error

Answer 62

2 levels of 1 IV

Answer 63

Finds the difference between group means divided by the variability within the groups( standard error of the mean difference)

Answer 64

All sources of variability within a set of data | that cannot be explained by the independent variable.

Answer 65

A within group variability with no variability is known as being *definitely different* ?

Answer 66

A within group variability with little bit of variability is known as *probably different*

Answer 67

A within group variability with larger amounts of variability is known as *maybe not different*

Answer 68

When the variability between groups are not necessarily the same, it is called *a differing variance*

Answer 69

A branch of statistics which assumes that sample data comes from a population that follows a probability distribution based on a fixed set of parameters.

Answer 70

* Samples are randomly drawn from populations * Population is normally distributed * Homogeneity of variance (roughly) * Data from ratio or interval (i.e. continuous) scales

Answer 71

Generalization

Answer 72

- Statistically - Graphically - Common sense

Answer 73

With unequal group sizes

Answer 74

Statistically

Answer 75

Levene's test

Answer 76

- The two population means are equal - The hypothesis can be in a nondirectional format (not equal) - Directional format (one is greater than the other)

Answer 77

A two-tailed test uses a *nondirectional* hypothesis

Answer 78

A one-tailed test uses a *directional* hypothesis

Answer 79

A two tailed test has *less* statistical power compared to the one tailed test

Answer 80

- Independent/unpaired t-test | - Paired t-test

Answer 81

Testing to see if there is a difference between 2 groups

Answer 82

- Pretest-posttest design (compare change scores) | - Posttest only design

Answer 83

Testing to see if there is a difference between conditions in the same person

Answer 84

- Difference scores or pretest-posttest | - Repeated measures design

Answer 85

A repeated measures factor is an example of a *within-subjects factor*

Answer 86

A non-repeated measures factor is an example of a *between-subjects* factor

Answer 87

Statistical method to decide whether an observed difference in sample scores represents a “real” difference in the population…. vs. just sampling error, but with 3 or more groups/levels of 1 IV and or 2 or more IVs

Answer 88

Are observed differences in whole set of means greater than would be expected by chance alone?

Answer 89

An f- statistic

Answer 90

The between group variability divided by the within group variability

Answer 91

All of the population means are even

Answer 92

At least one pair of samples is significantly different, but we don't know which one

Answer 93

* Samples are randomly drawn from populations * Population is normally distributed * Homogeneity of variance (roughly) * Data from ratio or interval (i.e. continuous) scales

Answer 94

Generalization

Answer 95

- Statistically - Graphically - Common sense

Answer 96

When there is an unequal group size

Answer 97

Statistically

Answer 98

- Whether they are one way (1 IV) or multiple ways | - Whether the IV are between subjects(independent groups) or within subjects (repeated measure) or a mixed model

Answer 99

Where there is 1 IV that is between subject and 1 IV that is within subjects

Answer 100

- One way ANOVA: independent samples - Two way ANOVA: independent samples - One way ANOVA: Repeated measures samples - Two way ANOVA: Repeated measures samples

Answer 101

1 IV with 3 or more levels

Answer 102

Whether or not there is a difference overall, but not where the difference is

Answer 103

2 or more IV

Answer 104

- Main effect of IV A - Main effect of IV B - Main effect of IV A & B (interaction effect)

Answer 105

Saying that the scores across one of the IV depends on the levels of the other IV

Answer 106

It is really helpful to look at *graphs* when talking about interaction effects

Answer 107

There is no interaction

Answer 108

There is an interaction

Answer 109

When the lines cross and significant main effects cannot be interpreted

Answer 110

When the lines don't cross and significant main effects can be interpreted

Answer 111

The one way ANOVA: Repeated measures samples is more powerful that the independent ANOVA because *it has less error variance*

Answer 112

Sphericity

Answer 113

The homogeneity of variance of differences

Answer 114

Test with Mauchly’s Test of Sphericity

Answer 115

No difference in variance

Answer 116

Use correction/adjusted p-value

Answer 117

To determine where the difference is

Answer 118

The multiple comparison test is also called the *pairwise comparisons*

Answer 119

1. Post-hoc | 2. Planned comparison

Answer 120

Performed after ANOVA

Answer 121

*Post-hoc* multiple comparison strategy is the most common

Answer 122

The post hoc test *every difference* and therefore are exploratory

Answer 123

Performed instead of ANOVA (a priori)

Answer 124

Focused only on specific comparisons

Answer 125

Add up all the alpha values

Answer 126

A Bonferroni Correction can be done

Answer 127

Divide alpha by the number of statistical tests to be performed and use that for each post hoc test

Answer 128

Because it has less power and a higher chance of a type 1 error, must balance risk of Type 1 and Type 2 error

Answer 129

- Fisher's least significant difference - Duncan multiple range test - Newman-Keuls method - Tukey's honestly significance difference - Bonferroni t-test - Scheffe's comparison

Answer 130

- Fisher's least significant difference - Tukey's honestly significance difference - Bonferroni t-test

Answer 131

Essentially and unadjusted t-test (LSD)

Answer 132

“Middle of the road” in terms of risk and most commonly used

Answer 133

Simply divides α by # of | comparisons

Answer 134

When an independent groups type test is being performed

Answer 135

- LSD - SIdak - Bonferoni correction

Answer 136

LSD is an *unadjusted paired t-test*

Answer 137

Sidak is *adjusted, but good balance of type 1 & type 2 error protection*

Answer 138

The LSD test has a high risk of *high*, type 1 error meaning it is less conservative

Answer 139

The bonferoni correction test has a high risk of *type 2* error and is more conservative

Answer 140

(Analysis of covariance) is a statistical technique that is used when you cannot control a variable through research design and sampling

Answer 141

It statistically adjust the dependent variable based on the covariate

Answer 142

ANCOVA produces *adjusted means*

Answer 143

ANCOVA is a combination of *ANOVA and linear regression*

Answer 144

- Usual parametric assumptions - Linear relationship between CoV and DV (with r>.6) - Homogeneity of slopes

Answer 145

You can also use ANCOVA to adjust for *baseline* scores

Answer 146

When the basic assumptions for a parametric test are not met

Answer 147

* Comparisons of ranks of scores | * Comparisons of counts(yes/no) or “signs” of score

Answer 148

Non- parametric statistics are *less powerful* compared to parametric statistics

Answer 149

Unpaired t-test

Answer 150

Paired t-test

Answer 151

One-way analysis of variance (ANOVA) (F)

Answer 152

One-way repeated measures analysis of variance (MANOVA)

Answer 153

Mann-Whitney U test

Answer 154

- Sign test | - Wilcoxon signed ranks test (T)

Answer 155

- Kruskal-Wallis analysis of variance by ranks (H or x^2)

Answer 156

Friedman two way analysis of variance by ranks

Answer 157

FALSE Unable to perform on more complex designs (e.g. 2x3)

Answer 158

Is the difference in ranks larger than would be expected by chance alone?

Answer 159

Is the difference in sign frequencies larger than would be expected by chance alone?

Answer 160

Chi- Square

Answer 161

Are observed frequencies different than expected frequencies

Answer 162

* Goodness of fit | * Tests of independence (association)

Answer 163

• Compare observed frequencies of 1 variable to uniform frequencies of another

Answer 164

• Eg: flip coin 50 times. Get 15 heads & 35 tails. Is this difference due to chance or a “real” bias?

Answer 165

Tests of independence (association)

Answer 166

Compare observed frequencies from 1 variable to observed frequencies of another variable

Answer 167

Eg: Is owning a mac laptop related to gender?

Answer 168

Requirement of chi-square is that variable levels must be independent (e.g. can’t be “healed” and “unhealed”)

Answer 169

McNemar test* is the form of a chi square test that is used for 2x2 with correlated sample

Answer 170

A correlation coefficient for 2 nominal variables/ degrees of association for 2x2

Answer 171

The phi coefficient is based off the *chi-square test*

Answer 172

Continuous

Answer 173

Continuous

Answer 174

Difference between means?

Answer 175

Difference between means?

Answer 176

Ranks different?

Answer 177

Continuous

Answer 178

Continuous

Answer 179

Continuous

Answer 180

Continuous

Answer 181

Strength of association?

Answer 182

Strength of prediction?

Answer 183

A pair of scores and how much they co-vary

Answer 184

Directly or inversely proportional. When one is high, so is the other and vice versa

Answer 185

* Do they vary together (covary)? * How strong is their linear relationship? * What is the nature of the relationship?

Answer 186

A correlation has to be *linear*

Answer 187

A number that quantifies the strength of a linear relationship that can range from -1 to 1

Answer 188

Closer to |1.00|, higher strength of relationship

Answer 189

The direction

Answer 190

The tighter the grouping of the linear relationship, the *higher* the correlation coefficient

Answer 191

Little or no relationship

Answer 192

Fair relationship

Answer 193

Moderate to good

Answer 194

Good to excellent

Answer 195

The square of the correlation coefficient

Answer 196

The percent of variance in one variable that is explained (or accounted for) by the other variable

Answer 197

To test the null hypothesis

Answer 198

The correlation between variable x and variable y is not significantly different from zero.

Answer 199

Coefficient correlation is very sensitive to * sample size*

Answer 200

Pearson Product-Moment Correlation Coefficient (r)

Answer 201

When both variables continuous (Interval or Ratio scale)

Answer 202

Non-parametric analog of Pearson r

Answer 203

When 1 continuous, 1 ordinal variable or 2 ordinal variables

Answer 204

When one variable is dichotomous, and the other variable continuous (interval or ratio)

Answer 205

dichotomous nominal (e.g Age & Race)

Answer 206

Computationally, a Point Biserial Correlation (rpb) is the same as a *Pearson’s r*

Answer 207

The results of a Point Biserial Correlation (rpb) is the same as *a t-test*

Answer 208

When one variable is dichotomous (nominal), and the other variable is ordinal

Answer 209

A Rank Biserial Correlation (rrb) is computationally about the same as *Spearman Rank*

Answer 210

When both variables dichotomous

Answer 211

A Phi coefficient (Φ) is computationally same as *Pearson’s r* (special case)

Answer 212

A scatterplot is *worthless* with a Phi coefficient (Φ)

Answer 213

A Phi coefficient (Φ) is similar to a *chi square test*, but unlike it, a Phi coefficient (Φ) gives gives strength of relationship, while the *chi-square test* only gives statistical significance

Answer 214

Does NOT assess differences or agreement

Answer 215

Can create inflated correlation with only a few extreme data points

Answer 216

Can’t generalize beyond range of scores in sample

Answer 217

Low correlation may be due to limited range

Answer 218

Extent to which a measurement is consistent and free from error

Answer 219

A reliable measure can be expected to repeat the same score on two different occasions provided that the characteristic of interest does not change

Answer 220

Reliability is closely tied to the concept of *measurement error*

Answer 221

* Pearson correlation (r) | * Intraclass correlation coefficient (ICC) (best)

Answer 222

* Percent agreement | * Kappa (best)

Answer 223

1. Assesses relationship, not agreement | 2. Only two raters or occasions could be compared

Answer 224

Both ICCs and kappa give single indicators of reliability that capture strength of relationship plus agreement in a single value

Answer 225

*Reliability coefficients* is stated in terms of variance

Answer 226

Range 0-1 0 = no reliability, 1 = perfect reliability

Answer 227

The more error variability you have, the *lower* your reliability coefficient will be

Answer 228

Reliability coefficient will be bigger, when *true variance* is larger

Answer 229

True score variability divided by true score variability plus error variability

Answer 230

It will reduce it

Answer 231

It will reduce it

Answer 232

It will be bigger

Answer 233

Measures degree of relationship (association) and | agreement simultaneously

Answer 234

ICCs give *standardized* estimate of reliability

Answer 235

ICC is often reported in conjunction with * Standard error of the measurement (SEM)*

Answer 236

ICC is designed for *interval/ ratio* data but can be used with *ordinal* data

Answer 237

If intervals “assumed” to be equivalent

Answer 238

SEM gives “unstandardized” estimate of reliability (i.e. in units of measurement)

Answer 239

* Purpose of study * Design of study * Type of measurements taken

Answer 240

ICC type defined by *two numbers in parentheses*

Answer 241

The first number is the model and the second number is the form. (2, 6) 2 = model, 6 = form

Answer 242

* Each subject measured by a different set of raters; raters “randomly” chosen * Rarely used in clinical research

Answer 243

Each subject measured by same raters; raters “randomly” chosen & representative of rater population; results generalizable

Answer 244

Most common for inter-rater reliability or test-retest reliability

Answer 245

Each subject measured by same rater(s); raters are only ones of interest; results not generalizable

Answer 246

Most common for intra-rater reliability

Answer 247

- Model 1 (most conservative, lowest number) - Model 2 (neutral) - Model 3 (least conservative, highest number)

Answer 248

Can be for inter-rater reliability if study raters only ones of interest

Answer 249

Second number in parentheses represents number of observations used to obtain reliability estimate

Answer 250

If only one observation per subject per rater (or rating)

Answer 251

If multiple observations averaged to get single number for analysis, form = number of observations averaged

Answer 252

ICC > 0.90

Answer 253

ICC > 0.75

Answer 254

ICC < 0.75

Answer 255

The interpretation of an ICC depends on *intended use*

Answer 256

ICC estimate based on *average measures* will always be substantially higher than estimate based on *single measure*

Answer 257

* Based on frequency table * Agreements on on diagonal * Disagreements are all others

Answer 258

How often the raters agree

Answer 259

Divide number of agreements by total of all possible agreements

Answer 260

* Does not account for agreement due to chance | * Tends to overestimate reliability

Answer 261

Proportion of agreement | between raters after chance agreement has been removed

Answer 262

Can be used on both nominal and ordinal data

Answer 263

Can choose to make “penalty” worse for larger disagreements

Answer 264

Weights can be arbitrary, and | symmetric or asymmetric

Answer 265

Best for ordinal data

Answer 266

The kappa interpretation depends on *the weights used*

Answer 267

Poor to Fair agreement beyond chance

Answer 268

Moderate agreement beyond chance

Answer 269

Substantial agreement beyond chance

Answer 270

Excellent agreement beyond chance

Answer 271

Often used to construct and evaluate scale / questionnaires

Answer 272

Estimate how well the items that reflect the same construct yield similar results. So, do different questions measure same concept or indicator?

Answer 273

Represents correlation among items and correlation of each individual item with the total score

Answer 274

Recommended that cronbach’s alpha be between 0.70 to 0.90

Answer 275

Cronbach's alpha can have *dichotomous or multiple-choice responses* on test/questionnaire

Answer 276

Can help eliminate items from test/questionnaire that are not homogenous to the set or are not contributing unique information

Answer 277

A way to quantify stability of repeated measures over time

Answer 278

Response stability is basically the same as *test-retest reliability*

Answer 279

* SEM: standard error of the measurement * MDC: minimal detectable difference/change * CV: coefficient of variation

Answer 280

Standard error of measurement is a *absolute* measure of reliability, while ICC and kappa is a *relative* measure of reliability

Answer 281

SEM is in units of *measurement as variable*

Answer 282

Standard deviation of the distribution of theoretical multiple measurements

Answer 283

An SEM can be used to create a *95% CI around a measurement*

Answer 284

Amount of change in a variable that must be achieved to reflect a true change/difference

Answer 285

*MDC* is a mathematical multiple of SEM

Answer 286

A standardized way to measure variability. (SD divided by the mean times 100)

Answer 287

Unit-less, so is helpful comparing variability between two distributions on different scales

Answer 288

Comparing different methods of testing same phenomenon with different instruments (goniometer vs inclinometer)

Answer 289

- Limit of agreement | - Bland- altman analysis

Answer 290

When you plot the mean of two measures on the x- axis and the difference between the 2 measures on the y- axis, and the center of the plots is a bias

Answer 291

There is more agreement between the two measures

Answer 292

When the line of bias is at 0

Answer 293

When the points on the plot are on one side of the bias line

Answer 294

When the points are split between the two sides of the bias line

Answer 295

A study aimed at studying determinants of disease, injury or dysfunction in populations

Answer 296

Epidemiology is another way of saying *risk*

Answer 297

• Experiencing an adverse outcome • Patients not improving with treatment • Requiring more invasive or expensive subsequent interventions in spite of treatment

Answer 298

Epidemiology generally uses observational designs with *dichotomous* variables

Answer 299

Case-Control & Cohort Studies

Answer 300

Case-Control & Cohort Studies looks at the *association (“cause”)* between disease & exposure

Answer 301

Dichotomous

Answer 302

In case-control & cohort studies, there is *less* strength in thinking something is causal of the other

Answer 303

Subjects selected based on | exposure or not

Answer 304

Usually prospective, but | can be prospective or retrospective

Answer 305

Doesn’t work well for very | rare conditions

Answer 306

Examine if there is a different | incidence of disease

Answer 307

Subjects selected based on whether or not they have disorder

Answer 308

Controls should be selected | from same population as Cases

Answer 309

Examine if exposure is different between cases and control

Answer 310

Works especially well for very | rare conditions

Answer 311

* Relative Risk (RR) | * Odds Ratios (OR)

Answer 312

Both quantify strength of association between “exposure” and “disease”

Answer 313

* RR in Cohort studies | * OR in Case-control studies

Answer 314

* = “null value” | * No association between an exposure and a disease

Answer 315

* A positive association between an exposure and a disease | * The exposure is considered to be harmful

Answer 316

* A negative association between an exposure and a disease | * The exposure is protective

Answer 317

Incidence of disease among exposed individuals compared to Incidence of disease among unexposed individuals

Answer 318

Since OR is selected based on whether they have disease or not, so can’t determine rate of “incidence”

Answer 319

Odds of exposure among cases (with disease) compared to Odds of exposure among controls (w/o disease)

Answer 320

The computation of OR is kinda like *kappa*

Answer 321

*Regression* uses relationships (correlation) as a basis for prediction

Answer 322

``` X and Y are correlated • X = independent variable (= predictor variable) • Y = dependent (or criterion) variable • We use X to predict Y • The value of Y depends on X • (Thats why Y is called the dependent variable) ```

Answer 323

The distance between each data point and the line of best fit

Answer 324

Residuals are squared to eliminate *sign and penalize for worse errors*

Answer 325

Line with least squared errors

Answer 326

Parametric

Answer 327

1. Linear relationship = approximation of true line in population 2. For every X there is a normal distribution of Y • Sample data include random samplings from these distributions on Y 3. Homogeneity of variance

Answer 328

Analysis of residuals by: Plot Residuals on Y-axis, vs predicted values on x-axis

Answer 329

Homogeneity of variance

Answer 330

Looking for the residual's distance between the predictive value and the actual value be symmetric and consistent throughout

Answer 331

- The graph starts to get wider the further it goes(data is further away from the line, the higher you go) - Data is not symmetric

Answer 332

Use a non linear regression

Answer 333

• Due to peculiar circumstances? • Can discard if error identified • Generally not justified on statistical grounds alone

Answer 334

* Measurement error * Recording error * Equipment malfunction * Miscalculation * Aberrant subject (should have been excluded)

Answer 335

• Correlation coefficient (R) Coefficient of determination (R2) • ANOVA of Regression

Answer 336

* Rough indicator of goodness of fit for regression line | * Same as correlation coefficient (r)

Answer 337

Proportion of variance in Y scores that can be explained by X scores

Answer 338

Tests hypothesis that predictive relationship occurred by chance (Ho: b = 0)

Answer 339

If b (slope) = 0, line is horizontal = no relationship

Answer 340

If p < than alpha, reject the null and conclude the predictive relationship is significant

Answer 341

There is only 1 predictor in a simple model and there are multiple predictors in a multiple linear regression model

Answer 342

1. Linear relationship = approximation of true line in population 2. For every X there is a normal distribution of Y • Sample data include random samplings from these distributions on Y 3. Homogeneity of variance 4. DV = continuous measure

Answer 343

Coefficient of determination is the square of *correlation coefficient*

Answer 344

Chance corrected R2, get punished for having more predictor variables

Answer 345

The more you can predict with fewer variables, the better

Answer 346

* The value/slope in the linear equation | * The rate of change in Y for each unit change of X

Answer 347

Helpful to know relative contribution of each predictor | variable

Answer 348

The R square will always be higher than or equal to the adjusted R square

Answer 349

When the Xs in the model are substantially correlated with each other

Answer 350

Creates problems with interpretations of b weights

Answer 351

* Risk of multicolinearity (correlation between predictors) * Risk of retaining non-contributing predictors * Risk of more predictors than justified by sample size

Answer 352

Criteria set to retain or reject predictors

Answer 353

Predictor with highest partial correlation entered first

Answer 354

Should result in model with greatest parsimony and | least multicolinearity

Answer 355

A model that is the most predictive, with the least amount of variables

Answer 356

The overlap between 2 variables

Answer 357

The unique correlation between 2 variables

Answer 358

A method that starts with no predictors, then adds them, starting with the strongest

Answer 359

A method that starts with all predictors, then removes them, starting with the weakest

Answer 360

A method that starts with no predictors, then add, | but can also remove

Answer 361

* Most predictors are continuous scales * Can also use dichotomous or ordinal scale predictors * But not multicategory nominal (e.g. race)

Answer 362

A large number of predictors in a regression requires *a very large sample size*

Answer 363

At least 10-15 subjects per predictor in model

Answer 364

Become susceptible to “model overfit” (chance associations, i.e. type 1 error).

Answer 365

When you are trying to predict a dichotomous variable

Answer 366

Dichotomous

Answer 367

Continuous, ordinal, or dichotomous

Answer 368

• MANOVA gets around multiplicity problem (familywise alpha: increased Type I error risk) • MANOVA can be more powerful if DVs related

Answer 369

• “Combo DV” is not directly interpretable • If statistically significant, then must follow up with post-hoc ANOVAs

Answer 370

Method of simplifying & organizing large sets of variable into fewer abstract components

Answer 371

Visual modeling of both direct & indirect relationships

Answer 372

Path analysis is an extension of *multiple regression*

Answer 373

Compared to a multiple regression, a path analysis is more *flexible and comprehensive*

Answer 374

Can analyze both direct and indirect relationships between 1 or more exogenous variables (IVs) and 1 or more endogenous variables (DVs)

Answer 375

* Multilevel linear modeling | * Linear mixed modeling

Answer 376

The type of analysis where you have some variables nested within other variables (students nested in a classroom when studying schools)

Answer 377

A hierarchical linear modeling, has far *fewer assumption and highly flexible*

Answer 378

How many patients you have to provide treatment to in order to prevent one bad outcome

Answer 379

Percent of patients in control group with bad outcome

Answer 380

Percent of patients in experimental group with bad outcome

Final Flashcards

(414 cards)