Statistics Flashcards

Question

type 1 error

Answer 1

- if the null hypothesis is rejected (i.e. the researcher declares that there IS differences and the independent variable DID have an effect), but it then later turns out that this was a mistake, this is a type 1 error - in other words - differences are found, but they do not actually exist

Answer 2

- the size of the rejection region example: - if the alpha is 0.5, the rejection region is 5% - if the alpha is 0.1, the rejection region is 1% *the size of the alpha is directly related to how likely it is you will make a type 1 error

Answer 3

- when the null hypothesis is accepted (i.e. the researcher declares that there is NOT a difference between groups and the independent variable DID NOT have an effect), but then it later turns out that this was a mistake, this is a type 2 error - in other words - differences are not found, but they actually do exist

Answer 4

- the ability to reject the null - if the null hypothesis is rejected (i.e. there is differences between the groups, the independent variable does have an effect), the decision is referred to as power (or significance) - affected by several factors including: homogenous populations (less variability facilitates effect detection)

Answer 5

- Think of the "Four Cs" to easily remember factors that increase statistical power: 1. **Effect Size (C1):** A larger effect size increases power. If the difference or relationship you're looking for is substantial, it's easier to detect. 2. **Sample Size (C2):** A larger sample size enhances power. A bigger sample provides more information and reduces the impact of variability. 3. **Significance Level (C3):** A higher significance level (e.g., using 0.01 instead of 0.05) can increase power, but it also decreases the risk of a Type I error. (0.01 vs 0.05) 4. **Consistency (C4):** A reduction in variability or noise in the data increases power. This involves controlling extraneous factors that might introduce randomness. Homogenous populations also do this. *if alpha increases, so does beta

Answer 6

- question of difference - question relationship/prediction - question of structure or fit

Answer 7

ONE DEPENDENT VARIABLE ONLY nominal or ordinal data > parametric test (Chi-Square) interval or ratio data > parametric test (t-test, ANOVA) MORE THAN ONE DEPENDENT VARIABLE interval or ratio data > MANOVA **how to ID between NOIR variables 1. **Nominal Variables**: These are categories with no specific order. Think of them as labels or names. Examples: Colors, Types of fruits, Gender. 2. **Ordinal Variables**: You can rank them/order them, but you can't say how much one is "more" than another. Examples: Education levels (High school, Bachelor's, Master's), Customer satisfaction ratings (1-star, 2-star, 3-star). 3. **Interval Variables**: They have a specific order, and the differences between values are meaningful, but there is no true zero point. Examples: Temperature in Celsius (0°C doesn't mean no temperature), IQ scores. 4. **Ratio Variables**: These have a specific order, meaningful differences, and a true zero point. Examples: Age, Height, Weight.

Answer 8

Hint: ask yourself what groups are being compared? gender = 1 independent variable, 2 levels (male/female) treatment = 1 independent variable, 3 levels (CBT, psychodynamic, no treatment)

Answer 9

independent = if people are randomly assigned or the group is based on a pre-existing characteristic (ex - gender) correlation = group members are measured at more than one point, group members are matched prior to their assignment to groups (ex - IQ, income), or there is an inherent relationship (ex - twins, siblings)

Answer 10

- statistic of choice when more than two groups are being compared on only one independent variable - more preferable over a t-test in this situation, because multiple t-tests increases the likeliness of a type 1 error

Answer 11

- F ratio = mean square between (MBW) over mean square within. (MW/IN). An F-ratio is a statistical measure used to compare the variances (variability) of two or more groups or sets of data. It helps determine if the differences between these groups are statistically significant. In simpler terms, it tells us whether the variations between groups are due to real differences or just random chance. - if the F ratio equals or approximates 1.0, there is no significance - if the F ratio is above 2.0, it is considered significant

Answer 12

- when groups are being compared on TWO independent variables, you can either run two separate one-way anovas, or do a two-way anova

Answer 13

statistics that depict relationships between variables

Answer 14

statistics that are used to predict * in a multiple regression analysis that has a negative regression coefficient = predictor has an inverse relationship with the criterion

Answer 15

- describe the relationship between X (the predictor) and Y (the criterion) in terms of strength and direction (positive or negative) - range in value from -1.0 to +1.0 - on a graph, the closer the data points are clustered, the stronger the correlation (and vice versa)

Answer 16

- calculated by squaring the correlation coefficient - represents the amount of variability in Y that is shared/explain/accounted for by X example: 25% of variability in income (Y) is explained by education (X), which leaves 75% of the variability in income to be accounted for by other factors

Answer 17

external validity internal validity

Answer 18

Effect size that indicates how the means* of two groups differ in terms of SD units. - D= 0.60 indicates a medium effect.

Answer 19

Occurs when scores on one or more explanatory variables are highly correlated with scores on one or more of other explanatory variables

Answer 20

Appropriate correlation coefficient when one variable is a true dichotomy (yes/no) and the other is measured on a continuous scale (interval or ratio)

Answer 21

TYPE 1: increases (when null is true, but is rejected) TYPE 2: decreases (when null is false but is NOT rejected/is maintained)

Answer 22

Orthogonal= uncorrelated (think don’t care about specialty) Oblique= correlated (but I do care about obliques)

Answer 23

Participants are chosen for inclusion because of their extreme scores on a pretest (low/high)

Answer 24

Base rate from the positive hit rate The base rate (number of people hired without the predictor) and who obtained high scores on the measure of job performance (criterion). The positive hit rate is the proportion who were hired using the new selection test and who obtained high scores on job performance.

Answer 25

Interval and ratio (both have equal intervals between adjacent points in a scale)

Answer 26

When there are 3 or more levels to the independent variable and we have a statistically significant finding and want to figure out which group is it.

Answer 27

Nominal: bar graph (gender, eye colour) Interval/ratio: histograms, line graphs (frequency polygons)

Answer 28

Assesses Inter-rater reliability. Assesses consistency of ratings assigned by two raters when ratings represent a nominal scale (discrete; yes/no)

Answer 29

Simple: drawing names from a hat, all have equal chance Stratified: sorting candles by colour and then picking from each group (strata) Systematic: like choosing every 5th person from a list (there is a system to choosing)

Answer 30

Purposive (judgemental) sampling: lose choosing friends to help bc you know their strengths Convenience: ask people nearby bc it’s easy not random Snowball: friends bring more friends into a study growing like a snowball

Answer 31

Change in client scores due to an outcome measure administered before and after the client receives treatment is attributable to measurement error*

Answer 32

1) make them longer 2) have an unrestricted range of scores/items are heterogeneous with regard to the attribute being measured by test

Answer 33

Non parametric test used to compare the mean of 2 data sets (or one) when the data is ranked*

Answer 34

- from principal components analysis - total variability explained by an orthogonal component

Answer 35

Less precise with data, they are less powerful and LESS LIKELY to detect a false null. They are also set with a lower alpha than parametric.

Answer 36

Mediator= accounts for/is responsible for relationship between IV and DV (Ellis proposes that beliefs mediate (are responsible for) the impact of an event on our emotional/behavioural responses.) Moderator: variable that affects the strength between two variables (if the size of the correlation between a predictor and criterion differs for OA vs YA) Latent: theoretical variable that is believed to underlie a measured or observed variable Suppressor: reduces or conceals the relationship between 2 variables. Removing the effects of this variable increases correlation between two variables

Statistics Flashcards

(60 cards)