exam 1 Flashcards

1
Q

Systematic empiricism

A

planning, making, recording and analyzing observations in the world
Collecting and analyzing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Parameter

A

descriptive, true value in a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Statistic

A

descriptive value in a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Inferential statistics

A

methods for using sample data to make general conclusions (inferences) about populations
Parameter– difference between groups in a population
Statistic– difference between groups in a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Sampling error

A

discrepancy (error) between the sample statistic and its population parameter
Chance variation in ever random sample we pull from the population could describe group differences between statistic and parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Goal of inferential statistics

A

estimating how large the sampling error actually is
Sample should be diverse and big enough to be representative to avoid sampling error
Random sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Covariates

A

other variables we know of that may correlate with our independent/dependent variables
Never manipulated by the researcher
We can statistically control for covariate and use them to ask more complex questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Demographic variables

A

descriptive variables about our sample
Age, race, gender, educational attainment, income, marital status, etc
always need at least age, race, and gender
Shows how well you are representing the population
Also important for replication purposes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Construct

A

variables that cannot be directly observed, but are useful for describing and explaining behaviors
ex– happiness, stress levels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Operation definition

A

the way a construct is measured in an empirical study
We operationally define constructs with measures
Survey measures
Psychological measures
Behavior measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Reliable

A

consistency across time, items, and raters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Valid

A

accuracy, measuring what they are supposed to measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Indivisible categories

A

nothing exists between them
Example – attachment styles
Secure, avoidant, resistant, disorganized

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Continuous variable

A

infinitely divisible at the discretion of the researcher
Example– time
Can have an infinite number of categories
Between any two points on a truly continuous variable its alway possible to find a third point between them
150 - 150.5 - 151 - 151.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Nominal scales

A

unordered set of categories identified only by name
Only able to tell us whether individuals are the same/different
Categories, no math value to them
Example– college major

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Real limits

A

boundaries located exactly halfway between adjacent categories and define the range of each category
Real upper limit– top boundary
Real lower limit– lower boundary
Example
4.425– 4.43
4.424– 4.42

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Ordinal scales

A

ordered set of categories
Tells us the direction of differences between two categories, but not the absolute distance
Not equal intervals between the categories
Examples– restaurant drink sizes, rankings in race (first, second, third)
If you rate a single statement on a scale from strongly disagree (1) to strongly agree (5), etc, it counts as an ordinal scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Interval scales

A

ordered set of categories with equally distances intervals with an arbitrary 0 point
Arbitrary– just another category, does not mean absence of the construct
Example– temperature
0 degrees does not mean no temperature

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Ratio scales

A

interval scale with an absolute zero point
Key– zero means there is none of the construct
Can also perform mathematical operations
Examples– height, weight, running distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

helpful in determining appropriate scale

A

Test one– ask yourself if zero means total absence of the quantity
If it does then its a ratio scale
Test two– ask yourself if you can have less than 0 of something
If you can’t, its likely a ratio scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Positively/ negatively skewed

A

pos– scores pile up on left
neg– scores pile up on right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

N

A

total scores in a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

n

A

total scores in a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

M

A

sample mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Central tendency

A

describe the central point or typical value of a distribution
Goal– allows researchers to summarize/condense a large data set into a single value
Allows us to quickly compare two data sets that have collected similar information
Ex– exam one scores in this class versus other psy 350 class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Variability

A

how spread out the scores are around the centrail point

Together, these measure describe distributions of scores

Usually reported together
Descriptive statistics– both central tendency and variability measures
Both define the shape of distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

central tendency/ mean

A

most commonly used measure of central tendency
Only works if measure is numerically coded

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Rescaling

A

moving the distribution on a number line
Adding/multiplying a constant value to every single score in sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

rescaling– multiplying

A

Mean: Is multiplied by the constant value.
Standard Deviation: also multiplied by the constant value (spread changes). - Distribution: Stretches or compresses depending on whether the constant is greater than or less than 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

rescaling– adding

A

Mean– Increases by the constant value.
Standard Deviation– stays the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Nominal data

A

It is always inappropriate to compute a mean for nominal data
No numerical value assigned to categories

32
Q

ordinal data

A

Usually inappropriate to compute mean for ordinal data
If you have a lot of categories its might be appropriate
8+ categories
Easier to just give percentages that fall into each category
Wiggle room with ordinal data, but never with nominal

33
Q

Median

A

midpoint of a list of scores that are sorted in order from smallest to largest
Median is less affected by extreme scores and skew
Stays in the center of a distribution
Should be used over the mean in these situations
Median does a better job at representing the distribution over mean
Also does a better job even if skew is present

34
Q

skew cut off points

A

perfect– 0

-2 or lower– problematic negative skew

2 or higher– problematic positive skew

35
Q

Mode

A

most frequently occurring category or score in a distribution

Only one to report for nominal data

36
Q

Central tendency and distribution shape

A

Normal distribution– means, median and mode will be equal to each other and be the same value
Skewed
Mode will be at the peak
Mean will be toward the tail
Median usually is between mean and the mode
Order (from left to right)– mode, median, mean

37
Q

variability purposes

A

Describe the spread in a distribution
Important component of most inferential statistical tests

38
Q

Three variability measures

A

Range
Variance
Standard deviation

39
Q

Variance

A

measures the average squared distance between a score and the mean

40
Q

Standard deviation

A

standard average distance between a score and the mean
Most commonly reported measure of variability
Both tell us how much spread is around the mean

41
Q

Population variance

A

mean squared distance from the mean

42
Q

Population standard deviation

A

standard, mean distance from the mean

43
Q

unbiased estimates

A

A sample statistic is unbiased if the average value of the statistics is systematically equal to the population parameter
On average sample statistic will be that parameter after repeating it over and over again

44
Q

biased estimates

A

A sample that is biased if the average value of the statistic is systematically different from the population parameter
Statistic samples won’t be equal after repeating it over and over again

45
Q

degrees of freedom

A

number of scores in the sample that are free to vary from the mean
Example
If M=6
Frist score– 8
Second score– 3
Third score has to be 7 to result in a mean of 6
First two are free to vary, where third is fixed

46
Q

Properties of standard deviation

A

if a constant is added to every score in the distribution, the standard deviation does not change
Adding a constant to every score just shifts distribution
If its multiplied by a constant it does change
Increased distance = increases standard deviations

47
Q

Z scores

A

standard score
We convert X into a z score to tell us the exact location of any given score

48
Q

interpreting z scores

A

Positive– z score is higher than sample mean
Negative– z score is lower than sample mean
If z score is 2.5– 2.5 standard deviations above the sample mean
When converting raw scores into z score– mean becomes 0, standard deviation becomes 1

49
Q

Standardized distributions

A

A distribution where all the original scores have been changed so that they now have a set (or specific) mean and standard deviation

50
Q

When we transform all scores to z-scores

A

the form of the distributions does not change
Mean now is 0
Standard deviation is now 1
Advantages– puts everything on the same scale so we can make comparisons between different distributions

51
Q

Interpreting z scores

A

Must pay attention to the sign (positive or negative) and absolution value (how large is it)
Sing tell us whether the score is above or below the mean
Negative– below
Positive– above
Absolute value– tells us how many standard deviations the score is from the mean
Z score of 0 means it is the mean

52
Q

Probabilistic reasoning

A

We typically deal with likelihood instead of certainty

53
Q

Independent random sample

A

means that every item has an equal chance of being chosen, and the chances don’t change from one pick to the next, even if you’re picking more than one item

54
Q

If body is the left and tail is right

A

z score is positive
Vice versa for negative
Usually more interested in the tail
Rarest z-score will always be the one furthest from the mean

55
Q

Distribution of sample mean

A

ollection of sample means for all possible random samples of a given size from a population
This is a specific type of sampling distribution, or a distribution of statistics obtained from all possible samples
The more samples you draw the more normal and symmetrical your distributions of samples will look

56
Q

Three distributions we have talked about so far

A

Distribution of individual scores in the population
Original set of scores in the population we want to study
Example– 333 million americans

Distribution of individual scores in a sample
Actual set of scores from our sample we are able to study
Example– 20,000 american you pulled for your sample

Disruption of sample means
Theoretical set of mean from all possible random samples
Used to draw conclusions

57
Q

Central limit theorem

A

The average of our distribution of sample mean will always be equal to the population mean
Expected value of M– most likely value for the mean in any sample
The standard deviation for the distribution of sample means is smaller than the population standard deviation

58
Q

Law of large numbers

A

the larger our sample sizes (n) are, the more probable it is that a sample mean will be close to the population mean
As our sample size gets larger, our standard error gets smaller

59
Q

Standard error vs sampling error

A

Sampling error is the diff between one sample mean and the population mean

Standard error is the average sampling error across all possible samples

60
Q

How large is large

A

30
If you are taking samples of at least n = 30, you are guaranteed a normal distribution of sample means
If it’s not 30, but the population distribution is normal, you are still guaranteed a normal distribution of sample means

61
Q

Z test

A

Purpose– test for a difference between the true population parameter and the observed sample statistic

Decide between two explanations
Difference is som small, there does not appear to be a treatment effect

Difference is so big that there appears to be a treatment effect
Can it reasonably be explained by sampling error?

62
Q

Known population

A

this is known mean from the original untreated population you want to compare your sample to
What you are drawing your sample from
Have to know what the mean and sd are

63
Q

Unknown population

A

unknown mean among the population that receives the treatment you are trying to estimate with your sample
After treatment if it is different enough from sample mean it now belongs to unknown treated population
Main goal– does the random sample still represent the treated population or now represent an unknown treated population

64
Q

Null hypothesis

A

state that there is no difference in the population as a result of some treatment

Assumptions there is no effect in the study
Assumes our “treated” sample comes from the same untreated population

In the example, average stress would still be 50

The null hypothesis is what we are testing so we assume thats its true and after applying treatment it will be the same

Trying to find enough evidence enough the null so we can reject it

Is diff large enough we can conclude null is not true?

65
Q

Alternative hypothesis

A

states that there is a difference in the general population as a result of some treatment

Assumes any change we see is beyond what we would expect because of sampling error

Assuming its not plausible that our treated sample comes from the original untreated population

We don’t make any specific prediction about the alternative hypothesis, just that it is ot the same as the null

Null– H0 = 50
Alt– H1(equal sign with a slash) 50

66
Q

Set the criteria for a decision

A

Choose between the null and alternative hypotheses based on how likely it is that we would observe our test statistic if the null hypothesis were true
If it’s likely, the the null is probably correct and we would prefer it
Vice versa for alternative
Have to decide what rare is going to be
Have to make this choice in advance– before we ever look at the data

67
Q

Alpha level

A

probability value used to define what is likely and what is unlikely
If you think 15% is an unlikely score, alpha level would be .15
In psychology 5% (.05) is the typical standard
5% of all possible sample means can be classified as unlikely to occur

68
Q

After selecting alpha level, find z score associated with 5% chance in the tails

A

Can’t do it with a skewed distribution– has to be at least 30 people in our sample
We want to pick a z score with both its positive and negative tails values to create 5%
(+/- 1.96)– gives you 5% of all the possible scores

69
Q

Critical region

A

defined by critical z score (+/- 1.96) that encompass 5% of all possible sample means
Values you would deem super unlikely to occur if null true
In theory, any score in this region is almost impossible if the treatment has no real effect

70
Q

Collect data and compute sample statistics

A

Rescutir sample, apply treatment, compute mean score
Convert the sample statistics to a test statistic, which is just a z-score in the case of a z test

Assuming the treated sample belongs to unknown treated population, you want to know how probable it would be to find a z score of that magnitude
Probable– accept null hypothesis
Unprobale– accept alternative hypothesis

71
Q

Make a decision

A

Use unit normal table
If it is in the critical region, we conclude that the difference is significant
In this case we reject the null
If it’s in the body, we would conclude we did not find enough evidence to reject the null
In this case we fail to reject the null

72
Q

Rejecting vs. accepting

A

We ever accept the alternative hypothesis because that not what we are actually testing (always testing the null), but you can support it
For the null we either reject (p<.05) or fail to reject the null (p > .05)

73
Q

Type one error

A

false error
Rejecting null when you shouldn’t have
A sample statistic falls in the critical region even though the treatment has no real effect in the population

74
Q

type two error

A

false negative
Treatment effect does exist, we just failed to detect it
More likely to make a type two error
Lack of statistical power
Power– probability the test will reject the null when the treatment does have a real effect
Depend on
Size of treatment effect
Size of sample– larger sample size, more power
Experimental validity

75
Q

Increase power of a hypothesis test

A

Increase effect size
Conduct an experiment instead of an observational study
Use more precise measurement

Increase sample size– what we have more control over
Make sample more representative of the population
Reduces standard error of the distribution of sample means– makes it skinnier

Decrease population standard deviation
Limit target population
Reducing population so people are more similar
“20 year old, female, right handed, college student”

Increase alpha level
Ex– making it .10 instead of .05