Final Exam Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Theory

A

to create explanations for what we see

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Hypothesis

A

create statements derived form theory which is testable and observable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Evidence

A

test hypotheses with data, cases, and experiments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Causal theory (applicable to political scientists)

A
  • World is in terms of variables and causal explanations, explore this relationship
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Independent variable

A

Presumed cause

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Dependent variable

A

presumed effect/outcome

- Value of dv depends on value of IV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Causal theory breakdown

A
  • Theory
  • IV
  • DV
  • hypothesize
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

operationalization

A

process by which abstract concepts are turned into ‘real world’ observations
- you can measure/collect data on unobservable phenomenon

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Hypothesis

A

theory based statement about relationsips that is observed– statement from theory that is testable and observable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Bivariate theory

A

most causal theories look at relationship between two variables (does X cause Y; does independet variable cause dependent)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

most real world problems are multivariate, so how to account for bivariate

A

control for other factors (z) and other possible causes of Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Spurious relationships

A

two or more events or variables are associated but not causally related, due to either coincidence or the presence of a certain third, unseen factor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

how to control for spurious relationship

A

when z is introduced, relationship bw X and y disappears, if Z is helf constant, then relationship disappears

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Additive relationships

A
Third variable (Z) → DV ← IV (X)
X causes Y, but Z also causes Y, both are independent to each other (X does not affect Z, and Z does not influence relationship bw X and Y)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Interactive Relationships

A

Ex/ Relationship between education and abortion rights depends on gender
- Relationships bw X and Y depends on value of Z

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How to control for interactive relationships

A
  1. Correlation bw X and Y disappears in both groups= spurious relationship
  2. Observe correlation still exists and does not differ bw females and males = additive relationship
  3. Observe correlation still exists in females but not in males (education does impact opinions on abortion rights, but depends/is conditional on gender) = interactive relationship
  4. Observe correlation still exists in both groups but stronger in females = depends on gender, but greater impact of opinion with females
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Types of variables

A

Categorical
Ordinal
Continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Label

A

description of variable

label: “gender of survey respondent”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

values

A

denominations in which variable occurs

Values: male, female, other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

categorical variables

A
  • Limited categories
  • Cannot make universally-held ranking distinctions
  • Impossible to rank order the categories from least to greatest
  • Label: religious ID ;
  • values: christian, jewish, muslim
  • EX/ race, gender, age, sex
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

ordinal variables

A
  • Universal ranking distinctions
  • Rank order the categories from least to greatest
  • Assign scores 1,2,3,4,5 to categories; 5 is assigned for best situation
  • Size of difference between categories is inconsistent
    EX/ socioeconomic status: low income, middle income, high income
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

continuous variables

A
  • Have equal unit differences
  • Universal ranking distinction
  • Size of difference between categories is consistent
  • If you can measure it, it is continuous; any value within a range
  • Discrete and continuous variables are numeric/scale variables as opposed to categorical variables
  • Use term continuous variables for discrete variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

measures of central tendency (typical or average of a variable)

A

Median
Mode
Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Research design

A
  • quantitiative

- qualitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Quantitative research (3 forms of research)

A
  • uses numerical data for statistical analysis
    1. surveys
    2. observational studies (to determine if an existing condition is related to a characteristic of interest ie. smoking causes lung cancer)
    3. experiments (condition is created by imposing a treatment on the sample) –> researcher both controls and randomly assigns values of iv to subjects
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

qualitative research

A
  • does not use numerical data for sa
  • collects/analyzes non-numerical data (text, video, or audio) to understand concepts, opinions, experiences
  • EX/ how does social media shape the image of female politicians?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

observational studies

A

taking the world as it already is no controlled setting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

why is randomization important

A
  • if x is determined by pure randomness then it should not be correlated with any variable including Z
  • takes systematic differences out of play
  • subjects will not be systematically different from one another
  • helps show that the observed Y is caused only bc X
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

To control for possible causes of Y to overcome spurious relationship between X and Y in observational studies

A

use statistical controls

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

every causal relationship is….

A

potentially spurious; bc we can never know for sure until all possible third variables are tested

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

the most common approaches in political science is

A

observational designs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

investigator can make sure to control for factors he or she has not thought of in:

A

Experimental Designs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Which approach is stronger, experimental or observational

design?

A

Experimental Designs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Investigator controls for the possible effects of all rival

explanations in:

A

observational designs

35
Q

internal validity

A

degree of confidence in causal relationship bw X and Y and ability to draw these conclusions

36
Q

external validity

A

extent to results being applied to and across different environments/outside world
- generalizable data?

37
Q

data sets (2)

A
  • population

- sample

38
Q

Population

A

data for every possible relevant case

39
Q

sample

A

a subset of cases that is drawn from underlying population

40
Q

types of samples

A
  • random sample

- non-random

41
Q

random sample

A

each member of population has an equal probability of being selected in the same

42
Q

non-random

A

sample of convenience

43
Q

populations vs. samples

A

Not interested in properties of same per se

Interested in underlying population as a whole

44
Q

Statistical inference

A

Use what we know to be true about one thing (sample) to infer what is likely to be true about another thing (population)
Process of making probabilistic statements about population characteristics based on our knowledge of sample characteristics
How can we learn about population from sample? Use stat inference

45
Q

How to infer

A

Things are known w certainty - like the mean of some variable in our sample

  • central limit theorem
46
Q

central limit theorem

A

used if only a sample of data and wanting to know anything about vast majority of individuals for whom we don’t have data; invokes kind of distribution called normal distribution

47
Q

Probability Theory

A
  • Outcome: result of a random observation
  • Independent outcomes: two more outcomes can be said to be independent if the realization of one of the outcomes does not affect realization of other outcomes
48
Q

Properties of probability

A
  1. All outcomes have probability ranging rom 0 to 1
    - If we roll a die, what if probability of outcome 2?: ⅙
  2. Sum of probabilities of all possible outcomes must be exactly 1: all possible outcomes: 1,2,3,4,5,6 → probability of each outcome is ⅙, add ⅙ 6 times you get 1
  3. IF two outcomes are independent, then probability of those events both occurring is equal to the product of them individually
49
Q

Normal distribution

A
  • is most important probability distribution in statistics
  • Called a bell-shaped distribution with some unique features
  • Distribution is symmetrical around the mean
  • certain % of cases fall within the a distance of mean
    68% of values are within 1 SD
    95% of values are within 2 SD
    99.7% of values are within 3 SD
50
Q

frequency distribution

A
  • distribution of scores

- not normally shaped

51
Q

statistical inference

A

use what we know to be true about one thing (sample) to infer what is likely to be true about another thing (population)
- process of making probabilistic statements about a population characteristics based on knowledge of sample characteristics

52
Q

How to know about vast majority of data with only a sample of data?

A

Central limit theorem

53
Q

Central limit theorem

A
  1. take the mean of each sample
  2. plot those means
  3. the hypothetical distribution of sample means- a sampling distribution- will be normal
    * *no matter the shape of frequency distribution, sampling distribution will always be normal
54
Q

mean of the sampling distribution=

A

true population mean

55
Q

mean of the sampling distribution=

A

true population mean

56
Q

Standard deviation of the sampling distribution=

A

standard error of the mean

57
Q

if we know the standard deviation of distribution, we can…

A

estimate the confidence interval for the population mean

58
Q

which is most common standard of confidence interval to use

A

95%= 95% of all possible sample estimates will fall within the interval by chance

59
Q

“sample mean -(2x standard error)

A

at lower end

60
Q

sample mean + (2x standard error)

A

at the upper end

61
Q

As sample size increases, distribution of the sample means tends toward…

A

the normal distribution

62
Q

sample mean can be considered approximately normal distributed if sample size is at least?

A

30

63
Q

central limit theorem only applies to samples that are selected…

A

randomly

*does NOT apply to sample of convenience

64
Q

what do “surveys” says about the population as a whole in central limit theorem?

A

BC samples are clearly not random samples of underlying population, the answer is “nothing”

65
Q

the smaller the standard errors

A

the tighter our resulting confidence intervals will be

66
Q

if interested in estimating population values, based on samples, with as much precision as possible, then we want

A

tighter confidence intervals

67
Q

probability distribution around mean is…

A

symmetrical

68
Q

if population is normally distributed, then sample mean…

A

is also normally distributed

69
Q

When sample size is small, how do you make inferences about population mean?

A

use t-distribution

70
Q

As n grows, what happens to t-distribution

A

begins to resembles a normal curve

71
Q

t-distribution for a small sample curve

A
  • fatter tails than normal
  • wider boundaries on SE
  • less confidence in accuracy of sample static
  • boundaries of 95% CI are not fixed
72
Q

Forms of Hypothesis Testing

A
  • confidence interval

- p-value approach

73
Q

what is included in the p-value approach?

A
  1. z-score
  2. standard normal distribution
  3. P-value
74
Q

Test-hypothesis

A
  • method from sample data to decide whether null hypothesis should be rejected
75
Q

null hypothesis

A

H0: hypothesis of no change from current opinion/rejection of current theory

76
Q

two conclusions from hypothesis-testing analysis

A
  1. reject H0 (in favor of the alternative hypothesis)

2. Fail to reject H0 (continue to believe null hypothesis)

77
Q

p-value

A

number describing how likely it is that data would have occurred by random chance (that null hypothesis is true)
- level of confidence with which we can reject the null hypothesis

78
Q

common p-value mistakes

A
  • it is not the probability that null hypothesis is correct (that can’t be proven)
  • it is not the probability of making an error
79
Q

Z-score

A
  • number of standard deviations a specified data point lies from the mean
    estimate of how far difference in values is from pop mean, relative to the spread of our data set
80
Q

standard normal distribution

A

mean=0; SD=1

  • total area under curve is 1
  • 95% of cases fall within distance of 1.96 of the mean
81
Q

What does p-value mean? (3)

A
  • probability of an observed relationship given that the null hypothesis (no relationship bw X and Y) is true
  • prob. that we see relationship bw two variables if there were truly no relationship bw them in unobserved pop.
  • prob. that we see relationship that we are finding bc of a random chance
82
Q

high p-value

A

high prob. that relationship likely happens by random chance, so it does not support theory

83
Q

low p-value

A

low prob. that relationship unlikely occurs by random chance, so relationship likely does exist

  • the more contradictory the data is to H0
  • more confidence that there is a systematic relationship
  • relationship is statistically significant
84
Q

limitations of p-value

A
  • has nothing to do with strength of relationship only talks about confidence that there is a relationship bw two variables in society
  • does not say whether relationship is causal