measurement terms Flashcards

1
Q

convergent validity

A

does it least partially measure the concept

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

face validity:

A

does it seem plausible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

discrimination validity

A

distinguish between what you’re measuring and how it is different from something else (i.e. government production speed and effectiveness)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

consensual validity

A

is it broadly accepted (consensus)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

correlational validity:

A

good if you don’t have consensus, for a new measure) does it track with other accepted measures? (think convergent but on a large scale and slightly different)
can statistically compare it to other measures of the same concept withe somewhat similar results, theoretical reason why )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

predictive validity:

A

can we use it to guess things we should be able to (measures say this, does it actually happen. are predictions accurate?) if not its a prob.
ex. of failure: polls and Hillary Clinton

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

2 major threats of Measurement reliability:

A

subjectivity: Chile example (instructions and 2 dif. people. person deciding = subjectivity) Desk effect, tests for intercodal reliability, lack of precision : samples are imprecise, build a measure of imperfection of sample into analysis, limits ability to predict but necessary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

types of measures

A

objective, subjective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

levels:

A

binary (0 or 1) dummy variables, interval: counts or continuous (#s most familiar with), ordinal: 1st, 2nd (ranking: ex. warmest, don’t know the difference between each level, just their relevance to others
nominal: can’t do math on: colors, names, variables that are stored and theres distinguishment between them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

limitations of data

A

social desirability bias (racism, truth/lie spinner)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

measurement-

A

assigning #s to phenomena for the purpose of analysis

theories, validity, reliability, types

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

measurement theories:

A

need to operationalise concepts (often need multiple measures)
almost always contentious (in political analysis)
usually assumed to contain error.
-more is always better in statistics (multiple indicators)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

polity:

A

how democratic a society is, various measures (minority, contestability of elections)
- people have dif. opinions on what democracy is, how to measure it etc. highly contentious

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

measurement reliability

A

do we get hte same measure every time (unless it is actually changed?)
ex. either people or the way it is being measured, measure same thing same way and 2 dif. answers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

objective

A

(something you can point to that no-one can disagree with. ex) how many people like trump measured by who voted for him. it is an actual number, even if imperfect) vs. subjective (one where someone sits with people and evaluated discussion to determine if they like them or not. v. subjective)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

social desirability bias

A

on’t want to admit they’re racist) so questions may have to lead people to admit it
“the spinner” give everyone a spinner, lie/tell the truth (larger part) is how they answer - compare statistical probability of spinner and compare to responses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

how to make frequency distributions

A

tally observations
define classes
consolidate and display
(use software) (see ex. on chalkboard) (rep w/ -plots (“polygons”), histograms , and cumulative frequency polygons
(plot classes on bottom and # of distrib. / frequency on side)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Measures of Central Tendency

A

values indicating where in the range the data tends to be
mean = average value
median= middle value
mode=most frequent value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Mean= (sum of all observations)/(# of observations)

A

can take unrealistic values (1.7 kids)
skewed BY OUTLIERS (CEO salaries)
not apporpriate for some variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

median (middle value)

A

-value of whatver the middle observeation in the range is
if range is even, take mean of 2 middle observations
-unlike mean, it is NOT WARPED BY OUTLIERS
-usually doesn’t take on unrealistic values
-may not mean much if data is multi modal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Mode- occurs most frequemment

A

can have more than one mode

  • often helpful to relax specificity of “most frequent”when discussing multimodal data and/or data with a wide range of values
  • not very useful if data is evenly distributed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

kurtosis

A

(tailedness)- more of data resides within tails

can calculate these with software

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Standard Deviation*

A

measure of average difference from the mean ( mu is mean, lower case sigma is sign for s.d.)

24
Q

-what can we infer from distance from mean? (interpret data)

A

if its small, data is tightly clustered around mean (mean explains well)
if its large, data is spread out more (less of data mean explains)
if its larger than mean, data is so far from mean it is not a good measure of centrality
always report standard dev. with the mean

25
Q

basic law of probability:

A

given that all poss. outcomes of a given event are equally likely the probability of any specified outcome is the ratio of number of ways that the outcome can be achieved to the total number of ways all poss. outcomes can be achieved.

26
Q

a priori:

A

probabilities calculated with full awareness of all possible outcomes (ex. dice, cards etc)

27
Q

posterior:

A

probabilities estimated after the fact with limited knowledge of possible outcomes

28
Q

EXPECTED VALUE:

A

-expected payout over many outcomes
for one outcome: the value x the prob. of it occurring
for the entire process: sum of all expected values of outcomes
(.e.g. game: costs 75 cents, expected value is 50 cents, not worth it)

29
Q

both occurring P( A U B) unions

A

a and b occurring

ex. odds that it will rain or snow today

30
Q

median voter theory:

A

ideology is normally distributed (left v. right)

31
Q

Finding individual probabilities: Z scores

A

number of standard deviations a value lies from the mean
to calculate subtract mean from value, divide by standard deviation
to figure out what percentage of data lies between mean and this process, consult a table!

32
Q

normal curve

A

68.26% are within1 s.d.
95.44% are within 2 s.d.’s
99.72% are within 3 s.d.’s
based on continuous variable
centered on mean outcome
half greater, half smaller than mean
most values are close to the mean

33
Q

Bermouli Process:

A

Two, mutually exclusive, jointly exhaustive outcomes

Independent Trials (Cnp*r- with superscripts)

34
Q

we can use the normal distribution

A

When np > 10 and n(1-p) > 10

35
Q

what is a hypothesis

A

questions regarding some aspect of world we intend to investigate
framed as a FALSIFIABLE statement (can be proven wrong with means available to us)

36
Q

Null Hypothesis :

A

contradiction to any research hypothesis

How to test: must have parameters for entire pop you can hypothesise abou

37
Q

TYPE 1

A

FALSE REJECTION OF NULL (ERROR)

38
Q

TYPE 2

A

FALSE ACCEPTING OF NULL

39
Q

Steps for a hypothesis test:

A
-find mean, s.d. and s.e.
calculate t score using null as mean
-use t score or z if n>30 and find probability
(how likely is it relative to alpha) 
describe findings
40
Q

independent samples

A

are samples where selection into one sample does not affect the odds of selection into another survey

41
Q

dependent samples:

A

where the selection into 1 sample affects the odds of selection into another sample
ex. pair of surveys taken before and after election.

42
Q

Difference in Variance: v

A

variance is the sum of squared differences from the mean
- if variance isn’t the same, it affects how likely the sample means are close to one another even if they are from the same population. variance becomes a particular concern when samples are different sizes or one is very small (<30)

43
Q

se= sd* sq.rt. (1/n1+1/n2)

A

pooled difference of means

44
Q

equal variances

A

use pooled: pooled s.d. sq. rt. {(n1-1)sˆ2 + (n2-1)sˆ2/ n1+n2-2 }

pooled s.e. = s.d. * sq.rt {1/n1+ 1/n2}

t= mean 1- mean 2/ s.e. d.f.

45
Q

t score

A

needs 1 standard error (overall)

46
Q

if t score generated is less than t score w/ alpha in table

A

cannot reject null, cannot accept research (within null)

47
Q

for proportions, null is opposite

A

(ie. > opp < or=)

48
Q

t score 2+

A

digression

49
Q

n=

A

(z* s.d. / max error allowed )ˆ2 or (t* s.d. / max error allowed )ˆ2

50
Q

s.d. of proportion is greatest when

A

proportion/s.d. = 0.5 (to est. sample size with unknown proportion) if proportion is smaller, don’t need as large of a sample

51
Q

difference between 2 groups:

A

testing whether it came from pop or not. i.e. failure to reject null means pop means are not different (can be that they are different) t for these is the difference of the two sample means divided by standard error

52
Q

independent samples t test difference w/ unequal variances

A

most conservative, (least likely to reject null) makes type 1 less likely (d.f. very complicated)

  • mean, s.d., s.e. for each group
  • overall s.e. (sq.rt. of se.ˆ2+seˆ2)
  • t score for diff of means (d.f. =n1+n2-2)
  • statistically significant at less than…
53
Q

t test independent equal variances

A

smaller s.e. and larger t scores (more type 1 error if variances are unequal) Levene test
-mean and s.d. before and after procedures being tested
-calc overall s.d. sq.rt.(n1-1sˆ2 +n2-1sˆ2/ n1+n2-2)
(this is weighted avg of 2 s.d.)
- s.e. overall (s* sq. rt. (1/n+1/n)
-calc t (mean-mean/s.e.)
- if in between t tables amount, give explanation of value in t table (probability)

54
Q

t test dependent samples

A
  • pairwise subtractions
  • mean, s.d. s.e. of those differences
  • t score (2nd value is 0, to see if there is a difference)
  • statistically significant at less than value above: level of significance
55
Q

proportions

A
  • means, s.d.{s= sq.rt. [p (1-p)]} of experimental and control groups
  • s.e. of each
  • overall s.e.
  • t of difference between experimental and control
  • if t score does not exceed value, the level of significance is used as probability the 2 samples could be from same pop.
56
Q

nominal, ordinal, interval

A

can be used as measurements for: mode with all, mean only interval, median with ordinal and interval

57
Q

statistical controls

A

3 or more variables’ relationships can be examined