Lectures Flashcards

1
Q

What are the 3 issues raised when it comes to suitable methods of understanding

A

How accurate are the observations we use? How good is the date? (Example: GDP/capita- which does not count unpaid work for instance, does not represent corruption and illegal activity, not precise to determine the productivity of the workforce …other example is Crime rates - because of unreported crimes, since data is based on criminal justice system but survey data is not perfect either) - Standard methods needs to be interpreted carefully
Representativeness
Causation [How do we know two variables are causally connected ? (in general we want to establish causal relationships. Example : stocks and fertility - associations are abundant but establishing connection is really hard- how do we know they are causally connected ?) ]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the qualifications of an hypothesis

A

In order to formulate these questions, we create hypothesis (turning question into testable statement), hypothesis are testable.
Qualification of hypothesis:
Hypothesis moves us closer to measurement. Have to consider the different dimensions of dependent vs independent variables (example : simply taking death penalty yes or no is not relevant because countries use it different)
Hypothesis leaves profound measurement issues unresolved (the problem with the independent variable [social class], what do we mean by that? )
Implying causal relationship between dependant and independent variables might be damaging because need to take into account other/exogenous variables

What is a hypothesis ? Research question restated in a testable form

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Depict the variables of an hypothesis

A

Need for variables :

  • must be observable
  • variability of the elements
  • possibility for falsification
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Depict deductive theorizing

A

Deductive
Distributive justice - sets of premises.
Axioms
Perception of justice involves comparison
Comparison are between what one person has and what others have
The salience of comparison varies with wealth (the gain or loss of 100$ matters more to a low income person than to a high income person)
Measurement matter ( comparison can be more or less precise, they can be formulate in terms of dollars (precise) or status (not precise)
If we think these axioms true- what do they imply ? If we change the axioms we can offer different things

Examples:

A person will prefer to steal from a fellow group member rather than from an outsider.
Why ? Because you can compare yourself to the other members of the group ( I am better off and made someone worse off)

The tendency to steal will be higher in a low income group than in a high income group.
Why? Because (axiom #3) the salience of comparison varies with wealth

Axiomatic theory (assert principles and deduce consequences from them) - deductive. 
The next step is to go out and test them- which might result in their falsification
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Depict inductive theorizing

A

Inductive
Example:
In the 1970s Jewish out marriage increased from east to west
In the 1970s catholic out marriage increased from east to west
Rates of suicide, drug use, social pathologies increased from east to west

This same pattern might suggest ineffectiveness of norms

*anomie might cause ineffectiveness of norms drug, suicide, out-marriage (because norms are less effective)
-Sets of observation and patterns that could be explained by anomie and migration
What do we do? We test it and try to falsify it - how ? Look at records, compare with other countries, develop a measurement of anomie
So Inductive = Observe something in the world, what might explain it ? Drawing conclusion from observations not going from specific principles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Some useful research does not involve _______

A

hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is theory

A

Theory = “systemic sets of interrelated statements intended to explain some aspect of social life”
Theory often implies causation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is functional theory

A

Functional (institutions/behaviour exist in order to serve some function)
Example:
Inequality —> effort —> growth —> income stability = functional hypothesis
How would you test that ? Compare US and France and look at measures of social stability (conventional employment, crime rates , employee satisfaction, youth employment) - once we start talking about measurement, we have to start re-thinking our concept. What it is we want to measure ?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is conflict theory

A

Group resources —> inequality = conflict theory hypothesis
*inequality rose in the 1970s- which we need that into account and it poses a problem to conflict theory
-Income of top decile goes up 200%
-Income of the medium goes up by 50%
What does that mean for the theory ? The rise of income of the bottom, means that they are getting better off too- therefore it would be consistent with functional theory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

*Research is _____ , the_______ are ______ , those _______ have ______ *

A
guided by theory
theories 
broad in terms of frameworks
broad framework 
implication for what you look at and how you measure it
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the 3 purpose of research

A

-Exploration:
What?when a researcher wants to become familiar with a topic.
Why?to find something about it in a quick way, to test the feasibility of a more extensive research, to better inform more systematic research
-exploration makes possible the specification of methods
-major limitations= the extend to which it can answer questions is limited because : one goes into it not knowing much about it and the findings are difficult to generalize
-Example: Lavoie, Robitaille, Hébert in a journal called “violence against women” interviewed 24 Quebec teenagers about their experience with dating violence. Findings: 1. Teens reported a range of form of violence including : sexual abuse, social control , threats, harassment…. 2.Teens brought forwards : drugs, alcohol, pornography … - they tend to explain the violence in terms of the exterior factors. 3.Teens attributed some responsibility to the behaviour of the victim. These findings showed that teens used a very broad definition of violence so should the researchers keep that broadness or impose a fixed definition. So subsequent research should be better because learning from that first study.

-Description:
Example : Venkatesh “Gang leader for a day” studying gang members in a Chicago house facility over the course of 8 years. Findings: the gang to some degree provided social control to discourage certain behaviours. They supplied drugs but we also drawn into politic processes. “Why does drug dealers live with their mothers” - because they provide different needs for probably free. It reflected their income SO the question is why would a drug dealer accept such a risky occupation with such low pay ? Because they is a possibility to move up the drug dealing hierarchy and because they have limited career opportunity.
A lot of findings from the study provided a lot of data and information about drug dealing

-Explanation:
Causation [Independent variables and dependent variables (exogenous and endogenous)]
The approach to causation in research is related to the purpose of the research (ideographic vs nomothetic)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Depict the two approach to causation

A

Ideographic: limited amount of cases and a complete explanation for it. Usually History
Example: Attempting to explain the French Revolution [war and taxes, the enlightenment, intellectual ideas and values rising, a bourgeoisie was emerging and receptive to enlightenment ideas, aggressive taxing, the French economy was not very productive so taxing capacity was limited so France had lost a couple of wars against Britain, France government had large financial need they could not fulfill] the revolution sprouted from this collection of factors

Nomothetic: explain a class of situation rather than a single one , so the purpose is to explain as much as possible with the smallest number of variables. Partials, non-exaustive results. Usually sociology 
Example: In 1995, there was a referendum on independence. Research could try to explain people’s vote. [Political party, age, language, urban area] Nomothetic looks at which of this list of factors provide a major explanation.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the two different type of causes

A

Necessary : has to be there for it to happen
Sufficient : can be there for it to happen
Nomothetic usually don’t identity neither necessary or sufficient explanations (example: language in referendum)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the three elements for inferring causation

A
  1. Association : do the patterns of occurrence of a variable associate with the pattern of occurrence of another variable - correlation (+1,0,-1), in social sciences correlation tend to be less then perfect (0.3 rather than 0.9 or 1)
    2.Time order : cause has to precede the effect
    Example: the effect of parental education and child occupation- meets the time order, it is consistent with causation (parental precedes child) but it doesn’t not necessarily infer causality
    Issues in time order :
    -Cross sectional data (example census)
    -Expectation (example: marriage improving health, complex time order if the expectation of marriage is the source of causation)
    3.Non-spuriousness : you have two variables that are associated because of another present variable. x is associated with y because they are both associated with z.
    Example: stork and fertility and rural/urban
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Depict the 2 types of fallacy

A

Ecological fallacy :involves inferring observations about individuals with aggregate level data
Example:
Durkheim compared suicide rates across societies. He found that primarily Protestant societies had higher suicide rates than primarily Catholic societies.
From this observation he constructed his idea about egoistic suicide: i) egoistic suicide is a product of excessive individualism, of a weakened sense of mutual obligation; ii) Protestantism is more strongly associated with individual conscience than Catholicism , which weakens the sense of mutualobligations; iii) consequently, Protestants are more likely to commit suicide than Catholics.
But: i) most European countries contain both Protestants and Catholics; ii) suicide rates are relatively low (Durkheim reported these rates per million inhabitants: 190 in Protestant countries and 58 in Catholic countries); iii) consequently, it is possible that the people committing suicide in Protestant societies were Catholics

Individualistic Fallacy : Inferring the properties of systems with the properties of individual
Example: Suppose one has evidence that country A has a larger percentage of people who favour democracy than country B: it would not be reasonable to infer from this that country A is more democratic than country B. Democracy is a property of a system rather than of individual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the two types of research design and time

A
  • Cross sectional studies = observation of population, event of phenomenon at one point in time [the census for example, which has new results every-time]
  • This causes the problem of establishing causal sequence

-Longitudinal =where we study the same phenomenon at 2 different point in time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what are the types of longitudinal studios (3)

A

Trend studies : consecutive independent samples and we compare results across these samples.

Cohort studies: groups who experience some shared event. [ ex: being born in 2000, marrying during the great depression, university graduates during the 70s ]
example: university professor earnings, looking at gender cohorts. Narrower difference overtime
Usually sample doesn’t tell us what the trend is overtime in the experiences of people in that sample - BUT cohort studies allows us to say something about the trend overtime

Panel studies: sample a set of units and gather information about those units overtime - we can look at what happens consequentially to the unit in their lifetime.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

explain the link between results and sample composition

A

-the results you get are influenced by sample composition , what you find is related to the sample selection decision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is operationalization

A

Operationalization = the process of translating abstract concepts into variables that indicate the concept

to turn general idea/concepts into something something we can observe. The development of specific research procedure or operations that will result in empirical observations representing those concepts in the world

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

depict the example of gender and earnings in terms of operationalization issues

A

Gender —> Earnings
Operational Issues:
Should illegal earnings be included ?
Hourly or annual earnings ? (it does make difference with the association between gender and earnings- it would be stronger using annual earnings, the difference is larger because women on average report fewer hours than men)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How do we define an appropriate schema - pragmatically

A

Which class scheme provides us with the best way of differentiating life chances? For example, which class scheme best predicts lifetime income?Which class scheme minimizes the within-category variance in relevant class attributes? For example, income, voting preferences.Which class ranking deals best with the fact that people live in households so that the analysis needs to come to grips with both male and female occupations? For example, what do you with a professional married to an unskilled manual worker.With which class scheme is political participation most associated - the decision to vote, who to vote for, which party to join, participation in protests, revolutionary activity ….Which class scheme works best for the purposes of comparative research that is, comparisons across countries? There may be comparable data available for some operationalizations of class and not for others.We decide by going backwards and forwards between research and theori

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

what are the 2 types of definition

A
Nominal definitions : for example, defining class as occupational category 
VS 
Operational definitions : the scheme  (original idea turned into something observable)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

what are the 4 levels of measurements

A

4 levels of measurements -
Nominal : categories which imply no ordering (ex: political parties, religion)
Ordinal : introduces ranking but uncalibrated ranking (we cant say by how much one category is superior to another - example : “strongly agree, agree, disagree, strongly disagree” ordinal scale)
Interval : ranking and precise statement of differences, distinguished by the fact they have meaningful units - example : temperature measured by Celsius or Fahrenheit
Ratio : interval scales with zeros (example : income, kelvin temperature)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

why would one aggregate scales into categories

A

-Sometimes people aggregate scales into categories ? It is a way of identifying non-linearities.
It may be that in the association between X and Y, X has no effect on Y for scores of X between zero and 50, X has a positive impact on Y for scores above 50 = non linearity, the association changes at a certain degree, different effects at different levels of X. (example: export intensity and productivity; only at export intensities of 60% or above was there an association, why? If you export more, you can reach scale economy and you can compete effectively).
Some associations vary with different levels so we may choose to aggregate interval or ratio data into categories in order to explore non-linearity.
We may start out with interval or ratio date but we may choose to aggregate it into categories

25
Q

depict the link between data and levels of measurements

A

what you can do with data is determine by to some degree the level of measurement

  • They have indications for the statistical procedures to be used
  • When you are looking at the difference between two variables, they could be measured at different levels of measurements (example : looking at association between nominal scale [religion] and ratio scale data [income]- how do you do that, you take the average.) what is the problem in comparing means across group : need to take into account which one is largest one**
26
Q

What are the different criteria of validity

A
  • Face Validity (does it make sense? Example: research on health, self rated health )
  • Construct Validity ( what behaviour would I expect to be associated with this concept?Example : addiction severity. Try to find what would be associated with addiction severity (health problems…) so I look at my measure of addiction severity and outcomes likely to be associated with it)
  • Criterion Validity (does the measure predict things that we think it should predict? Example: How religious are you? I check to see if its related to church attendance. Expecting to know whether or not they go to church or any behaviour. Why don’t we go directly to church attendance? I often cannot go to that because the data set I have does not contain that information)
  • Content Validity ( does the measure include all the relevant component of the concept ? example : feminism which is an multidimensional concept (equal treatment across sexes [equal pay, opportunities..] )
27
Q

Depict reliability

A

Does the measuring instrument produce the same result across different contexts ?
Example : Child play- looking at them all day. Do you get changes in the measurement if the person measuring is getting tired?
Does the measuring instrument yield the same result across different populations ?
Example : Same measurement generate different results between men and women
Do you get the same result across indicators ?
Example: we got a measure that is multi-dimensional, do the component provide similar results. For instance, authoritarianism. “do you dislike minorities” “adherence to certain norms “ if they are not associated, they don’t measure the same thing. If they are very associate the second component doesn’t bring important insight.

Across time … When I apply a measurement procedure at different times, do I get the same results? Example 1: standing on a set of scales (assuming this is uninterrupted by eating or changing clothes). Example 2: recording behaviour over a full day – does the accuracy change as the observer tires?
Across sub populations … Do young people systematically overstate their age and older people understate it? Do men and women differ in their propensity to report amount of pre marital sexual activity?
Across indicators? … Do the same people get classified as authoritarian using measures of conformity with norms and dislike of minorities? (If not, this raises questions about the validity of measure

28
Q

what are the ways of testing for reliability

A

Test-pre test (ask the same question more than once. Do you get the same answer in an initial or later interview. But it quite possible that being asked once will influence your second answer, there is also possibility for behaviour change)

Split Half Measure (suppose you have 10 items of a single context, take those 10 times and randomly assign them to one of two groups of 5 items. Then see the extend to which these two groups are associated (take the two groups and see if the results are the same)- they should associated if measuring the same thing)

29
Q

what is an example of an index

A

An example of an index :
“For each answer say yes, each yes is worth 1 ”

  • The sum of the yes answers is the score on the index of authoritarianism (maximum of 3 )
  • Each answer is given equal weight
  • What is the level of measurement implied by this particular index?
  • are the answer likely to be associated with the concept ** (face validity)
  • do they measure the same thing
  • do scores of the item vary (if 95% of participant have the same score)
  • to what degree are they associated
  • issue to deal with: missing values
  • Drop cases if the number of missing values is small. Problem: is the resulting sample unrepresentative? Solution, check representativeness by choosing some items for which there are no missing values (gender, age) and compare results for the sample after exclusions with before exclusions.
  • Make plausible judgments: if items are either checked ‘yes’ or left blank the blank may reasonably be interpreted to mean ‘no’.
  • Think about whether the missing values may be interpretable. B&R report that a study of religious beliefs found that those responding ‘don’t know’ on some items tended to be non believers on other items. ‘Don’t know’ on somereligious issues may indicate non belief
  • Impute the missing value: by assigning the mean, or midpoint score, by random assignment, or by modelling the missing item. That is, use your data set to i) establish predictors for the item missing for a particular case and then ii) use the model you’ve estimated to predict the value for the case for which the value is missing. This will become clearer when we discuss multiple regression towards the end of the cours
30
Q

What is one thing you can do when dealing with a concept that multidimensional, what do to with its different items

A

We can attempt to aggregate the items (ex: score and creating an index)

31
Q

-what do you do if some items are plausibly more important than other items

A
  • what to do ? weighting (example of college quality measurement are not all worth the same- where do they come from ? Someone’s judgement but they are other methods)
  • The unweighted formula generates the same score across all university
  • The weighted formula changes the number, shows that weight has an effect
  • These are plausible weights, when using weights, results will probably change
  • To weight or not to weigh ?
32
Q

Depict scales

A

They tend to incorporate items that reflect varying strengths or intensities of the concept of interest. In so far, as they succeed in doing so, we will be generated an ordinal scale. We can accomplish this with question design

33
Q

Depict the Bogardus social distance scale

A

Example: The Bogardus Social Distance Scale
-Proximity increases from 1 to 5 so we can probably conclude that 1 represents greater aversion than 5 - allows us to taps into different intensities of fear

34
Q

Thurstone scale in practice

A

we use judges to asses intensity, we incorporated the intensity in the question and then we have a measure that taps underlying intensity of the concept

** we use those judge determined score to calculate answer score (51:00 in recording)

35
Q

Depict Likert Scale

A

There is an ordinal question format. (agree, disagree…) Standard scale, we can assign numerical values and calculate an average leaving us with an ordinal measurement.

36
Q

Depict Guttman scale

A
  • used to see if question items conform to the requirement to be considered a scale
  • different indicators reflect different strength of sentiments
  • the result is that all of those people have positive attitudes to abortion
  • Some responses are consistent with the scale, patterns consistent with measures that tap different intensities throughout these issues
  • Some responses are not consistent- what would have to happen to make it consistent ? Minus and
  • 2 types of response - consistent vs non-consistent with the scale (Mixed type)
  • the cumulativeness confirm that this is a guttman scale because a lot of response fall under the scale type
37
Q

What do we do with mixed types

A

What do we do with the mix types ? 1.drop them 2. Adjust the scores In a way that minimizes the number of error produced by that score assignment that we make in doing it
How many errors do we make when adjusting ? (number of different answer X the number of cases)

38
Q

Depict Semantic Differentials

A
  • Polar descriptive at either end of the scales

- Design questions to generate scale responses

39
Q

depict the first type of sampling

A

-Non-probability :
-common in social sciences
Advantages : usually cheaper that probability sample, sometimes the only feasible option, it can yield information that may not be available in probability sample (standards probably sample is sometimes not enough to study [jews for example])
Disadvantages: non-representative of anything identifiable
1. Available population : (psych research using students because cheap, convenient- but not very representative) *Research methods are very practical, cost matters.
2.Purposes sampling : For theoretical reasons you want to find people involved/relevant to the subject studied
3.Deviant case sampling : some kind of behaviour is more common amongst people with specific backgrounds (drop out) so we have a sense of the overall pattern so you want to find out “why would someone from a privileged background would drop out”, deviant cases may extremely informative, choosing people because it has some defining characteristics that are of theoretical interest
4.Snowball sample : Constituted of a series of referral (sex work for example)
5.Quota sample : established in the population a distribution of age, gender, … so you go out and construct your sample to included designated sample proportion that coincide with the distributions. Matches the proportions identified.

40
Q

Depict the second type of sampling

A

-Probability:
EPS : simple random sampling without replacement. Using a table of random numbers in order to select cases (district from SRS[simple random sampling] because there is no replacement, not putting the card back)
Systematic : Take a list and look at every 10th case. Taking a list and designated number.
Stratified : Within designated groups (over-represented and others under-represented) Dealing with the problem of too few cases of people with characteristics of interests. Target groups in the population, over sample some and under sample others. *it could bias the results in terms of aggregates so need to correct
Cluster : Sample areas, within those areas sample smaller areas and within those you select cases (city, neighbourhood, households) coming to terms with difficulty associated with travel costs for interviews

41
Q

What issues do we have to address when determining how representatives are the means likely to be of the population from which the sample is done?

A

Issues we need to address:

  • Central tendency and dispersal
  • The normal distribution
  • Sampling error
  • The sampling distribution
  • Using the distribution and the properties of the normal curve to under population parameters - inference
42
Q

Depict central tendency

A

Central tendency
-mean (“average” , median (value with the same number of cases on either side of it, midpoint in a range of scores) , mode (most frequently observed value)
Mean: 7.7
Median : 5.5
Mode: 0
Mean is higher than the median : between there are outliers. Mean is sensitive to extremes value.
Median is not sensitive to extremes
Comparing the mean and the median gives us a first indication of dispersal
If we rely only on the central tendency, we don’t get significant representation

43
Q

Depict dispersal

A

Dispersal
What is the range ? 45
Standard deviation (kind of average of range)
Variance (standard deviation before the square root)
Standard deviation as a way to describe average

*If you only use the mean to characterized the size on this variable, it would be misleading. Dispersals vary greatly.

44
Q

Depict normal distribution

A

-family of probability distribution
-Bell shaped in form
-symmetrical, same proportion of cases on both side of the mean
-the standard deviation determines the family (yellow curve has a larger SD than blue curve)
-tails are asymptotic to the x axes (smaller and smaller but never touch it)
-Standard normal distribution : mean of zero and standard deviation of 1
proportion of cases that fall different numbers of SD from the mean are known

45
Q

depict notation

A

3 differents sets of number
-the sample generates statistics (x is the mean, s is the standard deviation)
the population has parameters (u is the mean, a is the standard deviation)
The sampling distribution also has a mean and a standard deviation (u is the mean, sigma sub x bar is the standard deviation, called the standard error)
its mean is the same as the mean of the population from which the sampling distribution was constructed
*Standard deviation of the sampling distribution is called the standard error

46
Q

-is it likely that mean income in my sample is going to be identical to be mean of the population from which the sample is drawn?

A

No. When I take a sample, it is likely that there will be sampling error. Sampling error means that sample statistic will diverge from the population parameter.

-We use sampling distribution to define sampling errors
Sampling distribution: we take a population and we draw a sample repeatedly from the population of interest, we draw a frequency distribution. That distribution of sample means is a sampling distribution. Sampling distribution has an extremely interesting property.

47
Q

How can we draw a conclusion about a population properly based on sample data?

A
  • Sample will be different from the population
  • Unlikely events can happen
  • Samples have distribution of results
  • Standard Deviation allows to observe the dispersal
48
Q

What my produce this outcome ? Why does taking samples and calculating their means squishes the distribution and tends to normality?

A

They are going to offset the extremes values as these are being submerge into a large sample
Extremes values are being reduced in incidents because they are aggregates with other values
Extremes values are offset within a sampling distribution, the large de sample the more that is the case

49
Q

The larger the sample size…

A

The larger the sample size, the closer to sampling distribution to normality. As sample size increases, the sample distribution tends to normality. Distribution narrows and tends to normality as sample size increases.

50
Q

what are the 4 steps to find the range of possibilities

A
  1. Take a sample. Calculate means (x bar) and standard deviation (s) for the specified N
  2. Calculate the standard deviation of the sampling distribution (sigma[substitute for s]/square root of N)- which gives us the standard error
  3. Determine acceptable level of risk or error
  4. Calculate the confidence interval
51
Q

How can we narrow that band with the same probability of error ?

A

Increasing the sample size (because when we do, we reduce the standard error because divided by square root of N)
We can reduce the confidence intervals by increasing sample size
SO larger samples are always better than smaller sample because larger samples deliver narrower confidence intervals

52
Q

When would larger samples not be better than smaller sample ?

A

In a non-probability sample (adding another 1000 cases doesn’t do much to improve inference)

53
Q

General points about sample and inference

A
  1. In our research we are interest about population parameters
  2. Sample data implies sampling error
  3. Inferential statistics are a way of coming to terms with sampling error
  4. Confidence interval are one way of doing so
  5. The same procedure can be used to examine a whole bunch of data of interest
  6. In the case that we looked at, we talked about Z scores. There are lots of other distribution, we usually use the T distribution (which we wont see in this course)
  7. How the range of distribution, .. ?
54
Q

explain the Z score

A

We are dealing here with a Z score. There is a table of Z score appendix C in the textbook.
Which provide areas below the curve in more detail
47.5 will fall between mean and 1.96
If we use 1.96 as a confidence interval, we have a 5% probability of error. Which will increase the with of our confidence interval (because we have to multiply the standard error with 1.96)
If we want a 1% probability of error, the confidence interval will be wider
3x 1.96 = 5.88
105 +- 5.88 so 99.12 to 110.88

We have created a broader confidence internal but we’ve reduced our probability of error
We can now determine within what range does that pop parameter falls and with what specific probability of error

55
Q

What are the purposes of exploration research - what are its limitations

A
  • satisfy curiosity and desire for better understanding;
  • test the feasibility of undertaking a more extensive study – thus, before embarking on a project on intellectual property in China that involves interviews it makes sense to go there first to assess the likely access to respondents;
  • develop the methods to be used in subsequent studies – for example, in a study of intellectual property in China, to inform choices of industries on which to focus

Seldom provides satisfactory answers to research questions.
Subjects often unrepresentative of the larger population of interest
So the findings are difficult to generalize

56
Q

What are the two kinds of variables

A

Independent variable → Dependent variables
Or, Exogenous variable → Endogenous variable
The independent variable is the cause (has an impact).
The dependent variable is the effect (is impacted).
By convention, the first independent variable is represented by X, the dependent variable by

57
Q

How to improve past cross-sectional studies

A

retrospective questions

58
Q

give definition for index and scale

A

index: a type of composite measure that summarizes and rank-orders several specific observations and represents some more general dimension
scale: A type of composite measure composed of several items that have a logical or empirical structure among them.