Econ 2740 Flashcards

1
Q

Population

A

: The group of all items of interest to a statistics practitioner(Everyone we are interested learning about). Frequently very large or can be infinitely large.

a. Examples Include: All tv viewers, Canadians, students, and all human beings
b. Typically we do not observe the population because it is too difficult

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Parameter

A

A descriptive measure of the population. In most applications of inferential statistics, the parameter represents the information we need.
a. Ex. Percent of Canadian voters who plan to vote for NDP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Sample:

A

A set of data drawn from the studied population (Smaller number of the population)
a. Reason for using a sample is because it is cheaper and easier to collect data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Statistic

A

A descriptive measure of a sample. Statistics are used to make inferences about parameters
a. Ex. Percent of the 1,200 voters polled who plan to vote for NDP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Confidence Level:

A

The proportion of times than an estimating procedure will lead to correct conclusions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Significance Level

A

Measures how frequently the conclusion will lead to false conclusions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Descriptive statistics

A

Just describe the sample, without worrying about the population. Includes graphical and numerical methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Interval Data:

A

Also known as quantitative or numeric data. They are numbers that have meaning. Example, age, years of schooling, wage GDP, foul shot percentage and exchange rate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Ordinal Data:

A

Numbers denote ordered categories and only the order matter. Ex. Highest degree completed 1 (none), 2 (elementary), 3 (high school), 4 (university).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Nominal Data:

A

Also known as Categorical or Qualitative. Numeric values just denote a name or category. They have no meaning as a number. Example, sex 0 (male), 1 (female) or postal code

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Frequency

A

: Number of observations falling into a group or category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Relative Frequency

A

: Proportion of observations falling into a group or category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Cumulative Relative Frequency:

A

: Proportion of observations falling into a group and all previous groups;
• Applies only to ordered groups
• Applies to Ordinal, but not nominal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Histograms

A

A graphical display of data using bars of different heights

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Reverse Causality

A

When changes in the dependent variable (Y-Variable) cause changes in the independent variable (X-Variable)
• Put another way, the causation goes in the opposite direction as expected
• You see a relationship in the scatter diagram, but the interpretation is opposite to what you would think.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Bottom Line:

A

Due to indirect and reverse causation care is needed when interpreting relations between variables

17
Q

Direction Causation

A

When Changes in the independent variable (X-Variable) cause changes in the dependent variable (Y-Variable).

18
Q

Indirect Causation:

A

When changes in the X and Y-Variables are both caused by a third variable ( Say Z).
• You will observe a relationship in the scatter diagram even if X does not impact Y and Y does not impact X.
• Be careful interpreting such a relation ship

19
Q

Modality

A

A unimodal histogram with a single peak, while a bimodal histogram is one with two peaks:

20
Q

Measures of Variability

A

: Measures of central location fail to tell the whole story about distribution. How much are the observations spread out around the mean value.

21
Q

Sample Standard Deviation:

A
  • To obtain sample variance we squared distance of each observation from the mean
  • Now we undue the squaring by taking the square root
  • The standard deviation is simply the square root of the variance, thus:
22
Q

Covariance (Generally Speaking):

A
  • When two variables tend to move in the same direction (both increase or both decrease), the covariance will be a large positive number
  • It is extremely rare that two variables always move in the same direction
  • When two variables tend to move in opposite directions, the covariance is a large negative number
  • When there is no particular pattern, the covariance is a small number
  • However, it is often difficult to determine whether a particular covariance is large or small
23
Q

Sample of Coefficient of correlation:

A

The coefficient of correlation is defined as the covariance divided by the standard deviations of the variables

24
Q

Sampling Errors

A

Differences between population and sample that occur because of the observations that happened to be picked from our sample

25
Q

Nonsampling Errors

A

Differences between population and sample that occur due to a flaw in the sampling method

26
Q

Stratified Random Sampling

A
  1. Divide the population into two (or more) mutually exclusive groups (stratas).
  2. Randomly sample from each strata
27
Q

Cluster Sampling

A

Random sample of groups or clusters of observations.

• E.g. Draw townships, postal codes, or city blocks at random then survey the residents

28
Q
  1. Classical Approach
A

Based on equally likely events

29
Q
  1. Relative Frequency
A

Based on experimentation or historical data

30
Q
  1. Subjective Approach
A

Based on (subjective) judgment

31
Q
  1. Bayesian Approach
A

Based on combination of subjective assessment with relative frequency

32
Q

Marginal Probabilities

A

Computed by adding across rows and down columns; that is they are calculated in the margins of the table

33
Q

Conditional Probability

A

Used to determine how two events are related. That is, we can determine of one event given the occurrence of another related event.
Written as P(A|B)

34
Q

Independence

A

• One of the objectives of calculating conditional probability is to determine whether two events are related
• In particular, we would like to know whether they are independent, that is, if the probability of one event is not affected by the occurrence of the other event.
-There are independent if:
P(A|B)=P(A) or P(A|B)=P(B)

35
Q

Multiplication Rule

A

Used to calculate the joint probability of two events. It is based on the formula for conditional probability earlier defined

P(A|B)= [P(A and B)] / P(B)