Week 1 Flashcards

1
Q

Descriptive statistics

A

summarize data using measures (e.g. of location and dispersion) that describe a distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Inferential statistics:

A

use probability theory to analyze phenomena subject to randomness (e.g. hypothesis testing).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Parameter definition

A

A measurable characteristic of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Statistic definition

A

A measurable characteristic of a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Objective of Statistics

A

Estimate parameters using the data. Given a sample we want to infer the parameters that characterize the population, or the law of motion of a process (in general the DGP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Note about INPUT and OUTPUT

A

Bear always in mind that our INPUT is the observed data and the OUTPUT is estimates for the unknown parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Law of Large Numbers

A

A theorem that describes the result of performing the same experiment a large number of times. According to the law, the average of the results obtained from a large number of trials should be close to the expected value, and will tend to become closer as more trials are performed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Central Limit Theorem

A

establishes that, in most situations, when independent random variables are added, their properly normalized sum tends toward a normal distribution (a bell curve) even if the original variables themselves are not normally distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Absolute frequency

A

The absolute frequency is simply the total number of observations or trials within a given range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Relative frequency

A

How often something happens divided by all outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Frequency distribution

A

The association of modalities and their frequencies. Frequency is how often something occurs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Histogram definition

A

A diagram consisting of rectangles whose area is proportional to the frequency of a variable and whose width is equal to the class interval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Calculating the density of the modality

A

Use relative frequencies and divide each modality by the width of the modality.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Sample Statistics: Measures of Location

A
  • Arithmetic mean
  • Median
  • Percentile
  • Quartile
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Arithmetic mean

A

the average of a set of numerical values, as calculated by adding them together and dividing by the number of terms in the set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Median

A

The middle number (in a sorted list of numbers).

17
Q

Percentile

A

Values such that the distribution is divided in 100 equal parts. Computation:

1) Arrange the sample in ascending order
2) Observe the index i of the position of the pth percentile

18
Q

Sample Statistics: Measures of Variability

A
  • Variance
  • Standard deviation
19
Q

Variance

A

The average of the squared differences from the Mean.

  1. First step is to find the Mean
  2. Now we calculate each dog’s hight difference from the Mean
  3. To calculate the Variance, take each difference, square it, and then average the result
    1. In Population divide by N
    2. In Sample divide by N-1
20
Q

Standard Deviation

A

And the Standard Deviation is just the square root of Variance

21
Q

Shape of distributions

A
  1. Skewness: measure of asymmetry (third moment of the data)
  2. Kurtosis: measure of “tailedness” (fourth moment of the data)
22
Q

Skewness

A

Data can be “skewed”, meaning it tends to have a long tail on one side or the other.

23
Q

Kurtosis

A