Maths 2 Flashcards

1
Q

Types of variables

A

Categorical/Qualitative: observation belongs to one of a set of categories (numeric or non-numeric)
Quantitative: observation takes numeric value representing magnitudes (how much or how many)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Discreet vs. Continuous

A

Discreet: possible values are operate numbers, finite, countable
Continuous: possible values form an interval, infinite, measurable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Level of measurement

A

Non-metric
Nominal: categorical, can be distinguished (e.g.: first name)
Ordinal: qualitative, can be ordered, distinguish and compare (e.g.: grades)

Metric:
Interval: quantitative, zero point is arbituary, distinguish, compare and meaningful difference (e.g.: temperature)
Ratio: zero point is absolute, distinguish, compare, meaningful differences and ratios (e.g.: income)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Measures of central tendency and dispersion

A

Central tendency: mean, mode, median, percentiles

Dispersion: deviation, variance, standard deviation, Range and IQR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Interpret frequency distributions

A
  1. Shape (peaks, skewed), centre (central tendency), spread (dispersion)
  2. Look for outliers (striking deviations from the pattern)
  3. Make statements
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Analysis single variables

A

Nominal: frequency tab, bar chart, mode
Ordinal: frequency tab, bar chart, mode, median, range, IQR
Metric: frequency tab, bar chart, median, mean, Std. Dev., range, IQR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Crosstables

A

Describe joint distribution of two variables
For all level of measurements
Illustrate absolut and relative frequency (also conditional frequencies)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Scatterplots

A

Quantitative variables
plotted symbols illustrate combinations of values (of the two variables)
Correlation easily visible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Covariance and Correlation

A

Covariance
Positive –> positive correlation between variables
Negative –> negative correlation

Correlation
[0.1;0.6) –> weak positive
[0.6;1] –> strong positive

Pearson: two metric variables
Spearman: one or both ordinal variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

OLS

A

Ordinary least squares –> linear regression
Residual: difference between observed value and fitted (predicted value)
Best line minimises sum of the squared distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Goodness of Fit

A

How good the regression line fits the data

R^2 falls between 0 and 1 –> the higher, the better the fit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Definitions: probability, random experiment, random variable, probability distribution

A

Probability: relative possibility that an event occurs
Random experiment: process leading to occurrence of one of all possible outcomes
Random variable: variable whose value is a numerical outcome of a random phenomenon
Probability distribution: function that relates each value of random variable to its probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Binomial Distribution

A

Fixed number n of observations
Two possible outcomes
Probability remains the same for each observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Normal distribution

A

n approaches infinity –> probability of a specific event turns zero
Area under the curve corresponding to an interval is probability that variable assumes a value within this interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Population, Sample, Sampling methods, Finite vs infinite

A

Population: set of all elements of interest of a study
Sample: subset of the population

Simple random sampling, stratified random sampling, cluster sampling, systematic sampling, convenience sampling, judgemental sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Sampling distribution and Central limit theorem

A

Each sample has different statistics –> the distribution of the different samples’ means is called sampling distribution

Central limit theorem: sampling distribution can be approximated with a normal probability distributions the sample size becomes larger

17
Q

Hypothesis Test

A

Idea: test whether a claim about a population is correct or incorrect

  1. Assumption
  2. Hypothesis
  3. Significance level
  4. Test statistics
  5. P-value
  6. Conclusion
18
Q

Kind of tests

A

One sample T-Test: claim - population mean is equal to particular value (H0: u = u0)
Independent samples T-Test: claim - no difference between two population means (two samples, H0: u1 = u2)
Dependent samples T-Test: claim - no difference between two population means (one sample, H0: ud = 0)
Chi-square: claim - variables are independent
Pearson/Spearman: claim - no correlation between two variables in the population (H0: p1 = 0)