Applied Quantitative Methods Flashcards
Variable
Usually denoted by capital letters such as X or Y, is a
characteristic or measurement that can be determined for each
member of the population.
Numerical variable
Take on numerical values.
Continuous variable
We measure it. Distance, height, GDP in kr., value of cars sold in kr.
Categorical variable
Known as qualitative data where the data is categorised (smoking vs non-smoking, vote (yes/no)) - the numbers in this type of data are purely for identification purposes (Ronaldo no.7, Christian Eriksen no. 10.)
Population
Collection of persons, things or objects under study.
Sampling
Select a subset (or portion) of the population, to gain information about population data.
Sample
Resulting data from sampling a population.
Statistic
Number that represents a property of the sample (e.g., sample mean, sample variance, etc.)
Parameter
Numerical characteristic of the whole population (e.g.
population mean, population variance, etc.)
Simple Random Sample
Chosen by a process that selects a sample of n objects from a population (N) in such a way that each member of the
population has the same probability of being selected.
Sampling Distributions
The population parameter (e.g., mean µ or variance ‡2), is a fixed (but
unknown) number.
But each sample from a population, has a different value of the mean and
variance. If you pick many samples and calculate the mean (and variance) of each sample, then the sample means (and variances) become a variable, which
can be treated as a random variable with a probability distribution.
Law of large numbers
States that given a random sample of size n from a population
N, the sample mean X¯ will approach the population mean µx as the sample size n
becomes large
Central Limit Theorem
States that the mean of a random sample, drawn from a population with any probability distribution, will be approximately: normally distributed given a large-enough sample size
Acceptance Interval
Is an interval where the sample mean has a high probability of occurring (given that we know the population mean and variance) If the sample mean falls within that specified interval, then we can accept
the conclusion that the random sample came from the population with the known mean and variance.
Distribution of sample proportion
Assume, we are dealing with a qualitative or categorical variable
For example, we investigate a characteristic (e.g. smoker/non-smoker) and note 1 if an individual has this characteristic and 0 otherwise. The (unknown) proportion of ones in the population is denoted P. We have a sample of 0 and 1 values.