Hypothesis Testing (3) Flashcards

Question

# define 1. non probability sampling 2. convenience sampling 3. probability sampling 4. simple random sampling 5. systematic random sampling 6. cluster sampling

Answer 1

1• non-probability sampling: the researcher cannot say what the chances are, or likelihood, of an entity being selected for the sample 2• convenience sampling: the researcher includes all individuals that are readily accessible until he/she is happy with the sample size 3• probability sampling: the chance or likelihood of each individual in the sampling frame being selected for the sample is known 4• simple random sampling: randomly choose n entities from the sampling frame - always use a random number generator to pick out individuals from the sampling frame – never think of a random number - note: random numbers are not necessarily evenly spaced; if clusters arise from a random process, it is still random 5.the first selected entity is chosen randomly, but then every nth entity is chosen from the sampling frame 6.cluster sampling: a small number of subgroups are chosen, then clusters of entities are chosen from those subgroups

Answer 2

Cluster • this is a mix of convenience and random sampling, and can introduce severe bias in highly heterogeneous regions • often used in political surveys, where certain neighbourhoods are targeted to assess how they may vote in an upcoming election

Answer 3

• objects: discrete entities (eg, people, families, cities, schools, countries, etc.) • fields: continuous entities (eg, elevation, temperature, energy, salinity, etc.) • measuring continuous fields is very challenging, and the researcher usually reverts to using discrete sampling methods to sample the continuous field • eg, precipitation (a continuous field) is measured using a precipitation gauge (a discrete entity)

Answer 4

representativeness: the degree to which the smaller set resembles the larger set • if our sampling method leads to non-representative samples, then we have introduced bias • the sampling frame generalizes the population, while the sampling method generalizes the sampling frame • non-probability sampling means the research cannot say with much certainty how well the sample represents the sampling frame, which means the link between the sample and population is less certain .redundancy means that you have wasted resources like time and money by continuing data collection for too long .not collecting enough data can result in missing important population characteristics

Answer 5

sample population

Answer 6

there are two situations where nonparametric methods can be applied: 1. when the data of the random variable is nominal or ordinal, parametric methods cannot be used – it is impossible for a nominal or ordinal variable to be normally distributed 2. when the nature of the underlying population distribution is unknown or unspecified – we don’t know that the random variable is normally distributed

Answer 7

F IS IS IMPOSSIBLE

Answer 8

non is not as quanititative, can be ordinal or nominal data - seen as worse then parametric, but in some situations there is no choice and it works well - less powerful than parametric creating more chance of Type 1 error - parametric tests are very sentitive to normaility, non parametric are more robust and less affected by normality - nonparametric can handle all types of data(nominal,ordinal,ration,interval) with fewer restrictions - non parametric is easier to understand as it uses simple mathematics - MORE reliable for SMALL samples as normaility is often violated

Answer 9

The assumption of homoscedasticity (meaning “same variance”) is central to linear regression models. Homoscedasticity describes a situation in which the error term (that is, the “noise” or random disturbance in the relationship between the independent variables and the dependent variable) is the same across all values of the independent variables.

Answer 10

``` goodness-of-fit tests are a special class of nonparametric test which can be used to determine if the distribution of a random variable fits a prescribed probability distribution, eg, the normal distribution ```

Answer 11

• the simplest test for normality is the quantile-quantile, or Q-Q, plot • when the points on a Q-Q plot are a straight line, then we can assume normality, otherwise it may be some other non-normal distribution

Answer 12

``` goodness-of-fit tests are a special class of nonparametric test which can be used to determine if the distribution of a random variable fits a prescribed probability distribution, eg, the normal distribution ``` -ex: Q-Q test which gives a plotted line where your points should run along • the simplest test for normality is the quantile-quantile, or Q-Q, plot • when the points on a Q-Q plot are a straight line, then we can assume normality, otherwise it may be some other non-normal distribution - unfortunately, the Q-Q plot approach is rather subjective – how close to a straight line is needed to confirm normality?

Answer 13

Non parametric Chi square test is to confirm independence of two samples • when comparing two datasets (eg, two-sample difference of means), the assumption is that the 2 series are independent – obviously, if 2 or more individuals are acting together they will inflate n and make it easier to make a Type I error H0 : the samples are independent

Answer 14

H0 is always that the random variable is normally distributed • this prob-value approach tells us that there is a 25.4% change of a Type I error if we reject H0 , therefore we must conclude that the data are normally distributed

Hypothesis Testing (3) Flashcards

(38 cards)