STATS END TERM EXAM Flashcards
is selected from the population and the data gathered from it will represent the data that can be gathered from the entire population.
sample
is concerned with the selection of a subset of population that will be used to estimate the characteristics of the entire population.
Sampling technique
is a statistical error that occurs when an analyst does not select a sample that represents the entire population of data. As a result, the results found in the sample do not represent the results that would be obtained from the entire population
Sampling Errors
TYPES OF SAMPLING ERRORS
Population-Specific Error
Selection Error
Sample Frame Error
Non-response Error
occurs when a researcher understands who to survey.
Population-Specific Error
occurs when the survey is self directed, or when only those participants who are interested in the survey respond to the questions. Researchers can attempt to overcome selection error by finding ways to encourage participation.
Selection Error
occurs when a sample is selected from the wrong population data
Sample Frame Error
occurs when a useful response is not obtained from the surveys because researchers were unable to contact potential respondents (or potential respondents refused to respond).
Non-response Error
SLOVINβs FORMULA
π = π / 1 + ππ 2
TYPES OF SAMPLING TECHNIQUES
Probability Sampling
Non-Probability Sampling
it is a sampling procedure where every element of a population is given an equal chance of being selected as a member of a sample.
Probability Sampling
This is a sampling procedure in which an element of the population is not given an equal chance of selected sample.
Non-Probability Sampling
TYPES OF PROBABILITY SAMPLING
Simple Random Sampling
Systematic Sampling
Stratified Sampling
Cluster Sampling
- This is the most basic sampling technique
- It is a sampling technique in which every element of the population has the same probability of being selected for inclusion in the sample.
Simple Random Sampling
- Is another type of probability sampling which is also known as interval sampling.
Systematic Sampling
- This method considers an interval in selecting a sample from a given population.
Systematic Sampling
- Is a random sampling technique in which a list of elements of the population is used as a sampling frame and the elements to be included in the desired sample are selected by skipping through the list at regular intervals.
Systematic Sampling
- Is a random sampling method that divides a population into different homogeneous subgroups called strata. Random samples will be selected from each stratum so that the population will be well presented. We use stratified random sampling when we consider subgroups like year level of students, gender and age, among others.
Stratified Sampling
- Is a random sampling technique in which the population is first divided into strata and then the samples are randomly selected separately from stratum.
Stratified Sampling
is the subset of strata.
Stratum
This type of random sampling is also called area sampling because it is usually used on a geographical basis.
Cluster Sampling
requires a complete list of clusters that represent the sampling frame. Choose a few clusters randomly as a source of primary data and the data that can be collected from each cluster to represent the characteristics of the whole population.
Cluster Sampling
TYPES OF NON-PROBABILITY SAMPLING
Convenience Sampling
Purposive Sampling
Quota Sampling
Snowball Sampling
- Selecting a participant because they are often readily and easily available.
- Tends to be a favored sampling technique among students as it is inexpensive and an easy option compared to other sampling techniques.
- This often helps to overcome many of the limitations associated with research.
- Also known as accidental, opportunity or grab sampling.
Convenience Sampling
or judgmental sampling is a strategy in which particular settings, persons, or events are selected deliberately in order to provide information that cannot be obtained from other choices.
Purposive Sampling
Samples are chosen based on the goals of study. They may be chosen based on their knowledge of their being conducted or if they satisfy the traits and conditions set by the researcher.
Purposive Sampling
It requires careful planning and justification to ensure your sample is representative of the population youβre interested in studying.
Purposive Sampling
participants are chosen on the basis of predetermined characteristics so that the total sample will have the same distribution of characteristics as the wider population
Quota Sampling
If the desired quota is reached, the drawing of samples is terminated.
Quota Sampling
uses a few cases to help encourage other cases to take part in the study, thereby increasing sample size.
Snowball Sampling
This approach is most applicable in small populations that are difficult for access due to their closed nature (like secret and inaccessible professions).
Snowball Sampling
Participants in the study were asked to recruit other members for the study.
Snowball Sampling
is a number describing a whole population.
parameter
is a number describing a sample.
statistic
is an area of statistical inference wherein we can evaluate a conjecture about some of the characteristics of a population based on the data gathered from the sample.
Hypothesis testing
is an educated guess that can be tested.
Hypothesis
This type of error rejects the null hypothesis when in fact it is true.- error is also known as alpha (πΌ) error.
Type I Error
This type of error fails to reject the null hypothesis when in fact it is false. - is also known as beta (π½) error.
Type II Error
depends on the statistician or researcher who is willing to commit type I error.
level of significance
is used when the alternative hypothesis is directional. It means that the value of the measures is either greater than or less than the other measure.
one-tailed test
is a hypothesis where the rejection region lies at only one tail of the distribution.
one-tailed test
is used when the alternative hypothesis is non-directional, which means that the values of two
two-tailed test
- is a hypothesis test where the rejection region lies on both end tails of the distribution, one on the left and one on the right.
two-tailed test
This is used as a basis for deciding whether the null hypothesis should be subjected.
Test statistics
This is the set of values of the test statistic that leads to rejection of null hypothesis.
Rejection region
This is the set of values of the test statistic that leads to acceptance of the null hypothesis.
Non-rejection region
This is the set of values of the test statistic that separates the rejection and non-rejections
Critical Value
is a procedure in making decisions based on sample evidence or probability theory used to determine whether the null hypothesis is accepted or rejected. If the statement is found reasonable, then the hypothesis will be accepted, otherwise it will be rejected.
Hypothesis testing
is the variable that may affect the dependent variable to change.
Independent variable
is the variable that is influenced or affected by the independent variable.
Dependent variable
The data collected in this type of study involves two variables called
bivariate data.
are diagrams that used to show the degree and pattern of relationship between two sets of data. They are constructed on the xy coordinate plane. Each data point on a scatter plot represents two values (x,y).
Scatter Plots
The of the point is the value of the independent variable (x)
abscissa
the is the value of the dependent variable (y).
ordinate
If the dots are on a straight line pointing upward to the right.
Perfect positive correlation
If the dots are on the straight line pointing downward to the right
Perfect negative correlation
If the dots are concentrated around a straight line pointing upward to the right.
Strong positive correlation
If the dots are concentrated around the straight line pointing downward to the right.
Strong negative correlation
If the dots are not close but are not too far from the straight line that they seem to follow
Moderately positive or negative correlation
If the dots in the scatter plot are widely spread.
Weak positive or negative correlation
If the dots are neither following a straight line pointing upward or downward to right nor have a pattern.
NO CORRELATION
denoted by r measures the strength of the linear relationship. To find r, the following formula is used.
Pearson Product Moment Correlation Coefficient