Exam Revision Flashcards
nominal
variables that have no numerical value and instead are categories (e.g. catholic, christian etc.).
ordinal
no numerical value as such but something that can somewhat be calculated (e.g, low, medium, high).
equal interval
can be numerical, must have an equal distance (e.g. 5000, 10000, 15000).
frequency tables
arranged to that there are three columns.
- number
- frequency
- percentage
grouped frequency tables
the same as normal frequencies except numbers are grouped into equal intervals in order to reduce the number of rows.
histograms
arranged so that the x variable, the horizonal axis is the number and that the y axis is the frequency.
frequency polygon
developed using a histogram. each maximum value of each number is taken and then plotted into a graph.
unimodal
has one maximum peak.
has one mode.
bimodal
two peaks and therefore two values that have a greater frequency than the others.
multimodal
multiple peaks, occurs when there are multiple values with a greater frequency than the others. i.e. multiple modes.
negative skew
The left tail is longer; the mass of the distribution is concentrated on the right of the figure. The distribution is said to be left-skewed, left-tailed, or skewed to the left, despite the fact that the curve itself appears to be skewed or leaning to the right; left instead refers to the left tail being drawn out and, often, the mean being skewed to the left of a typical center of the data.
positive skew
The right tail is longer; the mass of the distribution is concentrated on the left of the figure. The distribution is said to be right-skewed, right-tailed, or skewed to the right, despite the fact that the curve itself appears to be skewed or leaning to the left; right instead refers to the right tail being drawn out and, often, the mean being skewed to the right of a typical center of the data.
leptokurtic
thinner than the normal distribution. there is a sharper peak around the mean.
platykurtic
distribution is broad and flat.
characteristics of the Z distribution
Mean =0
SD=1
criterion variable
dependent variable
predictor variable
independent variable
positive correlation
A positive correlation is a relationship between two variables where if one variable increases, the other one also increases.
negative correlation
Negative correlation is a relationship between two variables in which one variable increases as the other decreases, and vice versa.
Difference between strong and weak correlation
The more closer the value of r is to its endpoints, the stronger is the correlation. If the value of r is close to 0 then we conclude that the correlation is weak and hence there is no linear relationship between the variables.
descriptive statistics
summarise data
inferential statistics
generalise data from a sample into a population.
characteristics of the normal distribution
mean = median = mode
- bell curved
- symmetrical
- unimodal
proportion, percentage and probability
are all interchangeable terms.
one tailed tests
used when the hypothesis is directional (i.e. hypothesised to either increase or decrease).
two tailed tests
used when the hypothesis is non-directional. (i.e. there will be a ‘change’).
cut off points for the Z distribution
one tailed (0.01)= +/- 2.33 two tailed (0.01)= +- 2.58
one tailed (0.05)= +/- 1.64 two tailed (0.05)= +- 1.96
Type I error
when you reject the null hypothesis but it is true.
= alpha = cut off
type II error
when you retain the null hypothesis but it is false.
= a -B = power
Power
probability that the study will give a significant result if the research hypothesis is true. i.e. rejecting the null when it is false.
is the area of the H1 distribution which is beyond the critical value on the Ho distribution.
sampling error
the mean of one sample is not likely to be the mean of the population.
minimising error
easiest way to minimise error is to use a larger sample.
increasing sample size will also increase power.
what happens when a is reduced?
probability of type I error decrease, while type II increases.
characteristics of the distribution of means
- unimodal
- symmetrical
this is as average scores become more likely in a distribution of means.
more scores in a sample, the more it will resemble a normal distribution.
z test vs t test
z test is conducted when the SD and M is known.
t test is when the SD or variance is unknown.
t test single sample vs t test dependent
single sample is a test in which a sample mean is being compared to a known population mean and an unknown population variance.
t test dependent is when there are two scores for each person and the population variance is unknown. Here, the population mean is assumed to be 0 and all calculations are doe on difference scores.
t test dependent vs t test independent
t dependent is difference scores taken from one group of people.
t independent is scores from two different groups of people.
t test assumptions
- The assumption for a t-test is that the scale of measurement applied to the data collected follows a continuous or ordinal scale
- The second assumption made is that of a simple random sample, that the data is collected from a representative, randomly selected portion of the total population.
- The third assumption is that the data, when plotted, results in a normal distribution, bell-shaped distribution curve.
- The fourth assumption is that a reasonably large sample size is used. A larger sample size means that the distribution of results should approach a normal bell-shaped curve.
- The final assumption is homogeneity of variance. Homogeneous, or equal, variance exists when the standard deviations of samples are approximately equal.
t test independent assumptions
- scores are independent
- populations are assumed to be normally distributes.
- population distributions are assumed to have the same variance (homogeneity of variance).
t test dependent assumptions
The dependent variable must be continuous (interval/ratio).
• The observations are independent of one another.
• The dependent variable should be approximately normally distributed.
• The dependent variable should not contain any outliers.
assumptions of the chi squared test
that the observations are indpendent of each other (unrelated).
thus, there is no repeated measures and each person can only supply one observation.
characteristics of the chi squared distribution
Chi-square is non-symmetric.
There are many different chi-square distributions, one for each degree of freedom.
The degrees of freedom when working with a single population variance is n-1.
characteristics of the t distribution
gradually approaches a normal distribution as sample size increases.
determining whether to retain or reject the null hypothesis
if the score is greater than the critical score then reject.
if it is smaller, retain.
t critical vs z critical
the cut off value is more extreme (higher) for the t test.
methods of sampling
- random sampling
- non random sampling which may include;
; haphazard selection (person next to you).
; convenience sample (friends and family).
; snowball sample (friends pass word onto their friends).
statistical significance
if the probability is less than the cut off (5% or 1%), then it is unlikely to be due to chance.
the comparison distribution represents…
the situation where H0 is true .
using 1% cut off rather than 5% results in..
higher chance of a type II error.
qualitative research
- suggests that we can never get rid of our subjectivity. Therefore, we cannot properly see the world through the eyes of another.
- Qualitative research is a means for exploring and understanding the meaning individuals or groups ascribe to a social or human problem.
- Qualitative research is a human focused approach to research design, which aims o delve into people’s experiences, perceptions, behaviour and beliefs.
- Qualitative research explores the processes at play on society, examines the meanings that individuals make of particular events, and provides a window into understanding why people do what they do and think what they think.
paradigms in social research
- Philosophy ○ How the world functions? - Ontology ○ What exists and what is real? - Epistemology ○ What constitutes valuable knowledge? ○ How is knowledge acquired? - Methodology ○ What constitutes data? ○ How should data be collected?
positivist paradigm
- Philosophy ○ Scientific materialism - Ontology ○ Laws of nature - Epistemology ○ Measurable and observable 'proof'. - Methodology Experimental design, survey research.
social constructionist paradigm
- Philosophy
○ Person centred reality as a social construct, pluralisation of life worlds, contextual verities.- Ontology
○ Relations and processes are real, not states and characteristics. - Epistemology
○ Self verified evidence, grounded theory, recorded testimony. - Methodology
○ Ethnography, case study research, action research.
- Ontology
deductive research
-bottom up approach
inductive research
bottom up approach