Stats Flashcards
What does qualitative data mean
non-numerical data e.g. hair colour
What does quantitative data mean
numerical data e.g. number of children
discrete data meaning
data that can be counted e.g. number of children
continuous data meaning
data that can be measured e.g. height
how do you find the class width of a set of data that uses short hand e.g.
length | 1-20 | 21-30 |…
put it into ‘5 < x< 9’ form
e.g. 1 - 20 -> 0.5<x<20.5
then minus 20.5 by 0.5
what is descriptive stats?
stats that are collected and organised
what is inferential stats
stats where the data is inferred and analysed for conclusion to be made
what is a population
a whole set of items that are in interest
what is a sample
a selection of items taken as a subset from the population
what is a parameter + example
numerical characterists of a population i.e. a mean
what is a stat?
a numerical characteristic of a sample that can help to estimate a parameter
what tool can you use to remember what a stat and a parameter are used for
stat begins with ‘s’ -> sample
parameter starts with ‘p’ -> Pop
what is a census
data that observes and measures every item within the population
what is an adv and a dis.adv of a census
adv -
representative of the whole pop
d.adv-
expensive, time-consuming, impossible
what can the size of a sample affect
the validity of any conclusions made. The more varied the sample, the more accurate the results
what is a sampling frame?
a list with all items of the population individually named
what is a sampling unit
an individual unit of the population
how do you carry out a simple random sample
- form a sampling name
- allocate each item a specific number
- generate a random number, e.g. using a calculator, as many times as needed for your sample size (if u need a sample size of 30, generate 30 random numbers)
how do you carry out systematic sampling
- form a sampling frame
- allocate each item a unique number
- using a calc, generate a random number within your population size (this is your starting unit)
- calculate the integer component (population size/ sample size = x)
- select every xth item after the first to be included in the sample
how do you carry out stratified sampling
- divide the data into groups (i.e. year groups, age sex)
- calculate sample size for strata (xi) -> xi = (sample size/population size) x strata size
- make sure the sum of all the xi’s equal the sample size (you may have to round is appropriate)
- conduct a simple random sample for each strata
adv and dis adv of
- simple random
adv:
- everyone has an easy chance of being selected - removes bias
- easy to conduct
d.adv:
- time consuming
-
adv and dis adv of
- systematic
adv:
- covers a wider study area
- less likely to introduce bias because the starting point is randomly generated
d.adv:
- need a randomly generated starting point
- need to know the number of total pop for it to work
adv and dis adv of
- stratified
adv:
- each group receives representation within the sample, as it is proprtional to the group size = increased accuracy
d.adv:
- not all members of the pop may belong to a specific group
what are the two non-random techniques
quota sampling
convenience/opportunity sampling
what is quota sampling
- the population is split into groups (year, age, race)
- individuals are chosen who best fit the requirements
what is convenience sampling
- a sample is taken from people who are availble for the study (i.e. the first 20 people i see)
adv and dis adv of
quota sampling
adv:
- representative
d. adv
- non random sampling may introduce bias
adv and dis adv of
convenience/ opportunity sampling
adv:
- quick -> no need of a sampling frame
d.adv:
- not representative -> the first people seen may not effectively represent the whole pop
which is affected by extreme values and which arent: mean, media, and mode
mode is not
median is not
mean is
how do you use your GDC to find mean, median, mode, etc
stats -> enter data ->F2 (calc) -> F6 (SET) -> make sure ‘List1’ is the first line and ‘List2’ is the second ->EXIT -> F1 (1-VAR)
x = mean
n = sample size
Med = median
Mod = Mode