4 statistics and probability Flashcards
discrete data
something you can count
discrete data
something you can count
continuous data
something you measure
a hypothesis
a statement you test to see if it is true or false
raw data
data before it has been analsyed or processed
primary data
data you collect yourself
secondary data
data you use which someone else has collected
categorical data
data is words not numbers
numerical data
data given as numbers
types of numerical data
continuous or discrete
ordinal data
data that is ordered in some way
adv of secondary data
available
cheaper
easy
adv of primary data
reliable
aware of bias
ways of collecting data
measurement or experiment
survey or questionaire
modelling or simulation
mistakes to avoid when doing surverys or questionaires
asking the wrong people or a biased sample asking leading questions asking confusing questions asking personal questions asking too open ended questions
random
every member of the popuation ahs the same probability of being included
the members of a genuinely random sample have to be selected independently.
ways of collecting a sample
convience
systematic
genuinely random
convienence sample
asking whoever is easiest to get hold of
systematic sample
asking every 3rd person
genuinely random sample
picking out of a hat or using a random number generator
quota sampling
Choosing a sample that is only comprised of members of the population that fit certain characteristics.
stratified sampling
Choosing a random sample in a way that the proportion of certain characteristics matches the proportion of those characteristics in the population.
continuous data
something you measure
hypothesis
a statement you test to see if its true or false
raw data
date before analysis or processing
primary data
data you collect yourself
secondary data
data you use which someone else has collected
categorical data
word data not numbers
numerical data
number data
ordinal data
ordered in some way
adv of secondary data
available
cheaper
easy
adv of primary data
reliability
aware of bias
ways of collecting primary data
measurement or experiment
survey or questionaire
modelling or simulation
random sample
every member of population has the same probability of being included. selected indepently.
what is the opposite of a census
a random sample
convience sampling
asking friends or those easy to ask
systematic sammpling
e.g. asking every 3rd person
genuinely random sampling
pick out of hat or use random number generator on calculator.
quota sampling
the populalation is divided into groups. a given number is surveyed forme ach grouo.
cluster sampling
the population is divided into groups or clusters. a random sample of clusters is chosen and every item in it is surveyed. a large number of small clusters minimises the chances of being unrepresentative.
opinion polls
large scale opinion polls often use a combination of cluster and quota sampling. large sample size based on small proportion of population. (geographical area, age). but opinions change over time
what is a uniform distribution
flat/even
what is a normal distribution
peaked in the middle
mean, median, middle, mode in the same place
gaussian distribution
what is negatively skewed
leading up to the right
what is the positively skewed
leading up to the left or decreasing
box plot left skewed
box on the right with the median line towards the right
box plot right skewed
box on the left with the median line towards the left
normal distribution and standard deviatiosn
the standard deviations (outliers) next to the highlighted (70%) will be30% total, 15% each
box plot name
box and whisker diagram
the ends of the box in a box plot are the
interquartile range
outlier definition
a term of data that is
at least 2 standard deviations away from the mean (histogram)
OR
at least 1.5 x IQR beyond the nearer quartile
(box and whisker)
benefits of a curve in a cumulative frequency diagram
they use the data to show a gradient, so if the frequency decreases slightly then the gradient will show it by flattening a little. straight lines only show the data and not the link between them
datum
singular piece of data