class10: Analyze the Data Flashcards
what is the soar analytics model?
- Specify the question
- obtain the data
- analyze the data
- report the results
a group with something in common
population
characteristic of a population/characteristic of a sample
parameter / statistic
what is the sample used for?
sample is used to make inferences, conclusion about the characteristics of a population
measures that describe a population/sample
descriptive statistics
measures calculated only using a sample
inferential statistics
explain 3 sampling methods
- simple random sampling
- stratified random sampling
- divide members into similar groups before sampling
- cluster sampling
- divide the population into groups => select few groups
process of reducing the size of the data set to a more manageable and suitable size for a business analysis projects
data reduction
4 common methods of data reduction
- filtering
- deduplication
- aggregation
- compression
4 types of bias in business analytics
- nonresponse
- selection
- confirmation
- outlier
Shows all possible values for a variable and how often they (could) occur
data distribution
a statistical function that describes the possible values in a population and the chance that any given observation can take a given range or value
probability distribution
explain 2 types of numerical data
- continuous data
- any numerical value, infinite
ex, height, weight, currency - discrete data
- whole number, finite
ex, customers
3 basic understanding of the data
: the starting point of analyzing
- structure
- dispersion
- frequency
the distribution shape
kurtosis