Descriptive stats Flashcards
What’s the difference between descriptive and inferential statistics?
Descriptive –> describe sample data based on sample statistics
Inferential –> use sample statistics to learn on population parameters
What is micro data?
Data collected on individuals
What is macro data?
Data collected on a group of units
What is a population?
The set of all statistical units object of interest
What is the sample?
A subset drawn from the population
What is non probability and probability sampling?
Non –> units are drawn from the population according to the judgement of the researcher
Probability –> units are drawn at random from the population, and every unit has the same probability to be drawn
What is the inferential process?
It consists in drawing conclusions that concern the entire population from the information provided by a sample
What are the two broad variable categories?
Numerical and categorical
What are the subsets of numerical variables?
Discrete and continuous
What are the subsets of categorical variables?
Ordinal and nominal
What are the columns of a frequency distribution table?
Classes/groups; absolute frequencies and relative frequencies
How is a histogram composed?
Horizontal axis –> intervals
Bars –> have an area equal to its relative frequency
Vertical axis –> interval density = relative frequency/interval width
How can we calculate an interval density?
Relative frequency/interval width
How does the number of intervals relate to the accuracy?
The higher the # of intervals, the higher the detail of the description.
What are the three measures of central tendency?
Mode, median and mean
What is the mode?
The level/value of a variable that is observed with the highest frequency
What is the unique measure of central tendency for nominal variables?
Mode
What is the median?
It is the central value. If odd–> (n+1)/2, if even it’s the median of the two central values
What is the mean?
The arithmetic average of the values. (x1+x2+….+xn)/n
What is the deviation?
It’s the difference of an observed value and the mean