Basic Statistics Flashcards
What is an observation ?
The units on which we measure data, such as persons, cars, animals… are called observations.
What is a population ?
A collection of all units
What is a sample ?
A selection of n observations. A sample is always a subset of the population
What is a qualitative variable ?
Variables which take value that cannot be ordered in a logical or natural way.
What is a quantitative variable ?
Variables that represent measurable quantities. The values which these variables can take can be ordered in a logical and natural way.
What is a graphic ?
It represents the relationship between two or more variables
It is an alternative way to summarize a variable’s information
It provides clues that words and equations do not
It is great tool to form hypotheses and draw conclusions
What is a disadvantage of graphs
They can be inaccurately interpreted, resulting in incorrect answers or conclusions
What is the pie chart used for ?
Used to visualize the absolute and relative fréquences of nominal (categorical) and ordinal variables
What is the bar chart used for ?
Used to visualize the absolute and relative frequencies of observed values of a variable. Can be used for nominal and ordinal variables.
What is the histogram used for ?
Used to visualize the distribution of values of continuous variables.
What are the differences between bar charts and histograms ?
Histograms shows the distribution of variables whereas bar charts compare variables
Histograms show quantitative data whereas bar charts show categorical data
The bars in an histogram cannot be reordered
What is line graph used for ?
Used to visualize quantitative data collected over a specific topic and a pecific time interval.
Data points are connected by a line, and they represent the observation.
What are box plots used for ?
Used to visualize the distribution of data based on a five number summary : minimum, first quartile, median, third quartile, maximum.
What is Q2 ?
The middle value of the data = the median
What is Q1 ?
The lower quartile, the middle number between the smallest and the median
What is Q3 ?
The upper quartile, the middle value between the median and the highest value
What is the interquartile range ?
From Q1 to Q3
How to determine the lower extreme in a blox plot graph ?
Lower extreme = Q1-1,5*IQR
Where IQR = Q3-Q1
How to determine the upper extreme in a box plot chart ?
Upper extreme = Q3+1,5*IQR
Where IQR=Q3-Q1
What are scatter plots used for ?
Used to visualize the relationship between two quantitative variables measured on the same individuals.
It is useful to visually detect outliers
It shows the type of relationship between two variables
What are tables useful for ?
Used to present results from research, e.g., within or between-group comparisons.
What is an outlier ?
An outlier represents a value distant from the rest, due to variability or error.
Outliers are value more than 1,5 time the IQR
How to detect an outlier ?
- visually inspect data using a scatter plot or box plot
- use Tukey rule to detect outliers :
Q1-1,5IQR
Q3+1,5IQR