Descriptive Statistics: Representation Flashcards
Data distributions
used often in statistics. They are graphical methods of organizing and displaying useful information
Dot plots
show numerical values plotted on a scale
frequency
rate of occurrence
Histograms
display data in ranges, with each bar representing a range of numeric values
useful for showing frequencies of a large set of data
a graphical representation of the distribution of data
Box plots
rectangular representations of data. They do not give the frequency of values
graphical representation of the distribution in a data set using quartiles, minimum and maximum values on a number line
box-and-whisker plots
Another name for box plots
tally chart
consists of a table with tally marks that show frequency of occurrences in each category
symmetrical
where the shape created is mirrored nearly perfectly across a line
skewed
which is where the shape of a graph peaks to the left or the right of the center
gap
is an area in the data set where no observations have been made
bimodal
where the data set has two different modes
Skewness in statistics
represents an imbalance and asymmetry from the mean of a data distribution
positive skew
means that the extreme data results are larger
negative skew
means the extreme data results are smaller
Standard deviation
tells you how different and varied your data set really is. Standard deviation shows you how far your numbers spread out from the mean and median
dot plot
a visual representation of data using intervals or categories of variables; the dots represent an observation in the data
Categorical data
is data that is grouped by type or object, not by numbers
Numerical data
is information that is measurable. It is always collected in number form.
quartile
is a group of values and/or means that divide a data set into quarters, or groups of four
interquartile range
is a value that is the difference between the upper quartile value and the lower quartile value
median
is the midpoint value of a data set, where the values are arranged in ascending or descending order.
scatterplot
is a graph of ordered pairs showing a relationship between two sets of data
bivariate data
which are two sets of variables that can change and are compared to find relationships.
correlation
which is the relationship between two variables or sets of data
independent variable
is a condition or piece of data in an experiment that can be controlled or changed
dependent variable
is a condition or piece of data in an experiment that is controlled or influenced by an outside factor, most often the independent variable
positive correlation
where the dependent variables and independent variables in a data set increase together