[L2] Descriptive Statistics Flashcards
Used with ordinal data or with non-normal distributions.
INTER-QUARTILE-RANGE
A ___ is drawn for each category, where the height of the
bars represent the frequency or number of members of
that category.
bar
indicates the
percentage of scores that fall below the upper limit of
each interval
Cumulative Percentage Distribution
How to compute the Median
- Scores should be placed in ascending order of size, from
the smallest to the largest score. - When there is an odd number of scores in the
distribution, halve the number and take the next whole
number up. - If there are an even number of scores, the median is the mean of the two middle scores.
Instead of using bars, a ____ is plotted over the midpoint
of each interval at a height corresponding to the
frequency of the interval. Points are joined by a ___-
point; straight
line.
To find the range we find the ____ value (2) and the
____ value (17).
lowest; highest
The computer defines that point as an ____.
outlier
- Called the ___ mean.
- Calculated by adding up all the scores and dividing by the
number of individual scores.
arithmetic; MEAN
if outliers can not be eliminated and you are convinced
that you have a genuine measurement, then you have a
___.
dilemma
When the distribution is ____ (i.e. has one mode) and
____, then the mode, median, and mean will have
very similar values.
unimodal, symmetrical
It is the distance between the highest score and the lowest
score.
range
_____ sometimes occur most commonly when we
are trying to ask questions to measure the range of some
variable, and the questions are all too easy, or too low
down the scale
Ceiling Effect
(e.g., histograms, bar chart etc) – is used to present
the pattern in the data.
Charts
small sample size, not normally distributed
nonparametric test
Loosely known as the average. In statistical description,
though, we have to be more precise about just ___ we mean.
what sort
of average
____should not overwhelm the reader who is
trying to see what is going on.
Data presentation
But as long as the assumptions are __ to any
great extent, we will be OK.
not violated
The downfall of the mean is that it is affected by ___
skew and
outliers.
The median is ____than the mean to extreme
scores.
less sensitive
___ occurs when only few of the subjects are
strong enough to get off the floor.
Floor Effect –
when data are measured on an ____, it is tricky
and difficult to decide whether to use the mean, median,
or even the mode
ordinal scale
The ___ then extend from the box to the highest
and lowest points – unless this would mean that the
length of the whisker would be more than 1.5 times the
length of the box.
whiskers
There are ____that can be used to
draw a normal distribution. These equations can be used
in statistical tests.
mathematical equations
Small number of data points that lie outside the
distribution when the distribution is approximately
normal.
OUTLIERS
The first thing to describe is the ___, to
show the kinds of numbers that we have.
distribution of the data
When the distribution is skewed, the ___ have
the effect of pulling the mean away from the true value.
skewed values
The Normal Distribution
* Also known as the ___
Gaussian Distribution
The central tendency does not mean a lot without a
____
measure of dispersion or spread.
It s very hard to interpret a measure of central tendency
without also having a ___
measure of dispersion
indicates the
number of scores that fall below the upper limit of each
interval.
Cumulative Frequency Distribution –
If median is used as a measure of central tendency, the
IQR is probably used as a ___
measure of dispersion.
Some statisticians would argue that things like ___ scales can only be considered to be
ordinal data.
personality
measures and attitude
It is the distance between the upper and lower quartiles.
INTER-QUARTILE-RANGE
___ has some serious implications for some types of
data analysis.
Skewness
When deciding which to use, take into account the
___
distribution of the scores.
__- Effects are common in many measures in
Psychology.
Floor
As long as the distribution is ____distribution, it will not matter too much.
close to a normal
frequency distributions of Nominal or
Ordinal Data are customarily plotted using a bar graph.
Bar Graph –
– when there are too many people, too
far away, in the tails of the distribution.
Negative Kurtosis
S - Greek letter called “____” or “summation of” or
“add up” or “take the sum of.
Sigma
Under most circumstances, of the measures used for
central tendency, the mean is ___
least subject to sampling
variation.
- A non-symmetrical distribution is said to be skewed.
SKEW
indicates the
proportion of the total number of scores in each interval.
Relative Frequency Distribution –
The shape is a pattern that forms when a histogram is
plotted and is known as the ___.
distribution
In a skewed distribution the mean, median, and mode are
____
not the same.
sample standard deviation
population standard deviation
- s; σ
First, because it is not symmetrical – this is called __
SKEW
____helps a researcher
understand the data that he has, while ____help him explain to other people what is
happening to his data.
Exploratory data analysis (EDA); descriptive
statistics
In which case it extends to the furthest point which
means it does not exceed ____
1.5 times the length of the box.
The range suffers from one huge problem, in that it is
massively ___ that occur
affected by any outliers
In some way refers to the most central value of a data set
with different interpretations of the sense of “___”.
central
___ differ between statisticians. There is a very fuzzy line between what could definitely
be called ____
Opinions; ordinal and interval.
Mean is pronounced as ___.
x-bar