Descriptive Statistics Flashcards

Question 1

Q

describe/summarize the data a researcher has

Answer

A

descriptive statistics

Question 2

Q

helps a researcher understand the data that he has, while descriptive statistics help him explain to other people what is happening to his data

Answer

A

Exploratory data analysis (EDA)

Question 3

Q

The first thing to describe is the distribution of the data,
to show the kinds of numbers that we have.

Answer

A

describing data

Question 4

Q

Different ways of Describing the Distribution
is used to
present the pattern in the data.

Answer

A

Frequency Table
Charts (e.g., histograms, bar chart etc)

Question 5

Q

frequency distributions of nominal or ordinal data are customarily plotted using a ______

Answer

A

bar graph

Question 6

Q

____ drawn for each category, where the height of the
bars represent the frequency or number of members of
that category.

Question 7

Q

used to represent frequency distributions
composed of interval or ratio data. Bar is drawn for each
class interval.

Class intervals are plotted on the horizontal axis such
that each class bar begins and terminates at the real
limits of the interval.

Answer

A

histogram

Question 8

Q

also used to represent interval or
ratio data.

Instead of using bars, a point is plotted over the midpoint
of each interval at a height corresponding to the
frequency of the interval. Points are joined by a straight
line.

Answer

A

frequency polygon

Question 9

Q

Don’t draw a bar chart for ___

Answer

A

Continuous measures

Question 10

Q

presents the score values and
their frequency of occurrence.

When presented in a table, the score values are listed in
rank order, with the lowest score value usually at the
bottom of the table.

Answer

A

Frequency distribution

Question 11

Q

in grouping data

Answer

A

how wide should interval be?

Question 12

Q

When data are grouped

Answer

A

some information is lost

Question 13

Q

The wider the interval,

Answer

A

the more information is lost.

Question 14

Q

Constructing a frequency distribution of grouped scores

Answer

A

Find the range of the scores.
Determine the width of each class interval (i).
List the limits of each class interval, placing the interval
containing the lowest score value at the bottom.
Tally the raw scores into the appropriate class intervals.
Add the tallies for each interval to obtain the interval
frequency.

Question 15

Q

indicates the
proportion of the total number of scores in each interval.

Answer

A

Relative Frequency Distribution

Question 16

Q

indicates the
number of scores that fall below the upper limit of each
interval.

Answer

A

Cumulative Frequency Distribution

Question 17

Q

–indicates the
percentage of scores that fall below the upper limit of
each interval.

Answer

A

Cumulative Percentage Distribution

Question 18

Q

what is this symbol?

f/N

Answer

A

Relative Frequency

Question 19

Q

frequency of interval + frequencies of all class intervals below it.

Answer

A

Cumulative Frequency

Question 20

Q

what is this formula?

cum f / N x 100

Answer

A

cumulative percentage

Question 21

Q

_____are very important in data analysis, because
they allow us to examine the shape of the distribution of
a variable.

The shape is a pattern that forms when a _____ is
plotted and is known as the distribution.

Answer

A

histogram

Question 22

Q

the normal distribution also known as the

Answer

A

Gaussian Distribution

Question 23

Q

_____ symmetrical and bell shaped. It
curves outwards at the top and then inwards nearer the
bottom, the tails getting thinner and thinner.

Answer

A

normal distribution

Question 24

Q

is the data form a perfect normal distribution?

Answer

A

never but as long as the distribution is close to a normal
distribution, it will not matter too much.

Question 25

Q

A very ___ of naturally occurring variables are
normally distributed.

A _____ of statistical tests make the assumption
that the data form a normal distribution.

Answer

A

large number

Question 26

Q

don’t refer to the Normal Distribution as either of the
following;

Answer

A

usual, regular, standard, or even distribution.

Question 27

Q

Wrong Shape

Distributions can be of wrong shape for two reasons.

First, because it is not symmetrical –

Second, because it is not the characteristic bell shape

Answer

A

SKEW
KURTOSIS

Question 28

Q

A non-symmetrical distribution is said to be _____.

Question 29

Q

the curve rises rapidly and then drops off
slowly.

Answer

A

positive skew

Question 30

Q

the curve rises slowly and then
decreases rapidly.

Answer

A

negative skew

Question 31

Q

Skewness has some serious implications for some types
of data analysis.

Skew often happens because of ____ or _____

Answer

A

floor effect or ceiling effect

Question 32

Q

occurs when only few of the subjects are
strong enough to get off the floor.

Answer

A

floor effect

Question 33

Q

causes negative skew and are much less
common in Psychology.

sometimes occur most commonly when we
are trying to ask questions to measure the range of some
variable, and the questions are all too easy, or too low
down the scale.

Answer

A

ceiling effect

Question 34

Q

Much trickier than Skew but is usually less of a problem.

Occurs when there are either too many people at the
extremes of the scale, or not enough people at the
extremes.

Question 35

Q

when there are insufficient people in
the tail (ends) of the scores to make the distribution
normal.

Answer

A

positive kurtosis

Question 36

Q

when there are too many people,
too far away, in the tails of the distribution.

Answer

A

negative kurtosis

Question 37

Q

_____ is just a “posh” way of saying average.

In some way refers to the most central value of a data
set with different interpretations of the sense of
“central”.

Loosely known as the average. In statistical description,
though, we have to be more precise about just what sort
of average we mean.

Answer

A

central tendency

Question 38

Q

Small number of data points that lie outside the
distribution when the distribution is approximately
normal.

Usually easily spotted in histograms.

______ are easy to spot but deciding what to do with
them can be much trickier.

Question 39

Q

The mean is very sensitive to _____

Answer

A

extreme scores

Question 40

Q

Called the arithmetic mean.

Calculated by adding up all the scores and dividing by the
number of individual scores.

Equation: (?) = ∑x / N

Question 41

Q

Under most circumstances, of the measures used for
central tendency, the mean is least subject to ______

Answer

A

sampling variation

Question 42

Q

For statistics to be correct, we need to make some _____

Answer

A

assumptions

Question 43

Q

The sum of the squared deviations of all the scores
about their mean is a ______

Question 44

Q

the _____ is equal to the sum of the mean of each
group times the number of scores in the group, divided
by the sum of the number of scores in each group.

Answer

A

overall mean

Question 45

Q

Second most common measure of central tendency.

It is the middle score in a set of scores.

Used when the mean is not valid, which might be
because the data are not symmetrically or normally
distributed, or because the data are measured in an
ordinal level.

Question 46

Q

The median is _____ than the mean to extreme
scores.

Answer

A

less sensitive

Question 47

Q

The most frequent score in the distribution or the most
common observation among a group of scores.

Best measure of central tendency for CATEGORICAL data
(although it is not even very useful for that)

Rarely used in research.

Question 48

Q

In a frequency distribution it is very easy to see because
it is the _______ of the distribution.

The problem with it is it does not tell us very much.

Answer

A

highest point

Question 49

Q

The _____ is the simplest measure of dispersion.

It is the distance between the highest score and the
lowest score.

It can be expressed as a single number, or sometimes it is
expressed as the highest and lowest scores.

Question 50

Q

To find the range we find the lowest value (2) and the
highest value (17). Sometimes the range is expressed as
a single figure, calculated as:

Answer

A

Range = Highest Value – Lowest Value

Question 51

Q

Used with ordinal data or with non-normal distributions.

If median is used as a measure of central tendency, the ___ is probably used as a measure of dispersion.

It is the distance between the upper and lower quartiles.

Answer

A

inter-quartile-range

Question 52

Q

There are ____ quartiles in a variable – they are the____ values that divide the variable into four groups.

Question 53

Q

The ____ quartile happens one quarter of the way up the
data, which is also the 25th centile.

Answer

A

1st quartile

Question 54

Q

The _____ quartile is the half-way point, which is the
median, and is also the 50th centile.

Answer

A

2nd quartile

Question 55

Q

The ____ quartile is the three-quarter-way point or the 75th
centile.

Answer

A

third quartile

Question 56

Q

symbol

s

Answer

A

sample standard deviation

Question 57

Q

______ is like the mean, in that it
takes all of the values in the dataset into account when
it is calculated.

It is also like the mean in that it needs to make some
assumptions about the shape of the distribution.

To calculate the _____, we must assume that we have a
normal distribution.

Answer

A

Standard Deviation

Question 58

Q

symbol

σ

Answer

A

population standard deviation

Question 59

Q

the _____ of a set of scores is just the square of the standard deviation

Question 60

Q

the variance is not used much in descriptive statistics because it gives us squared units of measurement. however, it is used quite frequently in ___________

Answer

A

inferential statistics

Question 61

Q

the SD gives us a measure of dispersion relative to the mean
the SD is sensitive to each score in the distribution
like the mean, the SD is stable with regard to sampling fluctuations

Answer

A

properties of the standard deviation

Question 62

Q

population standard deviation

Answer

A

boxplot or box and whisker plot