descriptive statistics lec Flashcards

1
Q

Observationofonevariablemay be shown visually by putting the variable’s on one axis and putting the frequency on the other.

A

Visual Presentation of Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A bar graph wherein the number of units observed is on the y-axis (_____) while the measurement levels are on the _____

A

frequency; x-axis; histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

in histogram the bars are..

A

The bars are visually proportional to each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A figure that is shorthanded
presents a histogram.

A

Frequency Polygon

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A ___ is placed at the center of the top of the bars and connected to form a polygon. This better ennuncuates the data shape.

A

dot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Basic graphs that can illustrate one or more data sets in one graph.

A

Line Graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

two types of line graph and its difference

A

-Arithmeticlinegraphs
○ Have both x and y-axes on
an arithmetic scale.
○ Both values are numerical.

● Semi-logarithmic line graph ○ Has the y-axis as a
logarithmic axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Parameters of a Frequency Distribution

A

central tendency and dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Frequencydistributionsfrom continuous data are defined by types of descriptors, known as _____.

A

parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

● Defined as the value used to represent the center or the middle (average) of a set of data values.
● Locates observations on a measurement scale.

A

Central Tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

● Describes the spread of values in a given data set.
● Suggests how widely spread out the observations are.

A

Dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

dispersion prefers…

A

Prefers low values, low variance, low standard deviation = not spread out data, results are not far from each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Measures of Central Tendency

A

mean, median, mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Average value or the sum (Σ) of all
the observed values (𝑥𝑖) divided

A

mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

has the most mathematical
properties and most representative of the dataset if not for our outliers.

A

mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The middle observation data when
data has been arranged from ______. When the dataset is an even number (hence no natural middle point), the two middling variables are averaged to find a median.

A

highest to lowest, median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Rarely used to make inferential conclusions from, but is used frequently in-healthcare and economics.

A

median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Most commonly observed value
(the value most frequently
observed).

A

mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

The downside to using the mode

A

a set of data may have no mode, or it may have more than one mode.

20
Q

Measures of Dispersion

A

Variance and sd, mean deviation

21
Q

A statistical measurement of the
spread between numbers in a data
set.
● It measures how far each number
in the set is from the mean (average), and thus from every other number in the set.

A

Variance

22
Q

formula for variance

A

lamo ne yen

23
Q

sample, degree
offreedom

A

N-1

24
Q

Average amount of variability in
your dataset.
● It tells you, on average, how far
each value lies from the mean.

A

Standard Deviation

25
Q

A high standard deviation means

A

that values are generally far from the mean,

26
Q

low standard deviation means

A

values are clustered close to the mean.

27
Q

di erence between the observed value of a data point and the expected value is known as deviation in statistics.

A

Mean Deviation

28
Q

the average deviation of a data point from the mean, median, or mode of the data set.

A

mean deviation or mean absolute deviation

29
Q

Values that split sorted data or a probability distribution into equal parts.

A

Quantiles

30
Q

A statistical term that
describes a division of observations into four
defined intervals based on the values of the data and how they compare to the entire set of observations.

A

Quartiles

31
Q

A type of quantiles, obtained by adopting a subdivision into 100 groups.

A

Percentiles

32
Q

Calculated by dividing an ordered set of data into 100 equal parts.

A

percentiles

33
Q

Di erence between the highest and lowest values.
○ Size of the narrowest interval which contains all the data.

A

range

34
Q

○ Di erence between the
third and the first quartile.
○ Size of the narrowest
interval which contains all the data.

A

InterquartileRange

35
Q

A measure of the
asymmetry of a
distribution.

A

skewness

36
Q

A distribution is
asymmetrical when

A

its left and right side are not mirror images.

37
Q

T/F: A distribution can have right (or positive), left (or negative), or zero skewness.

A

true

38
Q

other term for skewness

A

horizontal imbalance

39
Q

A descriptive statistic used to help
measure how data disperse between a distribution’s center and tails, with larger values indicating a data distribution may have “heavy” tails that are thickly concentrated with observations or that are long with extreme observations.

A

Kurtosis

40
Q

other term for Kurtosis

A

vertical imbalance

41
Q

Xi means

A

FOr each individual observation

42
Q

Xi means

A

Or each individual observation

43
Q

The di erence between the observed value of a data point and the expected value is known as

A

Deviation

44
Q

Uses boxes and lines to depict the distributions of one or more groups of numeric data.

A

Box plot

45
Q

indicate the range of the central 50% of the data, with a central line marking the median value.

A

Box plot