[LEC3] Descriptive Statistics Flashcards

1
Q

Observation on one variable may be shown visually by putting the variable’s on one axis and putting the frequencies on the other.

A

Visual Presentation of Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

___________ are best used to interpret the frequerncy distribution visually.

A

Figures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A bar graph wherein the number of units observed is on the y-axis while the measurement levels on the x-axis. The bars are visually proportional to each other.

A

Histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A figure that shorthanded presents a histogram. A dot is placed at the center of the top of the bars and connected to form a polygon. This better ennucuates the data shape.

A

Frequency Polygon

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Basic graphs that can illustrate one or more data sets in one graph.

A

Line Graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Have both x and y-axes on an arithmetic scale.

A

Arithmetics Line Graphs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

has the y-axis as alogarithmic axes

A

Semilogarithmic Line Graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Frequency distributions from continuos data are defined by types of descriptors, known as ___________

A

Parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The two measures of __________ and _______

A

Central Tendecy and Dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q
  • Defined as the value used to represent the center or the middle of a set of data values.
  • Locates observations on a measurement scale.
A

Central Tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
  • Descibes the spread of values in a given data set.
  • Suggests how widely spread out the observation are.
A

Dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q
  • The average value or the sum (Σ -sigma) of all the observed values (xi) divded by the total numver of observation (N).
  • has the most mathematical properties and most representatives of dataset if not for outliers.
  • used arithmetics, mathematical techniques
A

Mean (x̅)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The middle observation data when data has been arranged from highest to lowest. When the dataset is an even number (hence no natural middle point), the two middling variables are average to find a __________.

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Rarely used to make inferential conclusion from but is used frequently in healthcare and economics.

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
  • The most commonly observed value.
  • Has some clinical interest, but seldomly used in statistics. if two or more values appear with the same frequency, each is a mode. The downside to using the mode as a measure of central tendecy is that a set of data may have no mode, or it may have more than one mode.
A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

-bX or Xi means for “for each individual observation”
- Simple to calculate

A

Arithmetic Mean

17
Q

A type of mean that is calculated by multiplying the weight (or probability) associated with a particular event or outcome with its.

A type of mean that gives differing importance to the values in a datsets.

A

Weighted Means

18
Q

A statistical measurement of the spread between numbers in a data set. It measures how far each number in the set is from the mean (average), and thus from every other number in the set.

A

Variance

19
Q

The average amount of variability in your dataset. It tells you, on average, how far each value lies from the mean. A high standard deviation means that values are generally far from the mean, while a low standard deviation indicates that values are clustered close to the mean.

A

Standard Deviation

20
Q

The difference between the observed value of a data point and the expected value is known as ____________ in statistics.

A

Deviation

21
Q
  • also called as mean absolute deivation
  • the average deviation of a data point from the mean, median or mode of the data set.
A

mean Deviation

22
Q

Degree of Freedom

A

N-1

23
Q

Values that split sorted data or a probability distribution into equal parts. In general terms, a q-quantile divides sorted data into q parts.

A

Quantiles

24
Q

A statistical term that describes a divisions of observations into four defined intervals based on the values of the data and how they compare to the entire set of observation.

A

Quartiles

25
Q

Quartiles are are organized into:

A
  • lower quartiles
  • median quartiles
  • upper quartiles
26
Q

Values that split sorted data or a probability distribution into equal parts.

A

Quantiles

27
Q
  • Type of quantiles, obtained adopting a subdivison into 100 groups.
  • A number denoting the position of a data point within a numeric dataset by indicating the percentage of the dataset with a lesser value.
A

Percentiles

28
Q

Percentiles are calculated by dividing an ordered set of data into ______________

A

100 equal parts

29
Q

Statistics for a given data set is the difference between the highest and lowest values.

A

Range

30
Q

The size of the narrowest interval which contains all the data,

A

Range

31
Q

Range formul

A

Range = Max(X) - Min (X)

32
Q

Defined as the difference between the third and the first quartile.

A

Interquartile Range

33
Q

The range of a set data is size of the narrowest interval which contains all the data.

A

Interquartile Range

34
Q

A measure of the asymmetry of a distribution

A

Skewness

35
Q

A distribution is a symmetrical when its left and right side are not mirror images. A distribution can have righ (or positive), left (or negative) or zero skewness.

A

Skewness (Horizontal Imbalance)

36
Q

A descriptive statistics used to help measure how data disperse between a distributions center and tails, with larger values indicatin a data distribution may have “heavy” tails that thickly concentrated with observations or that are long with extreme observations.

A

Kurtosis (Vertical Imbalance)

37
Q
A