Numerical Descriptive Techniques and Graphical Methods Flashcards

1
Q

What is the “distribution” of the variable?

A
  • The data we collect when measuring a variable have different values.
  • For example, the variable ”weight of adult birds” probably will have different values each island: 670, 345, 563, 1050, 982 grams etc. . .
  • This group of different values is collectively referred to as the distribution of the variable.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What measurements can be used to describe a variables distribution?

A

Central tendency
Dispersion
Skewness
Kurtosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the central tendency?

A

A central or typical value for a probability distribution. (Where is the middle)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the Dispersion?

A

How spread out is the data?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the skewness

A

A measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive or negative, or undefined.(Is the data ‘skewed’ to left or right?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Kurtosis?

A

A measure of the combined sizes of the two tails. It measures the amount of probability in the tails. (How peaked is the centre?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the measures of central location?

A
  • Mean
  • Median
  • Mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the measures of Variability?

A
  • Range
  • Standard Deviation
  • Variance
  • Coefficient of Variation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the measures of Relative Standing?

A
  • Percentiles

- Quartiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the measures of Linear Relationship?

A
  • Covariance
  • Correlation
  • Determination
  • Least Squares Lines
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the median and the mode?

A
  • Median = Midpoint of ranked values
  • Mode = Most frequently observed value
  • Both are unaffected by outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the symbol “Sigma” mean?

A
  • To sum or add up
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the Arithmetic Mean?

A
  • The most common measure of central tendency (Aka - the mean)
  • Commonly called the average
  • Calculated as the sum of values divided by the number of values
  • Affected by extreme values (outliers)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can you find the median?

A
  • If the number of values in the data set is odd, the median is the middle ranked value
  • If the number of values in the data set is even, the median is the mean (average) of the two middle ranked values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the different types of skewness?

A
  • Symmetrical Distribution
  • Negatively Skewed Distribution (Leaning to the right)(Aka Left Skewed)
  • Positively Skewed Distribution (Leaning to the left) (Aka Right skewed)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why are Measures of Variation important?

A

Measures of variation give information on the spread or variability of the data values.

17
Q

If two lines where drawn on a graph with the same mean and one came to an immediate peak where as one had a gradual curve, which would have more variability?

A
  • The line with the gradual curve. This is because an immediate curve would imply that all the observations that were recorded were very close or similar to the mean and so are all in one area, where as, the gradual curve implies that the data was more spread out.
18
Q

What is the Range of a data set?

A
  • The range is the simplest measure of variability, calculated as:
  • Range = Largest observation – Smallest observation
  • However, It is influenced by outliers
19
Q

Why are Variance and Standard Deviation important in Statistics?

A

They are used to measure variability, they also play a vital role in almost all statistical inference procedures.

20
Q

What is the population variance denoted by?

A

Lower case Greek letter “sigma”, squared.

21
Q

What is the sample variance denoted by?

A

Lower case “s” squared

22
Q

How can you find the standard deviation from the Variance?

A
  • The standard deviation is simply the square root of the variance, thus:
  • Population standard deviation: (Lower case Greek letter for “Sigma” squared, in the square root sign)
  • Sample standard deviation:(lower case “s” squared in the square root sign)
23
Q

How can we interpret the Standard Deviation?

A
  • The standard deviation can be used to compare the variability of several distributions and make a statement about the general shape of a distribution. If the histogram is bell shaped, we can use the Empirical Rule.
24
Q

What is the Empirical Rule?

A
  • Approximately 68% of all observations fall within one standard deviation of the mean.
  • Approximately 95% of all observations fall within two standard deviations of the mean.
  • Approximately 99.7% of all observations fall within three standard deviations of the mean.
25
Q

What is the Coefficient of Variation?

A
  • The coefficient of variation of a set of observations is the standard deviation of the observations divided by their mean
  • This coefficient provides a
    proportionate measure of variation
26
Q

What are Quartiles?

A
  • They are the 25th, 50th and 75th percentiles of data
  • The first or lower quartile is labeled Q1 = 25th percentile.
  • The second quartile, Q2 = 50th percentile (which is also the median).
  • The third or upper quartile, Q3 = 75th percentile.
27
Q

Where are the Quartiles used?

A
  • In a Box-and-Whisker Plot: A graphical display of data using a %-number summary:
  • Min, Q1, Median, Q3, Max
  • A Box-and-Whisker Plot can be both horizontal or Vertical
28
Q

What is the Interquartile Range?

A
  • Interquartile Range = Q3 – Q1
  • The interquartile range measures the spread of the middle 50% of the observations.
  • Large values of this statistic mean that the 1st and 3rd quartiles are far apart indicating a high level of variability.
29
Q

What are the different graphics commonly used in statistics?

A
  • Bar Chart
  • Clustered Bar Chart
  • Pie Chart
  • Box and whisker plot
  • Q-Q Plot
  • Scatter Plot