week 5 data visual Flashcards

1
Q

What is a random variable?

A

A quantity with values not known with certainty.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define variation in the context of data.

A

The difference in a variable measured over observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does a frequency distribution describe?

A

The values of a variable and how often they appear in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a categorical variable?

A

Data consisting of labels or names for which arithmetical manipulation is impossible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a quantitative variable?

A

Data consisting of numerical values for which arithmetical manipulation is possible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a sample in statistics?

A

A subset of the population that makes data collection feasible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the relative frequency of a bin?

A

The proportion of items belonging to a class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is percent frequency calculated?

A

Relative frequency multiplied by 100.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What characterizes a probability distribution?

A

It characterizes the variability of a random variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a histogram?

A

A column chart with no spaces between the columns, used for quantitative data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the recommended number of bins for a histogram?

A

Between 5-20 depending on the number of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What should the width of bins in a histogram be?

A

The same for all bins.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the first bin in a histogram supposed to include?

A

The smallest value in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a frequency polygon?

A

A visualisation tool useful for comparing distributions using lines instead of columns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a trellis display?

A

A vertical or horizontal arrangement of individual charts that differ only by the data they display.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a strip chart used for?

A

Displaying individual values.

17
Q

What is the mean in statistics?

A

The average value of a dataset.

18
Q

What does the median represent?

A

The middle value in a dataset when ordered.

19
Q

What is the mode?

A

The value that appears most frequently in a dataset.

20
Q

What is the range in a dataset?

A

The largest value minus the smallest value.

21
Q

How is standard deviation defined?

A

Based on the average deviation from the mean.

22
Q

What does the Pth percentile indicate?

A

A value that exceeds p% of the observations in the set.

23
Q

What is the interquartile range (IQR)?

A

Q3 minus Q1, representing the middle 50% of a dataset.

24
Q

What is a confidence interval?

A

A parameter estimate such as the mean or the proportion of a population of interest.

25
What factors influence the margin of error for a confidence interval on a mean?
* Confidence level * Variability of sample values (standard deviation) * Sample size
26
What is time series data?
A sequence of observations on a variable measured at successive points in time.
27
What is the purpose of a time series chart?
To display the time unit on the horizontal axis and the values of the variable on the vertical axis.