Class 2 Spring 🌷 Flashcards

1
Q

What are the types of variables considered when buying a second-hand bicycle?

A
  • Make (brand)
  • Type (roadbike, hybrid, mountain bike, etc.)
  • Number of gears
  • Size of frame
  • Color
  • Age
  • Condition (excellent/good/poor)
  • Price
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two main classifications of variables?

A
  • Categorical
  • Numerical
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a frequency table used for?

A

To sort and summarize data to make sense of them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why is it important to convert raw counts to percentages in categorical data?

A

To make the big picture clearer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a grouped frequency distribution?

A

An arrangement that clarifies the pattern of data while sacrificing some detail.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a histogram?

A

A graphical representation made from a grouped frequency distribution that shows patterns clearly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the primary purpose of bar charts?

A

To display distributions of categorical variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What differentiates histograms from bar charts?

A
  • Histograms use bins for numerical data
  • Bar charts display categorical data without binning.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the measures of central tendency for numerical data?

A
  • Mean
  • Median
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the mode, and when is it primarily useful?

A

The value with the most occurrences; primarily useful for categorical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the measures of dispersion?

A
  • Range
  • Standard deviation
  • Interquartile Range (IQR)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is the mean calculated in continuous distributions?

A

By taking each value times the probability that x takes that value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does central tendency describe in a dataset?

A

Where the β€˜middle’ of the dataset is.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is dispersion in the context of data analysis?

A

How spread out or β€˜wide’ the dataset is.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the key questions to answer when describing a dataset?

A
  • Central Tendency: where is the β€˜middle’?
  • Dispersion: how spread out is the data?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Fill in the blank: A _______ is made up of groups of data called bins.

A

[histogram]

17
Q

True or False: Bar charts can display numerical variables.

18
Q

What does a time series graph represent?

A

Historical data plotted over time.

19
Q

What is the significance of the x-axis in a histogram?

A

It is a number line, and the order of the bars cannot be changed.

20
Q

What is the relationship between higher bars in bar charts and histograms?

A

Higher bars indicate higher counts or greater probability of occurrence.

21
Q

What is the trade-off when choosing between showing detail and overall patterns in statistics?

A

Some detail is sacrificed to clarify the overall pattern.

22
Q

What is the function of R in data visualization?

A

To create visual representations of data easily.