1 Data Exploration and Summary Statistics Flashcards

1
Q

What is central tendency?

A

Measures the ‘center’ or ‘typical’ value in a dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the formula for calculating the mean?

A

Mean = ( rac{ ext{Sum of all values}}{ ext{Total number of values}} )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the mean of the ages (39, 34, 37, 35, 33)?

A

Mean = 35.6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What Excel function is used to calculate the mean?

A

=AVERAGE(range)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the median?

A

Middle value when data is arranged in ascending order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the median of the ages (39, 34, 37, 35, 33)?

A

Median = 35

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the median calculation for an even number of values?

A

Average of the two middle values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What Excel function is used to calculate the median?

A

=MEDIAN(range)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the mode?

A

Most frequent value in the dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the mode of the ages (25, 30, 35, 35, 40, 45)?

A

Mode = 35

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What Excel function is used to calculate the mode?

A

=MODE(range)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do summary statistics provide?

A

An overview of the dataset’s main characteristics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does standard deviation measure?

A

How dispersed values are around the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the first step in calculating standard deviation?

A

Find the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a smaller standard deviation indicate?

A

Data points are closer to the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What Excel function is used to calculate standard deviation?

A

=STDEV(range)

17
Q

What is the definition of range?

A

Difference between maximum and minimum values.

18
Q

What is the formula for calculating range?

A

Range = Maximum - Minimum

19
Q

What is variance related to?

A

Standard deviation, focusing on squared deviations.

20
Q

What is the formula for variance?

A

Variance = Average of squared deviations.

21
Q

What Excel function is used to calculate variance?

A

=VAR(range)

22
Q

What are outliers?

A

Extreme values significantly different from other data points.

23
Q

What methods can be used to detect outliers?

A
  • IQR (Interquartile Range) * Z-scores
24
Q

What visualization tool can be used to visualize data distribution?

A

Histograms

25
Q

What can box plots help identify?

A

Outliers and understand the spread.

26
Q

What are the common summary statistics?

A
  • Mean * Median * Standard Deviation * Minimum * Maximum