Midterm 1 Flashcards

1
Q

Variability

A

How Spread out the data is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

CV

A

stdev/mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Individual

A

The objects that are the focus of the study. Basically who/what the data is about

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Variable

A

The characteristic of the individual that is being recorded or measured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Categorical

A

Variables that function as names/labels for objects. Essentially this is non-numerical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Quantitative

A

Variables that are measurements. Essentially this is numerical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you calculate Percent Frequency?

A

Percent Frequency (PERCENT) = Frequency / Size of the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do you calculate Relative Frequency?

A

Relative Frequency (DECIMAL) = Frequency / Size of the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the Frequency Distribution?

A

A table listing each unique category in a dataset next to that categories frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you calculate frequency of the number 10 in row A on excel?

A

=COUNTIF(A:A9,”10”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you form number classes for the range 300-400, using the numbers from the column B?

A

=COUNTIFS($B$1:$B$57,”>=300”,$B$1:$B$57,”<=400”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Cumulative Frequency

A

The cumulative frequency of a class (number range) is the frequency
74 of that class, summed together with the frequencies of each class that came before it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What’s a histogram and how is it different from a bar graph?

A

A histogram is like a bar graph, but for quantitative data. The huge visual difference between a histogram and a bar graph is that a histogram has NO gaps between the bars.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Positive Correlation

A

Relatively large X -> Relatively large Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Negative Correlation

A

Relatively large X -> Relatively small Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the Mode?

A

It is the value with the largest frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How do you calculate mode on excel with values in column C?

A

=MODE.MULT(C:C)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the median?

A

The literal center of the dataset. It is a value such that exactly 50% of the values in the dataset will be smaller/larger than when lined up.

19
Q

How do you calculate the median on excel?

A

=MEDIAN(C70:K70)

20
Q

What is the mean?

A

The arithmetic average of a set of numbers. It essentially considers the size of each number as its mass, and tries
to find a sort of “center of gravity” for that mass

21
Q

How do you calculate the mean on excel?

A

=AVERAGE(C85:K85)

22
Q

how do you calculate percents of a dataset in excel?

A

=percentile.inc(array, k)

23
Q

how do you calculate the first quartile?

A

=quartile.inc(dataset, 1)

24
Q

how do you calculate the second quartile?

A

=median(dataset)

25
Q

how do you calculate the third quartile?

A

=quartile.inc(dataset, 3)

26
Q

how do you calculate the fourth quartile?

A

=max(dataset)

27
Q

what is the range?

A

The difference between the maximum and the minimum of a dataset.

28
Q

how do you calculate the range in excel for data that’s in the 8th row, through D and H?

A

=MAX(D8:H8)-MIN(D8:H8)

29
Q

What is the Interquartile Range (IQR)?

A

It is difference between the third quartile (Q3) and the first quartile (Q1). This represents the spread
of the “middle 50%” of the dataset.

30
Q

How do you calculate the Interquartile Range (IQR) in rows 40 columns D through H?

A

=QUARTILE.INC(D40:H40,3)-QUARTILE.INC(D40:H40,1)

31
Q

What is the standard deviation?

A

The average deviation between the values in the dataset, and the mean of the dataset.

32
Q

What is the long way to compute the stdev? (5 Steps)

A

1) Compute the mean of the dataset
2) Take the difference between every value in the dataset and the mean
3) Square each difference found in step 2
4) “average” the differences^2 computed in step 3 by summing them and dividing by (amount of values - 1). This gives
us the variance
5) Take the square root of the variance to get the standard deviation

33
Q

What is the command for standard deviation?

A

=STDEV.S(A:A)

34
Q

What is the variance?

A

the square of the standard deviation.

35
Q

How do you get the variance ine xce;? excel?

A

=VAR.S(A:A)

36
Q

What is the co-efficient of the Variant? (CV)

A

it represents what percentage of the mean the stdev makes up

37
Q

Standard Value (z-score)

A

The z-score of a value in a dataset represents how far that value is from
the mean of the dataset, in terms of standard deviations

37
Q

How do you calculate the Z-score?

A

z-score = (Value - Mean) / Stdev

37
Q

What do we use to determine if a number is an outlier in excel?

A

We use the IQR rule: lower and upper fence formulas.

38
Q

What are the formulas for lower and upper fence in excel?

A

Lower Fence = Q1 - 1.5IQR
Upper Fence = Q3 + 1.5
IQR

39
Q

What is included in a 5 number summary?

A

Min, median, max, Q1, and q3.

40
Q

(TRUE or FALSE) In general, if a value in a dataset has a z-score of 5, we would consider that an extremely large value in comparison to the rest of the dataset

A

True,any z score above -1 or 1 is very large.

41
Q

What has to be true about a dataset for there to be a standard deviation equal to 0?

A

every number in the dataset needs to be the same

42
Q
A