Exam #1 - Chapters 1 - 4 and Chapter 6 Flashcards

1
Q

What is a “Qualitative Variable”?

A

categories & groups
a. gender, nationality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is statistics?

A

Study of how to collect, organize, analyze, and interpret numerical information from data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an “Individual”?

A

Group of objects being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a “Variable”?

A

features/characteristics of the individual being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is “Population Data”?

A

data from entire group of sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is “Sample Data”?

A

generates sample statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a “Population Parameter”?

A

average of results of entire population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a “Sample Statistic”?

A

average of results from sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a “Quantitative Variable”?

A

values & numerical measurements
a. example: height, age, weight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a “Qualitative Variable”?

A

categories & groups
a. gender, nationality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a “Nominal Variable”?

A

names, labels, and categories that CANNOT be ordered
a. gender, nationality, political party, student names

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is an “Ordinal Variable”?

A

data that can be ordered but the difference between data values CANNOT be defined
a. letter grades (A, B, C, D, etc.) and opinion categories (agree, neutral, disagree)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is an “Interval Variable”?

A

data has meaningful differences that can be ordered, but there is NO true zero
a. specific time, temperature, dates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a “Ratio Variable”?

A

data has meaningful differences that can be ordered, and HAS a true zero
a. amount of time, weight, age, height

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is “Simple Random Sampling”?

A
  • Definition: randomly selecting participants from a specific population
  • Steps:
    a. number sample population sequentially
    b. generate random numbers
    c. pull those numbered participants to be apart of the sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is “Stratified Sampling”?

A
  • Definition: separating into strata, or groups, and choosing participants in each strata
  • Steps:
    a. divide into strata
    b. conduct simple random sampling with each strata
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is “Cluster Sampling”?

A
  • Definition: population is divided by some demographic
  • Steps:
    a. divide the demographic areas into individual sections
    b. randomly select section/cluster
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is “Systematic Sampling”?

A
  • Definition: samples arranged in a natural order
  • Steps:
    a. arrange in some natural order
    b. pick a random place within the order to start
    c. select every n^th element
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is a “Nonsampling Error”?

A

A MISTAKE!!
- error in results
a. possible causes: poor sample, sloppy data collection, bias, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is a “Sampling Error”?

A

NOT A MISTAKE!!
- difference between measurements from an experiment and the actual respective measurements
a. basically percent error, without the percent
b. caused because each sample does not perfectly represent the entire population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is a “Sample”?

A

measurements of observations from part of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a “Simulation”?

A

an exact data of the real-world

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is an “Observational Study”?

A

individuals are observed for specific outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is an “Experimental Study”?

A
  • Definition: intervention with responses and studying the effects
  • Two Types of Experimental Studies
    a. completely randomized experiment: looking for different responses to different treatments
    b. randomized block experiment: separated by a factor that may effect results (age, gender, etc.)
25
Q

What are the steps to make a Frequency Table?

A
  1. Choose the desired number of classes
  2. Calculate the class width
  3. Determine the data range for each class
  4. Find the frequency for each class
  5. Find the class mark of each class
  6. Determine the class boundaries
  7. Calculated the final relative frequency of each class
26
Q

For a Frequency Table, how do you determine how many classes you should have?

A

Less than 5 classes is too little & more than 15 is too many
- somewhere between 6-14 classes

27
Q

For a Frequency Table, how do you determine the class width?

A

[(Largest Value) - (Smallest Value)] / # of classes
- make sure to increase the calculated class width to the nearest whole number

28
Q

For a Frequency Table, how do you determine each classes range?

A
  1. Take the smallest given value (lowest limit) and add the class width to it
    - this gives the lower limit of the next class
  2. The number data of each class will contain the same number of values as the class width

make sure the first class starts at the lowest given value

29
Q

For a Frequency Table, how do you determine the frequency for each class?

A

count the number of data values that fall within each class

30
Q

For a Frequency Table, how do you determine the class mark for each class?

A

[(lower class limit) + (upper class limit)] / 2

31
Q

For a Frequency Table, how do you determine the class boundaries for each class?

A

Take the average of the upper limit of class #1 and the lower limit of class #2
- for the next class you would find the average of the upper limit of class #2 and the lower limit of class #3

make sure to determine both the upper and the lower boundaries

boundaries will be lower than the smallest given value, but will not exceed the largest class limit value

32
Q

For a Frequency Table, how do you determine the final relative frequency?

A

(class frequency) / (total value of frequencies)

33
Q

How to you graph overall Relative Frequency?

A
  1. Class boundaries go on the x-axis
  2. Frequency goes on the y-axis
  3. Graph like a bar chart, but with the bars touching each other
34
Q

What are the 5 types of Histograms?

A
  • Mound-Shaped
    a. symmetrical, highest frequencies in the middle
  • Uniform
    a. equal frequencies
  • Skewed
    a. right: higher frequency on the right, tail to the left
    b. left: higher frequency on the left, tail to the right
  • Bimodal
    a. two or more populations separated by at least one bar
    b. has two or more higher bars with the rest being shorter
  • Outlier
    a. multiple empty spaces between a majority of the data and the outlier
35
Q

What are the axis’ labels for a Bar Graph?

A

X - Axis: categories
Y - Axis: frequency

36
Q

What are the characteristics of a Pie/Circle Chart?

A
  • Only use when less than 10 classes are present
  • Must label each piece with the proper class
  • Relative frequency must be present
37
Q

What are the axis’ labels for a Time-Series Graph?

A

X - Axis: time
Y - Axis: variable of interest

38
Q

What are the characteristics of a Stem-and-Leaf Graph?

A
  • Always have a key to indicate what the graph is showing
  • “Stem” is the tens/hundreds values
  • “Leaves” are the ones values
  • “Stem” and “Leaf” portions are always in numerical order and contain zeros *
39
Q

What is “Range”?

A
  • Difference between the largest and smallest values
40
Q

What is “Sample Variance” and how do you calculate it?

A
  • Definition: measure of the spread of data around a particular value, usually the mean
  • Calculation Steps:
    a. calculate the mean of the sample
    b. find how far each data value is from the mean
    c. square the values from step b
    d. (addition of squared values from step b) / (“number of data values” - 1)
41
Q

What is “Sample Standard Deviation” and how do you calculate it?

A
  • Definition: measure of the spread of data around a particular value, usually the mean
  • Calculation Steps:
    a. calculate the mean of the sample
    b. find how far each data value is from the mean
    c. square the values from step b
    d. (addition of squared values from step b) / (“number of data values” - 1)
    e. square root the value found in step d
42
Q

What does Chebyshev’s Theorem state?

A

“For any set of data and for any constant, k, that is greater than 1, the proportion of the data must lie within k standard deviation on either side of the mean.”

  • All data lies within μ - kσ and μ + kσ
    a. example: when k = 2, 75% of the data values are within μ - 2σ and μ + 2σ
  • Calculation steps
    a. 1 - ( 1 / k^2 )
43
Q

How do you determine percentile?

A
  • Using the median
    a. 50% of the values will be greater or equal to the median
    b. 50% of the values will be less than the median
  • pth percentile
    a. p% of the values fall at or below it
44
Q

What is the included in the five number summary?

A
  1. Lowest value from the data set
  2. Q1: average of the lowest data value and the median
  3. Q2: using the median, but can be the measure of center
  4. Q3: average of the highest data value and the median
  5. Highest value from the data set

Q = quartile

45
Q

How do you calculate the Interquartile Range?

A

Definition: measure the spread of the middle half
IQR = Q3 - Q1

46
Q

What is “Probability”?

A

Definition: measure between 0 and 1 that describes the likelihood an event will occur
a. example: weather forecast

47
Q

How do you show the probability of an event?

A

A: event of interest
P(A): probability of A occurring
P(A) = 1: event A is certain to occur
P(A) = 0: event A will never occur

48
Q

When determining probability, what is the “Sample Space”?

A

Definition: set of all possible outcomes resulting from an experiment

Example: Flip a coin
Ω = {heads or tails}
a. flip a coin 2 times
b. sample space: Ω = {HH, HT, TH, TT}

49
Q

How do you determine frequency of occurrence in a sample?

A

P(A) = ƒ/n
a. can be using to assign probabilities

50
Q

What is the “Law of Large Numbers”?

A

In the long run, the more we do the actual experiment, the more the experimental values will get closer and closer to the theoretical value.

51
Q

What are “Independent Events”?

A

Two events whose occurrence or nonoccurence of one event does not influence the occurrence or nonoccurence of the other event

52
Q

How do you find the probability of two or more events if they are all independent of each other?

A

Multiply the probabilities of each event occurring

53
Q

What is the total area under a normal distribution curve?

A

Area = 1

54
Q

How do you determine the sample variance when looking at a normal distribution curve?

A

When the horizontal component is longer (has a greater range in values), the standard deviation and variance is larger.

55
Q

What is the difference between a positive and negative Z-Score?

A

Definition: number of standard deviations between the raw score and the mean
a. positive: values to the right of the mean
b. negative: values to the left of the mean

56
Q

What is the calculation for a Z-Score and a Raw Score?

A

Z-Score:
z = (𝒳 - μ) / σ

Raw Score:
𝒳 = (z ⋅ σ) + μ

57
Q

As Z values increase, what happens to the area to the left of Z?

A

Increases

58
Q

If the Z value is negative, what is the area to the left of Z?

A

Less than 0.5

59
Q

If the Z value is positive, what is the area to the right of Z?

A

Less than 0.5