data management ch 1 Flashcards

1
Q

population

A

the entire group of the study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

sample

A

a selection of some individuals from the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

cross-sectional data

A

observational study made a specific point in time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

longitude

A

measured the variables over a long period of time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

raw-data

A

unprocessed information collected for a study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

qualitative variable

A

cannot be measured numerically

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

quantitative variable

A

can be measured nymerically

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

discrete data

only quantitive

A

measured with whole numbers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

continuous data

only quantitive

A

measured with a given range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

ordinal variable

A

can be put into relative order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

nominal variable

A

categories that cannot be ordered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what are the titles for the circle graph chart

the chart for the information for the graph

A
  1. title
  2. percent of what people like
  3. angle
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

how to find the angle for a circle graph

A

the percent people like * 360

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

how does a stem and leaf plot key work

give an example

A

you can use any random number and show how it would look in the stem and leaf plot

1|3 means 13

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how does a pictograph key work

A

each (picture) represents % of (title)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is a frequency table

A

how many things there are

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what are the titles for the frequency table chart

A
  1. # of __ / interval
  2. tally
  3. frequency
  4. midpoint (if you’re using intervals)
  5. cumulative frequency (if question asks)
  6. relative frequency (if question asks)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

how to calculate cumulative frequency

A

adding up the frequency one row at a time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

how to calculate relative frequency

A

frequency / total freq

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

what is the point of a bargraph and what is it used for

A

for categorial or discrete
no touching indicates separation between groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

what is a frequency polygon

A

same information as a bar graph but simpler to look at

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

what is a cumulative frequency graph (ogive)

A

the running total from lowest to highest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

what is a histogram

A

a bar graph but the bars are touching
!! the x values are not intervals, the bars are in between the values!!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

how to use the brackets for intervals

A

”[” means exact ≤
“(“ means anything but <

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
relative frequency polygons y values
go up from (0-1) | 0.1, 0.2, 0.3
26
median
middle value
27
mean
added total frequency / numbers of frequency
28
mode
the most occuring number
29
simple mean equation | with sigma
x̄ = Σx / n x̄ = sample mean n = total # value in sample Σ = "the sum of"
30
weighted mean sample equation
X̄w = Σxw / Σx ## Footnote the denomenator will always equal 100%
31
what is the interquartile range (IQR)
range of the middle half data
32
how to find the IQR
Q3 - Q1 = IQR
33
how to find the upper threshold | for modified box and whisker plot
1. IQR * 1.5 2. Q3 + IQR
34
how to find the lowerthreshold | for modified box and whisker plot
1. IQR * 1.5 2. Q1 - IQR
35
how to find the semi-interquartile range (SIQR)
IQR + (IQR/2)
36
population deviation formula
x - μ μ = population x = point of data
37
sample deviation formula
x - x̄ x = point of data x̄ = sample
38
population variance formula
σ^2 = Σ(x-μ)^2 / N σ = population Σ = "the sum of" μ = population mean N = # of elements in population
39
population standard deviation formula
σ = √[Σ(x-μ)^2 / N] σ = population Σ = "the sum of" μ = population mean N = # of elements in population
40
sample variance formula
S^2 = Σ(x-x̄)^2 / n - 1 S = sample Σ = "the sum of" x̄ = sample mean n = # of elements in population
41
sample standard deviation formula
S^2 = √[Σ(x-x̄)^2 / n - 1] S = sample Σ = "the sum of" x̄ = sample mean n = # of elements in population
42
popular standard deviation for grouped data formula
σ = √[Σf * (x-x̄)^2 / N]
43
when finding the standard deviation we use a table to stay organized. what are the titles for the table
1. x 2. x - x̄ 3. (x - x̄) ^2 | change x̄ to μ for population insted of sample if appicable
44
when finding the weighted standard deviation we use a table to stay organized. what are the titles for the table | there are 6 titles
1. x 2. frequency (f) 3. culminating frequency 4. x - x̄ 5. (x - x̄)^2 6. f * (x - x̄)^2 | change x̄ to μ for population insted of sample if appicable
45
how to get the deviation graph threshold
(mean ± standard deviation) μ - σ = threshold -1 μ = threshold 0 μ + σ = threshold 1 μ + σ = threshold 2
46
sample z-score formula
x-x̄ / s deviation / standard deviation)
47
population z-score formula
x-μ / σ (deviation / standard deviation)
48
what is an index
the value of a variable (or group of variables) to a value of a particular date
49
how to find the slope
m = y2-y1 / x2-x1
50
factor grow/ fall formula | in a stock graph for example
(new number / old number) * 100 = percent grown/ fall
51
percent change formula
1. find the percent change 2. subtract 100 from the answer in 1
52
how to find how much money your stock will go up depending how much you put | formula
multiply it with the rate of change (money invested) * y2-y1/x2-x1 = $$
53
random sampling
literally anything that is 100% random * random number generator * pick out of a hat
54
systematic random sampling | what is it, what formula do you have to use and why
* every nth person * have to pick a random starting point n = (population size) / (sampeling size) * n is the number of people you jump
55
stratified random sampling
* population is divided into subgroups based on qualities 1. 'relative frequency' is -> # of students / total # of students 2. '# of surveyed in sample' is RF * (survey size %) survey size % = % of total amount of students
56
cluster random sampling
* divide population into groups * randomly select a few of the many groups * survey everyone in the group unreliable if clusters are not representing the whole population
57
multistage random sampling
* multiple levels of random sampling bias from some areas around the world are not diverse 1. randomly choose city 2. randomly choose block 3. randomly choose houses within the block
58
convenience sampling - non random
* asking people something - bias from unrepresented data -> only ask friends
59
voluntary sampling - non random
* people who willingly take the survey - bias from super strong opinions of hate/ love - people who dont care dont care
60
what is a bias
occurs when a sample is not representative of the population
61
sampling bias
dose not accurately represent the population * football game, asked for football or band equipment
62
household bias
different groups are not polled proportionally to their size * 10 students sampled from each grade but there are more gr 9 students
63
measurement bias
the way data was collected influences the results also happens when something is unnatural or unclear * sign says slow down but you're trying to find how many people speed
64
leading question bias
pushes people to answer in a certain way * what are your fav songs, give 3 options
65
loaded question bias
certain words that imply a certain response * do you really intend...
66
non-response bias
people choose not to participate also non participation of certain groups * group of students respond to a survey about school activities
67
response bias
feel embarrassed to give honest answers also poorly written questions *do you do illegal things (not anonymous)