Module 3 Flashcards

1
Q

What is a variable?

A

any measureable characteristic of an observation unit (ex. the number of facebook posts that a person might read each day). can vary among sampling units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the 3 pieces of information a variable contains?

A
  • what the variable repersents
  • the measurement unit
  • description of observation unit
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the datum/data?

A

the value of a variable that you measure from an observation unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Difference between numerical and categorical variables?

A

numerical: the data is numeric (have measurement units that indicate the scale)

categorial: the data is a qualitative description (have no measurement units)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a continuous numerical variable?

A
  • can take on continuous numers (any number, even fractional numbers)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a discrete numerical variable?

A

can only take on whole numbers (integers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is an ordinal categorical variable?

A

can take on qualitative values but where the values are from a ranked scale (ex. using a scale to describe how you’re feeling)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a nominal caregorical variable?

A

can take on qualitative values but where the values do not have a particular order (ex. food)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is categorical data characterized

A
  • counts: the number of sampling units in each category
  • proportions: the share of the total sampling units in each category
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which is easier to undersand when doing descriptive statistics: counts or proportions

A

proportions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do counts and proportions indciate?

A

the central tendency of categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is range? and what is it used to indicate?

A
  • range is used to indicate dispersion
  • describes the variation in the esponse variable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the two approachs used to determine descriptive statistics for numerical variables?

A
  • means
  • quartiles
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the mean characterize?

A

the central tendency of a numeric variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is variance and how is it calculated?

A
  • measure of amount of variation in sample
  • calculated as average squared distance of each data point from sample mean (all summed) then divided by number of data points
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is standard deviation?

A
  • square root of variance
    *
17
Q

What are quartiles?

A
  • are specific values of the variable that divide your data into ranked groups
    *
18
Q

What is the central tendency given by?

A

the second quartiles, is also called the median (50% above and 50% below)

19
Q

What is the interquartile range?

A

uses quartiles to describe dispersion in a numerical variable. It is the difference between the 3rd and 1st quartiles and gives the range of the innermost 50% of a numerical sample.

subtract 1st quartile from 3rd

20
Q

What are the pros of using quartiles?

A
  • median and interquartile range are relatively robust to extreme values (not as affected by the extremes)
21
Q

cons of using quartiles?

A

the median and interquartile range can become variable for samples with small number of observations (more sensitive)

22
Q

Pros of using means?

A

mean and standard deviation are more robust when small number of observations

23
Q

cons of using means?

A

downside of mean and standard deviation is that they are sensitive to extreme values

24
Q

When is it better to use quartiles?

A

when characterizing numerical values, as long as the number of observations is not too small

25
What is effect size?
the change in mean value of the response variable among groups. allows us to evaluate whether the change in the response variable is meaningful for a particular study
26
What is effect size calculated among?
between treatment levels (one possibility)
27