Module 3 Flashcards

1
Q

What is a variable?

A

any measureable characteristic of an observation unit (ex. the number of facebook posts that a person might read each day). can vary among sampling units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the 3 pieces of information a variable contains?

A
  • what the variable repersents
  • the measurement unit
  • description of observation unit
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the datum/data?

A

the value of a variable that you measure from an observation unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Difference between numerical and categorical variables?

A

numerical: the data is numeric (have measurement units that indicate the scale)

categorial: the data is a qualitative description (have no measurement units)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a continuous numerical variable?

A
  • can take on continuous numers (any number, even fractional numbers)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a discrete numerical variable?

A

can only take on whole numbers (integers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is an ordinal categorical variable?

A

can take on qualitative values but where the values are from a ranked scale (ex. using a scale to describe how you’re feeling)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a nominal caregorical variable?

A

can take on qualitative values but where the values do not have a particular order (ex. food)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is categorical data characterized

A
  • counts: the number of sampling units in each category
  • proportions: the share of the total sampling units in each category
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which is easier to undersand when doing descriptive statistics: counts or proportions

A

proportions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do counts and proportions indciate?

A

the central tendency of categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is range? and what is it used to indicate?

A
  • range is used to indicate dispersion
  • describes the variation in the esponse variable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the two approachs used to determine descriptive statistics for numerical variables?

A
  • means
  • quartiles
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the mean characterize?

A

the central tendency of a numeric variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is variance and how is it calculated?

A
  • measure of amount of variation in sample
  • calculated as average squared distance of each data point from sample mean (all summed) then divided by number of data points
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is standard deviation?

A
  • square root of variance
    *
17
Q

What are quartiles?

A
  • are specific values of the variable that divide your data into ranked groups
    *
18
Q

What is the central tendency given by?

A

the second quartiles, is also called the median (50% above and 50% below)

19
Q

What is the interquartile range?

A

uses quartiles to describe dispersion in a numerical variable. It is the difference between the 3rd and 1st quartiles and gives the range of the innermost 50% of a numerical sample.

subtract 1st quartile from 3rd

20
Q

What are the pros of using quartiles?

A
  • median and interquartile range are relatively robust to extreme values (not as affected by the extremes)
21
Q

cons of using quartiles?

A

the median and interquartile range can become variable for samples with small number of observations (more sensitive)

22
Q

Pros of using means?

A

mean and standard deviation are more robust when small number of observations

23
Q

cons of using means?

A

downside of mean and standard deviation is that they are sensitive to extreme values

24
Q

When is it better to use quartiles?

A

when characterizing numerical values, as long as the number of observations is not too small

25
Q

What is effect size?

A

the change in mean value of the response variable among groups. allows us to evaluate whether the change in the response variable is meaningful for a particular study

26
Q

What is effect size calculated among?

A

between treatment levels (one possibility)

27
Q
A