Defining the Data Flashcards
What is a population?
The collection of all the individuals of interest
What is a sample?
The subset of the population that is selected as the
result of sampling
What is a biased sample?
Study participants are not representative of the target population
What is an unbiased sample?
Study participants are representative of the target population
What is validity?
the extent to which the instruments that are used in the study measure exactly what they should be measuring
What is reliability?
the extent to which the results of the study are consistent when the study is repeated under the same conditions
What is a variable?
something whose value can change or vary
What is data?
the values we obtain when we measure a variable
What are the two type of variables?
1, Categorical “attributes”
2. Quantitative “numbers”
What are the two types of categorical attributes? And their meanings?
Nominal: Values are “names” that are unordered categories
Ordinal: Values are “names” that are ordered categories
What are the two types of quantitative numbers? And their meanings?
Discrete: Values are integer values 0, 1, 2 … on a proper numeric scale
Continuous: Values are a measured number of units, including possible decimal values
What are the two types of “continuous” quantitative numbers? And their meanings?
Interval: Interval scale variable has no true zero on the scale
Ratio: Ratio scale variable has true zero on the scale (0 just means the absence of something)
What is derived variables?
variables that you create by calculating or categorising variables that already exist in your data set
What are the two different types of derived variables?
Calculated
Categorized
What is threshold variables?
variables obtained by splitting the values of another variable into categories based on the values of well-known thresholds
What is a transformed variable?
a variable which has been transformed from another variable with a different measurement scale (ex. square rooting numbers, squaring…)
What is an exposure variable?
a variable thought to predict an outcome variable
What is an outcome variable?
a variable thought to change as a function of changes in an exposure variable
What is the Center?
A representative or average value that indicates where the middle of the data set is located
What is variation in data?
A measure of the amount that the values vary among themselves from the average value