Summer Work Vocabulary Flashcards
Individuals
Objects described by a set of data. Individuals may be people, animals, or things.
Variable
Any characteristic of an individual. A variable can take different values for different individuals.
Categorical variable
Places an individual into several groups or categories.
Quantitative variable
takes numerical values for which it makes sense to find an average
Distribution
The distribution of a variable tells us what values the variable takes and how often it takes these values.
Marginal distribution
The marginal distribution is one of the categorical variables in a two-way table of counts is the distribution of values of that variable among all individuals described by the table.
Conditional distribution
A conditional distribution of a variable describes the values of that variable among individuals who have a specific value of another variable. There’s a separate conditional distribution for each value of the other variable
Association
We say that there is an association between two variables if knowing the value of one variable helps predict the value of the other. If knowing the value of one variable does not help you predict the value of the other and there’s no association between the variables.
Symmetric
A distribution is roughly symmetric if the right and left sides of the graph are approximately mirror images of each other
Skewed right
A distribution is skewed to the right if the right side of the graph (containing half of the observations with Larger values) is much longer than the left side
Skewed left
A distribution is skewed to the left if the left side of the graph is much longer than the right side.
Median
The median is the midpoint of distribution, the number such that about half the observations are smaller and about half are larger.
Five number summary
The five number summary of a distribution consists of the smallest observation, the first quartile, the median, third quartile, and the largest observation, written in order from the smallest to largest.
Standard deviation
The standard deviation measures the typical distance of the values in a distribution from the mean. It is calculated by finding an average of the square deviation and then taking the square root.
Variance
This average square deviation is called the variance
Discrete variable
It’s a variable can take on any value between it to a minimum value and its maximum value it is called a continuous variable otherwise it is called a discreet variable.
Continuous variable
It’s a variable can take on any value between it to a minimum value and its maximum value it is called a continuous variable otherwise it is called a discreet variable.
Univariate data
When we conduct a study that looks at only one variable, we say that we are working with univariate data. Suppose, for example, that we conducted a survey to estimate the average weight of high school student. Since we are only working with one variable wait we will be working with univariate data.
Bivariate data
When we conduct a study that examines the relationship between two variables we are working with bivariate data. Suppose we conducted a study to see if there were a relationship between the height and weight of high school students. since we are working with two variables will be working with bivariate data.