Chap I Flashcards
Individuals
The objects described by a set of data - can be people, animals, or things
Variable
Any characteristic of an individual - can take different values for different individuals
Categorical Variable
places in individual into one of several groups or categories - values are names or labels
Quantitative Variable
takes numerical values for which it makes sense to find an average - represent a measurable quality
Discrete Variables
A variable that cannot take on any value between its minimum and maximum value - for example, when flipping a coin, the number of heads can be any integer value between 0 and plus infinity, but could not be any value because you could not get 2.5 heads.
Continuous Variable
A variable that can take on any value between its minimum and maximum value - for example, the weight of a firefighter between 150-250 pounds, because the firefighter’s weight could be any value between 150-250 pounds.
Univariate Data
A study that looks at only one variable - e.g. a study that looks at the weight of high school students
Bivariate Data
A study that examines the relationship between two variables - e.g. a study looking at the relationship between the height and weight of high school students.
Population
The total set of observations that can be made
Sample
A set of observations drawn from a population
Census
A study that obtains data from every member of a population - often no practical because of time/cost involved.
Distribution
Tell us what values the variable takes and how often it takes those values
Inference
Drawing conclusions that go beyond the data at hand, though it depends on how the data is produced
Frequency Table
Displays counts (frequencies) of x variable in each category
Relative Frequency Table
Displays percentages (relative frequencies) of x variable in each category
Interquartile Range (IQR)
Measures of the range of the middle 50% of the data - measure of variability, equal to Q3 - Q1.
Five-Number Summary
Consists of the smallest observation, the first quartile, the median, the third quartile, and the largest observation & divides each distribution roughly into quarters.
Boxplot
A type of graph used to display patterns of quantitative data & splits the data into quartiles, consisting of a box the size of the Q1 & Q3, with a line in the middle representing the median and lines, or whiskers, extending from the box to the largest and smallest observations that aren’t outliers.
Standard Deviation
A numerical value used to indicate how widely individuals in a group vary - measures the deviation from the mean and differs based upon population or a sample. Standard deviation for a population is found using σ = sqrt [ Σ ( Xi - X )2 / N ] and standard deviation for a sample is found using s = sqrt [ Σ ( xi - x )2 / ( n - 1 ) ]
Variance
A numerical value used to indicate how widely objects in a group vary and is equal to the square of standard deviation. Variance of a population is found using σ2 = Σ ( Xi - X )2 / N & variance of a sample is found using s2 = Σ ( xi - x )2 / ( n - 1 )