15 - Data Summarization and Visualization Flashcards
What does descriptive statistics refer to?
Methods for summarizing and organizing the information in a data set.
What are elements in a data set?
Entities for which information is collected.
What is a variable?
A characteristic of an element, which takes on different values for different elements.
What are observations in a data set?
The set of variable values for a particular element.
What are qualitative variables?
Variables that enable the elements to be classified or categorized according to some characteristic.
What are quantitative variables?
Variables that take numeric values and allow arithmetic to be meaningfully performed on them.
What are the four levels of measurement for data?
- Nominal * Ordinal * Interval * Ratio
What is nominal data?
Data that refer to names, labels, or categories without natural ordering.
What is ordinal data?
Data that can be rendered into a particular order but cannot have arithmetic meaningfully performed on them.
What is interval data?
Quantitative data defined on an interval without a natural zero where addition and subtraction may be performed.
What is ratio data?
Quantitative data for which all arithmetic operations may be performed and a natural zero exists.
What is a discrete variable?
A numerical variable that can take either a finite or a countable number of values.
What is a continuous variable?
A numerical variable that can take infinitely many values, forming an interval on the number line.
What is a population in statistics?
The set of all elements of interest for a particular problem.
What is a parameter?
A characteristic of a population, usually unknown but constant.
What is a sample?
A subset of the population.
What is a statistic?
A characteristic of a sample.
What is a census?
The collection of information from every element in the population.
What does statistical inference refer to?
Methods for estimating or drawing conclusions about population characteristics based on a sample.
What is a random sample?
A sample for which each element has an equal chance of being selected.
What is a predictor variable?
A variable whose value is used to help predict the value of the response variable.
What is a response variable?
A variable of interest whose value is presumably determined by the predictor variables.
What does frequency refer to in categorical data?
The number of data values in each category.
What is a relative frequency?
The frequency of a category divided by the number of cases.