Week 1 Flashcards
What is statistics?
Statistics is the science of collecting, organizing, interpreting and learning from data.
What are the three aspects of statistics?
Design: Planning how to obtain data to answer the question of interest.
Description: Summarizing the data that are obtained.
Inference: Using sample data to learn about the population.
Population
The population is a collection of units of interest, such as all adults in the United States, alligators in the everglades, iPads from a factory.
Subject
Subjects are the individual units of a population, such as an adult, an alligator, an iPad.
Sample
A sample is a subset of the units of a population.
What makes a good sample?
A sample should be representative of the population. This can be obtained by selecting sample subjects randomly.
Where do statistical methods come in?
- Use DESIGN to obtain an appropriate sample from the population. 2. DESCRIBE the sample data with graphical and numerical summaries. 3. Perform STATISTICAL INFERENCE.
Statistical Inference
The procedure of using a sample to learn about a population is called statistical inference.
Parameter
A parameter is a number that describes a population. It is usually unknown.
Statistic
A statistic is a number that describes a sample. It can be computer from data, therefore, it is known.
We use a ____ to estimate a ______.
We use a sample statistic to estimate a population parameter.
Variable
A variable is any characteristic of a subject in a population.
Categorical (Qualitative) Variable
Classifies subjects as belonging to a certain group/category. For example, gender, race, political party, issue positions, etc.
Quantitative Variable
Takes on numerical values that represent different magnitudes. For example, height, weight, age, IQ, income, temperature, etc.
A quantitative variable can either be _____ or _____.
A quantitative variable can either be discrete or continuous.
Discrete
The possible values of a discrete quantitative variable form a set of separate numbers that can be listed or counted. For example, age in years, number of tattoos, etc.
Continuous
The possible values of a continuous quantitative variable form an interval. That is, there is an infinite continuum of possible values. For example, height, weight, income, time, etc.
Graphical Summaries for Categorical Variables
Graphical summaries of categorical variables help us visualize the distribution of the data among the separate categories. Before constructing the graphical summary, we first organize the categorical data into a frequency table.