AP Statistics Flashcards
Individuals
Individuals are any data set that contains information about a group. The group that is being studied or experimented.
Variable
A variable is an attribute that describes a person, place, thing, or idea.
The value of the variable can “vary” from one entity to another.
For example, suppose we let the variable x represent the color of a person’s hair. The variable x could have the value of “blond” for one person, and “brunette” for another.
Categorical Variable
Categorical. Categorical variables take on values that are names or labels. The color of a ball (e.g., red, green, blue) or the breed of a dog (e.g., collie, shepherd, terrier) would be examples of categorical variables.
Quantitative Variable
Quantitative. Quantitative variables are numerical. They represent a measurable quantity. For example, when we speak of the population of a city, we are talking about the number of people in the city - a measurable attribute of the city. Therefore, population would be a quantitative variable.
Discrete Variables
Suppose we flip a coin and count the number of heads. The number of heads could be any integer value between 0 and plus infinity. However, it could not be any number between 0 and plus infinity. We could not, for example, get 2.5 heads. Therefore, the number of heads must be a discrete variable.
Continuous
Suppose the fire department mandates that all fire fighters must weigh between 150 and 250 pounds. The weight of a fire fighter would be an example of a continuous variable; since a fire fighter’s weight could take on any value between 150 and 250 pounds.
Univariate Data
Univariate data. When we conduct a study that looks at only one variable, we say that we are working with univariate data. Suppose, for example, that we conducted a survey to estimate the average weight of high school students. Since we are only working with one variable (weight), we would be working with univariate data.
Bivariate Data
Bivariate data. When we conduct a study that examines the relationship between two variables, we are working with bivariate data. Suppose we conducted a study to see if there were a relationship between the height and weight of high school students. Since we are working with two variables (height and weight), we would be working with bivariate data
Population
In statistics, population refers to the total set of observations that can be made.
For example, if we are studying the weight of adult women, the population is the set of weights of all the women in the world. If we are studying the grade point average (GPA) of students at Harvard, the population is the set of GPA’s of all the students at Harvard.
Sample
In statistics, a sample refers to a set of observations drawn from a population.
Often, it is necessary to use samples for research, because it is impractical to study the whole population. For example, suppose we wanted to know the average height of 12-year-old American boys. We could not measure all of the 12-year-old boys in America, but we could measure a sample of boys.
Census
A census is a study that obtains data from every member of a population. In most studies, a census is not practical, because of the cost and/or time required.
Distribution
The distribution of a statistical data set (or a population) is a listing or function showing all the possible values (or intervals) of the data and how often they occur. When a distribution of categorical data is organized, you see the number or percentage of individuals in each group.
Inference
The process of using data analysis to deduce properties of an underlying distribution of probability. Inferential statistical analysis infers properties of a population, for example by testing hypotheses and deriving estimates.
Frequency Table
When a table shows frequency counts for a categorical variable, it is called a frequency table Below, the bar chart and the frequency table display the same data
Relative Frequency
A frequency count is a measure of the number of times that an event occurs.
To compute relative frequency, one obtains a frequency count for the total population and a frequency count for a subgroup of the population. The relative frequency for the subgroup is:
Relative frequency = Subgroup count / Total count
The above equation expresses relative frequency as a proportion. It is also often expressed as a percentage. Thus, a relative frequency of 0.50 is equivalent to a percentage of 50%.
Table
The values of the cumulative distribution functions, probability functions, or probability density functions of certain common distributions presented as reference tables for different values of their parameters.
Pie Chart
A pie chart (or a circle chart) is a circular statistical graphic, which is divided into slices to illustrate numerical proportion. In a pie chart, the arc length of each slice (and consequently its central angle and area), is proportional to the quantity it represents.
Bar Graph
A bar chart is a graphical representation of the categories as bars. … A bar chart can be plotted vertically or horizontally. Usually it is drawn vertically where x-axis represents the categories and y-axis represents the values for these categories.
Two-way Table
A two-way table (also called a contingency table) is a useful tool for examining relationships between categorical variables. The entries in the cells of a two-way table can be frequency counts or relative frequencies (just like a one-way table ).
Marginal Distribution
Entries in the “Total” row and “Total” column are called marginal frequencies or the marginal distribution. Entries in the body of the table are called joint frequencies.