Ch. 1: Statistics Flashcards
Data
Facts, especially numerical facts, collected together for reference or information.
Statistics
- Aggregated data, summed up into one or a few numbers or an image, are statistics
- A way to turn data into useful information
- A tool for creating new understanding from a set of numbers. Ideally, the data will help us tell new stories.
Information
Knowledge communicated concerning some particular fact.
Population
- The group of all items of interest to a statistics practitioner.
- Frequently large sometimes infinite
Parameter
A descriptive measure of a population
Sample
- A set of data drawn from a population
- Potentially very large, but less than the population.
- If we could afford it, we’d directly look at the population, but populations tend to be big.
- A statistic is a descriptive measure of a sample
Cross-sectional
A survey done in many places at the same time.
Ex: Tracking average temperatures this July in ten places on the US East Coast including Long Island, Baltimore, and Ocean County NJ
Time-series
Done in the same place more than once
Ex: Counting the tons of tuna captured each year over a 20 year span
Panel data
Is both time-series and cross-sectional. If I interview all of you now and every 5 years for the next 50 years, that would make a panel data set.
Descriptive statistics
- Are methods of organizing, summarizing, and presenting data in a convenient and informative way.
- Describe the data set that’s being analyzed, but don’t allow us to draw any conclusions or make any inferences about the full set of data or the population as a whole.
Inferential statistics
- A set of methods, but it is used to draw conclusions or inferences about characteristics of populations based on data from a sample.
- Statistics that are useful not just to describe a sample but to draw conclusions about a larger population
Statistical inference
Is the process of making an estimate, prediction, or decision about a population based on a sample.
Observational study
- Doesn’t involve messing with things: it’s just watching to see what’s already going on.
- Hard to identify cause & effect because confounding variables make it look there’s a relationship when there’s not
Experiments
Involve researchers manipulating “explanatory” variables to check for effects on “outcome” variables.
- Ex: Stanley Milgram’s work on obedience
Simple Random Sample
- Is chosen using a method that ensures that each different possible sample of the desired size has an equal chance of being chosen.
- Example: make a list of all possible choices and choose from them using random.org.
- Note that it is the selection process, and not the final sample, which determines whether the sample is a simple random sample.