1. Statistics and the Scientific Method Flashcards
Scientific Method
- Formulate research goal: research hypotheses, models
- Design study: sample size, variables, experimental units, sampling mechanism
- Collect data: data management
- Draw inferences: graphs, estimation, hypotheses testing, model assessment
- Make decisions: written conclusions, oral presentations
- Formulate new research goals: new models, new hypotheses –> go back to #2
The Four-Step Process (in learning from data)
- Defining the problem
- Collecting the data
- Summarizing the data
- Analyzing the data, interpreting the analyses, and communicating the results
Population
the set of all measurements of interest to the sample collector.
Sample
any subset of measurements selected from the population
Data Mining Models
the patterns and trends discovered in the analysis of data (data mining)
Can be applied to many situations, ex:
Forecasting (Estimating future sales, predicting demands on a power grid, or estimating server downtime)
Assessing risk (Choosing the rates for insurance premiums, selecting best customers for a new sales campaign, determining which medical therapy is most appropriate given the physiological characteristics of the patient)
Identifying sequences (Determining customer preferences in online purchases, predicting weather events)
Grouping (Placing customers or events into cluster of related items, analyzing and predicting relationships between demographic characteristics and purchasing patterns, identifying fraud in credit card purchases)
Data Mining
a process by which useful information is obtained from large sets of data
- uses statistical techniques