Topic 1 Flashcards
Introduction to Statistics and Data Analysis
Why study Statistics?
- To critically evaluate and interpret numerical data presented in reports, articles, and everyday life.
- To conduct and understand statistical analyses in your field of work, helping you make data driven decisions.
- To draw meaningful conclusions about a population based on insights gathered from a sample, improving the accuracy of predictions and decision-making processes.
A field of study concerned with the collection, analysis, and interpretation of data to make
decisions, solve problems, and design products and processes in the presence of variability or
uncertainty
Statistics
Simply the science of data or the art of learning from data
Statistics
Information, either numerical or categorical, collected through observation or experimentation for the purpose of analysis.
Data
Classification of Data
- Quantitative (Numerical)
- Qualitative (Categorical)
2 Types of Quantitative
Discrete - or counted
Continuous - or measured
has distinct interval or meaningful difference between values but there is no true zero
Interval
has distinct interval or meaningful difference between values, and there is true zero
Ratio
It refers to a point where there is no presence or amount of the variable being
measured, meaning the quantity is completely absent.
True zero
labels, names, or classification and does not imply order or ranking
Nominal
labels, names, or classification and does imply order or ranking
Ordinal
The basic idea behind statistical methods of data analysis
make inferences about a
population by studying a relatively small sample
Engineering/Scientific Method
- Develop a clear description of the problem.
- Identify the important factors affecting the problem or may play a role in its solution.
- Propose or refine a model using engineering knowledge of the phenomenon being studied.
- Conduct an appropriate experiment to confirm that the proposed solution to the problem is both effective and efficient.
- Refine the model on the basis of the observed data.
- Manipulate the model to assist in developing a solution to the problem.
- Confirm/Validate the solution by conducting an appropriate experiment
- Conclusions and recommendations
Role of Statistics in Engineering
Statistics provides tools and methods to analyze data, make informed decisions, and improve processes
An inherent characteristic of data
Variability
Repeated observations of a system or phenomenon do not yield identical results
Variability
developed to address and manage variability in data. It provides a framework for describing this variability and for learning about which potential sources of variability are the most important or which have the greatest impact.
Statistical analysis tools
What if there is no variability?
Simplified Analysis: statistical analysis would be straightforward, relying only on simple descriptive statistics like the mean
A single observation would tell us everything about the entire population
Statistics would be reduced to basic arithmetic
Types of Statistics
- Descriptive Statistics
- Inferential Statistics
Summarizes and Presents Data: Focuses on organizing and displaying data in a
meaningful way, providing insights into the central tendency, variability, and overall distribution of observations within the sample
Visual Representation: utilizes graphs like histograms, stem and-leaf plots, scatter plots, dot plots, and box plots to visually capture the characteristics and “footprint” of the sample
Key Measures: includes essential calculations such as means, medians, and standard
deviations to describe the data’s central location and spread
Descriptive Statistics
Draws Conclusion: utilizes techniques that enable us to make inferences about a larger
population based on the analysis of a smaller sample
Prediction: leverages sample data to predict outcomes for the broader population.
Key Methods: includes hypothesis testing and confidence intervals to assess claims and estimate population parameters
Inferential Statistics
The first step in statistical analysis is collecting data. To make accurate inferences about a population, the sample must represent the population
Collecting Data
a collection of all elements that possess a characteristic of interest
Population
a portion of a population selected for study
Sample