EDAAAAA Flashcards
✓ science that deals with techniques for collecting, presenting, analyzing and drawing conclusions from data
✓ science of data
Statistics (Singular)
✓ numerical descriptions by which we enhance understanding of data
✓ summary measures used to describe a sample
statistics (Plural)
What is statistics?
Ott and Longnecker (2016)
- statistics is the science of learning from data
✓defining the problem
✓collecting the data
✓summarizing the data
✓analyzing the data
✓interpreting the results
✓communicating the results
Why study statistics?
✓statistics provides a more formal way of organizing information
than relying on anecdotes and personal experience
✓more and more things are now measured numerically
✓results of experiments must be expressed quantitatively
✓statistics provides an objective way of measuring variation and uncertainty
Role of statistics in engineering
✓engineering practice involve collecting, working with, and using data in the
solution of a problem, so knowledge of statistics is just as important to the
engineer as knowledge of any of the other engineering sciences
✓statistical methods are a powerful tool in model verification, designing new
products and systems, improving existing designs, and designing,
developing, and improving production operations
✓statistical methods are used to help us describe and understand variability
✓statistical thinking can give us a useful way to incorporate this variability
into our decision-making processes
is the entire collection of objects or
outcomes about which data are collected. According to
Johnson and Bhattacharyya (2011), it is the complete
collection of units about which information is sought
➢A statistical population is the set of measurements
corresponding to the ______ of interest.
Population
Statistical Inquiry
Almeda et al. (2010)
✓Identify the problem
✓Plan the study
✓Collect the data
✓Explore the data
✓Analyze the data and interpret the results
✓Present the results
size has the same chance of being
selected from the population
random sample
is a subset of the population containing the
observed objects or the outcomes and the resulting data.
sample
Methods of selecting a sample
simple random sampling,
systematic sampling,
cluster sampling, and
stratified sampling
is a characteristic or attribute which varies from one entity to another entity, e.g. age, tensile strength, no. of buildings
variable
is one which quantifies an element of a
sample or population, e.g. strength of concrete, height of building
quantitative variable
A quantitative variable that can assume only a finite or countably
infinite number of possible values (usually integers) is called
discrete variable
is one which classifies/identifies/describes
an element of a sample or population, e.g. sex of an engineer, type of construction materials
qualitative variable
A quantitative variable that can theoretically assume any value in a specified interval (i.e., continuum) is called a
continuous variable
➢ data categories are mutually exclusive
➢data categories have no logical order
➢ex. Religion, civil status
Nominal or Classificatory Scale
process of assigning numbers to characteristics
according to a defined rule
Measurement
➢possesses the characteristics of ordinal scale
➢equal unit scale, no true zero point
➢ratios of magnitude are not meaningful
➢ex. gpa, achievement test score
Interval Scale
Also called as scales of measurement
Levels of Measurement
➢data categories are mutually exclusive
➢data categories have some logical order
➢defined to order or rank or data categories are scaled according to the amount of the particular characteristics they possess
➢ex. grading system, t-shirt size
Ordinal or Ranking Scale
➢highest level
➢possesses the characteristics of ordinal scale
➢equal unit scale, true zero point
➢ex. height, weight
Ratio Scale
▪ uses either all or a sample of the historical process data from some
period of time.
▪ quickest and easiest way to collect engineering process data, often
provide limited useful information for controlling and analyzing a
process
▪ Using historical data always involves the risk that, for whatever reason,
some of the important data were not collected or were lost or were
inaccurately transcribed or recorded.
▪ Consequently, historical data often suffer from problems with data
quality. These errors also make historical data prone to outliers
Retrospective study
the engineer observes the process or
population, disturbing it as little as possible, and records the
quantities of interest.
▪ These studies are usually conducted for a relatively short time
period, sometimes variables that are not routinely measured can
be included
Observational study
▪ the engineer makes deliberate or purposeful changes in controllable
variables (called factors) of the system, observes the resulting system
output, and then makes a decision or an inference about which
variables are responsible for the changes that he or she observes in the
output performance
Designed experiments