Chapter 1: Overview of Statistics Flashcards
Statistics (plural)
The science of collecting, organizing, interpreting, and presenting data
Data science
A trilogy of taste involving data modeling, analysis and decision making. Some experts prefer to call statistics data science
A Statistic (singular)
A single measure, reported as a number, used to summarize a data set
Descriptive statistics
Refers to the collection, organization, presentation, and summary of data (either using charts and graphs or using a numerical summary)
Inferential statistics
Refers to generalizing from a sample to a population, estimating unknown population parameters, drawing conclusions and making decisions
Business analytics
Uses statistics, mathematics, and computational tools to extract information from data.
Three categories-
Descriptive
Predictive
Prescriptive
Descriptive analytics
Tools to analyze historical data and help them identify trends and patterns (what happened?)
Predictive analytics
Tools to predict probabilities of future events and help them forecast future behavior (what is likely to happen next?)
Prescriptive analytics
Tools to help them make decisions on how to achieve objectives within real world constraints (what actions do we need to achieve our goals?)
Machine learning
(ML) Refers to using observed data and algorithms to train computers to classify events and predict outcomes in a useful way without task specific rules
Artificial intelligence
(AI) Refers to an area of computer science that seeks to create intelligent machines that can think and behave like humans to solve problems and act autonomously
Emulations of human capabilities
Artificial neural networks
(ANN) Or neural nets are a key component of ML and AI. These are “black boxes” whose internal connections mimic the human brain, learning to process raw inputs and produce outputs or conclusions based on examples that are provided
Critical thinking
Being able to evaluate evidence, to tell fact from opinion, to see holes in an argument, to tell whether cause and effect has been established and to spot illogic
Empirical data
Data collected through observations and experiments
Post hoc fallacy
The mistaken conclusion that of A precedes B, then A is the cause of B
Assuming causation anytime there is a statistical association between events