Module 1 Flashcards
Descriptive Analytics
Past Data; Inform/Explantory
Descriptive Analytics deals with analyzing past data.
Predictive Analytics
Past Data to Predict Future
Predictive Analytics uses past data to predict future outcomes.
Prescriptive Analytics
Past Data to Predict future + Optimizing (changing)
Prescriptive Analytics uses past data to predict future outcomes and optimize by making changes.
Systematic Error
Error that repeats itself; Will not fix itself, you have to fix it.
Systematic Error requires intervention to be resolved.
Random Error
Unpredictable Error; Will fix itself, goes away with you fixing it.
Random Error disappears once the issue is addressed.
Outlier
Is a number in the dataset that are different from others.
An outlier is a data point that significantly differs from the rest.
Out of Range Error
When you made a mistake.
Out of Range Error occurs when an error is made in data input.
Omission
Missing important information, causing inaccurate results.
Omission refers to leaving out crucial data that impacts accuracy.
Reliable Data
Consistent & Repeatable
Reliable Data is data that is consistent and can be replicated.
Valid Data
Measures what is intended to be measured
Valid Data accurately measures the intended aspect.
Measurement Bias
Non-Representative Sample; Non-Random Sample. Sample has to be 30 or more.
Measurement Bias can be reduced by ensuring a representative sample of at least 30.
Information Bias
Ignoring the purpose of the information collected; non-truthful answers.
Information Bias occurs when the purpose of data collection is disregarded.
Big Data
Both structured & unstructured data that is too large to process using traditional database & software.
Big Data refers to large volumes of data that require specialized tools for processing.
Data Mining
Process of discovering patterns in large data sets.
Data Mining involves extracting patterns from extensive datasets.
Structured Data
Organized, easily searchable (e.g., rows, colums).
Structured Data is organized and searchable, typically found in databases and spreadsheets.