Misfit Questions Flashcards
Naïve Bayes Analysis
Used to solve classification problems, often used in a sentiment analysis, spam filtering, recommendation systems
Imputation
Preserves all cases by replacing missing data with an estimated value based on other available information
Hadoop
Open source framework based on Java that manages the storage and processing of large amounts of data for applications by breaking them down into smaller workloads
D3.js
Open source Java script library used to create dynamic, interactive visualizations enabled on modern web browsers
Type 1 Error
Rejection of the null hypotheses when true
Type 2 Error
Acceptance of a null hypothesis when false
Descriptive Analytics
Focuses on summarizing and describing historical data to provide insights into past trends and patterns
Diagnostic Analytics
Analyses past data to identify the root causes of the specific outcomes or events
Predictive Analytics
Uses historical data to forecast future outcomes
Prescriptive Analytics
Recommends actions that can be taken to optimize or improve a situation
Exploratory Analytics
Exploring and analyzing data to identify potential trends, patterns, and relationships
Alpine Miner
Provides a GUI for creating analytic workflows
Open Refine
Free open source powerful tool for working with messy data
Data Wrangler
Interactive tools for data cleaning and transformation