APPENDIX D- DATA ANALYSIS Flashcards
What is Descriptive Statistics? (5)
Mode- most often
Mean
Median
Skewness- how symmetrical a range of numbers are
Standard Deviation- how dispersed a range of numbers are from the mean
Prescriptive Analysis
Using data to guide analysis
Causal Analysis
Using data to find underlying factors that led to an event
What do you use for Predictive Analysis
Using algorithms and machine learning
Descriptive Analysis
Using numbers to describe qualities of a data set
Exploratory Analysis
Technique to find patterns or trends in data set
Inferential Statistics
(Sample)
Aim to make predictions based on a sample and population-based findings
Such as info on height differences between kids grouped by gender
Social Network Analysis
Analyses patterns and relationships between people + groups
Behaviour Profiling
Used to identify anomalies in computer systems
e.g IP address not equating to the typical region of user login
ports and application use
Device usage
Network traffic
Data Dredging
Produces patterns that occur by chance
Need to contextualise data to make actionable insights
Differs from Data Mining as it lacks a hypothesis
Forecasting vs Predictions
Prediction is not based on prior information/data
Forecasting only applicable with prior information/data
Circular Reporting
Multiple sources reporting the same bad intelligence from each other
Supervised Machine Learning aspects
Labels are provided to train.
Classifies unseen data into established categories
Unsupervised Machine Learning
Raw and unlabelled data
Identifies patterns and trends
Used to cluster data sets
Unstructured Data Analysis
Natural Language Processing
Metadata- EXIF data
Images