APPENDIX D- DATA ANALYSIS Flashcards

1
Q

What is Descriptive Statistics? (5)

A

Mode- most often
Mean
Median
Skewness- how symmetrical a range of numbers are
Standard Deviation- how dispersed a range of numbers are from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Prescriptive Analysis

A

Using data to guide analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Causal Analysis

A

Using data to find underlying factors that led to an event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do you use for Predictive Analysis

A

Using algorithms and machine learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Descriptive Analysis

A

Using numbers to describe qualities of a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Exploratory Analysis

A

Technique to find patterns or trends in data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Inferential Statistics
(Sample)

A

Aim to make predictions based on a sample and population-based findings
Such as info on height differences between kids grouped by gender

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Social Network Analysis

A

Analyses patterns and relationships between people + groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Behaviour Profiling

A

Used to identify anomalies in computer systems
e.g IP address not equating to the typical region of user login
ports and application use
Device usage
Network traffic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data Dredging

A

Produces patterns that occur by chance
Need to contextualise data to make actionable insights
Differs from Data Mining as it lacks a hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Forecasting vs Predictions

A

Prediction is not based on prior information/data
Forecasting only applicable with prior information/data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Circular Reporting

A

Multiple sources reporting the same bad intelligence from each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Supervised Machine Learning aspects

A

Labels are provided to train.
Classifies unseen data into established categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Unsupervised Machine Learning

A

Raw and unlabelled data
Identifies patterns and trends
Used to cluster data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Unstructured Data Analysis

A

Natural Language Processing
Metadata- EXIF data
Images

How well did you know this?
1
Not at all
2
3
4
5
Perfectly