B01 Unstructured Data Analytics Flashcards

1
Q

Define Structured Data

A
Data that is organized and
stored in a pre-defined
format. Examples include
relational databases,
tabular data (csv or xls),
etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define Semi-Structured Data

A
Data that has a selfdescribing
structure
(usually with tags) but also
doesn’t follow a predefined
format. Examples
include HTML, XML, JSON,
etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define Unstructured Data

A
Data that does not have a
pre-defined data model
and/or format. Examples
include images, text,
audio, video, sensor data,
etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Unstructured Data Analytics?

A
“Unstructured data
analytics are a set of
techniques focused on
the extraction of useful
insights from data with
unpredictable or
inconsistent form.”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define the Unstructured Analytics approaches we will cover

  • Text Analytics
A
Text Analytics is the process
of extracting high quality
insights from textual data. It
is sometimes referred to as
text mining.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define the Unstructured Analytics approaches we will cover

  • Sentiment Analysis
A

Extracting an author’s emotional

intent from text.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define the Unstructured Analytics approaches we will cover

  • Topic Modelling
A

Discovering the abstract “topics”
that occur in a collection of
documents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define the Unstructured Analytics approaches we will cover

-Naïve Bayes

A

Quantifying the probability of
events and how those probabilities
should be revised in the light of
additional information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define the Unstructured Analytics approaches we will cover

-Support Vector Machines

A

Separating categories of data by
representing the data as points in
multi-dimensional space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define the Unstructured Analytics approaches we will cover

-Neural Networks

A

Recognizing patterns in data by
using a method that loosely models
the neurons in a biological brain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly