General Definitions Flashcards

1
Q

Define data science

A

Gathering and manipulating data
“translating” data into presentable forms
Extracting insights from data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define data

A

quantifiable
Raw information prior to analysis
large collections/sets of strings, ints, floats, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do data scientists do?

A

case studies
deduce meaning from data
predictions, recommendations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define unstructured data

A

Requires preprocessing prior to interpretation.

Otherwise not directly conducive to analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define structured data

A
tabular data
(the focus of this class)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define big data

A

high-dimensional data

(excessively) large amounts of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are some challenges with big data

A

Sheer scale - time
difficulty in visualization
can be noisy, untrustworthy, or otherwise problematic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define little data

A

low-dimensional data

small number of datapoints

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are some challenges with little data

A

May identify spurious patterns or trends

insufficient size for reliable machine learning/analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define numerical data

A

Data represented by numbers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define categorical data

A

Data represented by categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly