Intro to DataScience Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is structured data?

A

Data that is recorded within a spreadsheet, has data, tables etc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is unstructured data?

A

Data like twitter, images, newspapers, and stuff that doesn’t necessarily have a data structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What kind of data structure is email?

A

Most likely structured, to, from, content, time, etc. The only unstructured part is the actual writing of content

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data science is interdisciplinary

A

Programming/ coding, math and stats, business logic, statistical methods, math and algorithms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data Analytics

A

Analyzing big data - based on history - RAW DATA BEING USED TO TO DERIVE MEANINGFUL INFORMATION (MEAN MEDIAN AND MODE)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data Science

A

Can analyze future DEALS WITH THE FUTURE – uses data from data analytics to process information and predict

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Machine Learning

A

distance based, regression algo’s and also deep learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Deep learning

A

Works with deep neural networks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

AI

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data science is interdisciplinary.

A

Programming/ coding, math, and stats, business logic, statistical methods, math, and algorithms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Big Data, 4 V’S OF BIG DATA

A

With too much data to fit into a machine, high volume (amount) and velocity (how fast, aka tweets 150000 per second), and or high variety (could be text, image), data is generated super quickly and veracity(data accurately represents reality) (, of information assets that demand cost-effective, innovative forms of information processing that enable insight, decision making, and process automation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Descriptive analytics

A

What happened? eg. Starbucks sells 500 amount of coffee every day (deals with history)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Diagnostic

A

Why did it happen? eg star only sold 2000 in a day, more than (was there an event nearby? free cup of coffe? good add?) – deals with history

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Predictive analytics

A

What will happen? we will sell this many Starbucks tomorrow, most likely

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Prescriptive analytics

A

What do we need to do? Can you write a formula so Starbucks sells 1500 a day?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

External data

A

finding external data (like why prices may change for houses) schools? stock market price, economy, neighborhood

17
Q

External data

A

Figure out algorithm error possibly because of like school ranking, stock market, etc

18
Q

internal data

A

gathering data from your own database in company

19
Q

except, exception, ValueError

A

except - catches all errors, exception - like catch error, tells you your error, ValueError, catches just the issue with the value