DS LAMS Flashcards

1
Q

Stepss in ds pipeline

A

Problem formulation
Data visualisation
ML
Statisticl inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which steps relies heavily on algoritjmiv optimisation?

A

ML

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How much
How many

A

Numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Isit type A or B

A

Classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How isit organised?

A

Structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Isit weird behaviour?

A

Anomaly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What shld be done next?

A

Action

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

To predict numeric

A

Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

To predict classes

A

Classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

To detect cluster

A

Clustering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Detect anomaly

A

Anomaly detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Predict action

A

Adaptive learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Types of structured data

A

Numeric
Categorical
Mixed data
Time series
Network

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Unstructured

A

Text
Image
Voice
Videos

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Central tendency

A

Mean
Mediam

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Dispersion

A

Standard deviation
Quantiles

17
Q

Central tendency

A

Median
Mean

18
Q

Box plot

A

3 lines
Q1,2,3
25,50,75

19
Q

Correlation

A

Draw from x,y=0
Correlation is not casuality

20
Q

Swarm plot

A

Only 1 variable is v confident

Confidence increase when data is more clear cut

21
Q

Gini calculation

A

x/n(1-x/n)x 2