Quiz 1 Flashcards

1
Q

Steps of the data science pipeline

A
Data selection
Data Preprocessing
Data Transformation
Data Mining
Evaluation/Interpretation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Ways to measure central tendency

A

Mean
Median
Midrange
Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Ways to measure dispersion or spread

A
Range
Quartiles
Variance
Standard Deviation
Interquartile Range
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Unsupervised Learning

A

Includes Clustering

Find groups in data without provided labels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

types of supervised learning problems

A

Regression

Classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Classifier

A

Discovers a pattern that can predict a class that a new data instance falls into

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is clustering points used for?

A

Anomaly Detection
Based on similarities between them
Does not require labeled data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Supervised learning examples

A

Examine a web page, and classify whether the content on the web page should be considered “child friendly” or “adult.”

In farming, given data on crop yields over the last 20 years, learn to predict next year’s crop yields.

Learn from historical data and determine whether a new user will respond to an add campaign (or not).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Data discretization is part of data reduction

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Scatter plot is not an effective graphical method to look for correlation between two numerical variables

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Truths about correlation

A

If correlation is equal to -1 then two features are perfectly negatively correlated

Correlation between two features ranges between [-1, 1]

If correlation is equal to 1 then two features are perfectly positively correlated

If correlation is equal to 0 then two features have no correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Scatter plot

A

Can handle multiple Y values per X value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Bar Chart

A

Good for categorical X values and cases where the Y value is ratio scaled.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Line Graph

A

Implies some importance of the connection between the data points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly