Quiz 1 Flashcards

1
Q

tasks of classification and prediction as well as pattern discovery

A

predictive analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

examine unknown categories or predict future

A

classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

predicting value of a numerical variable (ex go up 5%)

A

prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

designed to find general association patterns between items in large databases, for whole population

A

association rules/affinity analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

uses individual users’ preferences and tastes given their historic measurable behaviors indicative of preference, for user

A

collaborative filtering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

analytical methods in PA

A

classification, prediction, collaborative filtering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

process of consolidating a large number of records/cases into a smaller set (rows)

A

data reduction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

methods of reducing the number of cases

A

clustering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

reducing the number of variables in a set (columns)

A

dimension reduction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is one of the earliest stages of engaging with data?

A

exploring it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

exploration of data by creating charts and dashboards

A

data visualization/visual analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

used in classification and prediction, must have data available in which the value of the outcome of interest is known (yes/no)

A

supervised learning algorithms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

data from which classification or prediction algorithms learns about the relationship between prediction variables and outcome variable

A

training data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

data which the algorithm is applied to to test how well it does and fixing any issues

A

validation data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

data used for evaluating chosen model

A

test data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

used when there is no outcome variable to predict or classify, no learning from when outcome variable is known

A

unsupervised learning algorithms

17
Q

methods are trained on a set of training data and then their performance is evaluated on a separate set of validation data

A

data partioning

18
Q

predicted values of training data, they are for the records on which the model was fit

A

fitted values

19
Q

what does SEMMA stand for?

A

sample, explore, modify, model, assess

20
Q

nominal categorical variables that have been decomposed into a series of binary variables (yes/no)

A

dummy variables

21
Q

including all four dummy variables in an algorithm

A

one-hot encoding

22
Q

knowledge of the particular application being considered, not the algorithm

A

domain knowledge

23
Q

two ways to standardize data

A
  1. subtract the mean from each value and divide by standard deviation
  2. subtract by minimum and divide by range (for 0 to 1 standardization)
24
Q

entire set of operations that was performed on a dataset

A

workflow

25
Q

maintenance triggered by advance warnings that are predictive of failure

A

proactive preventive maintenance

26
Q

steps in machine learning process

A
  1. develop understanding of ML project
  2. obtain dataset
  3. explore, clean, and preprocess data
  4. reduce data dimensions if necessary
  5. determine ML task
  6. partition data (for supervised tasks)
  7. choose ML techniques
  8. use algorithms to perform task
  9. interpret results
  10. deploy the model