Chapter 1 Continued Flashcards

1
Q

What is the purpose of data transformation?

A

Data transformation aims to map the values of an attribute to new replacement values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the common techniques used in data transformation?

A

Common techniques for data transformation include data normalization, standardization, and conversion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why might you want to combine attributes during data transformation?

A

Combining attributes can create more useful ratios or relationships between them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does scaling data benefit data mining algorithms?

A

Scaling attributes to the same approximate scale improves the performance of many data mining algorithms and results in better models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the purpose of data conversion in data transformation?

A

Data conversion can involve converting categorical data to numeric values and discretizing continuous data, making it more intuitive and improving algorithm performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is feature scaling important in data mining?

A

Feature scaling is essential in data mining because variables with widely varying ranges can lead to biases in the results, favoring attributes with larger ranges.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the primary goal of feature scaling in data mining?

A

The main objective of feature scaling is to ensure that all variables or features are within the same scale, preventing attributes with large ranges from dominating those with smaller ranges.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is feature scaling?

A

Feature scaling is essential in data mining because variables with widely varying ranges can lead to biases in the results, favoring attributes with larger ranges.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the primary goal of feature scaling in data mining?

A

The main objective of feature scaling is to ensure that all variables or features are within the same scale, preventing attributes with large ranges from dominating those with smaller ranges.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the two common methods of feature scaling?

A

Normalization and standardization are two common methods of feature scaling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is normalization?

A

Normalization scales the values of a feature to a range between 0 and 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is standardization?

A

Standardization scales the values to have a mean of 0 and a standard deviation of 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When is normalization useful?

A

Normalization is useful when the distribution of the feature is not Gaussian.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When is standardization useful?

A

Standardization is useful when the distribution of the feature is Gaussian.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why are normalization and standardization used?

A

Both techniques are used to improve the performance of machine learning algorithms by ensuring that all features have equal importance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How is Min-Max Normalization calculated for a value like $73,000 in the income range of $12,000 to $98,000?

A

To normalize $73,000 using Min-Max Normalization, you calculate it as (73,000 - 12,000) / (98,000 - 12,000), which results in 0.716.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the formula for Z-Score Standardization?

A

The formula for Z-Score Standardization is (x - mean) / standard deviation (sd), where x represents the data point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is data conversion?

A

Changing data from one format to another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are some DM techniques that can handle categorical variables without transforming them?

A

Naïve Bayes and decision tree

Other techniques (such as neural nets and regression) require only numeric inputs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is data conversion encoding?

A

Ordinal to Numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How is a single categorical variable with m categories typically transformed?

A

m-1 dummy variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Why is data conversion important?

A

To make data usable across different systems or applications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Why do we need to convert nominal fields into numeric values for techniques like neural nets and regression?

A

These techniques require only numeric inputs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

How can ordinal data be converted to numbers?

A

Preserving natural order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What values do the dummy variables take?

A

0 or 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is the purpose of using dummy variables when converting a categorical predictor with k ≥ 3 possible values?

A

To create k - 1 dummy variables and use the unassigned category as the reference category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is an example of converting ordinal data to numbers?

A

Grade: A → 4.0, A- → 3.7, B+ → 3.3, B- → 3.0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What does a value of 1 represent in a dummy variable?

A

yes” category”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is discretization?

A

Transforming continuous attributes into discrete ones

In the data mining field, many learning methods –like association rules can handle only discrete attributes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is the problem when creating dummy variables for nominal to numeric data conversion with many values?

A

Too many variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

How many dummy variables can be created for a nominal variable with few values?

A

Two dummy variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Why is discretization necessary in data mining?

A

To handle learning methods that can only handle discrete attributes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is the solution to reduce the number of variables when converting nominal to numeric data with many values?

A

Combine similar values and create dummy variables for the new values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Can you give an example of discretization?

A

Transforming the age attribute into child and adult categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Color=Red, Yellow or Green.

What does it mean if C_red = 0 and C_yellow = 0?

A

The color is green

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is the purpose of C_red dummy variable?

A

To represent if the color is red or not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is the purpose of C_yellow dummy variable?

A

To represent if the color is yellow or not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

How many common methods are there for binning numerical predictors?

A

Four

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What is the purpose of binning numerical variables?

A

To discretize data into categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What are the 3 categories created using equal width binning?

A

Low, Medium, High

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is the purpose of k-means clustering?

A

To identify partition

42
Q

Why would we need to partition numerical predictors into bins?

A

To use them as categorical predictors for certain algorithms

43
Q

What is equal width binning?

A

Partitioning data into equal width categories.

44
Q

Dataset: {1, 1, 1, 1, 1, 2, 2, 11, 11, 12, 12, 44}

Using equal frequency binning, we have n = 12, k = 3, and n/k = 4. The partition is

A

Low: Contains the first four data values, all X = 1.
Medium: Contains the next four data values, X = {1,2, 2, 11}.
High: Contains the last four data values, X = {11, 12,12, 44}.

Note that one of the medium data values equals a
data value in the low category, and another
equals a data value in the high category. This
violates what should be a self-evident heuristic:
Equal data values should belong to the same
category.

45
Q

Suppose we have the following tiny data set,
which we would like to discretize into 3
categories:
X = {1, 1, 1, 1, 1, 2, 2, 11, 11, 12, 12, 44}.
Using equal width binning, we partition X into what of the following categories of equal width:

A

Low: 0 ≤ X < 15, which contains all the data values except
one.
Medium: 15 ≤ X < 30, which contains no data values at all.
High: 30 ≤ X < 45, which contains a single outlier.

46
Q

What is the last method of Binning Numerical values?

A

The partition identified by k-means clustering

47
Q

Data Modeling

A

Data modeling refers to a group of processes in which datasets are analyzed to uncover relationships or patterns among the variables.

48
Q

Goal of Data Modeling

A

The goal of data modeling is to use past data to generate predictions on new data and make inferences about these relationships.

It helps in understanding how dependent variables change concerning independent variables.

49
Q

:

Data modeling involves two cultures of reaching conclusions from data. What are they?

A

One culture assumes data is generated by a stochastic model, while the other employs algorithmic models and treats the data mechanism as unknown.

50
Q

Stochastic Model

A

A stochastic model is a tool for estimating probability distributions of potential outcomes by allowing for random variation in one or more inputs over time. It is a common approach in data modeling.

51
Q

Algorithmic Modeling

A

Algorithmic modeling, a rapidly developing field, is used in data modeling on both large and small data sets. It is an alternative approach that treats data mechanisms as unknown and can provide more accurate results.

52
Q

Elaborate on:

Data Modeling Techniques

A

Data modeling encompasses various techniques originating from the two cultures of modeling. No single learning algorithm is universally superior. It is common practice to experiment with multiple models to find the one best suited to a specific problem.

53
Q

What is important to consider when selecting a modeling technique?

A

types of variables involved and relevant business considerations. Model selection is driven by the problem’s specific requirements and characteristics.

54
Q

Define:

Occam’s Razor Principle

A

When presented with two models that predict equally well, it is often wise to choose the simpler of the two. This principle, known as Occam’s Razor, suggests that simpler models are preferred unless there is a clear advantage to using a more complex one.

55
Q

What are the key differences between traditional programming and machine learning?

A

Traditional Programming:

Input: Data and Program
Process: Computer processes data using the program.
Output: Result or output based on the program’s logic.

Machine Learning:

Input: Data and Output (desired result)
Process: Computer learns from the data to generate a program (model).
Output: The model can predict or generate new outputs based on input data.

56
Q

What is the definition of machine learning according to Tom Mitchell?

A

A computer program is said to learn from
experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.

57
Q

What are the three components of a machine learning task: T, P, and E?

A

E: Experience
P: Performance measure
T: class of tasks

58
Q

What is the name of the system that improves performance from experience according to Herbert Simon?

A

A computer system learns from data, which
represent some “past experiences” of an
application domain.

59
Q

Machine Learning can simple be identitified as

A

algorithms that improve their
performance at some task with experience.

60
Q

Describe:

The machine learning process

A

Flow from training data to algorithm, then to learning, resulting in a trained model, and finally producing results.

61
Q

What is the data used for the machine learning task of predicting loan approval?

A

Loan application data

62
Q

What is the performance measure used for the machine learning task of predicting loan approval?

A

Accuracy

63
Q

How is the model used to classify future loan applications?

A

By assigning them to either Yes (approved) or No (not approved)

64
Q

What is the machine learning task of predicting credit card approval?

A

A classification problem

65
Q

What are the features used for the machine learning task of predicting credit card approval?

A

Information about an applicant, such as age, marital status, annual salary, outstanding debts, credit rating, etc.

66
Q

What are the labels used for the machine learning task of predicting credit card approval?

A

Two categories, approved and not approved

67
Q

How is the model trained for the machine learning task of predicting credit card approval?

A

By using a supervised learning algorithm that learns from a set of labeled examples

68
Q

List:

Different types of machine learning

A
  1. Supervised Machine Learning
  2. Unsupervised Machine Learning
  3. Semi-Supervised Learning
  4. Reinforcement Learning
69
Q

What is the difference between supervised and unsupervised learning in terms of data labels?

A

Supervised learning uses data with corresponding labels or desired outputs, while unsupervised learning uses data with no labels or outputs.

70
Q

What is the goal of semi-supervised learning?

A

To combine labeled and unlabeled data for training and improve the performance of the model.

71
Q

What is the main idea of reinforcement learning?

A

To learn from interactions with an environment and receive rewards or punishments for the actions taken.

72
Q

What is the key characteristic of supervised learning?

A

A: In supervised learning, we know the correct labels during training which helps the algorithm learn from historical examples.

73
Q

How can we categorize algorithms within supervised learning?

A

We can divide them into classification and regression based on the nature of the output they produce.

74
Q

What do supervised machine learning techniques automatically learn based on historical examples or instances?

A

A model of the relationship between a set of descriptive features and a target feature

75
Q

What is a regression problem?

A

Predicting a real valued number

76
Q

What is an example of a regression problem?

A

Predicting final exam scores based on homework scores

77
Q

What is supervised machine learning?

A

Learning with labeled data

78
Q

What is the goal of supervised machine learning?

A

Predicting an output variable associated with each input item

79
Q

What are some examples of supervised machine learning tasks?

A

Email classification, image classification, regression for predicting real-valued outputs

80
Q

How does supervised machine learning work?

A

Using input features to predict a target variable

81
Q

What is labeled data?

A

Data that has known and assigned outcomes or labels

82
Q

What is unsupervised machine learning?

A

Discovering patterns in unlabeled data

83
Q

What is an example of unsupervised learning?

what do I mean by structure?

A

Clustering similar data points

84
Q

What is a typical example of unsupervised learning?

A

Clustering

85
Q

What is an example of an application for unsupervised machine learning?

A

Categorizing or grouping customers into different types

86
Q

In the context of E-commerce:

What are some potential customer groups that can be discovered through unsupervised learning?

A

Power users, quick browser-type users, careful researcher type users

87
Q

How can unsupervised learning be used to discover different customer groups?

A

By analyzing data of how people interact with the site

88
Q

Why is tailing site offerings to different groups useful?

A

Improved user experience and increased likelihood of purchase

89
Q

What is a classic example of unsupervised learning?

A

Problem with no labeled examples

90
Q

What is an example of an unsupervised learning problem related to web security?

A

Flagging abnormal access to a web server

91
Q

Why is it difficult to use supervised learning for the web security

For security reasons, you might want to be notified if a website user is making requests that could be a cyber attack or is somehow very different from typical user behavior on your site.

A

Lack of reliable training labels

Since there can be many different types of hacking or intrusion attempts to break into the server or exploit in some way

92
Q

What approach do we need for outlier detection?

A

Unsupervised approach

93
Q

What does outlier detection not assume?

A

Future attacks will be of the same form as previous attacks

94
Q

What does outlier detection assume?

A

Features of attacks will look different from average user’s behavior

95
Q

What is the difference between supervised and unsupervised learning?

A

Supervised learning uses labeled data to discover patterns that relate data attributes with a target attribute. Unsupervised learning uses unlabeled data to explore the data and find some intrinsic structures in them.

96
Q

What are the two types of supervised learning problems?

A

The two types of supervised learning problems are regression and classification. Regression predicts numerical values, while classification predicts categorical values or labels.

97
Q

What are some examples of unsupervised learning techniques?

A

Some examples of unsupervised learning techniques are clustering, association, link prediction, and data reduction.

Clustering groups data according to “distance”, association finds frequent co-occurrences, link prediction discovers relationships in data, and data reduction projects features to fewer features.

98
Q

What is the target attribute in supervised learning?

A

The target attribute in supervised learning is the output variable that we want to predict based on the input data. It can be either numerical or categorical.

99
Q

Fraud Detection is an application in which of the
following
(A) Unsupervised Learning: Regression
(B) Supervised Learning: Classification
(C) Unsupervised Learning: Clustering
(D) Reinforcement Learnin

A

B

100
Q

Customer Segmentation is an application in which of the
following
(A) Supervised Learning: Classification
(B) Unsupervised Learning: Clustering
(C) Unsupervised Learning: Regression
(D) Reinforcement Learning

A

B

101
Q
  1. Customer Retention is an application in which of the
    following
    (A) Unsupervised Learning: Regression
    (B) Supervised Learning: Classification
    (C) Unsupervised Learning: Clustering
    (D) Reinforcement Learning
A

B

102
Q

Image classification is a popular problem in the
computer vision field. Here, the goal is to predict what
class an image belongs to. In this set of problems, we are
interested in finding the class label of an image.
A. Supervised Learning
B. Unsupervised Learning

A

A