Fundamentals of AI Flashcards

1
Q

What is Machine Learning?

A

A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P if its performance at tasks in T, as measured by P, improves with experience E.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Top Down?

A

Model all different functions and wire all the ‘agents’ together, deduction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Bottom Up?

A

Give the system a lot of data, so it can discover the concepts itself, induction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Three Pillars of Machine Learning

A

Models and Algorithms, Powerful and cheaper computation, Massive data warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Data Mining?

A

Exploration and analysis of large quantities of data discover valid, novel, useful and understandable patterns in data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Supervised Learning?

A

Infers a function from labelled training data, each example consisting of input and outputs (Classification and Regression)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Unsupervised Learning?

A

Infers a function to describe hidden structure from unlabelled data (Clustering and Association)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What can be done in Data pre-processing?

A

Fill in missing data, find outliers, feature selection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Unsupervised Learning - Clustering

A

Given: Un-labelled data set and similarities/distance metric
Goal: Find ‘natural’ partitioning, or groups of similar data points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

K-means clustering

A

Choose the number of k clusters and initialise K cluster centroids randomly. Assign each data point to the nearest centroid (based on distance), and update the centroids by the mean of all the data points assigned. Output the final cluster assignments and centroids.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Application of k-clustering

A

Anomaly detection, Social Media Analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Unsupervised Learning - Association

A

Discover correlation between any two or more variable.
Given: a set of records containing items
Goal: Produce dependency rules to predict occurrence of variable X with variable Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Categorical data

A

Learn to predict to which set an instance belongs to based on pre-labeled (classified) instances (Classification)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Continuous data

A

Finds a linear relationship with the variable X and the variable Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Supervised Learning: Regression

A

Based om the given data find the function that minimises its mean squared error to fit the samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Overfitting

A

Describes errors in the dataset instead of the underlying relationship of the variables

17
Q

Regression Pros and Cons

A

Pros: short training time, easy to implement, easy to interpret
Cons: sensitive to noises and outliers (overfitting), cannot handle complicated relationships (linear only)

18
Q

Supervised Learning: Decision Tree

A

Internal nodes: decision rules on features
Branched: course of decision or action
Leaf nodes: a predicted class label (output)
Iteratively partition the decision space of chosen features.

19
Q

Decision Tree Pros and Cons

A

Pros: Reasonable training time, Caan handle large number of features, easy to implement, easy to interpret
Cons: only simple boundary decisions, problems with missing data, cannot handle complicated relationships, over-complex tree (overfitting)

20
Q

Neural Networks Pros and Cons

A

Pros: can learn more complicated class boundaries, can be more accurate, can handle large number of features
Cons: hard to implement: trial and error for choosing parameters and network structure, slow training time, can overfit the data, hard to interpret

21
Q

Supervised Learning: Neural Networks

A

Set of neurons connected by directed, weighted edges
Positive weight encourages the neuron to fire, while negative prevents firing. Each neuron is fixed at threshold t.

22
Q

Linearly Seperable

A

Where the output data pointscan be seperated using a linear boundary. Only a linearly separable function can be represented by a perceptron.

23
Q

Bayes Rule

A

Fundamental notion is of conditional probability

24
Q

KRR

A

Knowledge Representation and Reasoning

25
Q

Knowledge Based Systems

A

A system build around a knowledge base, i.e collection of knowledge taken from a human and stored in such a way that the system can reason with it

26
Q
A