AIML Flashcards by lenckhbu ope

What is supervised learning, and why is it used?

Learning with labels attached. Trained to generalize new features in the dataset.

How well did you know this?

Not at all

Perfectly

What is unsupervised learning, and why is it used?

Learning with features but no labels. Trained to predict future information, using a hypothesis of equation y = p1(x) + p0.

How well did you know this?

Not at all

Perfectly

What is cost?

Poorness of fitted line to data.

How well did you know this?

Not at all

Perfectly

What is the K-NN algorithm?

Supervised learning algorithm to classify new incoming data with low complexity

How well did you know this?

Not at all

Perfectly

What are three advantages of the K-NN algorithm?

Simple to implement.
Flexible to all features and distance equations.
Easily handles multi-class data.

How well did you know this?

Not at all

Perfectly

What are three disadvantages of the K-NN algorithm?

Large search problem to find nearest neighbors, can be intensive
Requires a large amount of stored data with many classes
Distance function should be meaningful

How well did you know this?

Not at all

Perfectly

How does K-NN operate?

Choose some K as the number of neighbors to take.
Locate K nearest neighbors for unclassified example. Should not be multiple of K. Optimize K on observation.
Most votes wins.

How well did you know this?

Not at all

Perfectly

What is the K-means Clustering algorithm?

Unsupervised learner for classification or regression
Finds K groups in the set, defined by centroids.
Guaranteed to converge on a result, though it may not be local optimum

How well did you know this?

Not at all

Perfectly

What are three advantages of the K-means clustering algorithm?

Can be used for any kind of grouping
Thanks to the simple layout of data, new data can easily be applied to a cluster
Clustering allows finding groups that have formed organically without definition

How well did you know this?

Not at all

Perfectly

What are three disadvantages of the K-means clustering algorithm?

Cannot handle outliers
Cannot handle complicated cluster types
Cluster assignments change with each run

How well did you know this?

Not at all

Perfectly

How does K-means clustering work?

Randomly generates K centroid locations in result space
Each point assigned to nearest centroid
Centroid relocated to mean of all assigned points
Iterates until stopping, due to no change or sum of distances minimized, iteration cap etc.

How well did you know this?

Not at all

Perfectly

What is the search space?

Graph representing how good each solution is, attempting to find the global optimum (vs. the local optimum)

How well did you know this?

Not at all

Perfectly

What are the components of a neuron?

Synapses, receiving numerical input
Summation sub-unit sums weighted inputs to single value
Activation sub-unit maps to new output
Output through axon to all connected neurons

How well did you know this?

Not at all

Perfectly

What is a hidden layer?

Layers between the input and output.

How well did you know this?

Not at all

Perfectly

What is feed-forward?

The input is fed through all connections, weighted and passed forward. An output is produced.

How well did you know this?

Not at all

Perfectly

What is linear seperability?

Study These Flashcards

Whether a single straight line can be drawn that would separate all positive and negative values. XOR is not linearly separable.

Define bias vs variance.

Study These Flashcards

Bias is a simple model which fails to capture relationships within the data.
Variance is a complex model that has overfitted to the data, producing noisy results.

What is regularisation?

Study These Flashcards

Group of techniques coercing a model to infer simpler results, aiding the bias-variance trade-off.

Give three examples of regularization with brief descriptions.

Study These Flashcards

L1 - Prevents overfitting by applying penalty to loss, driving unimportant values to zero. Smaller models.
L2 - Adds extra term to loss. Drives towards smaller values.
Dropout - Randomly shuts off neurons. Forces robust learning.

What is a genetic algorithm?

Study These Flashcards

Probabilistic search method adopting natural selection approach

List three benefits of a genetic algorithm.

Study These Flashcards

Supports multi-objective optimization
Good for noisy data
Always provides an answer
Easily parallelable

How does a genetic algorithm represent solutions?

Study These Flashcards

Encoded, often using binary bit strings (chromosomes)

Describe the broad steps of a genetic algorithm.

Study These Flashcards

Initialize a population of solutions
Evaluate each using an objective function
Create new solutions via selection, crossover, mutation and elitism
Replace old population and repeat

Define selection in a genetic algorithm.

Study These Flashcards

Each solution arranged by fitness value
Total value calculated
Each solution calculates its % of total fitness
Biased roulette wheel built from probabilities

Define crossover in a genetic algorithm.

From selection roulette wheel, choose two parents and switch one or more pairs of bits

Define elitism in a genetic algorithm.

Select some best candidate solutions and grant free passage without modification

Define mutation in a genetic algorithm.

Random alteration or flip of a bit in a solution.

AIML Flashcards

aiml (27 cards)