Lecture 2 Flashcards

1
Q

What is rote learning?

A

Learning by memorization, so like these flashcards but also used in the machine learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is supervised learning?

A

Use specific examples to reach general conclusions or extract general rules. So we give (X, Y) training set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is unsupervised learning?

A

Unsupervised identification of natural groups in data. So we give X values and group the results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is reinforcement learning?

A

Feedback (positive or negative reward) given at the end of a sequence of steps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why is Rote learning limited?

A

It memorizes the input/output pairs and performs matching with new inputs. If the computer has not seen the precise case before, it cannot apply its experience.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is evaluationary learning?

A

Inspired by evolution: adapts to environment to improve the survival rate and chance of having offspring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the most important factor in learning?

A

Generalization - we want computers to generalize from prior experience

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is explanatory modeling?

A

Application of statistical models to data for testing causal hypotheses about theoretical constructs

Useful description and explanation of why and how thing works

“what can we do about it”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is predictive modeling?

A

A process of applying a statistical model or data mining algorithm to data for the purpose of predicting new or future observations

“what is likely to happen”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a predictive model?

A

Any method that produces predictions, regardless of its underlying approach

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are different AI levels?

A
  1. Narrow AI (weak AI)
    - Designed for a specific task, often outperforming humans
    - Virtual assistants, image recognition systems
  2. General AI (strong AI)
    - Human like cognitive abilities, can understand, learn and apply knowledge across multiple tasks
    - Only theoretical currently
  3. Super AI (AGI - Artificial General Intelligence)
    - Surpasses human intelligence in virtually every aspect and can outperform humans in any economically valuable work
    - Autonomous learning, problem-solving, cretivity
    - Mainly speculative and associated with potential risks
  4. Singularity
    - Hypothetical point in the future where technological growth becomes uncontrrollable and irreversible, leading to unforeseeable changes in human civilization.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is pattern recognition?

A

A branch of machine learning that focuses on the recognition of patterns and regularities in data

For example recognizing is an email spam or non-spam

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How are pattern recognition systems trained?

A

Mainly from labeled training data (supervised learning), but when no labeled data available then unsupervised or semisupervised learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Whats the difference between pattern recognition and pattern matching algorithms?

A

Pattern recognition aims to provide “most likely” answer matching of the inputs

Pattern matching algorithm looks for exact matches in the input with pre-existing patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the classical way for pattern recognition?

A

sensor -> representation pattern -> feature selector/extractor -> feature pattern -> classifier -> decision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is feature vector?

A

Raw input data from sensors preprocessed to obtain a feature vector X, that describes all of the relevant features for classifying examples

X is a list of attribute, value pairs X=[color=Orange Shape=Round]

17
Q

What is a feature space?

A

Each example can be interpreted as a point in an n-dimensional feature space, where n is the number of attributes

18
Q

What is a pattern?

A

A pattern could be an object or event, like “hand”, “head”, “horse”, number etc.

19
Q

What are different goals in pattern recognition?

A
  1. Recognize objects within the input signal, for example determine that written 2 is 2.
  2. Segmentation of objects, for example partitioning image into multiple segments, like highlightin all the people from the image
  3. Region based approaches -> recognizes the image regions (aka boxes) of the object, so if there is a boat in the image it makes a box around it
  4. Semantic segmentation -> classifies each image pixel to its corresponding class, like people, cars, trains etc.
20
Q

What are different pattern classes?

A
  • Intra-class variability: for example different types of letter T
  • Inter-class variability: Letters/numbers that look similar, for example OD, B8, 2Z, I1
21
Q

How are pattern classes typically expressed?

A

In terms of a statistical model, for example histogram, probability density function etc.

22
Q

What are Pattern Recognition Objectives?

A
  • Hypothesize the models that describe each pattern class
  • Given a novel pattern assign it to the pattern class associated with the model (predictive modelling)
23
Q

What is the difference between classification and clustering?

A

Classification: categories are known, finds letters for “category A, category B”. Supervised

Clustering: Creates categories, for example separates all 1 and I to two categories. Unsupervised

24
Q

What are the stages in a pattern recognition problem?

A
  1. Formulation of the problem: gaining a clear understanding and planning the stages
  2. Data collection
  3. Initial examination of the data - summary statistics, plots etc.
  4. Feature selection or feature extraction for the task
  5. Unsupervised pattern classification or clustering (exploratory data analysis)
  6. Apply discrimination or regression procedures as appropriate
  7. Assessment of results, often in the form of confusion matrix
25
Q

What is statistical pattern recognition?

A

It attempts to classify patterns based on a set of extracted features and determine underlying statistical model for the generation of these patterns

26
Q

What is structural pattern recognition?

A

A field within pattern recognition that focuses on analyzing the structure of patterns or objets. Recognizes relationships and connections between different components of a pattern

For example: square
Statistical: “number of segments:4”
Structural: “horizontal -> vertical -> vertical -> horizontal”

27
Q

What are key concepts of structural pattern recognition?

A

Graph theory: Utilized for modeling and representing relationships in patterns

Hierarchical structures: Recognition of patterns organized in a hierarchical manner

Syntactic analysis: involves the use of formal grammars to describe and recognize patterns

28
Q

What is the cost of miss-classification?

A

The cost is different for miss-classifications. For example buying salmon and the machine has put sea bass in there -> people are angry. buying sea bass and getting salmon -> people are not that angry

29
Q

How many features should the model have?

A

No clear answer

Some features might be difficult and computationally expensive to extract

Correlated features do not in theory improve performance

30
Q

What is the curse of dimensionality?

A

Adding too many features can lead to worse performance. If each input feature is divided into M divisions, then the total number of cells M^d grows exponentially

31
Q

What might be the problem with using too complex models?

A

Overfitting, the model could be tuned too well for particular training samples

32
Q

How can we improve generalization performance?

A
  • More training examples
  • Simpler models
33
Q

How to evaluate model performance?

A

Simple: separate training and test data

Cross-validation

34
Q

What does pooling mean in pattern recognition?

A

Performance can be improved using a “pool” of classifiers -> combining multiple classifiers

35
Q

What is the limitation of classical approach in pattern recognition?

A

It uses hand-crafted models, which is very tedious and costly to develop, and highly dependent on one application

36
Q

How does deep learning work in pattern recognition?

A

Deep learning seeks to learn rich hierarchical representations automatically through multiple stage of feature learning process

picture of a tiger -> low level features -> mid-level features -> high-level features -> trainable classifier -> output

For example text:

  • Character -> word -> word group -> clause -> sentence -> story
37
Q

What is ViT - Vision transformer?

A

It divides data into smaller parts (patches) which are processed as a sequence. Then spatial information of the image is added with positional encoding so the model understands the position of the parts in the image.

The attention mechanism allows the model to emphasize important parts of the image and learn long-range dependencies