C9 Flashcards

Question 1

Q

why does pre-training work especially well on deeply layered architectures?

Answer

A

the knowledge in the layers goes from generic to specific. The lower layers contain more generic information that is well suited for transfer to other tasks

Question 2

Q

what are foundation models?

Answer

A

they are large models in a certain field (eg. image recognition or NLP) that are trained extensively on large datasets. They contain general knowledge that can be specialized for a certain purpose

Question 3

Q

What is the reason for the interest in meta-learning and transfer learning?

Answer

A

we want to speed up learning a new task, using previous knowledge instead of learning from scratch. In transfer learning, we pretrain our parameter network with knowledge from a single task. In meta-learning, we use multiple related tasks.

Question 4

Q

what is transfer-learning?

Answer

A

networks trained on one dataset are used to speedup training for a different task, possibly using a much smaller dataset

Question 5

Q

what is meta-learning?

Answer

A

learning how to learn

Question 6

Q

How is meta-learning different from multi task learning?

Answer

A

In multi-task learning, more than one task is learned from one dataset. The tasks are often related, such as classifcation tasks of different, but related, classes of images.

In meta-learning, both datasets and tasks are different, but not too different. A sequence of datasets and learning tasks is generalized to learn a new (related) task quickly. The aim is learning to learn

Question 7

Q

what is domain adaptation?

Answer

A

needed when there is a change in the data distribution between the training and test dataset, eg. when items must be recognized with different backgrounds

goal: compensate for variation between two data distributions, to be able to reuse information from a source domain on a different target domain

Question 8

Q

what is the difference between meta-learning and machine learning?

Answer

A

machine learning learns parameters that approximate the function and meta-learning learns hyperparameters about the learning-function

Question 9

Q

what is few-shot learning?

Answer

A

test if a learning algorithm can be made to recognize examples from classes from which it has seen only few examples in training. Prior knowledge is available in the network.

Question 10

Q

what is the connection between meta-learning and curriculum learning?

Answer

A

Both approaches aim to improve the speed and accuracy of learning, by learning from a set of subtasks.

So curriculum learning is a form of meta-learning where the subtasks are ordered from easy to hard, or, equivalently, meta-learning is unordered curriculum learning

Question 11

Q

Zero-shot learning aims to identify classes that it has not seen before. How is that possible?

Answer

A

Attribute-based zero-shot learning uses separate high-level attribute descriptions of the new categories, based on categories previously learned in the dataset.
Eg. recognize a red beak because we have learned the concepts “red” and “beak”

Question 12

Q

is pre-training a form of transfer learning?

Answer

A

yes: some network layers are copied to intialize a network for a new task, followed by fine tuning, to improve performance on the new task, but with a smaller dataset

Question 13

Q

what is MAML?

Answer

A

a well-known deep few-shot learning approach

Question 14

Q

As the diversity of tasks increases, does meta-learning achieve good results?

Answer

A

For tasks that are related, good results are reported, but where tasks are less related (such as pictures of animals from very different species), results are reported that are weaker.