lesson_1_flashcards

Question 1

Q

What is hierarchical compositionality in deep learning?

Answer

A

A principle where features are learned through a composition of simple and complex transformations, mirroring real-world structures.

Question 2

Q

What is end-to-end learning in deep learning?

Answer

A

A learning method where models optimize directly from raw inputs to final outputs, automating feature extraction and classification in one process.

Question 3

Q

What is distributed representation in deep learning?

Answer

A

A feature representation where information is distributed across multiple neurons, enabling rich and generalizable representations.

Question 4

Q

What is gradient descent used for in deep learning?

Answer

A

An optimization algorithm that iteratively updates model parameters to minimize a loss function by moving in the direction of steepest descent.

Question 5

Q

What are mini-batches in gradient descent?

Answer

A

Mini-batches are small subsets of training data used to compute gradients and update weights, balancing computational efficiency and convergence stability.

Question 6

Q

What is the role of softmax in classification tasks?

Answer

A

Softmax converts raw class scores into normalized probabilities, making them interpretable for classification.

Question 7

Q

What is cross-entropy loss?

Answer

A

A loss function used for classification tasks that penalizes incorrect predictions by comparing predicted probabilities with true labels.

Question 8

Q

How are computation graphs utilized in deep learning?

Answer

A

Computation graphs represent models as differentiable operations, enabling efficient backpropagation for optimization.

Question 9

Q

What are parametric models in machine learning?

Answer

A

Models that explicitly represent a function ( f(x, W) ) with parameters ( W ), such as linear models or neural networks, optimized during training.

Question 10

Q

What is the purpose of feature learning in deep learning?

Answer

A

Feature learning automates the process of extracting meaningful representations from raw data, reducing dependency on manual feature engineering.

Question 11

Q

What is supervised learning?

Answer

A

A learning paradigm where models are trained on labeled datasets to learn mappings from inputs (X) to outputs (Y).

Question 12

Q

What are regularization techniques used for in deep learning?

Answer

A

Regularization techniques like L1 or L2 prevent overfitting by penalizing large weight values, encouraging simpler models.

Question 13

Q

What makes deep learning unique compared to traditional machine learning?

Answer

A

Deep learning uses hierarchical, end-to-end learning and distributed representations to generalize across tasks without manual feature engineering.

Question 14

Q

What are loss functions, and why are they important?

Answer

A

Loss functions measure the error between predicted outputs and ground truth, guiding optimization during training.

Question 15

Q

How does hierarchical compositionality mirror real-world data?

Answer

A

It reflects natural structures, such as edges forming shapes in images or phonemes forming words in speech.

lesson_1_flashcards

(15 cards)