w3 Flashcards
(45 cards)
What makes deep learning ‘deep’?
o a) The complexity of features learned
o b) The sophistication of the neural connections
o c) The number of layers in the network
o d) The size of the input data
c) The number of layers in the network
Which of the following is true about convolutional neural networks?
o a) They classify objects by matching pixel colors
o b) They apply filters to detect features in an image
o c) They use a single layer to capture complex features
o d) They don’t use activation maps
b) They apply filters to detect features in an image
What is the main function of the classification module in a ConvNet?
o a) To calculate the number of layers
o b) To output a set of activation maps
o c) To determine the class with the highest confidence score
o d) To increase network depth
c) To determine the class with the highest confidence score
What is an epoch in the context of training a neural network?
o a) A function for determining classification
o b) A measure of data accuracy
o c) One full pass through the entire training dataset
o d) The confidence score of the model
c) One full pass through the entire training dataset
What drives a ConvNet to converge?
o a) Random initialization of weights
o b) Lack of training data
o c) Weight adjustments stopping as the network stabilizes
o d) Reduction in the number of layers
c) Weight adjustments stopping as the network stabilizes
True or False
The activation maps in ConvNets are similar to activation patterns in the human brain’s visual system.
True
True or False
The number of layers in a neural network directly reflects the quality of learning, regardless of the network’s architecture.
False
True or False
In a ConvNet, the highest convolutional layer is used as the final output layer for classification.
False
True or False
Edge detection typically occurs in the first layer of a ConvNet.
True
Why is object recognition challenging for neural networks, despite it seeming easy for humans?
Object recognition is complex for neural networks because they must account for variations in shape, size, orientation, and lighting, whereas the human brain processes visual information with remarkable flexibility and adaptability.
Explain the role of convolutional layers in a ConvNet and how they process image data.
Convolutional layers apply filters to the input image, creating activation maps that highlight specific features such as edges, textures, or object parts.
Convolutional layers extract features from image data by applying filters (kernels) that slide over the input, These filters detect patterns such as edges, textures, or shapes by focusing on local regions of the image (local receptive fields) creating activation maps. By stacking multiple convolutional layers, the network learns hierarchical features, with early layers detecting simple patterns and deeper layers capturing complex structures, enabling the ConvNet to understand and classify images effectively.
What does the term ‘convergence’ mean in neural network training, and why is it important?
Convergence refers to the stabilization of the network’s weights as training progresses.
How does backpropagation help a ConvNet learn, and what role do epochs play in this process?
Backpropagation enables the ConvNet to adjust its weights and biases by propagating the error backward and using gradients to minimize the loss function.
Epochs determine how many times the model iterates over the dataset, allowing it to progressively refine its understanding of the data.
Together, backpropagation and multiple epochs allow a ConvNet to effectively learn from data and improve its performance over time.
Backpropagation calculates errors from the output layer and adjusts the weights in each layer to minimize these errors.
What are some ways ConvNets are similar to the human brain’s visual processing system?
ConvNets process images in a hierarchical manner, where each layer builds upon the previous one to identify increasingly complex features.
What was the purpose of the Amazon Mechanical Turk in building the ImageNet dataset?
a) To design deep learning algorithms
b) To crowdsource image labeling for training data
c) To create the 1,000 categories in ImageNet
d) To analyze model performance
b) To crowdsource image labeling for training data
What is the ‘top-5 accuracy’ metric in the ImageNet competition?
a) The model’s top five categories must be correct
b) The correct label must appear among the model’s top five predictions
c) Five categories are selected for each image
d) A model with five layers achieves higher accuracy
b) The correct label must appear among the model’s top five predictions
Why did ConvNets start outperforming other methods in image recognition?
a) Increased access to large datasets and faster parallel computing hardware
b) ConvNets required less data than other models
c) Better labeling techniques reduced training time
d) ConvNets used top-1 accuracy to improve classification
a) Increased access to large datasets and faster parallel computing hardware
Which model’s success in the ImageNet competition is considered a turning point for ConvNets?
c) AlexNet
What is one of the main challenges ConvNets face in comparison to human object recognition?
a) ConvNets consistently outperform humans
b) ConvNets struggle with localization tasks
c) ConvNets are unable to recognize any real-world images
d) ConvNets cannot achieve top-5 accuracy
b) ConvNets struggle with localization tasks
True or False
ConvNets are now considered fully human-equivalent in object recognition across all tasks.
False
True or False
The ImageNet competition used Amazon Mechanical Turk to improve GPU processing speeds.
False
True or False
ImageNet has 1,000 categories, and models are evaluated using top-5 accuracy.
True
True or False
Data snooping, seen during some ImageNet competitions, refers to a type of cheating where test data is improperly accessed.
True
What were the two key factors that allowed ConvNets to dominate the field of computer vision? Explain how each contributed.
The availability of large datasets (e.g., ImageNet) provided ample training data, allowing models to learn complex features. Additionally, advancements in GPU technology enabled faster parallel computing.