deck_15613364 Flashcards

Question

Learning semantic grammars

Answer 1

* resolving the meaning of indexicals, which are phrases that refer directly to the current situation * Example sentence: “I am in Boston today,” both “I” and “today” are indexicals. The word “I” would be represented by Speaker, a fluent that refers to different objects at different times * interpreting the speaker’s intent * The speaker’s utterance is considered a speech act, and it is up to the hearer to decipher what type of action it is (question, a statement, a promise, a warning, a command, etc.)

Answer 2

* connections only in one direction (input to output) * directed acyclic graph with designated input and output nodes * No loops

Answer 3

* its intermediate or final outputs back into its own inputs. * signal values within the network form a dynamical system that has internal state or memory

Answer 4

back-propagation: the way that the error at the output is passed back through the network – not going to get into this in a ton of detail today In practice when we train a neural network with see that a loss function is minimized

Answer 5

Recurrent neural networks (RNNs) allow cycles in the computation graph each cycle has a delay * units may take as input a value computed from their own output at an earlier step * RNN has an internal state/memory * RNNs add expressive power compared to feedforward networks, Training a basic RNN * input layer x, a hidden layer z with recurrent connections, and an output layer y,

Answer 6

Choosing a network architecture Some neural network architectures are explicitly designed to generalize well on particular types of data When comparing two networks with similar numbers of weights, the deeper network usually gives better generalization performance. Deep learning systems perform better than any other pure machine learning approaches for high dimensional inputs (images, video, speech signals, etc) Deep learning models lack the compositional and quantificational expressive power May also produce unintuitive errors. Tend to produce input–output mappings that are discontinuous

Answer 7

Neural architecture search * neural architecture search to explore the state space of possible network architectures. * Some options to do this: * Evolutionary algorithms: recombination (joining parts of two networks together) and mutation (adding or removing a layer or changing a parameter value) * train one big network, search for subgraphs of the network that perform better

Answer 8

For transfer learning experience with one learning task helps an agent learn better on another task freeze the first few layers of the pretrained model that serve as feature detectors modify the parameters of the higher levels only * problem-specific features and do classification Common to start with pretrained model such as the ROBERTA model Followed by fine-tuning the model in two ways: * giving it examples of the specialized vocabulary used in the desired domain * training the model on the task it is to perform Multitask learning is a form of transfer learning in which we simultaneously train a model on multiple objectives

Answer 9

* success of the AlexNet deep learning system in the 2012 ImageNet competition that propelled deep learning into the limelight * supervised learning task with 1,200,000 images in 1,000 different categories * top-5 error rate has been reduced to less than 2%— below error rate of trained human (5%)

Answer 10

* machine translation and speech recognition * end-to-end learning, the automatic generation of internal representations for the meanings of words, and the interchangeability of learned encoders and decoders * end-to-end learning outperforms classical pipelines * re-representing individual words as vectors in a high-dimensional space—so- called word embeddings

Answer 11

Neural Language Model

Answer 12

* Need video memory * Need time * Difficult to run computers for extended periods (power supply, hardware) * Training of big neural networks has moved to the cloud

Answer 13

* Deep learning can be done on the CPU but it is slow * Generally need a GPU (4060Ti, 4080, 3090, 4090) * Memory biggest factor for training models * 16GB+

Answer 14

works if you ask it questions directly

Answer 15

Intermediate computations before producing the output y. Different representation for the input x. Each layer transforms the representation produced by the preceding layer to produce a new representation In the process of forming all these internal transformations, deep networks often discover meaningful intermediate representations of the data The hidden layers of neural networks are typically less diverse than the output layers.

Answer 16

back-propagation: the way that the error at the output is passed back through the network – not going to get into this in a ton of detail today In practice when we train a neural network with see that a loss function is minimized

Answer 17

tensors, which (in deep learning terminology) are simply multidimensional arrays of any dimension Vectors and matrices are one-dimensional and two-dimensional special cases of tensors Computational efficiency of tensor operations: given a description of a network as a sequence of tensor operations, deep learning software package can generate compiled code that is highly optimized Are run on GPUs (graphics processing units) or TPUs (tensor processing units), which make available a high degree of parallelism

Answer 18

tunable values learned during training process

Answer 19

full pass of your training data

Answer 20

N samples from your training data

Answer 21

updating your model every batch

Answer 22

intermediate layer

Answer 23

we want to minimize this to zero

Answer 24

Keras, PyTorch

Answer 25

With enough training data and enough training ingenuity, CNNs produce very successful classification systems Images can have small alterations, patterns can be quite informative. Convolution followed by a ReLU activation function—as a local pattern detector composite patterns can be detected by applying another layer to the output of the first layer.

Answer 26

training examples are copied and modified slightly Images can have small alterations without changing the identity * randomly shift, rotate,or stretch an image by a small amount, or randomly shift the hue of the pixels by a small amount local patterns * CNN-based classifiers are good at ignoring patterns that aren’t discriminative * Context: patterns that lie off the object might be discriminative * e.g., a cat toy, a collar with a little bell, or a dish of cat food might actually help tell that we are looking at a cat

Answer 27

Object detectors find multiple objects in an image, report what class each object is and also report where each object is by giving a bounding box around the object Building an object detector: * looking at a small sliding window onto the larger image—a rectangle. * At each spot, we classify what we see in the window, using a CNN classifier Details: * Decide on a window shape * Build a classifier for windows * Decide which windows to look at * Choose which windows to report * Report precise locations of objects using these windows

Answer 28

* physical agents that perform tasks by manipulating the physical world * equipped with effectors such as legs, wheels, joints, and grippers * equipped with sensors, which enable them to perceive their environment * Maximizing expected utility for a robot means choosing how to actuate its effectors to assert the right physical forces * Robotic learning is constrained because the real world operates at real time

Answer 29

Home care: Robots have started to enter the home to care for older adults and people with motor impairments Health care: Robots assist and augment surgeons, enabling more precise, minimally invasive, safer procedures with better patient outcomes Services: Mobile robots help out in office buildings, hotels, and hospitals Autonomous cars Entertainment: Disney has been using robots (under the name animatronics) in their parks since 1963. Exploration and hazardous environments: Robots have gone where no human has gone before, including the surface of Mars. Industry: The majority of robots today are deployed in factories, automating tasks that are difficult, dangerous, or dull for humans.

Answer 30

* weak AI: the idea that machines could act as if they were intelligent

Answer 31

he assertion that machines that do so are actually consciously thinking (not just simulating thinking)

Answer 32

* simplest logical agent design * qualification problem: difficult to capture every contingency of appropriate behavior in a set of necessary and sufficient logical rules

Answer 33

AI engineering * The AI industry has not yet reached that level of maturity. * We do have a variety of powerful tools and frameworks, such as TensorFlow, Keras, PyTorch, CAFFE, Scikit-Learn and SCIPY. * But many of the most promising approaches, such as GANs and deep reinforcement learning, have proven to be difficult to work with—they require experience and a degree of fiddling to get them to train properly in a new domain * start with a single huge system and, for each new task, extract from it the parts that are relevant to the task

deck_15613364 Flashcards

(87 cards)