Class 2: Introduction to artificial intelligence and its relationship to cognition Flashcards

Question

What are the future development directions of ChatGPT?

Answer 1

The future development direction of ChatGPT includes better dialogue quality and efficiency, better emotion recognition function and multilingual support.

Answer 2

False. There is randomness in essay generation which makes it more "creative".

Answer 3

False (generally), it uses "tokens which are linguistic units that could be whole words or segments like "pre" or "ing" or "ized". "

Answer 4

When users rate Chat GPT's output, a new neural network model is created to predict user ratings. The new model then runs like a loss function on the original network continually adjusting the network to user preferences.

Answer 5

The more capable a system is, the less trainable it becomes. Conversely, the more trainable a system is, the less capable it becomes.

Answer 6

Neural nets are a type of machine learning algorithm that are simple idealizations of how human brains seem to work. Like a human brain, neural nets learn more through practice and repetition. The nodes and neurons that make up these neural nets perform mathematical operations on the input data to produce an output.

Answer 7

A temperature parameter determines how often a lower-ranked word (relative to the highest-ranked word calculated to be a probable match for responses) will be selected. For example, when ChatGPT responds to essay prompts, a 'temperature' of 0.8 yields the best results in essay generation.

Answer 8

It is a parameter that is used to adjust the randomness and creative output of of ChatGPT. The ideal temperature is approximately 0.8, otherwise the text becomes repetitive. A temperature of 0.8 allows for a degree of randomness while still ensuring the output is relevant and coherent.

Answer 9

It operates in three basic stages. First, it takes the sequence of tokens that corresponds to the text so far, and finds an embedding (i.e. an array of numbers) that represents these. Then it operates on this embeddingÑin a "standard neural net way", with values "rippling through" successive layers in a networkÑto produce a new embedding (i.e. a new array of numbers). It then takes the last part of this array and generates from it an array of about 50,000 values that turn into probabilities for different possible next tokens. (And, yes, it so happens that there are about the same number of tokens used as there are common words in English, though only about 3000 of the tokens are whole words, and the rest are fragments.) However, according to ChatCPT's answer, it is an AI language model created by OpenAI. It communicates with us through natural language processing (NLP) technology. It analyzes the text us enter and uses a combination of algorithms, statistical models, and machine-learning techniques to understand the meaning of our input and generate a response that best answers our question or fulfills our request.

Answer 10

FALSE. Generally, neural nets need to see a lot of examples and at least for some tasks, the examples can be incredibly repetitive. It is standard strategy to show a neural net all the examples one has, over and over again. In each of these training rounds" the neural net will be in at least a slightly different state, and somehow "reminding it" of a particular example is useful in getting it to "remember that example." However, it is normally also also necessary to show the neural net variations of one example."

Answer 11

False. Always picking the highest ranked word can make a piece of text seem flat so lower ranked words are often used to make the text more interesting

Answer 12

False. Although there are steps and ways to reduce dependency on cognitive biases, it is not possible to completely eliminate them.

Answer 13

Neural net

Answer 14

A parameter that determines how often lower-ranked words will be used.

Answer 15

Different answers

Answer 16

Chat Generative Pre-Trained Transformer

Answer 17

to continue text in a "reasonable" way, based on what it's seen from the training it's had (which consists in looking at billions of pages of text from the web, etc.)

Answer 18

C: Human-written text from books, the web, and other sources

Answer 19

Take an input corresponding to a position (x,y) and to recognise it as whichever of the three points it is closer to

Answer 20

"X" = Weights, the neural network relies on the weights to interpolate (generalise) between the given examples

Answer 21

As GPT is not perfect, it can have errors in generating biased or inappropriate responses as well as not having a full understanding of language. Also, it can be limited to what resources it receives

Answer 22

The fundamental goal of ChatGPT is to produce a "reasonable continuation" of a given text, based on what one might expect someone to write after seeing what people have written on billions of web pages, etc.

Answer 23

The discrepancy between the current values of the function and the desired function. This value is calculated in order to adjust the weight of the function to be able to reproduce the function that we want.

Answer 24

ChatGPT incorporates a temperature parameter to determine the frequency of utilising 'low-ranked words'.

Answer 25

Their ability to learn to do things

Answer 26

The basic operation of the (neural net) is also very simple, consisting essentially of passing input derived from the text it's generated so far "once through its elements" (without any loops, etc.) for every new word (or part of a word) that it generates.

Answer 27

True. The training method uses a loss function (how far away are the current weights from the desired end goal). This loss function will decrease progressively until the network reproduces the desired function *(within an approximation margin).

Answer 28

Neural net.

Answer 29

ChatGPT is trying to produce a continuation of the text it has gotten so far with the available input it has access to.

Answer 30

Given the text so far, what should the next word be?

Answer 31

Syntactic grammar refers to the structure of language, where as semantic grammar refers to the meaningfulness of language. For ChatGTP to properly grasp semantic grammar, it would need a "model of the world" to refer to, which could be acheived through coding.

Answer 32

Bias; discrimination; misinformation; manipulation; privacy; security.

Answer 33

The optimal temperature for essay generation refers to the temperature setting used in language models like GPT-3 to control the degree of randomness and creativity in the generated text. The temperature determines the degree to which the model is willing to take risks and produce unexpected outputs, versus sticking to more predictable and safe choices.

Answer 34

Reasonable continuation in this context alludes to what might be written after reviewing billions of readings. The ChatGPT reply produces a list of ranked words that might follow the previous word, together with "probabilities" after reviewing the readings. However, "temperature" parameter that determines the probabilities of how often lower-ranked words will be used.

Answer 35

The loss function calculates the sum of the squared differences between a machine learning model's anticipated output and the actual output (the goal) for a given input. More importantly, it's an essential concept in machine learning because it's used in ChatCTP to direct training and gauge how well the model fits the training data.

Answer 36

The human brain

Answer 37

simple idealisations of how the brain works - specifically discussed is the process of how humans form a thought upon recognising something

Answer 38

Neural Network

Answer 39

Computational irreducibility is a concept that refers to the idea that some computational problems cannot by simplified or reduced in a meaningful way. This concept says that some problems do not have quick or predictable ways to solve, essentially a limit on computational capabilities (such as neural networks). Rather these problems must be studied through human intuition, experimentation, and observation.

Answer 40

Is not only can they in principle do all sorts of tasks, but they can be incrementally trained from examples to do those tasks.

Answer 41

The use of tokens instead of words makes it easier for ChatGPT to deal with rare, compound, or non-English words. Tokens can be words, and can also be parts of words, such as "ing", "pre", "anti" etc.

Answer 42

Embedding involves assigning numbers to text and words to represent their meaning, and grouping similar meanings to nearby numbers.

Answer 43

Approximately 175 billion

Answer 44

ChatGPT stands for Generative Pre-Trained Transformer (GPT) and is a language model that uses a text through its large dataset to generate responses to prompts and questions an individual may ask.

Answer 45

After analysing the prompt that you give it, it then uses statistical probability of what it has learned to generate a response that is likely to be relevant and informative

Answer 46

ChatGPT is large language model (LLM) that is trained on large amounts of human-created text. It then utilises this information to estimate probabilities and generate meaningful text after given a prompt.

Answer 47

bias from the datasets used for training, risk of generating inappropriate or offensive responses, lack of understanding of social norms or cultural context, relies on large amounts of data (not suitable for situations with little data or privacy restrictions)

Answer 48

The assignment of a number to a type of stimulus (in ChatGPT's case, common English words) that help to group like stimulus with a similar essence" together. "

Answer 49

Temperature is a parameter used to control the level of randomness/unpredictability/creativity in the generated text. Higher temperatures result in more diverse and unpredictable output and lower temperatures result in more conservative and predictable outputs

Answer 50

False, Machine learning acquires a plethora of examples to determine whether the new object fits within the constraints of the prior examples.

Answer 51

Loss functions are important for large language models like GPT as they provide a measure of how well the model is able to predict the next word or sequence of words in a given text, allowing the model to generate more accurate and contextually appropriate text by minimizing the loss function during training.

Answer 52

A neural network is a type of machine learning algorithm that is modelled on the human brain. It is made of layers of interconnected nodes, or neurons, which perform mathematical computations on input data and pass the results to the next layer until the final output is generated. Neural networks can model a wide variety of functions with high execution and training performance.

Answer 53

Neural networks are simplified models inspired by the workings of the human brain. Our brains are complex networks of nerve cells that are connected to assist in processing information. When we look at an image, photoreceptor cells at the back of our eyes convert the image into electrical signals that travel through layers of neurons to help us recognize the image. Neural networks use mathematical functions to simulate this process.

Answer 54

D. decreases, small

Answer 55

Backpropagation adjusts the weights and biases of the ANN and corrects its random guesses and to make them less wrong. The way an ANN learns is by making adaptive changes. The probability of making the right calculation improves with each backpropagation and is one of the most frequently used learning rules in many applications of artificial neural networks.

Answer 56

Having to make longer texts. Essay, stories etc.

Answer 57

When ChatGPT generates text, it selects its next word from a ranked list of words based on their probability of being next. It asks repeatedly what the next word should be and adds it. The temperature parameter(with a value of 0.8 considered as optimal) determines how often lower-ranked words will be used. The randomness in ChatGTP's selection of lower-ranked words makes for more interesting writing.

Answer 58

A large language model (LLM) is a neural network trained on vast amounts of text to predict the likelihood of a given word or sequence of words occurring in a sentence. The LLM uses this training to generate more accurate and coherent language output, and complete tasks such as language translation, summarization, and question answering.

Answer 59

A neural net is a computational model that is modeled/inspired after the structures and function of the humain brain. Neural nets consists of interconnected nodes or "neurons" that are arranged into a layered structure. It can then be trained to learn patterns and relationships in data using large data sets.

Answer 60

They provide feedback for ChatGPT's responses and help inform its future responses

Answer 61

It is because when ChatGPT generates a new token, it has to do a calculation involving every single one of the 175 billion weights.

Answer 62

Training the machine learning models used by these systems on diverse and representative datasets in order to generate high-quality, human-like responses.

Answer 63

A: By presenting a batch of examples and then adjusting the weights in the network to minimize the error

Answer 64

A large number of examples of input and output are to be given to the system. The system will then 'learn' from these examples. What we can do is find the weight that is suitable for the system to be able to reproduce these examples, by relying on its ability to generalise between the examples reasonably.

Answer 65

That is somehow captures a "human like" way of doing things/thinking.

Answer 66

175 billion

Answer 67

Computational irreducibility is a phenomenon in which there are some computations that can't be reduced to simpler steps, and must be computed step-by-step in order to determine the outcome.

Answer 68

Idealisation of how brains seem to work.

Answer 69

ChatGPT is a natural language processing software that uses machine learning to generate responses to text inputs that mimic human-like communication.

Answer 70

It uses a temperature of 0.8 - which means 20% of the time, it will randomly select a word that isn't the highest ranked word.

Answer 71

To create an essay that sounds more creative and less flat than it otherwise would.

Answer 72

Continue a piece of text that has been given

Answer 73

1. A moderate 2. Creative and more interesting

Answer 74

Neural net

Answer 75

1. The Architecture of a Neural Network needs to be considered for a particular task. 2. It is critical to obtain the necessary data to train the Neural Network 3. It is important to incorporate existing, trained Neural Networks or use them to generate training examples for a new Neural Network.

Answer 76

C) The model uses a probability-based ranking system to determine the most likely word to follow the given text based on billions of webpages and digitized books.

Answer 77

It's where the neural net must find patterns in the data on its own.

Answer 78

the GPT in ChatGPT stands for Generative Pre-trained Transformer, which is a type of language model based on deep learning

Answer 79

ability generate longer and more complex responses, use of contextual information to generate more relevant responses, ability to generate responses that are more diverse and creative

Answer 80

False. (Pre-training is the process of training a model on a large dataset of unlabeled data before fine-tuning it on a specific task.)

Answer 81

175 Billion

Answer 82

Computational irreducibility is when you must go through each computational step to get the result, something that the brain presumably has chosen to avoid. Learnability 'compresses data by leveraging regularities,' whereas computational irreducibility implies that there are limits to the regularities that exist, contradicting learnability.

Answer 83

Computations that cannot be sped up by means of any shortcut are called computationally irreducible. The principle of computational irreducibility says that the only way to determine the answer to a computationally irreducible question is to perform, or simulate, the computation.

Answer 84

It is based on the probabilities. According to the author, it is explained that the probabilities are based on the frequency of the words and phrases in the training data and are adjusted by the neural network during training. It is also noted that the probabilities assigned by ChatGPT are influenced by the context of the input text and the system's previous responses.

Answer 85

Convenient linguistic units that might be whole words, or may be pieces, such as, "pre," or "ing," or "ized," etc.

Answer 86

1. ChatGPT first takes the sequence of tokens that corresponds to the existing text, finding an embedding. 2. ChatGPT then looks at values of successive layers in a Neural Network, producing a new embedding. 3. It then takes these array of numbers, generating about 50,000 values. These values turn into probabilities for the next possible tokens which represent the different possible words that ChatGPT will use to respond to a prompt.

Answer 87

To make it more interesting. If it chose the highest ranked word, the passage would be 'flat' with no 'creativity.'

Answer 88

GPT is pre-trained on a large corpus of text, and then fine-tuned on specific tasks like language translation or text completion

Answer 89

It uses weights in its algorithm to assign importance to certain features of language it has been exposed to, for example, the frequency of words, to create a response. Weights are ChatGPTs understanding of the patterns in the input data which help it generate believable text.

Answer 90

The loss function gives us the distance between the values we have got and the true values.

Answer 91

False, A higher temperature corresponds to a higher probability of choosing words that are lower-ranked words.

Answer 92

ChatGPT produces a reasonable continuation of text by scanning billions of pages of human-written text and analyzing patterns in how words and phrases are used. It then generates a ranked list of words that might follow, along with probabilities based on how frequently they occur in similar contexts.

Answer 93

Transformers

Answer 94

Probabilities

Answer 95

An attention head allows ChatGTP to look back over previous words in it's sequence rather than just the last word in order to make a better selection for the next word.

Answer 96

ChatGPT uses a large language model (LLM) to generate text. It produces a ranked list of words that might follow a given text, with associated probabilities. It adds words to the text by repeatedly asking which word to add next, based on the ranked list and a temperature parameter that introduces randomness. The highest-ranked word is not always chosen, as this can result in repetitive, uncreative text. The result is that ChatGPT can generate a variety of unique texts.

Answer 97

It runs through three fundamental phases. Initially, it looks for an embedding that corresponds to the sequence of tokens that so far correspond to the text. Ultimately, it performs operations on this embedding to create a new embedding in a "typical neural nett approach," with values "rippling across" subsequent layers of a network. The last element of this array is then used to create an array of around 50,000 values that represent probability for various possible following tokens.

Answer 98

False, it uses the given data to learn" syntax implicitly. "

Answer 99

False - When words with the highest probability are always used, the text comes out as flat and tends to iterate over itself.

Answer 100

1. It takes a sequence of tokens that correspond to the text so far and finds embedding. 2. Data goes through the neural network to create a new embedding. 3. This new embedding is then used to calculate the probabilities for the next word.

Answer 101

linguistic units ChatGPT operates with such as whole words, or word pieces such as "pre" or "ing" or "ized.

Answer 102

The temperature parameter refers to how often lower-ranking words are used.

Answer 103

An embedding is a way to try to represent the "essence" of something by an array of numbers - with the property that "nearby things" are represented by nearby numbers. And so, for example, we can think of a word embedding as trying to lay out words in a kind of "meaning space" in which words that are somehow "nearby in meaning" appear nearby in the embedding. The actual embeddings that are usedÑsay in ChatGPTÑtend to involve large lists of numbers.

Answer 104

FALSE! Although in a sense ChatGPT does "re-read" tokens it has previously generated, this information is never repeatedly reprocessed. Rather, each previous token is used only once by each computational element when generating a new token; to help understand the context and encoding of the passage. When a new token is being generated, the passage will be "fed back" and used to determine the next appropriate token; but never re-processed or changed.

Answer 105

Neural net - the core of ChatGPT, where through its many layers of interconnected artificial neurons are able to produce a recognisable human-like language.

Answer 106

Neural net training involves finding weights that make the network successfully reproduce input-output examples provided during the training process. To do this, a "loss function" is computed at each stage, measuring the difference between reported and true values. The weights are then adjusted to minimize the loss function, typically using a technique called steepest descent. The goal is to find a set of weights that minimizes the loss function and produces accurate outputs for new inputs.

Answer 107

AI & deep learning

Answer 108

It is because they somehow capture a "human-like" way of doing things.

Answer 109

False. The same architecture of neural net can work for different types of tasks, as the neural net can typically capture general human-like processes.

Answer 110

The accuracy and validity of the information

Answer 111

False. - they only generate texts based on its relationships and patterns it has learned from the training data - for longer and more complex texts, they might "wander off in non-human-like ways and hence, might not always provide the most suitable/accurate responses "

Answer 112

neural nets

Answer 113

It produces a ranked list of words with probabilities.

Answer 114

Trajectory.

Answer 115

It calculates how far away the output is from the desired result (loss function) and adjusts the weights until the output is as closed to the desired result as possible.

Answer 116

Embedding can be seen as a way to represent the "essence" of something by an array of numbers - with the property that "nearby things" are represented by nearby numbers.

Answer 117

A symbolic discourse language is a language that can describe the world and its concepts in a precise, unambiguous way. This language is essential for creating a semantic grammar that can understand the meaning of language beyond just its syntax. Computational language has the advantage of being precise and can be used to build a symbolic discourse language. Such a language could be used to generate text, ask questions about the world, and make assertions about the world. It would be a valuable tool for scientific research and artificial intelligence.

Answer 118

By taking a sample of English text and calculating the frequency of each letter

Answer 119

Large Language Models

Answer 120

True. ChatGPT can use context from the prompt it receives to generate responses that are tailored to the specific situation.

Answer 121

1. As an AI, it can't truly understand the context it uses in chat. 2. Some uncommon words will confuse it by less training in those words. 3. It is unreliable by lack of human judgement and ethical consideration.

Answer 122

ChatGPT generates responses by using machine learning to analyze and learn from extensive amounts of data.

Answer 123

Computational irreducibility is when you must 'trace out' each computational step from the initial conditions to get the result. You are unable to get the result based off of the initial conditions alone.

Answer 124

a machine learning technique that reflects the behaviour of the human brain, composing layers of neurons that work together.

Answer 125

ChatGPT is significantly more advanced than earlier language models like ELIZA and ALICE, due to its use of large-scale transformer-based architectures and pre-training techniques, which allow it to generate more sophisticated and human-like responses to text-based prompts.

Answer 126

ChatGPT is a large language model that is trained to generate outputs based on contextual (word-based) prompts given by the user and the backlog of textual information it has been fed by the team at openAI. It then creates a chart of probabilistic answers that would fit the context and question it was asked, chooses a word, and does this word by word, until a suitable answer is created

Answer 127

True! Neural Networks are a simple idealisation of how brains seem to work; neurons produce electrical pulses, passing electrical signal to thousands of neurons, an individual neuron will send an electrical pulse depending on the pulse it receives.

Answer 128

Trained algorithm only on feature variables containing only the input variables, not the output variables. Rather than responding to feedback, the algorithm identifies commonalities for correct classification

Answer 129

ChatGPT utilises a neural network that has undergone extensive training using large volumes of textual data. It is enhanced with attention mechanisms and transformer networks to produce responses to textual inputs that are both understandable and similar in style to those made by humans.

Answer 130

0 temperature, it is repetitive and can be confusing

Answer 131

False. ChatGPT still experiences the same difficulties with those irreducible processes, it's just that human processes such as essay-writing have turned out to be simpler than first thought in computational terms.

Answer 132

False. To make the answers that it gives more "realistic" and "creative", ChatGPT has built in a certain amount of randomness when choosing words to make up its response, where it chooses lower ranked words some of the time

Answer 133

To create a reasonable continuation of the text it has received, through a ranked word list that includes the probability of each word.

Answer 134

Confirmation Bias

Answer 135

ChatGPT has the ability to learn from vast amounts of text data, ranging from books to web pages. This allows it to recognise patterns and generate text responses that directly answer the given prompts.

Answer 136

Neural nets can be perceived as a simple idealisation of how a brain works, and mimic the neural network of the brain where whether a neuron produces an electrical pulse - at any given moment, depends on the electronic signals it receives from other neurons and assigns a 'weight' to the connection.

Answer 137

transformer

Answer 138

GPT uses and learns language patterns through using a "multi-layer transformer neural network" which allows it to analyze and understand these complex language patterns to develop a response.

Answer 139

some kind procedure for computing the answer rather than just measuring and remembering each case.

Answer 140

False, there are variations; often the highest probability word is chosen, but lower probability words are also used in a certain ratio.

Answer 141

Using its full capacity, a system makes use of computational irreducibility. By performing every task in the sequence to get to the result, as in computational irreducibility, it is less able to recognise patterns, skip steps, and in turn learn and be trained.

Answer 142

1. The current string of tokens is split in an embedding module, with different values being converted or generated, and ultimately added together to create an overall embedding vector. 2. These new embedding vectors are fed into the attention heads of the transformer, which help to repackage and reweight specific information from previous tokens. This creates a final embedding vector that is a collection of the sequence so far. 3. The final embedding of the collection is decoded and used to assign the probability of what token should logically follow in the sequence.

Answer 143

linguistic units, these can be pieces of a word or whole words

Answer 144

FALSE. Generally, neural nets need to see a lot of examples and at least for some tasks, the examples can be incredibly repetitive. It is standard strategy to show a neural net all the examples one has, over and over again. In each of these training rounds" the neural net will be in at least a slightly different state, and somehow "reminding it" of a particular example is useful in getting it to "remember that example." However, it is normally also also necessary to show the neural net variations of one example."

Answer 145

ChatGPT is trained on vast amounts of text data, such as books and websites, to learn patterns in language usage and structure. This allows it to generate responses that are coherent and convincing.

Answer 146

ChatGPT is a technology tool that can automatically generate something that reads even superficially like human-written text

Answer 147

Neural Nets

Answer 148

False. ChatGPT tries to give the answer you would most reasonably expect, not necessarily the most accurate answer

Answer 149

An embedding is a way to represent something, for example words or sentences, through numbers. Words or sentences with similar meanings will therefore have similar numbers to represent them.

Answer 150

1. It takes a sequence of tokens/words that correspond to the current text and finds an embedding (i.e. an array of numbers) that represents these. 2. It then utilises the neural network to continue operating on this embedding working through the successive layers of the network implementing new values across the layers to give a new embedding. 3. It finally utilises the last part of the array (i.e. created in the new embedding) to generate about 50,000 values which are converted into probabilities for a range of possible next tokens

Answer 151

Anchoring Bias.

Answer 152

Language models like ChatGPT write an essay by repeatedly asking "given the text so far, what should the next word be?" The model uses a type of neural network called a transformer to predict the next word or token in a sequence based on the words or tokens tha came before it. During training, the transformer is presented with a sequence of words or tokens and is trained to predict the next word or token in the sequence. When the language model is used to generate new text, it follows the same process by predicting the next word or token based on the patterns it has learned from the training data and adding it to the sequence until the desired length of text has been generated.

Answer 153

True: ChatGPT is able to learn from texts that it has been given by using covered versions of the texts as input, and the uncovered versions of text as output

Answer 154

It is because more complicated problems have a large amount of weight variables, allowing for minimisation to occur from a high-dimensional space with lots of different vectors; meanwhile, the neutral net may only reach a local minimum with more simple problems as it has fewer variables to work with.

Answer 155

175 billion

Answer 156

1 Trillion Words

Answer 157

ChatGPT has a wide range of potential applications, including improving customer service through chatbots, generating natural language summaries of long documents, and even aiding in scientific research by helping to analyze and synthesize complex data sets.

Answer 158

As numbers

Answer 159

Transformers formulate and process different sequences of the input data; they will then administer the notion of 'attention' to pay more attention to specific sections of the sequence to consolidate information necessary for generating output.

Answer 160

showing/introducing a variety of examples of images and objects and then have the network learn to recognise and distinguish between them.

Answer 161

The sunk cost fallacy.

Answer 162

Machine learning is the process of training by giving many examples which allows the trained neural network to generalise from the given examples.

Answer 163

A 'large language model' (LLM) is a key component of ChapGPT that has been built to estimate the probabilities of sequences of words occurring.

Answer 164

The main goal of ChatGPT is to produce a "reasonable continuation" of a given text, where "reasonable" means what one might expect someone to write after seeing what people have written on billions of webpages.

Answer 165

No, although ChatGPT uses machine learning, there are definitely limitations on the accuracy of answers it provides.

Answer 166

Neural nets typically need to see a lot of examples to train well, and it's important to show variations of the same example for better training. Repetition of the same example is useful and variations in how the data is presented don't have to be sophisticated to be helpful.

Answer 167

construction of language from words

Answer 168

Efficiency

Answer 169

A neural network is a method in artificial intelligence that teaches computers to process data in a way that is inspired by the human brain. It is a type of machine learning process, called deep learning, that uses interconnected nodes or neurons in a layered structure that resembles the human brain.

Answer 170

The main challenges center around acquiring or preparing the necessary training data.

Answer 171

The breakthrough in "deep learning" was that when using neural nets it is easier to solve complex problems than simple problems. The exact reason is unknown; however, it appears that this is because more complex problems have more "weight variables" and, therefore higher dimensional space with more directions, which can lead to the global minimum. Simpler problems with fewer variables have fewer directions to the minimum and can be caught in local minimum.

Answer 172

No, ChatGPT takes into account the probability for the next several pairs of words which creates a sequence of which words should come next.

Answer 173

A component of a neural network that allows a model to selectively focus on certain parts of the input text to assist it in generating the most probably suitable output text.

Answer 174

A hundred billion neuronsÊmake up the human body, and each one is able to emit an electrical pulse as fast as a thousand times per second. Each neuron allowsÊit to transmit electrical messages to maybe hundreds of other neurons. The neurons are interconnected in a complex network. The electrical pulses that a particular neuronÊreceives from other neurons determine whether it will create a pulse at a particular time, with various connections contributing to different "weights."

Answer 175

Back propagating is a technique whereby the strength of the connections between the neurons within a neural net is updated based on the feedback received from the training data, thus adjusting the network's performance. Back propagating calculates the error between the expected output and the network's output and propagates that error back through the network, adjusting the weight of the connections between neurons as it goes in order to minimise this error. This is important as it allows for the neural net to learn from its mistakes and optimise its performance on tasks over time. This in turn allows it to generate accurate and contextually relevant responses.

Answer 176

Because ChatGPT's neural net consists of millions of neurons, which means billions of connection and weights, and every time a new token is being generated, ChatGPT has to calculate every single one of these weights.

Answer 177

175 billion connections, hence 175 billion weights.

Answer 178

Large language models like ChatGPT are trained using a technique called unsupervised learning on massive amounts of text data. During training, the model tries to predict the next word or sequence of words in a piece of text, given the words that come before it, which helps it learn patterns in language and generate coherent text.

Answer 179

Neural nets

Answer 180

Key limitations of ChatGPT include; A need for large amounts of computing power and a potential for biased or misleading outputs.

Answer 181

Simple ideas about how brains seem to work. In Ai, they are computing systems or code that are inspired by the "natural" or biological neural networks (animal/human brains)

Answer 182

Temperature (0.8) allows ChatGPT to produce a more human like piece of writing rather than choosing the next word on the 'next best probability' each time. Temperature (0.8) opens a possibility for creativity.

Answer 183

The concept of meaning space is linked to the process of word embedding. Word embedding can be conceptualised as "trying to lay out words in a kind of 'meaning space'". This means that words that are close in meaning are physically close in the embedding.

Answer 184

False; a "feedforward" mechanism means that all computational element or "neuron" in the neural network is used only once.

Answer 185

True. Cognitive Biases will impact the way you think, but with careful considerations and clear steps, you will be able to make informed decisions.

Answer 186

Lower temperatures tend to produce more conservative and predictable outputs.

Answer 187

The reason why ChatGPT can communicate well is because of its large pre-training data and its ability to understand the context well.

Answer 188

Some computations have shortcuts that allow them to be completed more quickly. Computations that cannot be done more quickly by using shortcuts are computationally irreductible. The only way to determine the answer to a computationally irreducible question is to perform each step of the computation.

Answer 189

ChatGPT is a large language model developed by OpenAI, trained using deep learning algorithms to generate human-like responses to natural language inputs.

Answer 190

The answer is C. ChatGPT doesn't always just pick the highest-ranked word because this will result in a flat-sounding essay that may be repetitive, so at random it sometimes picks lower-ranked words.

Answer 191

A neural net is a computational model that is inspired by the structure and function of the human brain. It consists of interconnected neurons, and the output of each neuron is determined by the input it receives from other neurons, weighted by specific values. This is similar to how neurons in the human brain communicate with each other through electrical signals.

Answer 192

ChatGPT sometimes picks lower-ranked words instead of always picking the highest-ranked word because doing so can lead to a "more interesting" essay with greater creativity.

Answer 193

There is a fundamental logic to human language, through "laws of logic" and "laws of thought". ChatGTP can follow these laws, patterns, and regularities of human writing and speech and replicate it.

Answer 194

By showing them multiple examples. When "teaching" a neural network to distinguish, we don't have to write programs about them and mention features eg. nose, eyes, mouth etc. Rather, multiple pictures can be shown instead. The network then generalises these examples.

Answer 195

the way in which information is presented can influence our judgments and decisions

Answer 196

A neural net is a set of computational "neurons" that are capable of recognising and analysing patterns in large amounts of data. These "neurons" are layered and are taught how to analyse and recognise patterns that we deem as important by feeding it information and having it churn out responses. These responses are then fine-tuned by introducing weights at each level that make the output more and more acceptable to us.

Answer 197

Data used to train ChatGPT and other neural network is not directly stored anywhere once used. The node eights are adjusted every time data is fed through, influencing how the network makes decisions. It is 'some kind of distributed encoding of the aggregate structure of all that text'.

Answer 198

ChatGPT works by using a combination of deep learning techniques, including unsupervised learning, to analyze and understand patterns in vast amounts of text data. It then generates responses based on the input it receives, using its understanding of natural language to create human-like responses.

Answer 199

Neural net training works by showing the neural net lots of examples and variations of the example repetitively, which gradually updates the weight variable to best capture the training example it has been given.

Answer 200

No, at this stage it is simply unknown whether ChatGPT has "discovered" underlying systems such as geodesics in its decision-making processes relating to human language.

Answer 201

Generative Pre-trained Transformer

Answer 202

GPT generates text by predicting the probability of the next word in a sequence based on the context of the preceding words

Answer 203

A neural net is a way of organising information in a way that is similar to the processes of the human brain.

Answer 204

C. It doesn't always just pick the highest-ranked word because this will result in a flat-sounding essay that may be repetitive, so at random it sometimes picks lower-ranked words.

Answer 205

A few 100 billion

Answer 206

Computational irreducibility refers to the idea that there are some computations that cannot be simplified or shortened in any meaningful way, and must be carried out step-by-step to determine their outcome.

Answer 207

b) temperature parameter

Answer 208

A parse tree is a graphical representation of the syntactic structure of a sentence or a piece of code in a programming language. It is also known as a derivation tree or a syntax tree.

Answer 209

ChatGPT sometimes uses lower-ranked words when generating essays to make them more interesting and so this 'temperature' parameter determines how often lower-ranked words will be used, and specifically for essays, this is found to be 0.8.

Answer 210

Embeddings are the translation of concepts into numbers so that computers can more easily establish relationships between different concepts.

Answer 211

September 2021

Answer 212

ChatGPT picks its next words based on a probability distribution generated by analyzing vast amounts of text data. It uses a neural network architecture that is trained on a large corpus of text data to predict the likelihood of each possible word following a given sequence of words.

Answer 213

"Trained from examples" through neural nets.

Answer 214

B. Transformer Neural Network

Answer 215

Syntax and logic.

Answer 216

Things that match in meaning.

Answer 217

Neural networks are designed to mimic the processes of the human brain. For ChatGPT, this means responding to prompts by breaking data down into multiple small steps (simulating neurons), each passing its output to others to generate more accurate, specific responses. The various neural connections in the network also vary in their significance, with each output contributing a different weight to those proceeding. These neural connections create a network where information is processed through multiple layers of "neurons," which allow for responses to be fine-tuned to respond to specific prompts and data.

Answer 218

Computational irreducibility

Class 2: Introduction to artificial intelligence and its relationship to cognition Flashcards

Wolfram, S. L. (2023 February 14). What is ChatGPT doing... And why does it work? Stephen Wolfram Writings. https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work (300 cards)