AI Flashcards

Question

What does implementing an attention mechanism in a neural network do?

Answer 1

It allows the model better access to previous context than a recurrent network. This leads to learning which words in the context are the most important for predicting the next word.

Answer 2

Text classifiers

Answer 3

You should mark this node as a leaf node and assign it the class label.

Answer 4

No, clustering is an example of unsupervised learning as it learns to group similar data points together based on their features without any prior labelling or guidance.

Answer 5

Yes, the data is continuous numerical- e.g, house prices, can be measured down to cents and infinite prices.

Answer 6

Artificial Intelligence is a broad field of computer science that focuses on creating intelligent agents that can reason, learn, and act autonomously. AI aims to simulate human intelligence and problem-solving capabilities in machines. There is narrow AI, general AI and super intelligence. Some AI are also transformers, which means they are trained broadly on things and fine-tuned to specify in something - GPT-4 is a transformer model. Self driving cars such as Tesla are applications of AI too.

Answer 7

Machine learning is a subset of artificial intelligence that focuses on developing algorithms and models that allow computers to learn from data and improve their performance on a specific task without being explicitly programmed. In essence, machine learning enables computers to "learn" from experience. An example of machine learning is recommendation systems such as the recommender systems seen in social media that push peoples preferences at them.

Answer 8

Deep learning is a subset of machine learning that uses neural networks with multiple layers to learn complex patterns from data. AI Image generation is an example of a deep learning model.

Answer 9

Artificial intelligence is a field of computer science -> machine learning is a subset of AI -> deep learning is a subset of machine learning. AI is the general field of creating intelligent agents. ML is a specific approach within AI that focuses on learning from data. DL is a specialized technique within ML that uses deep neural networks.

Answer 10

Overfitting happens when a model fits too closely and memorises the patterns of the training data, causing it to be unable to generalise to unseen data.

Answer 11

To avoid a decision tree from overfitting, ensure the model is not too complex and doesn't have too much 'noise'. Further, train the model on test data, so it doesn't perfect training data patterns and is able to generalise to unseen data. Prune: set a maximum depth or minimum number of samples per leaf to limit the tree's growth. Remove irrelevant features: Select only the most relevant features to reduce the complexity of the model. Feature engineering: Create new features that are more informative and less prone to overfitting.

Answer 12

ChatGPT is trained on a wide variety of data sources, such as the internet, books, websites, articles. The data is typically in text sequences or tokens, and the model learns to predict the next word based on the preceding words.

Answer 13

ChatGPT uses supervised learning. Here are the supervised steps to its learning: Data Preparation: A massive dataset of text is collected and preprocessed to remove noise, inconsistencies, and other issues. Tokenisation: The text is broken down into smaller units, such as words or phrases. Model Architecture: A neural network architecture, such as a transformer, is used. Training: During training, the model is fed a sequence of tokens and is asked to predict the next token. The model's predictions are compared to the actual next token, and the model's parameters are adjusted to minimise the error. Fine-tuning: Once the model has been trained on a large dataset, it can be further fine-tuned on specific tasks or domains.

Answer 14

predictions, make a comment about form - unsupervised - it learns from training data - llm and picks best prediction from that data. It is taught using training data that involves various examples, and predicts the best fit based on context.

Answer 15

refer to tranformer models and how GPT4 is a transformer and can be trained on large amounts of information and then fine tuned to focus on one thing

Answer 16

Harmful content classifiers

Answer 17

Trust, transparency, adjustability (through interpretability and explainability).

Answer 18

Input, convolution, pooling, output (fully connected)

Answer 19

A caption is the input, therefore the more detail given the better fit the generator will create an image to your idea. A caption to the generator is like a brush to a canvas.

Answer 20

interior design ideas and sexual exploitation via deepfakes

Answer 21

unnaturally smooth texture, facial deformities, tags, unrealistic depth perception, background blurs to one, details such as too many fingers or clothes with false labels

Answer 22

State = Current screen Action = movements Reward = win/loss (outcome)

Answer 23

Classification: email spam detection Regression: predicting house prices

Answer 24

Sensor systems: send signals (e.g, if vitals drop, first responders would be notified), provide summary data, can sensor reminders Systems that predict patient flow in/out, bed use, patient recovery time, organise administrative stuff

Answer 25

AI is positively benefiting workers in the workplace as a complementary tool for efficiency. Likewise, AI is a complementary tool for students, positively benefiting their research and idea generation. Although, negatively, AI is breaching ethical concerns with its ability to generate convincing faux videos of people saying or doing things they didn't.

Answer 26

There are regulations around which AI systems hit the market, such as the EU AI laws act which prevents harmful AI models from being disseminated. Further, some models use harmful content classifiers that block users from requesting inappropriate misuse of output.

Answer 27

Whakapapa: Relationships The principle of whakapapa recognises Maori right to control the collection, storage and distribution of data about Maori - ensuring it positively benefits them - sustainably as well - long term. If data collected about Maori includes under/misrepresentation about Maori individuals/groups then this can perpetuate bias in the model - worsening Maori outcomes from AI. - by ensuring Maori are in control and have full transparency throughout the process of the collection and use of their data. Also, by making sure that the data is representative of Maori, so that in the long run it benefits them rather than indicting bias/stereotypes.

Answer 28

The algorithm learns from LABELLED training data and then makes predictions based on the learned relationships.

Answer 29

The data is UNLABELLED, therefore, the algorithm must find patterns by itself. E.g, regressors in social media - pushes content at us that we have 'interacted' with.

Answer 30

Maps inputs (training data) to outputs (model) A human writes the ML program, and the ML program creates the model

Answer 31

Supervised - as it is given training data that is labelled (cat, dog, etc with pictures)

Answer 32

clustering: identifying different types of input dimensionality reduction: simplifying the way data is represented

Answer 33

A database of 'customer transactions', what items do different types of customers purchase together

Answer 34

AI that plays games - AlphaGo etc

Answer 35

A model that is trained for one task and re-trained for a new, similar task.

Answer 36

Tabular data refers to the structure of training data as a table. Each row is a training instance. (single piece of data) Each instance has many features. Each column holds one of these features. One feature is the output, and the others are inputs.

Answer 37

Our data often doesn't arrive in a neat table - we have to create the table. Therefore, we need to create suitable categories, we need to normalise features (make sure one feature doesn't dominate the other), and fix any missing or incorrect data

Answer 38

A one-dimensional space - a line

Answer 39

Scatterplot

Answer 40

A set of points in 3D, using different colours (of dots) for different output labels. The boundaries between output classes can ne thought of as lines or planes, learning often involves finding the right boundary

Answer 41

As points in a n-dimensional feature space

Answer 42

the distance between two points in a feature space

Answer 43

the set we use to explore over/underfitting so we keep the test set for one use only. we can experiment with different parameters on the validation set to see which ones generalise best

Answer 44

k=1 will cause overfitting as the model will pay too much attention. to single items k=5 will capture general trends but may ignore actual patterns which is underfitting

Answer 45

k-NN is an 'instance-based' classification algorithm that classifies new instances based on similarity (small distance)

Answer 46

a tree is a structure of nodes (boxes) connected by arcs (lines) each node represents one of the features to consider and each nodes arc represent all the possible values for that featuee

Answer 47

a classifier that maps input features to an output. it is a structured process that represents a sequence of decisions . they're very easy for humans to understand.

Answer 48

u pick an input feature and add a node for that feature in the tree split the training set by values of that feature and create arcs to the node for each value at each arc, make a new decision tree with the remaining features

Answer 49

a binary classifier predicts whether an input item is 'in' a class or not. its output is either positive (yes its in the class) or negative (no its not in the class). we can also use a confusion matrix in. a binary classifier to provide extra information about positive and negative predictions

Answer 50

Generative adversarial networks generator network - makes fake images discriminator - judges if an image is fake or real

Answer 51

the discriminator classifies both real and fake data from the generator it is then penalised for misclassifying a real instance as fake/fake as real the discriminator then updates its weights through backpropogation

Answer 52

produce generator output from random noise use discriminator to classify this output as a real or fake image penalise generator if classified as fake updates weights through back propagation

Answer 53

for the generator to create images so realistic the discriminator cant tell they're fake

Answer 54

The Gabon deepfake controversy - video released of bongo in 2018

Answer 55

Kaka - identification of whether its a new or known individual pipeline: input visuals through data collection then pre processed then object detection then image segmentation then feature matching then knowledgebase then output of known or new individual

Answer 56

modelling plant stress responses : predicting where a species is likely to move and using the ML to understand why we can do this through ML's that predict variables - climate, elevation etc, then training data (NVS surveys) then prediction of area change

Answer 57

prediction - using climate and geographical features as inputs to try and predict the abundance changes (species increase or decrease)

Answer 58

generate and predictive systems medical imaging to diagnose diseases and guide treatment sensing technology - alerts, summary data predicting patient outcomes, patient in/out flow, bed use

Answer 59

proxy interpretable models, visualisation, counterfactual explanation, feature importance

Answer 60

giving an explanation for the model. this can be ether a global (overall model structure/behaviour) or local (how/why the model makes a decision) explanation can also be model specific or model agnostic an explanation can look like a simpler model, visualisation, data points, text explanation, feature importance,

Answer 61

coefficients these show the positive/negative relationship and ignore the different scales of the features

Answer 62

SHAP is Shapley Additive exPlanations this refers to the use of Shapley values where the contribution of each feature tp the overall result is calculates, it uses every possible feature subset, and SHAP approximates Shapley values to give a measure of feature importance

Answer 63

its causal - if X has not occurred, then Y would not have occurred a counterfactual explanation of a prediction describes the smallest change of the feature values that changes the prediction output

Answer 64

CNN's are for handling image data - to understand spatial relationships in images and avoid the flattening issue (more similar to human vision)

Answer 65

Input (convolutions ->) Feature maps (pooling ->) feature maps (convolutions ->) feature maps (pooling->) feature maps (fully connected ->) output

Answer 66

convolution - has filters, that allow recognition of edges, textures etc each filer slides across the image to create a feature map pooling - reduces dimensionality (width and height) of the feature map this is important for reducing sensitivity, computational cost and gets us closer to the size of the output layer fully connected - connects all neurons from the previous layer to each node in the fully connected layer allows the network to produce an output (final decision/classification)

Answer 67

human designs the network architecture train weights in network (filters and standard weights) with back propagation needs a lot of labelled data - supervised learning

Answer 68

draft with ideas etc to an essay

Answer 69

stochastic selection - it chooses an output based on probabilities. when u give ChatGPT a prompt its like ur pointing to a space of its training data and saying I want u to produce text from here! the more detail u give, the more precise the region ur pointing. its given various examples for inputs so it chooses based on probs

AI Flashcards

(94 cards)