AI / ML Information Fundamentals Flashcards
What are the three main types of AI?
Machine learning
Deep learning
Generative AI
What are multi-modal models?
They do not rely on a single type of input or output. They can support (text, or images, or audio)
What does GPT mean?
Generative pre-trained Transformer. It generates human text or computer code based on input prompts
What does BERT mean?
Bidirectional Encode Representations from Transformers. Similar to GPT, but reads the text in two directions.
What does RNN mean?
Recurrent Nureal Network. It is meant for sequential data such as time series or test. Useful in speech recognition.
What does GAN mean?
Generative Adversarial Network. Used to generate synthetic data.
What is labeled data?
Data that includes both input features and output labels. For example, where images of animals are also labeled with the name of the animal.
What is unlabeled data?
It is data where an input feature is defined, but no output labels are included. For example, I uploaded images of animals, but never provided the name of those animals as an output label. Used for unsupervised learning.
What is structured data?
Tabular data like MS SQL and Excel.
What is unstructured data?
Data that does not follow a specific structure. Articles, social media posts, customer reviews.
Is image data structured or unstructured data?
Unstructured.
What is supervised learning
It can predict the output for new unseen input data.
Does supervised learning require labeled or unlabeled data?
Labeled
What is regression?
Used to predict a numeric value based on input data. For example, the cost of a house based on the size, the weight of a person based on their height.
What is classification?
It is used to predict the categorical label of input data.
Does classification have to be Binary?
No it can have multiple class classifications and multiple label classifications. E.g., “Acction” and “Comedy” for a movie or “Mammal” and “bird”
What is a validation set?
It is a subset of your training data used to tune and validate performance.
What is a test set?
It is used to evaluate the final model performance.
What is feature engineering?
Transformation of your raw data. example, converting birth date to age. You can create new labels with calculated values
What is unsupervised learning?
Discovery of inherent patterns, structures, or relationships within the input data.
What is clustering?
Used to group similar data points together based on their features.
What is association rule learning?
It is performed on unsupervised data to find products frequently bought together.
What is anomaly detection?
It is used for example to find fraud by finding outliers in data.
What is semi-supervised learning?
It uses both labeled and unlabeled data for training.
What is self-supervised learning?
It’s when a model generates labels for data without having humans label it first.
What are pre-test tasks in self-supervised learning?
They solve simple tasks and learn patterns in the dataset.
What is re-enforcement learning?
A type of machine learning where and agent learns to make decisions by performing actions to maximize cumulative rewards.
What is an agent in reinforcement learning?
It is the learner or decision maker. example, deep racer.
What is RLHF?
It is reinforcement learning from human feedback.
What is model overfitting?
When the model performs well on the training data, but badly on the evaluation data.
What is underfitting?
When the model performs poorly on the training data. It could mean the model is too simple or has poor data features.
If a model is underfitting, what can you do?
Perform feature engineering.
What is Bias?
The difference or error between the predicted and actual values.
What is high bias?
When the model doesn’t closely match the training data. This means model underfitting.
What can be done to reduce bias?
Increase the number of features or use a more complex model.
What is model varience?
How much the performance changes if trained on a different dataset.
What does high variance mean?
The model is overfitting and sensitive to changes in the training data.
How do you reduce model varience?
Reducing feature selection and splitting your training and tests data sets multiple times.
What are confusion matrices?
A metric to evaluate your model. True positive, false positive, true negative, false negative.
What are Precision, Recall, F1, and Accuracy used for?
Metrics used for Binary classification.
What is AUC-ROC used for?
Finding the best model for binary classification.
What is the mean absolute error
mean between predicted and actual values
What is the mean absolute percentage error?
mean in percentage between predicted and actual values
What is inferencing?
When a model is making a predication on new data.
What is the downside of real-time inferencing?
Speed is more important than accuracy. Example, a chat bot.
What is the benefit of batch processing in the context of inferencing.
Accuracy is valued above speed.
What kind of language models can run on the edge?
A small language model (SLM)
What is the learning rate hyperparameter?
How large or small the steps are when updating the model’s weight. Too high can overshoot the correct answer and too low may take longer, but have more accuracy.
What is the Batch Size hyperparameter?
The number of training examples used to update the model weights in one iteration. Smaller batches are more stable, but take more time. Larger batches are faster, but less stable.
What is the Number of epochs hyperparameter?
The number of times the model will iterate over the dataset. too few is underfitting, too many is overfitting.
What can be done to prevent overfitting?
Increase the training data size, stopping the training data early, feature engineering.