AI / ML Information Fundamentals Flashcards
What are the three main types of AI?
Machine learning
Deep learning
Generative AI
What are multi-modal models?
They do not rely on a single type of input or output. They can support (text, or images, or audio)
What does GPT mean?
Generative pre-trained Transformer. It generates human text or computer code based on input prompts
What does BERT mean?
Bidirectional Encode Representations from Transformers. Similar to GPT, but reads the text in two directions.
What does RNN mean?
Recurrent Nureal Network. It is meant for sequential data such as time series or test. Useful in speech recognition.
What does GAN mean?
Generative Adversarial Network. Used to generate synthetic data.
What is labeled data?
Data that includes both input features and output labels. For example, where images of animals are also labeled with the name of the animal.
What is unlabeled data?
It is data where an input feature is defined, but no output labels are included. For example, I uploaded images of animals, but never provided the name of those animals as an output label. Used for unsupervised learning.
What is structured data?
Tabular data like MS SQL and Excel.
What is unstructured data?
Data that does not follow a specific structure. Articles, social media posts, customer reviews.
Is image data structured or unstructured data?
Unstructured.
What is supervised learning
It can predict the output for new unseen input data.
Does supervised learning require labeled or unlabeled data?
Labeled
What is regression?
Used to predict a numeric value based on input data. For example, the cost of a house based on the size, the weight of a person based on their height.
What is classification?
It is used to predict the categorical label of input data.
Does classification have to be Binary?
No it can have multiple class classifications and multiple label classifications. E.g., “Acction” and “Comedy” for a movie or “Mammal” and “bird”
What is a validation set?
It is a subset of your training data used to tune and validate performance.
What is a test set?
It is used to evaluate the final model performance.
What is feature engineering?
Transformation of your raw data. example, converting birth date to age. You can create new labels with calculated values
What is unsupervised learning?
Discovery of inherent patterns, structures, or relationships within the input data.