Exploring AI use cases and applications Flashcards
When are AI/ML solutions appropriate?
- Coding the rules is challenging - e.g. determining if an email is spam or not.
- Scale of the task is challenging - e.g. scanning millions of emails to determine which one is spam.
What are the different types of ML model learning techniques?
- Supervised Learning - use of labeled data
- Unsupervised - unlabeled data, machines recognize patterns inherent in the data
- Reinforcement - given performance score; Reinforcement learning is broadly useful when the reward of a desired outcome is known, but the path to achieving it isn’t—and that path requires a lot of trial and error to discover
What are the two sub-categories of supervised learning?
- Classification - model trained on a labeled dataset and then used to predict unforeseen data. - e.g. recognize a new car
- Regression - supervised learning, model the relationship between dependent and independent variables and then use them to predict the dependent variable - e.g. what I did in South Carolina with the powerplant sensor readings.
What are the two types of unsupervised learning?
- Clustering - A kind of algorithm that groups data into different clusters based on similar features or distances between the data point to better understand the attributes of a specific cluster
- Dimensionality Reduction - reduce the number of features or dimensions in a dataset while preserving the most important information or patterns
What are the capabilities of Gen AI?
- Adaptability - to various tasks and domains
- Responsiveness - generate content in real-time; useful in chatbots
- Simplicity - simplifies complex tasks
- Creativity and Exploration - generates novel ideas
- Data efficiency - train on a small amount of data and generate additional synthetic data
- Personalization - to suit individual preferences
- Scalability - producing content at scale
Challenges of Gen AI
- Regulatory risks - e.g. disclosure of PII; mitigate through strict control of training data
- Social risk - unwanted content; mitigate via testing and evaluation prior to deployment
- Data security and privacy concerns - mitigate via encryption/firewalls
- Toxicity - inflammatory content; mitigate via guardrails.
- Hallucinations - inaccurate responses not consistent with training data ; mitigation - users much check output and must not over-rely on the model
- Interpretability - being able to explain how a model arrived at the decision that it did.
- Non-determinism - model generates different outputs for the same input.
What factors are in play when choosing a Gen AI model?
- Model Types - different models have different capabilities - e.g. text generation, summarization, chat, images, etc. - which may suit different use cases.
- Performance Requirement - accuracy, reliability
- Constraints - computational power, data availability, onprem vs. cloud
- Capabilities - text vs. multi-modal vs. text-to-images etc.
- Compliance - biases, privacy issues, potential for abuse
- Cost - larger models are precise, but cost more. Smaller models are cheaper and faster.
What are the key business metrics for Gen AI?
It depends on the use case, but, in general:
* User Satisfaction - with AI generated content
* Average revenue per user - generated by use of Gen AI
* Cross-domain performance - ability to perform across domains and industries
* Conversion rate - e.g. rate of visitors to a website that turn into buyers
* Efficiency - how much gen AI improves resource utilization, computation time, scalability - e.g. on a manufacturing site.