Module 4 Flashcards
List 5 security risks related to generative AI
- Hallucinations
- Deepfakes
- Training data poisoning
- Data leakage
- Filter bubbles (echo chambers)
What are hallucinations?
Instances where a generative AI model creates content that either contradicts the source or creates factually incorrect output under the appearance of fact
What is the problem with deepfakes?
You don’t know what is real and what is not
What is training data poisoning?
- Common approach when using AI to hack other AI models
- Hackers try to poison your training data pools
Where is data leakage common?
Federated learning
What are filter bubbles (or echo chambers)?
- The AI model repeats back to you what you already believe and what you have already told it
- It is not producing new insights
List 6 security risks of general AI
- AI can concentrate the power on a few individuals or organizations
- Overreliance on AI which provides a false sense of security
- AI systems are vulnerable to attack
- Misuse of AI
- AI algorithms that are used to attack other AI systems
- Storing training data outside of production
Why is it bad for AI to concentrate power on a few individuals or organizations?
It erodes individual freedom
What do you call it when an attacker manipulates input data to obtain a different output?
Adversarial machine learning attack
What is the potential impact of an adversarial machine learning attack?
Can lead to incorrect decisions which could lead to security breaches or data loss
What is the potential impact of the misuse of AI?
- May lead to security risks
- Like in transfer of learning attacks
Why should you never store training data outside of production?
- Those environments do not have the same level of security
- You may put this data out in a less secure environment
What operational risks are associated with AI?
How you use the AI
- High cost to run the AI algorithm - hardware (CPUs, GPUs, …)
Operating and running the AI
- Environmental costs
- Data corruption and poisoning
What environmental costs could AI generate?
- Increased carbon footprint
- Greater resource utilization
- Costs for running green
List 4 privacy risks associated with AI
- Data persistence
- Data repurposing
- Data spillover
- Data collected or derived from AI itself
Describe privacy issues related to data persistence
- Data exists longer than the human subjects that created it
- Once the data subject is gone, best practice would be to delete it
Describe privacy issues related to data repurposing
- Data being used beyond the originally specified purpose
- May be intentional or not