Guest Lectures on AI Development Flashcards

1
Q

What percentage of AI projects fail to reach production in 2024?

A

80% of AI projects do not go into production, twice the failure rate of non-AI projects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why do AI systems fail to get into production?

A

AI systems are system-based, not just software-based.

They rely on statistical techniques with inherent uncertainty.

They require a broader concept of quality.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the two main components of an AI application?

A

AI Portion: Includes the knowledge base and inference engine.

Non-AI Portion: Supports and integrates the AI components.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the main roles in AI system development?

A

Data Scientists: Develop the knowledge base and inference engine.
Developers: Create the non-AI portion and integrate components.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why are AI development teams interdisciplinary?

A

They require expertise from multiple domains, including software development, data science, and domain-specific knowledge.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are common challenges in interdisciplinary teams?

A

Communication barriers: Different terminologies and styles.

Cultural clashes: Conflicting norms and values.

Power struggles: Dominant disciplines asserting control.

Resistance to change: Hesitancy to adopt new methods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How can interdisciplinary challenges be mitigated?

A

Education and training in unfamiliar disciplines.

Learning vocabulary and concepts from other fields.

Time investment to build and mature teams (Tuckman’s Model: Forming, Storming, Norming, Performing).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the two primary types of AI models?

A

Narrow Machine Learning (ML) Models – Task-specific models.

Foundation Models (FMs) – General-purpose models trained on extensive, unlabeled data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are narrow ML models used for?

A

Classification: Assigning categories (e.g., spam detection).

Regression: Predicting continuous values (e.g., time estimation).

Clustering: Grouping similar data (e.g., customer segmentation).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are common challenges with narrow ML models?

A

Ethical concerns & bias.

Interpretability & explainability.

Generalization & overfitting.

Robustness against adversarial attacks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can challenges with narrow ML models be mitigated?

A

Bias mitigation: Diverse datasets, ethical review boards.

Explainability: XAI techniques (LIME, SHAP, visualizations).

Overfitting reduction: Regularization, cross-validation, data augmentation.

Adversarial defense: Adversarial training, input validation, feature noise injection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are Foundation Models (FMs)?

A

Trained on massive, diverse, unlabeled datasets.

General-purpose but customizable for specific tasks.

Large Language Models (LLMs) are a subset of FMs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are common use cases for FMs?

A

Natural Language Processing (e.g., text summarization, translation).

Image generation & classification.

Code generation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the key components of FM architecture?

A

Vector Spaces: Sentences are tokenized and represented as high-dimensional vectors.

Attention Mechanism: Determines the importance of different tokens for extracting meaning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How can Foundation Models be customized?

A

Prompt Engineering: Modifying input queries.

Retrieval-Augmented Generation (RAG): Adding external data sources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are major risks of Foundation Models?

A

Data privacy & security.

Misuse & misinformation.

Deepfakes & fake content.

17
Q

How can risks of Foundation Models be mitigated?

A

Implementing guardrails to monitor and reject problematic inputs/outputs.

Rejecting sensitive data or misinformation requests.

18
Q

Why is achieving quality in AI systems harder than in traditional software?

A

AI introduces data quality issues that affect performance.

AI models require additional preparation steps.

Quality is impacted by both software engineering and model training.

19
Q

How does AI system quality differ from traditional software quality?

A

Traditional Software Quality: Determined by software architecture, code quality, and development processes.

AI System Quality: Adds model quality and data quality as crucial factors.

20
Q

What are key AI quality attributes?

A

Performance: Accuracy, latency, throughput.

Security: Defense against data poisoning and adversarial attacks.

Reliability: Stability despite data/environmental shifts.

21
Q

How can data challenges be mitigated?

A

Data drift & environmental drift: Continuous monitoring and retraining.

Regulatory changes: Organizational unit to track legal developments.

22
Q

What additional development practices impact AI quality?

A

Data preparation: Cleaning, resolving missing values, handling outliers.

Model training: Selecting features, hyperparameter tuning.

Testing: Checking for bias and data distribution shifts.

Tool Support: Data lineage tracking, model packaging, and deployment tools.

23
Q

What is the role of software architecture in AI systems?

A

It isolates model changes using API layers.

It ensures system robustness despite AI model modifications.

24
Q

What are the three main contributors to AI deployment failures?

A

Achieving AI system quality is difficult.

AI development requires interdisciplinary collaboration.

AI models are based on statistical methods, introducing inherent uncertainty.

25
Q

How can AI deployment success rates improve?

A

Recognizing and mitigating common challenges in AI development.

Improving data quality, model robustness, and team collaboration.

Leveraging best practices in software and AI engineering.