Module 1 Vocab Flashcards

1
Q

Technical Vocabulary

A

Definition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Egeria

A

An open-source data governance platform for integrating and managing metadata across tools. It provides automated metadata exchange, compliance enforcement, and lineage tracking to ensure data quality and discovery.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Kylo

A

An open-source data lake management platform that simplifies the development of data ingestion pipelines. Built on Apache NiFi, it offers features like data quality monitoring and integrated security.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Atlas

A

Apache Atlas is an open-source metadata management and data governance tool. It integrates with the Hadoop ecosystem to support metadata classification, auditing, and lineage tracking.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Git

A

A distributed version control system widely used to manage changes in source code. It allows multiple developers to collaborate on a single project while maintaining version history.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

GitLab

A

A comprehensive DevOps platform that extends Git with CI/CD pipelines, issue tracking, and access control. It supports both cloud-hosted and self-hosted deployments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

AI Fairness 360

A

An open-source toolkit designed to measure and mitigate bias in machine learning models. It provides fairness metrics and mitigation algorithms for equitable predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

AI Explainability 360

A

An open-source library offering tools to interpret and explain machine learning models. It includes methods for feature importance, rule-based explanations, and counterfactual reasoning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Adversarial Robustness 360

A

A toolkit for testing and improving machine learning models against adversarial attacks. It includes defenses like adversarial training and preprocessing techniques.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Prometheus

A

An open-source monitoring system that collects and stores metrics, provides powerful querying capabilities, and supports alerting. Ideal for cloud-native environments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

ModelDB

A

A tool for tracking, versioning, and visualizing machine learning experiments. It integrates with frameworks like TensorFlow and PyTorch to ensure reproducibility.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Apache Spark

A

An open-source distributed computing system for batch processing. It provides scalability and efficiency in handling large datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Apache Flink

A

An open-source platform for stream processing and real-time analytics. It is optimized for low-latency data streams.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Node-RED

A

A visual programming tool for connecting APIs, hardware, and services. It enables event-driven workflows through an intuitive drag-and-drop interface.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

TensorFlow Lite

A

A lightweight version of TensorFlow designed for deploying machine learning models on mobile and embedded devices.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Apache Kafka

A

A distributed event-streaming platform for building real-time data pipelines and applications. It ensures fault tolerance and scalability.