Chapter 2: Infrastructure and Tools For Ai Flashcards

Question

What differentiates Deep Learning (DL) from Machine Learning (ML)?

Answer 1

DL is based on neural network algorithms, while ML encompasses other algorithms ## Footnote DL is considered a subset of ML.

Answer 2

The neural networks learn to pick up patterns or features from the data autonomously ## Footnote This contrasts with traditional ML where engineers select features.

Answer 3

Humans label the data, and machines attempt to label current or future data points ## Footnote This method involves continuous training to improve model accuracy.

Answer 4

* Classification models (e.g., spam filters) * Regression models (e.g., predicting trends) ## Footnote These applications require continuous training and updating.

Answer 5

Classification problems ## Footnote It considers each feature in the dataset as an independent variable.

Answer 6

Support Vector Machine ## Footnote SVM is used for classification by splitting datasets into two classes.

Answer 7

To predict future data points based on one or more variables ## Footnote It finds the best line that fits the data.

Answer 8

A future binary categorical state ## Footnote Examples include predicting loan defaults.

Answer 9

It uses nodes and branches to learn from past data to predict future values ## Footnote Decision trees are popular due to their interpretability.

Answer 10

An ensemble of decision trees used for both categorical and numerical predictions ## Footnote It averages or votes for predictions based on multiple trees.

Answer 11

The data is unlabeled, and machines find patterns independently ## Footnote This method is often used when the correct answers are unknown.

Answer 12

* Clustering * Dimensionality reduction ## Footnote Clustering identifies groups in data, while dimensionality reduction simplifies datasets.

Answer 13

Principal Component Analysis; reduces dimensions without losing information ## Footnote PCA is useful for analyzing large datasets.

Answer 14

A model trained with both labeled and unlabeled data ## Footnote It helps guide the model towards finding patterns when labeled data is scarce.

Answer 15

When a model fits too well to a specific dataset and performs poorly on new data ## Footnote This is a common issue in data science.

Answer 16

A learning method that uses a small amount of labeled data alongside a larger amount of unlabeled data to improve model performance ## Footnote The process involves predicting on unlabeled data and checking accuracy against the labeled data.

Answer 17

It learns through trial and error, adapting its approach based on past behavior and optimizing for rewards ## Footnote Commonly used in robotics to help machines adjust to real-world parameters.

Answer 18

* Choosing models and use cases * Managing production * Updating models * Keeping data fresh and clean * Organizing experiments * Validating and testing ## Footnote This process is complex and often a challenge for companies outside the tech industry.

Answer 19

To make layers of data and metadata available for AI/ML models to ingest and offer insights from ## Footnote ETL stands for Extract, Transform, Load.

Answer 20

* Testing/validating code and components * Continuous code updates * Continuous learning from new data * Monitoring model performance ## Footnote This ensures that models do not become stale and maintain effective performance.

Answer 21

It impacts the performance of models and the overall product ## Footnote Poor data storage can lead to significant operational issues.

Answer 22

* Data Lake: Stores raw data in its native format * Database: Structured data for easy access and querying * Data Warehouse: Centralizes structured data for analysis and insights ## Footnote Each serves different purposes based on organizational needs.

Answer 23

It allows for quick leverage of insights and trends across various business units ## Footnote Essential for organizations looking to implement AI/ML across multiple functions.

Answer 24

False ## Footnote Relational databases can struggle with aligning schemas when combining data from different sources.

Answer 25

iterating the process ## Footnote Stagnation can lead to outdated models and ineffective performance.

Answer 26

Loss of jobs or unfair outcomes, such as incorrect mortgage rates or prison sentences ## Footnote This highlights the importance of responsible AI management.

Chapter 2: Infrastructure and Tools For Ai Flashcards

(50 cards)