NVIDIA AI final Flashcards
Question
Answer
Unit 4. Employs algorithms and statistical models that enable computer systems to find patterns in massive amounts of data, and then uses a model that recognizes those patterns to make predictions or descriptions on new data. Is this Deep, Machine, Neural, Deep Neural Network learning.
Unit 4. Machine Learning
Unit 4. This framework is an essential tool for Data Scientists. This is a Computer Vision, Natural Language Processing, Speech and Audio Processing, Robot learning more. Interface. Library or Tool. What is this framework? AI, ML, DNN, MDL framework.
Unit 4. Machine Deep Learning Frameworks
Unit 4. PyTorch Geometric DGL and others rely on these libraries such as cuDNN, NCCL and DALI to deliver high-performance, accelerated training. What is this type of accelerated training. Is this Deep Learning, Machine accelerated, GPU accelerated, AI?
Unit 4. GPU Accelerated
Unit 4. This framework offers building blocks for designing, training and validating deep neural networks through a high-level programming interface. Widely used frameworks such as PyTorch and TensorFlow. Is this AI, DL DNN, or ML frameworks.
Unit 4. Deep Learning Frameworks
Unit 4. A sub-class of Machine Learning. It uses neural networks to train a model. Using very large data sets. In the range of Terabytes or more of data. Is the answer: Machine Learning, AI, or Deep Learning or Deep Neural Network approach.
Unit 4. Deep Learning Approach
Unit 4. This type of Neural Network model are Algorithms that mimic the human brain in understanding complex patterns. Once trained, on new images, it can make predictions. What is this type of Neural Network Model?
Unit 4. Deep Neural Network Model
Unit 4. What is this type of training data? It is a set of data with “_ _ _ _ _ ” that help the neural network learn. These “ _ _ _ _ _” can be the objects in the images: cars, trucks, cranes. The error that the classifier makes on the training data are used to incrementally improve the network structure.
…Name this type of training data (- - - - - -)
Unit 4. ‘Labels’ as in labeled Training Data
Unit 4. Once the neural network based model is trained it can make this type of “predictions” on new images. Once trained the network and classifier are deployed against previously unseen data, which is not labeled. If the training was done correctly, the network will be able to apply its feature representation to correctly classify similar classes in different situations. These “predictions” are also referred to a certain “class”
Unit 4. Object Class Predictions
Unit 4. A modern Open Source Machine Deep learning framework used to train and deploy deep neural networks. It is scalable allowing for fast model training, and supports a flexible programming modem and multiple languages. This type of library is portable and can scale to multiple GPU’s and multiple machines.
Unit 4. Machine Deep Learning Frameworks - MXNet
Unit 4. Machine DL Frameworks. This free software machine learning scientific library (framework) for Python Program language features various classification, regression and clustering algorithms. Choose mxnet, scikits-learn or tensorflow.
Unit 4. Machine Deep Learning Frameworks - SciKit Learn
…and is designed to interoperate with the Python numerical and scientific libraries.
Unit 4. This is an essential tool for Data Scientist in the Machine Deep Learning Framework. It is also a popular Open source software library (framework) for dataflow programming across a range of tasks. It is a symbolic math library and is commonly used for deep learning applications.
Is it MXNet, or SciKit-learn or TensorFlow
Unit 4. Machine Deep Learning Frameworks - Tensor Flow
Unit 4. This Nvidia Deep Learning Software Stack is comprised of Host OS and NVIDIA Driver, NGC Container, DL Frameworks
Unit 4. Nvidia Deep Learning Software Stack
Unit 4. This Nvidia Deep Learning Software Stack “OS” enables the deep learning framework to use the GPU functions
Unit 4 Host OS and Nvidia Drive
Unit 4. These publicly available containers, are optimized to run NVIDIA GPU’s in the Nvidia Deep Learning Software Stack.
Unit 4 NGC Container
Unit 4. This popular type of framework(s) is available inside the containers for Nvidia Deep Learning Software Stack. Is it ML, AI, DL, DNN?
Unit 4. DL or deep learning Frameworks
Unit 4. Nvidia Deep Learning Software Stack - The name for Nvidia’s groundbreaking parallel programming model that provides essential optimization for deep learning.
Unit 4. A CUDA MATADA
Unit 4 Accelerate data preparation, Model Training, Visualization with this type of software stack
Unit 4 Machine Learning Software Stack
Unit 4 Machine Learning Software Stack “Columnar name” in memory data structure “_ _ _ _ _ _” arrow
Unit 4 Apache arrow (Machine Learning Software Stack) which Delivers efficient and fast data interchange with the flexibility to support complex data models. What is the Columnar name referred to as
Unit 4. A suite of open source software libraries and API’s which offers the ability to execute end to end data science and analytics for executing data science pipelines, entirely on GPU’s. And can “reduce” training times from days to minutes. Built on NVIDIA® CUDA-X AI.
Unit 4. RAPIDS (Machine Learning Software Stack)
(Unit 4) A framework and collection of graph analytics libraries that seamlessly integrates into the RAPIDS data science platform Tensor RT
Unit 4. CUGRAPH (Machine Learning Software Stack) Nvidia GPU Software Ecosystem.
Unit 4. A Dataframe manipulation library based on Apache Arrow that accelerates loading, filtering and manipulation of data for model training data preparation. dask, cudf, cuml, cudnn
Unit 4. CUDF (Machine Learning Software Stack)
Unit 4. A collection of GPU accelerated machine learning libraries that will provide GPU versions of all machine learning algorithms available, including SciKit-learn Knn, Kmeans, Random Forest and Regressions. Is it rapids, cuml, dask, python
Unit 4. CUML (Machine learning software stack)
Unit 4. Give users the ability to run jobs in the map reduce style of programming. Which allows pipelines to stage data in main memory, if everything doesn’t fit in GPU memory. cuml, cudf, dask, cugraph
Unit 4. DASK (Machine Learning Software Stack)
Unit 4. Developers use this language which is a simple programming language, to develop models using the above libraries.
Unit 4. Python
Unit 4. This Is a collection of software acceleration libraries built on top of CUDA and over 13 other libraries. Increase productivity.
Unit 4. Nvidia CUDA-X AI Ecosystem - CUDA-X AI
Unit 4 Nvidia Deep Learning Software Stack for Accelerating Deep learning primitives. Is this cuDL or cuDNN or cuML or python?
Unit 4. Nvidia CUDA-X AI Ecosystem - CUDA-X-AI - cuDNN
Unit 4. Accelerating data science workflows and “Machine Learning” algorithms ecosystem. Name this Nvidia CUDA-X AI Ecosystem - CUDA-X-AI
Unit 4. Nvidia CUDA-X AI Ecosystem - CUDA-X-AI - cuML
Unit 4. Nvidia Deep Learning Software Stack Nvidia Optimizing Trained Models for Inference
Unit 4. Nvidia CUDA-X AI Ecosystem - CUDA-X-AI-NVIDIA TensorRT
Unit 4. Nvidia Deep Learning Software Stack This Nvidia CUDA hardware provides a data frame manipulation library ecosystem. Name this CUDA-X-AI-NVIDIA DL software stack.
Unit 4. Nvidia CUDA-X AI Ecosystem -CUDA-X-AI-NVIDIA cuDF
Unit 4. Nvidia Deep Learning Software Stack For performing high-performance analytics on Graphs
Unit 4. Nvidia CUDA-X AI Ecosystem-CUDA-X-AI-NVIDIA cuGraph
Unit 4. Nvidia Deep Learning Software Stack cuDNN, cuML, TensorRT,cuDF, cuGraph, together they work seamlessly with this Nvidia product to accelerate the development and deployment of AI based applications.
Is it DL, ML, AI, Tensore core
Unit 4. Nvidia CUDA-X AI Ecosystem -CUDA-Xi-AI Nvidia Tensor Core GPU
Unit 4. Nvidia Deep Learning Software Stack This type of Nvidia CUDA-X framework is used for Desktops, workstations, servers, cloud computing deployments and software acceleration libraries.
Unit 4. Nvidia CUDA-X AI Ecosystem - Frameworks, Cloud ML, Deployments
Unit 1. Clinical care, Operational Efficiency (no-shows), Precision Medicine (radiomics, one size fits all), Drug Discovery (monitoring). Is it ML, DL, or AI
Unit 1. AI in Healthcare
Applications include Radiomics (biomarker), At-Risk Patients, Medical billing, Disease/Genetic correlation, Medical Transcription, Drug Interactions, Cancer Detection.
Applications of AI in Healthcare
A broad field of study focused on using computers to do things that require human-level intelligence
Artificial Intelligence
An approach that uses statistical learning Algorithm
Machine Learning
A techinque inspired by how human beings learn.
Deep Learning
Where Computations can run on CPI cores and on GPU’s
Compute Nodes (AI Cluster Components)
Where data is stored
Storage Nodes (AI Cluster Components)
6.1 These types of nodes for Multi System AI cluster are used for system monitoring, provisioning, troubleshooting. Services required can include user authentication, network proxies, workload, data, fabric, system management and monitoring and general user and acess and services.
Tip:Containerization tools such as Docker are often used to separate and manage devices. Reliable, resilient and robust servers are often required to ensure a highly available system.
6.1 Management Nodes (AI Cluster Components)
6.1 What is used for AI Cluster Components, that connects compute nodes, storage nodes, and management network services. It is also used specifically for when the nodes are powered off. (In Band, Out of band, rubber band, or GUI, IPMI Networking)
6.1 Out of Band Networking (AI Cluster Components)
6.1 These type of nodes are for GPU based servers provide most of the computational resources and more power efficient. All components must keep up. Sharing data across multiple systems and multiple users.
6.1 Compute nodes (AI Cluster)
6.1 Provides nodes functionality rack of servers into a system. Service required user authentication, network proxies, workload, data, fabric, system management and monitoring. Is it Network, Switch, Storage, Out of band
6.1 Management Nodes (AI Cluster)
Connects Compute Nodes
Computer Network (AI Cluster)
Connects storage nodes
Storage Network (AI Cluster)
Used by all services necessary for system to operate
Management Network (AI Cluster)
Provides Best Practices to Design Systems for AI Workloads. Provides proven designs that organizations can leverage for their own needs as well as a recipe for getting started. (Model, Container, Reference)
Reference Architectures
This Nvidia DGX device for Reference Archie. With two - eight DGX A100 systems, compute servers, Nvidia storage partners
NVIDIA DGX POD
This Nvidia DGX device uses Configurations starting with 20, infused with expertise, designed to support widest range of DL and HPC workloads.
NVIDIA DGX SuperPOD
(Unit 6.2) Training DL and ML Models Requires “Massive Datasets” to obtain high accuracy.This increase in complexity leads to increased accuracy. What is this type of consideration for AI Workloads. Is it DL, ML, AI, DNN, RA, cuFL, storage?
Storage for AI Workloads Unit 6.2
6.2 Data should be visible, labeled, resiliency, recall, reconstructed, controls, vet, monitor, robust, end user needs, high perf, shared, data stewardship. These are the AI Characteristics of this type of “data” systems.
6.2 Storage Systems Characteristics for AI
6.2 These questions should be asked when deciding on this one, specific, type of data solution…How often will it be accessed, How often will it be written too, How often will it be read, when will it retired, what if there are system failures, Will this be fast storage, Is the data private…once again, these are questions for a very particular type of data solution.
6.2 Deciding on a storage solution. When deciding on a storage solution the full life cycle data should be considered.
(Unit 6.2) This is a type of storage…
Simpler than Traditional shared f/s
Scale storage massively (PB)
High level of data protection via data replication
Traditionally used in large cloud data storage repos
No directory structure, files are referenced by keys
Files are accessed via a REST API
not a standards, or pplications
must be re-written to directly access data
Object storage Unit 6.2 Storage considerations.
6.2 SQL, NoSQL, SQL-Like databases. Unique perfo. Charact. Access methods. Not as general as other fs types belong in this category of data storage systems…not parallel or distributed but “- - - - -“ data storage systems.
6.2 Other Data Storage Systems
6.2 This type of storage file system (in a data hierarchy) can share data, group servers, scale out, and it can offer the highest read and write speeds…It is not NFS or Local. It is:
6.2 Parallel and Distributed fs…Storage systems data hierachy