Explore data and train models (35–40%) Flashcards
What is automatically created when a workspace is provisioned?
Azure storage account, azure key vault, application insights, azure container registry
What are the ways to create a workspace
The user interface in Azure portal, Azure Resource Manager (ARM) template, Azure CLI, Azure ML python SDK
workspace class has what params?
name, display_name, location, description
How to give workspace access to others?
Use role-based access control (RBAC)
What are the three general built-in roles?
Owner, contributor, reader
Owner vs contributor
Contributor can do everything but give access to others
What are the two built-in roles in AML?
AML data scientist: can do workspace actions except altering or deleting computes and editing workspace settings, AML compute operator: can create, change, and manage access the compute resources
What are the 4 different computes in AML workspace?
Compute Instance, Compute Cluster, Inference Cluster, Attached Compute
Compute Instance details
Managed by workspace, good for small work
Compute cluster details
Workspace-managed on-demand clusters of CPU or GPU nodes. Automatically scaling
Inference Cluster details
Azure Kubernetes Service cluster for deployed ML models in production
Attached compute details
Allows you to use Databricks or spark pools
What are AML assets
Models, environments, data, and components
Why do you want to register your model in the workspace
It will be available in your workspace as opposed to only your local comp, and it gets a version number
What are environments for
Environments provide any necessary components that are needed to execute the code, such as packages or environment variables. Environments need a name and a version to be created
What do you need to create a data asset
Name, version, and path to the asset
What’s the point of components
Allows you to reuse and share code. To create; needs name, version, code, and environment
What are the 4 options for model training in AML
Designer, Automated ML, Jupyter notebook, run a script as a job
What’s good about designer
Low code drag and drop components that are easy to manage and visualize
What’s good about automated
Automated will iterate through hyperparameters and algorithms to find the best selection for your use case
What are the different types of jobs?
Command, sweep, pipeline
What is a command job
Job that executes a single script
What is a sweep job
Command job with hyperparameter tuning
What is a pipeline job
Multiple steps to a job
ML studio: Author tab options?
Notebooks, Automated ML, Designer
ML studio: Assets tab options?
Data Jobs, Components, Pipelines, Environments, Models, Endpoints
ML studio: Manage tab options?
Compute, Linked Services, Data Labeling
What do you need to authenticate into a workspace?
subscription_id, resource_group, workspace_name
how is MLClient() used after authenticating?
need to call MLClient whenever you connect to the workspace, like creating or updating assets or resources
What can you do on the azure cli?
Basically, manage anything. It is also good for automating tasks. see: https://learn.microsoft.com/en-us/cli/azure/ml?view=azure-cli-latest
What are the params of a command job?
code: path to model, command: training script, environment: environment for script, compute: compute for script, display_name, experiment_name
How to create a job with MLClient()
mlClient().create_or_update(job)
What is a URI?
Uniform Resource Identifier