W2-Model Serving Architecture Flashcards by Mahsa Zamanifard

Two main places to run infrastructure for machine learning models?

Model Serving Architecture- GPT Summary

on-premises or outsourcing to a cloud provider.

How well did you know this?

Not at all

Perfectly

There are two main ways of scaling: vertical and horizontal. Vertical scaling is using ____ , whereas horizontal scaling is ____.

Scaling Infrastructure- GPT summary

bigger and more powerful hardware
adding more devices to the network

How well did you know this?

Not at all

Perfectly

Horizontal scaling is generally recommended, why?

Scaling Infrastructure- GPT summary

Because it provides elasticity and avoids taking the app offline to upgrade hardware resources. Moreover, it can be done automatically by the system, which reduces overall latency.

How well did you know this?

Not at all

Perfectly

W2-Model Serving Architecture Flashcards

(3 cards)