W2-Model Serving Architecture Flashcards
Two main places to run infrastructure for machine learning models?
Model Serving Architecture- GPT Summary
on-premises or outsourcing to a cloud provider.
There are two main ways of scaling: vertical and horizontal. Vertical scaling is using ____ , whereas horizontal scaling is ____.
Scaling Infrastructure- GPT summary
bigger and more powerful hardware
adding more devices to the network
Horizontal scaling is generally recommended, why?
Scaling Infrastructure- GPT summary
Because it provides elasticity and avoids taking the app offline to upgrade hardware resources. Moreover, it can be done automatically by the system, which reduces overall latency.