W2-Model Serving Architecture Flashcards

1
Q

Two main places to run infrastructure for machine learning models?

Model Serving Architecture- GPT Summary

A

on-premises or outsourcing to a cloud provider.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

There are two main ways of scaling: vertical and horizontal. Vertical scaling is using ____ , whereas horizontal scaling is ____.

Scaling Infrastructure- GPT summary

A

bigger and more powerful hardware
adding more devices to the network

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Horizontal scaling is generally recommended, why?

Scaling Infrastructure- GPT summary

A

Because it provides elasticity and avoids taking the app offline to upgrade hardware resources. Moreover, it can be done automatically by the system, which reduces overall latency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly