ML Systems Flashcards
HLS
HTTP Live Stream, effectively chops up video into MP4 chunks
pub/sub
publish - subscribe pattern (messaging)
asynchronous communication model that makes it easy for developers to build highly functional and architecturally complex applications in the cloud. In modern cloud architecture, applications are decoupled into smaller, independent building blocks called services. Pub/sub messaging provides instant event notifications for these distributed systems. It supports scalable and reliable communication between independent software modules.
Publishers => Messages (with topic) => subscribers
Apache Kafka / Amazon Kinesis
One open source, another AWS products to handle real-time data streaming. highly scalable and distributed streaming platforms, which handle batch or real time ingestion from multiple sources.
OLTP / OLAP
Online transaction processing vs Online analytical processing.
1. Online transaction processing - system that handles near real-time business processes, for example a database.
2. near real time analytical processes, for example reporting, auditing, business intelligence.
Zookeeper
Apache Zookeeper - is a centralized service that provides a place for distributed applications to store data, communicate, and coordinate activities. It’s used in distributed systems to coordinate processes and services.
It has a simple architecture and API and works well with workloads where reads are more common than writes.
ZooKeeper provides services such as:
- Maintaining configuration information
- Providing distributed synchronization
- Providing group services
- Offering a hierarchical key-value store
ZooKeeper nodes are called Z nodes. They can be persistent or ephemeral. Persistent Z nodes stay alive and are remembered by ZooKeeper. Ephemeral Z nodes disappear if the app disconnects.