Observability patterns Flashcards

Question

Distributed tracing: problem

Answer 1

How to understand the behavior of an application and troubleshoot problems?

Answer 2

* External monitoring only tells you the overall response time and number of invocations - no insight into the individual operations * Any solution should have minimal runtime overhead * Log entries for a request are scattered across numerous logs

Answer 3

Instrument services with code that * Assigns each external request a unique external request id * Passes the external request id to all services that are involved in handling the request * Includes the external request id in all log messages * Records information (e.g. start time, end time) about the requests and operations performed when handling a external request in a centralized service This instrumentation might be part of the functionality provided by a Microservice Chassis framework.

Answer 4

* It provides useful insight into the behavior of the system including the sources of latency * It enables developers to see how an individual request is handled by searching across aggregated logs for its external request id

Answer 5

Aggregating and storing traces can require significant infrastructure

Answer 6

Log aggregation - the external request id is included in each log message

Answer 7

You have applied the Microservice architecture pattern. The application consists of multiple services and service instances that are running on multiple machines. Errors sometimes occur when handling requests. When an error occurs, a service instance throws an exception, which contains an error message and a stack trace.

Answer 8

How to understand the behavior of an application and troubleshoot problems?

Answer 9

* Exceptions must be de-duplicated, recorded, investigated by developers and the underlying issue resolved * Any solution should have minimal runtime overhead

Answer 10

Report all exceptions to a centralized exception tracking service that aggregates and tracks exceptions and notifies developers.

Answer 11

It is easier to view exceptions and track their resolution

Answer 12

The exception tracking service is additional infrastructure

Answer 13

Log aggregation - exceptions should be logged as well as reported to a tracking service

Answer 14

You have applied the Microservice architecture pattern. Sometimes a service instance can be incapable of handling requests yet still be running. For example, it might have ran out of database connections. When this occurs, the monitoring system should generate a alert. Also, the load balancer or service registry should not route requests to the failed service instance.

Answer 15

How to detect that a running service instance is unable to handle requests?

Answer 16

* An alert should be generated when a service instance fails | * Requests should be routed to working service instances

Answer 17

A service has an health check API endpoint (e.g. HTTP /health) that returns the health of the service. The API endpoint handler performs various checks, such as: - the status of the connections to the infrastructure services used by the service instance - the status of the host, e.g. disk space - application specific logic A health check client - a monitoring service, service registry or load balancer - periodically invokes the endpoint to check the health of the service instance.

Answer 18

The health check endpoint enables the health of a service instance to be periodically tested

Answer 19

The health check might not sufficiently comprehensive or the service instance might fail between health checks and so requests might still be routed to a failed service instance

Answer 20

Service registry - the service registry invokes the health check endpoint

Answer 21

You have applied the Microservice architecture pattern.

Answer 22

How to understand the behavior of an application and troubleshoot problems?

Answer 23

It useful to see when deployments and other changes occur since issues usually occur immediately after a change

Answer 24

Log every deployment and every change to the (production) environment.

Answer 25

Enables deployments and changes to be easily correlated with issues leading to faster resolution.

Observability patterns Flashcards

(49 cards)