Prometheus Flashcards

Question

Alert management: silencing

Answer 1

a straightforward way to simply mute alerts for a given time.

Answer 2

1. Counter 2. Gauge 3. Histogram 4. Summary

Answer 3

a cumulative metric that represents a single monotonically increasing counter whose value can only increase or be reset to zero on restart.

Answer 4

a metric that represents a single numerical value that can arbitrarily go up and down.

Answer 5

samples observations (usually things like request durations or response sizes) and counts them in configurable buckets. CUMULATIVE

Answer 6

1. _bucket: cumulative counters for the observation buckets 2. _sum: the total sum of all observed values 3. _count: the count of events that have been observed

Answer 7

1. Similar to a histogram, a summary samples observations 2. also it calculates configurable quantiles over a sliding time window.

Answer 8

1. streaming φ-quantiles (0 ≤ φ ≤ 1) of observed events, exposed as {quantile="<φ>"} 2. the total sum of all observed values, exposed as _sum 3. the count of events that have been observed, exposed as _count

Answer 9

1. an endpoint you can scrape is called, 2. usually corresponding to a single process

Answer 10

A collection of instances with the same purpose, a process replicated for scalability or reliability for example

Answer 11

1. job 2. instance :

Answer 12

1. online-serving 2. offline-processing 3. batch jobs

Answer 13

1. one where a human or another system is expecting an immediate response 2. should be monitored on both the client and server side

Answer 14

1. number of performed queries, 2. errors 3. latency 4. number of in-progress requests

Answer 15

1. no one is actively waiting for a response, and batching of work is common 2. may also be multiple stages of processing

Answer 16

For each stage 1. track the items coming in 2. how many are in progress 3. the last time you processed something 4. how many items were sent out 5. track batches going in and out

Answer 17

do not run continuously, which makes scraping them difficult.

Answer 18

1. last time it succeeded 2. how long each major stage of the job took 3. overall runtime and the last time the job completed (successful or failed) 4. overall job-specific statistics (e.g. total number of records processed)

Answer 19

1. Libraries 2. Logging 3. Failures 4. Threadpools 5. Caches 6. Collectors

Answer 20

Every time series is uniquely identified by its metric name and optional key-value pairs called labels. 1. The change of any labels value, including adding or removing labels, will create a new time series. 2. Labels with an empty label value are considered equivalent to labels that do not exist.

Answer 21

1. a float64 value 1. a millisecond-precision timestamp

Answer 22

1. Text-based format 2. OpenMetrics Text Format 3. Protobuf format

Answer 23

1. All lines for a given metric must be provided as one single group, with the optional HELP and TYPE lines first 2. Each line must have a unique combination of a metric name and labels. http_requests_total{method="post",code="200"} 1027 1395066363000

Answer 24

1. calculates the difference between the first and last value of each time series element in a range vector 2. should only be used with gauges and native histograms

Answer 25

1. calculates the per-second derivative of the time series in a range vector v, using simple linear regression 2. shell only be used with gauges

Answer 26

1. calculates the per-second instant rate of increase of the time series in the range vector. 2. This is based on the last two data points 3. should only be used when graphing volatile, fast-moving counters

Answer 27

1. calculates the per-second average rate of increase of the time series in the range vector. 2. should only be used with counters and native histograms where the components behave like counters. It is best suited for alerting, and for graphing of slow-moving counters. 3. can't be used with the gauge metrics

Answer 28

A set of metrics in an instrumented application that will be returned when scraped

Answer 29

1. Alert labels should be used for metadata that uniquely identifies an alert 2. Annotations should be used for longer form descriptive content of an alert

Answer 30

queries the alerts being evaluated by Prometheus

Answer 31

1. may contain ASCII letters, digits, underscores, and colons 2. must match the regex [a-zA-Z_:][a-zA-Z0-9_:]*

Answer 32

1. (single-word) application prefix: domain the metric belongs to (prometheus_, process_, http) 2. should have a suffix describing the unit, in plural form (single base unit) 3. should represent the same logical thing-being-measured across all label dimensions.

Prometheus Flashcards

(56 cards)