l7 monitoring Flashcards

1
Q

White / Black Box Monitoring

A
  • Blackbox Monitoring: Look from outside
    into a “blackbox”
  • Classical User Perspective
  • Source for alerts
  • External View
  • Define Output:
  • Latency
  • Return Values / Return Codes
  • Prober OSI-Layer aware:
  • Level 7: HTTP/s / DNS / gRPC
  • Level 4: TLS / IMAP
  • Level 3: ICMP
    HTTP Response Time
  • Whitebox Monitoring: Transparent look
    into a system:
  • Internal Metrics like CPU Usage
  • Traces
  • Logs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Monitoring should adress two questions

A
  • What is broken -> Symptom
    Immediate Alerts
  • Why is it broken -> Cause
    Proactive Prevention
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

4 Golden Signals

A

Two monitoring methods related to these signals
* RED: Rate (Traffic), Error, Duration (Latency)
* Application focus, no infrastructure or external systems
* USE: Utilization, Saturation, Errors
* Focus on all resources, even external resources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Saturation

A

Is the system overutilized or not?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Metric Types

A

Counter
Monotonic Increase: A counter is a cumulative metric that only increases over time. It is used to track the number of occurrences of specific events or actions.
Reset on Restart: Counters typically reset to zero when the system restarts or the application restarts.

Gauge
Arbitrary Values: A gauge is a metric that can go up and down, representing a current value at a specific point in time.
Instantaneous Measurement: It measures the current value of some variable that can fluctuate, such as temperature, memory usage, or active connections.

Histogram:
Buckets: Measures distribution over predefined buckets.
Examples: Response time distribution, event duration distribution.
Use Cases: Analyzing frequency and distribution of values.

Summary:
Quantiles: Provides quantile estimates and total counts/sums.
Examples: Request latency quantiles, performance analysis.
Use Cases: Tracking latency, analyzing performance characteristics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Saving Metrics

A

Time Series Databases
* Cleaning up old data:
- Roll up: Summarizing values with coarser granulariy after given number of days
- Clean Up: Removing old data from the storage
- Archive: Put old data to slower storage
* Continuously capturing data over time
* Data Stream: Data is inserted all the time, Updated are seldom (or not occuring after all)
* Data is time-centric
* All data is timestamped

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

PromQL

A

The storage metrics can then be used to be queried
with an own defined query language: PromQL
* Non-SQL style
* Only read, never update/write
* Vector-based
* Build in operators:
* +, -, and, or, unless, >, <, =~, !=,
* Aggregation timestamps: avg, sum, rate, stddev
Base on Time Range
* Return data for fixed defined timestamp
* Is the base for Dashboards as well as for al

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain scrape config

A

kubernetes_sd_config: * Matches the kind of service discovery including
element to be scraped
relabel_configs: * Modifying the labels with source_labels (which
label is demanded), action (what should happen
with the label?), target_label (label to be
generated)
*Very powerful to enrich monitoring data with
further information like: * namespace, pod, all labels * Adapting urls-suffic (metrics path) * Filtering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly