Monitoring and Logging Flashcards
What is Azure Monitor?
Azure Monitor aggregates and correlates data across multiple Azure subscriptions and tenants, in addition to hosting data for other services. Because this data is stored together, it can be correlated and analyzed using a common set of tools.
What is the structure of Azure Monitor in regards to services provided?
Give some examples of each service provided
- Insights (Apps, containers, VM, Monitoring solutions)
- Visualization (Dashboards, Views, Power BI, Webhooks)
- Analyze (Metric analysis, log analytics)
- Response (alerts and autoscale)
- Integration (Logic Apps, Export APIs)
Two fundelmental types of data used by Azure Monitor
- Metrics
- Logs
Kinds of data collected
- Data from applications (performance and functionality)
- Data about the OS
- Data about the operation of an Azure Resourse
- Operation of Active Directory at the tenant level
- Azure Monitor can also collection data through an API from resource on-premises
What is Application Insights?
Application Insights is an extensible application performance management (APM) service for developers and DevOps professionals.
What are the capabilities of Application Insights?
- Check performance of server machines like CPU and memory usages
- Detect thrown exceptions in the application source code
- Add custom events and metrics in the client or server code, to track business events
- Collect request rates, response times, and failure rates
- Collection page views and load performance reported by user’s browser
Also supports Mobile and Desktop Apps Support (App Center Diagnostics)
How do I see telemetry from my applications?
- Smart detection (automatically warns yoy of potentional performance problems and failures)
- Application Map
- Live Metrics
- Failures
What are some monitoring availability tools
- Web Test (URL Ping test, Custom Track Availabilty Test, and Multi-step web test)
- Alerts and Action Groups
An Action Group contains the following properties: Type (notification or action performed), Name (name of group) , Action (email, webhook)
Transient Fault Challenges
What are the challenges for transient faults?
- Application must detect the fault
- Application must retry the operation
- Application MUST use the approperiate retry policy including the number of retries
Transiet Fault Challenges
What retry policies are there?
Polly, an open-source library
** Retry Policy**
* Send the request again after a specific time
* Wait before sending next request
* Configure retry times and intervals
Circuit Breaker
* Service is unavailable and cannot respond to a request
* Avoid sending request for some time
* When the circuit is opened, no request is end until it is closed again
Timeout
* Set a timeout for waiting for response
* If timeout reached, send a cancellation token and release resources
* Do not use in conjunction with retry policy