Cloud Monitoring Flashcards

1
Q

Amazon CloudWatch

A

Is a monitoring and observability service built

• Metrics: monitor the performance of AWS services and billing metrics
• Alarms: automate notification, perform EC2 action, notify to SNS based on metric
• Logs: collect log files from EC2 instances, servers, Lambda functions…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

CloudWatch Metrics

A

• CloudWatch provides metrics for every services in AWS
• Metric is a variable to monitor
• Metrics have timestamps
• Can create CloudWatch dashboards of metrics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Important CloudWatch Metrics

A

• EC2 instances: CPU Utilization, Status Checks, Network (not RAM)
• EBS volumes: Disk Read/Writes
• S3 buckets: BucketSizeBytes, NumberOfObjects, AllRequests
• Billing: Total Estimated Charge (only in us-east-1)
• Service Limits: how much you’ve been using a service API
• Custom metrics: push your own metrics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

CloudWatch Alarms

A

Alarms are used to trigger notifications for any metric
• Can choose the period on which to evaluate an alarm
• Alarm States: OK. INSUFFICIENT_DATA, ALARM

Alarms actions…
• Auto Scaling: increase or decrease EC2 instances “desired” count
• EC2 Actions: stop, terminate, reboot or recover an EC2 instance
• SNS notifications: send a notification into an SNS topic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

CloudWatch Logs (Hybrid)

A

Enables real-time monitoring of logs
• Adjustable CloudWatch Logs retention

• CloudWatch Logs can collect log from:
• Elastic Beanstalk: collection of logs from application
• ECS: collection from containers
• AWS Lambda: collection from function logs
• CloudTrail based on filter
• CloudWatch log agents: on EC2 machines or on-premises servers
• Route53: Log DNS querie

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Amazon EventBridge (formerly CloudWatch Events)

A

Serverless event bus that makes it easier to build event-driven applications at scale using events generated from your applications

• Schedule: Cron jobs (scheduled scripts)
• Event Pattern: Event rules to react to a service doing something
• Trigger Lambda functions, send SQS/SNS messages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Amazon EventBridge

A

• Schema Registry: model event schema
• You can archive events (all/filter) sent to an event bus (indefinitely or set period)
• Ability to replay archived events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

AWS CloudTrail

A

Provides governance, compliance and audit for your AWS Account
• Get an history of events / API calls made within your AWS Account
• Can put logs from CloudTrail into CloudWatch Logs or S3
• A trail can be applied to All Regions (default) or a single Region.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

CloudTrail Events

A

• Management Events= Provide information about management operations that are performed on resources, like Registering devices and Configuring security.
• Data Events = Provide information about the resource operations performed on or in a resource, like Amazon S3 object-level API activity and AWS Lambda function execution activity.
• CloudTrail Insights Events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

CloudTrail Insights

A

Enable CloudTrail Insights to detect unusual activity in your account
• inaccurate resource provisioning
• hitting service limits
• Bursts of AWS IAM actions
• Gaps in periodic maintenance activity

• CloudTrail Insights analyzes normal management events to create a baseline
• And then continuously analyzes write events to detect unusual patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

CloudTrail Events Retention (Days)

A

• Events are stored for 90 days in CloudTrail
• To keep events beyond this period, log them to S3 and use Athena

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

AWS X-Ray

A

Helps developers analyze and debug production, distributed applications, such as those built using a microservices architecture.
• Troubleshooting performance (bottlenecks)
• Understand dependencies in a microservice architecture
• Pinpoint service issues • Review request behavior
• Find errors and exceptions
• Are we meeting time SLA? • Where I am throttled?
• Identify users that are impacted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Amazon CodeGuru

A

A Machine Learning powered service for automated code reviews and application performance recommendations

• CodeGuru Reviewer: automated code reviews for static code analysis (development)
• CodeGuru Profiler: visibility/recommendations about application performance during
runtime (production)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Amazon CodeGuru Reviewer

A

Identify critical issues, security vulnerabilities, and hard-to-find bugs

• Integrates with GitHub, Bitbucket, and
AWS CodeCommit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Amazon CodeGuru Profiler

A

Helps understand the runtime behavior of your
application
• Support applications running on AWS or on- premise

Features:
• Identify and remove code inefficiencies
• Improve application performance (e.g., reduce CPU utilization)
• Decrease compute costs
• Provides heap summary (identify which objects using up memory)
• Anomaly Detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

AWS Status - Service Health Dashboard

A

Shows all regions, all services health

• Shows historical information for each day

17
Q

AWS Personal Health Dashboard

A

Provides alerts and remediation guidance when AWS is experiencing events that may impact you.

• Personalized view into the performance and availability of the AWS services underlying your AWS resources
• The dashboard displays relevant and timely information to help you manage events in progress and provides proactive notification to help you plan for scheduled activities.

18
Q

Monitoring Summary

A

• CloudWatch:
-Metrics: monitor the performance of AWS services and billing metrics
-Alarms: automate notification, perform EC2 action, notify to SNS based on metric
-Logs: collect log files from EC2 instances, servers, Lambda functions…
-Events (or EventBridge): react to events in AWS, or trigger a rule on a schedule
• CloudTrail: audit API calls made within your AWS account
• CloudTrail Insights: automated analysis of your CloudTrail Events
• X-Ray: trace requests made through your distributed applications
• Service Health Dashboard: status of all AWS services across all regions
• Personal Health Dashboard: AWS events that impact your infrastructure
• Amazon CodeGuru: automated code reviews and application performance recommendation