Cloud Monitoring Section Flashcards

1
Q

Amazon CloudWatch Metrics

A
  • Provides metrics with timestamps for every AWS service

* Can create CloudWatch dashboards of metrics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Metrics in AWS

A

• EC2 instances: CPU Utilization, Status Checks, Network (not RAM)
Default metrics every 5 minutes
Option for Detailed Monitoring ($$$): metrics every 1 minute
• EBS volumes: Disk Read/Writes
• S3 buckets: BucketSizeBytes, NumberOfObjects, AllRequests
• Billing:Total Estimated Charge (only in us-east-1)
• Service Limits: how much you’ve been using a service API
• Custom metrics: push your own metrics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Amazon CloudWatch Alarms

A

• Alarms are used to trigger notifications for any metric

Alarms actions:
• Auto Scaling: increase or decrease EC2 instances “desired” count
• EC2 Actions: stop, terminate, reboot or recover an EC2 instance
• SNS notifications: send a notification into an SNS topic

• Alarm States: OK. INSUFFICIENT_DATA, ALARM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Amazon CloudWatch Logs

A

Can be collected from:
• Elastic Beanstalk: collection of logs from application
• ECS: collection from containers
• AWS Lambda: collection from function logs
• CloudTrail based on filter
• CloudWatch log agents: on EC2 machines or on-premises servers
• Route53: Log DNS queries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

CloudWatch Logs on EC2

A
  • By default, no logs from your EC2 instance will go to CloudWatch
  • You need to run a CloudWatch agent on EC2 to push the log files you want
  • Make sure IAM permissions are correct
  • The CloudWatch log agent can be setup on-premises too
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Amazon CloudWatch Events

A
  • Schedule: Cron jobs (scheduled scripts)
  • Event Pattern: Event rules to react to a service doing something
  • Trigger Lambda functions, send SQS/SNS messages…
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Amazon EventBridge

A
  • EventBridge is the next evolution of CloudWatch Events
  • Default event bus: generated by AWS services (CloudWatch Events)
  • Partner event bus: receive events from SaaS service or applications (Zendesk, DataDog, Segment, Auth0…)
  • Custom Event buses: for your own applications
  • Schema Registry: model event schema
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

AWS CloudTrail

A
  • Provides governance, compliance and audit for your AWS Account
  • CloudTrail is enabled by default!
  • Get an history of events / API calls made within your AWS Account by:
  • Console
  • SDK
  • CLI
  • AWS Services
  • Can put logs from CloudTrail into CloudWatch Logs or S3
  • A trail can be applied to All Regions (default) or a single Region
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

CloudTrail Events - Management Events

A

• Operations that are performed on resources in your AWS account

Examples:
• Configuring security (IAM AttachRolePolicy)
• Configuring rules for routing data (Amazon EC2 CreateSubnet)
• Setting up logging (AWS CloudTrail CreateTrail)

  • By default, trails are configured to log management events.
  • Can separate Read Events (that don’t modify resources) from Write Events (that may modify resources)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

CloudTrail Events - Data Events

A
  • By default, data events are not logged (because high volume operations)
  • Amazon S3 object-level activity (ex: GetObject, DeleteObject, PutObject): can separate Read and Write Events
  • AWS Lambda function execution activity (the Invoke API)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

CloudTrail Events - Cloudtrail Insights Events

A
Enable CloudTrail Insights to detect unusual activity in your account:
• inaccurate resource provisioning 
• hitting service limits
• Bursts of AWS IAM actions
• Gaps in periodic maintenance activity

• CloudTrail Insights analyzes normal management events to create a baseline

  • And then continuously analyzes write events to detect unusual patterns:
  • Anomalies appear in the CloudTrail console
  • Event is sent to Amazon S3
  • An EventBridge event is generated (for automation needs)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

CloudTrail Events Retention

A
  • Events are stored for 90 days in CloudTrail

* To keep events beyond this period, log them to S3 and use Athena

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

AWS X-Ray

A
  • Visual analysis of application
  • Troubleshooting performance (bottlenecks)
  • Understand dependencies in a microservice architecture
  • Pinpoint service issues
  • Review request behavior
  • Find errors and exceptions
  • Are we meeting time SLA?
  • Where I am throttled?
  • Identify users that are impacted
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Amazon CodeGuru

A

• An ML-powered service for automated code reviews and application performance recommendations

Provides two functionalities:
• CodeGuru Reviewer: automated code reviews for static code analysis (development)
• CodeGuru Profiler: visibility/recommendations about application performance during
runtime (production)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Amazon CodeGuru Reviewer

A

• Identify critical issues, security vulnerabilities, and hard-to-find bugs
• Example: common coding best practices, resource leaks, security detection, input
validation
• Uses Machine Learning and automated reasoning
• Hard-learned lessons across millions of code reviews on 1000s of open-source
and Amazon repositories
• Supports Java and Python
• Integrates with GitHub, Bitbucket, and AWS CodeCommit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Amazon CodeGuru Profiler

A
  • Helps understand the runtime behavior of your application
  • Example: identify if your application is consuming excessive CPU capacity on a logging routine

Features:
• Identify and remove code inefficiencies
• Improve application performance (e.g., reduce CPU utilization)
• Decrease compute costs
• Provides heap summary (identify which objects using up memory)
• Anomaly Detection

  • Support applications running on AWS or on- premise
  • Minimal overhead on application
17
Q

AWS Status - Service Health Dashboard

A
  • Shows all regions, all services health
  • Shows historical information for each day
  • Has an RSS feed you can subscribe to
18
Q

AWS Personal Health Dashboard

A

• AWS Personal Health Dashboard provides alerts and remediation guidance when AWS is experiencing events that may impact you
personalized
• view into the performance and availability of the AWS services underlying your AWS resources
• proactive notification to help you plan for scheduled activities.