Troubleshooting and Maintenance Flashcards
Q: What is AWS CloudWatch used for in troubleshooting?
A: It monitors AWS resources and applications, providing metrics, logs, and alarms for performance and operational insights.
Q: How does AWS CloudTrail help with troubleshooting?
A: By providing a detailed record of API calls and account activity for auditing and debugging.
Q: What are the common steps for troubleshooting in AWS?
- Identify the issue (logs, metrics).
- Analyze recent changes.
- Check configurations.
- Review network connectivity.
- Test solutions incrementally.
Q: What are the steps to troubleshoot EC2 connectivity issues?
- Verify security group and NACL rules.
- Check the instance’s public/private IP.
- Confirm route table configurations.
- Test network paths using ping and traceroute.
Q: How do you troubleshoot VPC network connectivity issues?
- Verify route tables, NACLs, and security groups.
- Check VPC peering or Transit Gateway configurations.
- Analyze VPC Flow Logs for dropped traffic.
Q: How do you troubleshoot AWS Lambda function errors?
- Check the execution logs in CloudWatch Logs.
- Verify IAM roles and permissions.
- Review the function timeout and memory settings.
- Analyze error messages and stack traces.
Q: How do you troubleshoot “Access Denied” errors in S3?
- Check bucket policies and IAM user permissions.
- Verify object ACLs.
- Ensure requests are signed with valid credentials.
- Enable and review S3 access logs.
Q: What are common causes of RDS connectivity issues?
- Incorrect security group settings.
- Misconfigured subnet or route table.
- Database not publicly accessible.
- Exceeding max connections limit.
Q: How do you resolve IAM permission issues?
- Use the IAM Policy Simulator to test policies.
- Check for conflicting explicit denies in policies.
- Verify trust relationships for cross-account roles.
Q: How do you troubleshoot autoscaling issues?
- Check scaling policies and alarms.
- Verify instance launch templates/configurations.
- Analyze failed lifecycle hooks or insufficient capacity errors.
Q: What are common steps to troubleshoot ELB issues?
- Verify target group health checks.
- Check listener and rule configurations.
- Analyze access logs for errors.
- Ensure security group rules allow ELB traffic.
Q: How do you troubleshoot DNS issues in Route 53?
- Verify DNS record configurations.
- Test DNS resolution using nslookup or dig.
- Check health check status for failover records.
- Confirm TTL values for propagation delays.
Q: How do you troubleshoot CloudFormation stack errors?
- Review the stack events for error messages.
- Validate templates with the AWS CLI.
- Check resource quotas for exceeded limits.
- Resolve dependency errors in resources.
Q: How do you address application performance issues?
- Monitor metrics in CloudWatch (e.g., CPU, memory).
- Scale resources using Auto Scaling or Spot Instances.
- Optimize database queries and indexing.
- Use caching solutions like ElastiCache.
Q: What is AWS Trusted Advisor?
A: A tool that provides recommendations for cost optimization, security, fault tolerance, performance, and service limits.