Amazon EMR | Debugging Flashcards
How quickly does Amazon EMR retire support for old Hadoop versions?
Debugging
Amazon EMR | Analytics
Amazon EMR service retires support for old Hadoop versions several months after deprecation. However, Amazon EMR APIs are backward compatible, so if you build tools on top of these APIs, they will work even when Amazon EMR updates the Hadoop version it’s using.
How can I debug my cluster?
Debugging
Amazon EMR | Analytics
You first select the cluster you want to debug, then click on the “Debug” button to access the debug a cluster window in the AWS Management Console. This will enable you to track progress and identify issues in steps, jobs, tasks, or task attempts of your clusters. Alternatively you can SSH directly into the Amazon Elastic Compute Cloud (Amazon EC2) instances that are running your cluster and use your favorite command-line debugger to troubleshoot the cluster.
What is the cluster debug tool?
Debugging
Amazon EMR | Analytics
The cluster debug tool is a part of the AWS Management Console where you can track progress and identify issues in steps, jobs, tasks, or task attempts of your clusters. To access the cluster debug tool, first select the cluster you want to debug and then click on the “Debug” button.
How can I enable debugging of my cluster?
Debugging
Amazon EMR | Analytics
To enable debugging you need to set “Enable Debugging” flag when you create a cluster in the AWS Management Console. Alternatively, you can pass the –enable-debugging and –log-uri flags in the Command Line Client when creating a cluster.
Where can I find instructions on how to use the debug a cluster window?
Debugging
Amazon EMR | Analytics
Please reference the AWS Management Console section of the Developer’s Guide for instructions on how to access and use the debug a cluster window.
What types of clusters can I debug with the debug a cluster window?
Debugging
Amazon EMR | Analytics
You can debug all types of clusters currently supported by Amazon EMR including custom jar, streaming, Hive, and Pig.
Why do I have to sign-up for Amazon SimpleDB to use cluster debugging?
Debugging
Amazon EMR | Analytics
Amazon EMR stores state information about Hadoop jobs, tasks and task attempts under your account in Amazon SimpleDB. You can subscribe to Amazon SimpleDB here.
Can I use the cluster debugging feature without Amazon SimpleDB subscription?
Debugging
Amazon EMR | Analytics
You will be able to browse cluster steps and step logs but will not be able to browse Hadoop jobs, tasks, or task attempts if you are not subscribed to Amazon SimpeDB.