AWS Data Pipeline | Getting Started Flashcards
How do I install a Task Runner on my on-premise hosts?
Getting Started
AWS Data Pipeline | Analytics
You can install the Task Runner package on your on-premise hosts using the following steps:
Download the AWS Task Runner package.
Create a configuration file that includes your AWS credentials.
Start the Task Runner agent via the following command:
java -jar TaskRunner-1.0.jar –config ~/credentials.json –workerGroup=[myWorkerGroup]
When defining activities, set the activity to run on [myWorkerGroup] in order to dispatch them to the previously installed hosts.
How can I get started with AWS Data Pipeline?
Getting Started
AWS Data Pipeline | Analytics
To get started with AWS Data Pipeline, simply visit the AWS Management Console and go to the AWS Data Pipeline tab. From there, you can create a pipeline using a simple graphical editor.
What can I do with AWS Data Pipeline?
Getting Started
AWS Data Pipeline | Analytics
With AWS Data Pipeline, you can schedule and manage periodic data-processing jobs. You can use this to replace simple systems which are current managed by brittle, cron-based solutions, or you can use it to build complex, multi-stage data processing jobs.