Amazon EMR is a web service that makes it easy to process large amounts of data efficiently. Amazon EMR uses Hadoop processing combined with several AWS products to do such tasks as web indexing, data mining, log file analysis, machine learning, scientific simulation, and data warehousing.

Management Guide
Describes key concepts of Amazon EMR and provides instructions for using the platform.
HTML | PDF


API Reference
Describes the Amazon EMR API operations, including sample requests, responses, and errors for the supported web services protocols.
HTML | PDF

  Release Guide
Provides information about Amazon EMR releases, including installed cluster software such as Hadoop and Spark.
HTML | PDF
Developer Guide (Releases 2.x and 3.x)
Provides a conceptual overview of Amazon EMR and includes detailed development instructions for using the various features of releases 2.x and 3.x.
PDF
   

For older versions of this documentation: