Overview
WHAT YOU'LL LEARN
- Apache Hadoop in the context of Amazon EMR
- The architecture of an Amazon EMR cluster
- Launch an Amazon EMR cluster using an appropriate Amazon Machine Image and Amazon EC2 instance types
- Appropriate AWS data storage options for use with Amazon EMR
- Ingesting, transferring, and compressing data for use with Amazon EMR
- Use common programming frameworks available for Amazon EMR including Hive, Pig, and Streaming
- Work with Amazon Redshift to implement a big data solution
- Leverage big data visualization software
- Appropriate security options for Amazon EMR and your data
- Perform in-memory data analysis with Spark and Shark on Amazon EMR
- Options to manage your Amazon EMR environment cost-effectively
- Benefits of using Amazon Kinesis for big data
CALENDAR
Check out our Big Data on AWS schedule of upcoming sessions delivery dates for this course.
PREREQUISITES
We recommend that attendees of this course have the following prerequisites:
- Familiarity with big data technologies, including Apache Hadoop and HDFS
- Knowledge of big data technologies such as Pig, Hive, and MapReduce is helpful but not required
- Working knowledge of core AWS services and public cloud implementation
- Students should complete the AWS Essentials course or have equivalent experience
- Basic understanding of data warehousing, relational database systems, and database design
RELATED CERTIFICATIONS
AWS Certified Data Analytics Specialty
FOLLOW-ON COURSES
Depending on your role, some of the following specialty trainings may help you to deepen your knowledge and skills in the field:
Sold by | Global Knowledge France |
Categories | |
Fulfillment method | Professional Services |
Pricing Information
This service is priced based on the scope of your request. Please contact seller for pricing details.
Support
Need more information ? Contact us : info@globalknowledge.fr | +33 (0)1 78 15 34 00