Sign in
Categories
Your Saved List Partners Sell in AWS Marketplace Amazon Web Services Home Help

Apache Hadoop 3.3.1 RHEL 7.9 (ENA Enabled) With 24/7x365 Support

Apache Hadoop 3.3.1 RHEL 7.9 (ENA Enabled) With 24/7x365 Support

By: cloudimg Latest Version: v1.0.0
Linux/Unix
Linux/Unix

Product Overview

Apache Hadoop is an open source framework that is used to efficiently store and process large datasets ranging in size from gigabytes to petabytes of data. Instead of using one large computer to store and process the data, Hadoop allows clustering multiple computers to analyze massive datasets in parallel more quickly.

Hadoop consists of the below main modules

Hadoop Distributed File System (HDFS) : A distributed file system that runs on standard or low-end hardware. HDFS provides better data throughput than traditional file systems, in addition to high fault tolerance and native support of large datasets.

Yet Another Resource Negotiator (YARN) : Manages and monitors cluster nodes and resource usage. It schedules jobs and tasks.

MapReduce : A framework that helps programs do the parallel computation on data. The map task takes input data and converts it into a dataset that can be computed in key value pairs.

Hadoop Common : Provides common Java libraries that can be used across all modules.

Version

v1.0.0

Operating System

Linux/Unix, Red Hat Enterprise Linux 7.9

Delivery Methods

  • Amazon Machine Image

Pricing Information

Usage Information

Support Information

Customer Reviews