Apache Hadoop: Data Processing and Storage For Big Data
Linux/Unix
Linux/Unix
Product Overview
Apache Hadoop is an open source, Java-based software platform that manages data processing and storage for big data applications. The platform works by distributing Hadoop big data and analytics jobs across nodes in a computing cluster, breaking them down into smaller workloads that can be run in parallel. Hadoop processes structured and unstructured data and scale up reliably from a single server to thousands of machines.