AWS Partner Network (APN) Blog

Tag: MapReduce

Mactores-AWS-Partners

Lower TCO and Increase Query Performance by Running Hive on Spark in Amazon EMR

Learn how Mactores helped Seagate Technology to use Apache Hive on Apache Spark for queries larger than 10TB, combined with the use of transient Amazon EMR clusters leveraging Amazon EC2 Spot Instances. It was imperative for Seagate to have systems in place to ensure the cost of collecting, storing, and processing data did not exceed their ROI. Moving to Hive on Spark enabled Seagate to continue processing petabytes of data at scale with significantly lower TCO.

Machine Learning-4

How to Use Amazon SageMaker to Improve Machine Learning Models for Data Analysis

Amazon SageMaker provides all the components needed for machine learning in a single toolset. This allows ML models to get to production faster with much less effort and at lower cost. Learn about the data modeling process used by BizCloud Experts and the results they achieved for Neiman Marcus. Amazon SageMaker was employed to help develop and train ML algorithms for recommendation, personalization, and forecasting models that Neiman Marcus uses for data analysis and customer insights.