Using AWS, it's never been easier or more affordable to solve business problems and uncover new opportunities using data. Now, businesses of all sizes and across all industries can take advantage of big data technologies and easily collect, store, process, analyze, and share their data.

To help you get started, check out the following qualified big data partner solutions, validated by the AWS Partner Competency Program to help you build a complete big data solution. 

Reduce the effort to extract, transform, load data and manage data processing workflows.


Any input, quick integration, and manage schema changes

Alooma is a real-time data pipeline as a service. With Alooma you can integrate any data sources such as databases, applications, and any API - with your own Amazon Redshift.


Empower analysts with data blending in Amazon Redshift leading to insights in hours, not weeks.

Alteryx offers fast in-database blending of multiple, disparate, and high volume data sources. Cut your data prep time by 80%, all through an easy, intuitive drag and drop workflow.


High-performance and easy-to-use, intuitive user interfaces.

Attunity CloudBeam enables organizations to simplify, accelerate, and automate data transfer to, from, and across AWS regions. Supported services include Amazon Redshift, RDS, S3, and EC2.


Enterprise Grade Data Replication & ELT Integration for Amazon Redshift

Bryte empowers users to load, integrate and blend data with just a few clicks with a comprehensive software suite featuring Enterprise Grade Data Replication and ELT / ETL integration technologies. Supported AWS services include RDS, EC2, S3, Redshift, Kinesis and Hadoop on Amazon EMR.


CloudBasic offers cross-region and cross-cloud replication engine for all versions of SQL Server.  

With CloudBasic, S3 BI data lakes, Redshift integrations, and data migrations can be setup in minutes. Clustered deployments with fast failover are also available. CloudBasic specializes in the development of enterprise multi-cloud AlwaysOn/Geo-Replicated relational database technologies focused on the MS SQL Server. CloudBasic customers have implemented solutions for DR, High Availability, BI, Data warehousing, Reporting.


Etleap is an ETL solution that lets you create perfect data pipelines from day one.  

Set up data pipelines and transformations (ETL) from many sources into Redshift in less than 10 minutes, without writing any code. It's the powerful and analyst-friendly ETL solution for Redshift.

Informatica Cloud

Accelerate and scale high-performance data management for AWS, with ease and confidence.

Informatica Intelligent Data Platform accelerates AWS data management for data warehouse, data lake, data migration and data cataloging with Redshift, S3, RDS/Aurora and EMR, leveraging ETL/ELT, 100s of connectors, templates and intuitive development.


ironSource allows the customer to collect any data type from any location with our multi-region deployment. 

Atom is an infinitely scalable big data flow management solution that transfers your logs to Redshift in a secure environment, with multiple backups along the way. Atom handles all your data, regardless of origin and type, while giving you full control.


Matillion ETL for Redshift. Data management in Amazon Redshift has never been simpler.

Simplify data loading, transformation and orchestration in Amazon Redshift. Matillion delivers a modern ETL/ELT solution built specifically for Amazon Redshift, on available on AWS Marketplace. Integrates with Redshift, S3, RDS and SQS.


Data Self Service platform for intelligently and quickly preparing raw data into clean, contextual, ready-to-use information at scale.

Paxata delivers an interactive, visual, analyst-centric data preparation experience powered by an intelligent and unified set of technologies for comprehensive data integration, data quality, semantic enrichment, collaboration, and governance.


The SnapLogic Enterprise Integration Cloud empowers enterprises to make informed decisions by visualizing large volumes of data and uncovering trends, patterns, and behavior.

SnapLogic’s platform-agnostic approach supports a distributed, web-oriented architecture for large datasets on-premises, in the cloud, or both – giving maximum visibility to Spark, Hadoop, ETL, etc.


Automate AWS integration, and execute native Spark code using 900+ connectors and components from Talend

Natively supporting big data, Talend’s open and unified solutions take the complexity out of any integration project and equip IT to be more responsive to the demands of the business, at a predictable cost.


Xplenty is a data delivery platform that allows organizations to easily integrate, transform and process data from all of their major sources.

Xplenty is a platform that simply gets your data delivered and frees you from the hassle of traditional data integration.

Store, manage, process, and analyze different types of datasets. 

C3 IoT

The C3 IoT Platform is currently in large-scale production at over 20 leading enterprises in North America, Europe, and Australia.

The C3 IoT Platform leverages the full power of AWS for the rapid design, development, deployment, and operation of next generation IoT and big data SaaS applications – applying AI at scale across a multiplicity of data sources to generate and manifest predictive insights in real time.


Cloudera delivers the modern platform for machine learning and advanced analytics built on the latest open source technologies.

The world’s leading organizations trust Cloudera to help solve their most challenging business problems by efficiently capturing, storing, processing and analyzing vast amounts of data.


Digital transformation is being driven by evolving customer expectations for extraordinary experiences with the brands with which they choose to interact.

Couchbase is a full-featured Engagement Database. Built on the most powerful NoSQL technology, Couchbase Server gives you the flexibility to constantly reinvent the customer experience.


Dataguise provides optics into the entire sensitive data of an enterprise; structured, semi-structured or unstructured. Due to this powerful metadata, DgSecure can surgically mask and encrypt the sensitive data, and monitor it for unauthorized access.

To enable secure business execution for data-driven enterprises by delivering data-centric security solutions that detect and protect sensitive data assets in real time wherever they live and move across all repositories.

Treasure Data

MemSQL is a real-time data warehouse designed for cloud and on-premises that delivers immediate insights across your live and historical data.

MemSQL provides an adaptable database for real-time applications that require transactions and analytics in a single high performance platform. The distributed solution uses scalable SQL to enable real-time analytics required of modern applications from enterprises like Uber, Kellogg's, Dell/EMC, Comcast, and more.

Treasure Data

Real-time insights from operational workloads on the leading NoSQL database.

MongoDB's expressive query language, aggregation framework, and native support for MapReduce allow users to extract near real-time business insights from their data. Both editions of MongoDB are available on AWS.


Panoply is a Smart Data Warehouse built for the cloud, using Redshift. Panoply delivers the industry’s fastest time to insights by eliminating the development and coding typically associated with transforming, integrating, and managing big data.


Eliminate the divide between data transactions and analytics in a single, modern architecture.

SAP HANA One accelerates transactional processing, operational reporting, OLAP, predictive and text analysis streamlining both transactional (OLTP) and analytical (OLAP) processing by working with single data copy in the in-memory columnar data store.


Snowflake handles diverse data and analytics at any scale of data, workloads, and concurrency--without the cost and complexity of alternatives.

Snowflake provides a data warehouse built from the cloud up for today’s data and analytics. Uniquely architected for cloud, Snowflake brings together the flexibility of big data, the elasticity of the cloud, and the power of SQL in a single system.

Snowflake provides a data warehouse built from the cloud up for today’s data and analytics. Uniquely architected for cloud, Snowflake brings together the flexibility of big data, the elasticity of the cloud, and the power of SQL in a single system.
Snowflake handles diverse data and analytics at any scale of data, workloads, and concurrency--without the cost and complexity of alternatives.
Snowflake handles diverse data and analytics at any scale of data, workloads, and concurrency--without the cost and complexity of alternatives.

Harness the power and freedom of Teradata software in AWS. Analyze anything, deploy anywhere, buy any way, and move anytime.

Teradata enables powerful hybrid analytical ecosystems with 100% software consistency across all deployment modes. Leverage your existing investments, bring workload portability to life, and analyze any kind of data wherever it resides – at scale.

Snowflake provides a data warehouse built from the cloud up for today’s data and analytics. Uniquely architected for cloud, Snowflake brings together the flexibility of big data, the elasticity of the cloud, and the power of SQL in a single system.
Snowflake handles diverse data and analytics at any scale of data, workloads, and concurrency--without the cost and complexity of alternatives.
Snowflake handles diverse data and analytics at any scale of data, workloads, and concurrency--without the cost and complexity of alternatives.
Treasure Data

Collect, store and analyze your data with our plug and play analytics infrastructure in the cloud.

Reduce complexity and confidently scale your analytics infrastructure. With 300+ connectors, collect all of your data into our data lake without worrying about schema and efficiently load into Redshift.

Treasure Data

Delivering Enterprise-Class Big Data Analytics on AWS for Any Size Organization

Vertica maximizes cloud economics for mission-critical big data analytical initiatives by delivering blazingly high-performance query access and elastic scalability for rapid deployments on AWS. Packed with the most comprehensive set of features and deep integration with S3, Vertica manages massive amounts of data quickly and reliably to give you fast analytical insight. With Vertica, you can perform queries much faster than other analytical databases, without breaking your budget.

Treasure Data

Zaloni (ZDP) is a big data management, governance and self-service platform that operationalizes the data lake, catalogs metadata, orchestrates workflows and eliminates data silos for centralized management of all data sources to improve business insights. ZDP provides control throughout the data pipeline from ingestion to analytics, with data management, governance and self-service data preparation.

Sift through large datasets to uncover hidden patterns, correlations and other insights. 


Higher productivity for data teams, faster deployment of data pipelines, democratize data access.

Databricks offers a cloud platform powered by Spark, that makes it easy to turn data into value, from ingest to production, without the hassle of managing complex infrastructure, systems and tools.


Deloitte Analytics uses many technology solutions and tools to enable data analytics including BigData.  

Many of the world’s leading businesses count on Deloitte to deliver powerful outcomes, not just insights, for their toughest challenges. Deloitte Analytics practice is built around the wide range of client needs.


Qubole greatly simplifies, speeds, and scales big data analytics workloads.

Qubole is a Big Data processing service, with offerings for MapReduce, Hive, Spark, Pig, and Presto. Qubole provides a web UI and programmatic access via SDKs to run advanced analysis using Hadoop-based technologies with little to no set up time.


Splunk software scales to collect and index hundreds of terabytes of data per day, across multi-geography, multi-datacenter and hybrid cloud infrastructures. Because the insights from your data are mission critical, Splunk software provides the resilience you need, even as you scale out your low-cost, distributed computing environment.

Splunk software helps you unlock the hidden value of this data. And with the ability to bring in insights from your other tools, you can get value from the full spectrum of your data, not just a sub-set. Now you can collect, index, search, analyze and visualize all your data in one place. Splunk provides a unified way to organize and extract real-time insights from massive amounts of machine data from virtually any source. 


Secure and scalable machine data analytics to help you build, run and secure your modern applications.

With Sumo Logic, you're up and running in minutes with an AWS-native advanced analytics platform powered by machine learning, discovering meaningful patterns in your log data and detecting performance, usage and security anomalies for faster issue resolutions.

Turn data into actionable insights via reports, charts, and graphs.


Stop spending time fulfilling data requests. Empower everyone in your company to explore their data without relying on the data team

Quickly and easily visualize your Amazon Redshift data alongside all your business data with an interactive interface everyone in your company can use. Chartio is a powerful data analytics platform used by companies like Optimizely and Rackspace to help everyone query and analyze their data.


Domo is a business management platform that puts real-time data in the hands of everyone in the organization - from CEO's and business leaders to front-line employees.

With Domo’s over 400 pre-built connectors (including Redshift, Athena and S3) anyone can securely access, consume and interact with the data they need to optimize business performance. Domo, built for the cloud, web and mobile, helps foster collaboration, improve insights, speed decision making, increase organizational productivity and most importantly optimize business results.


Looker was architected from the cloud up to take advantage of the new kinds of powerful analytic data stores.

Looker opens up a new way to think about business intelligence in the cloud. A modern approach to BI that's fast, agile, and easy to manage, Looker leverages your AWS infrastructure without needing to move or unsecure your data.


Enterprise-grade business intelligence - Full range of analytic functionality.

Get all MicroStrategy software suite including mobile, web, architect and all of the admin tools. Each MicroStrategy subscription includes standard technical support which includes email and phone support as well as access to MicroStrategy knowledge base and community forums.


Periscope Data brings data science and advanced analytics to the world of BI.

Built on AWS, Periscope Data allows organizations to gain deeper, more actionable insights across the business, enabling both data teams and their business stakeholders. Ingest, store, analyze, visualize, and report on data in one, unified platform.


Embed analytics. Surface answers. Make happy customers.

Jaspersoft for AWS is a cloud reporting and analytics server that can be purchased and/or deployed on AWS. It is fully equipped to run standalone but is more commonly embedded into web and mobile-web applications. Automatically detect and connect to Amazon data sources like RDS, Redshift and EMR.

Tableau Software

All you need is your data and the questions you want to answer.

Tableau Software makes it fast and easy to create beautiful analytics from virtually any source of data (Redshift, EMR, RDS being some of our most popular).  Tableau is a natural fit for organizations that are looking to deploy with lightning speed.


WingArc’s user friendly dashboard enables creation of charts from different data sources into a single customizable board.

MotionBoard is a BI dashboard solution that help visualize data from various data sources, database, Excel, and Cloud. Users can discover real time situation of the business wherever they are, connecting art and science of their business decisions.


Zoomdata is Fastest Visual Analytics for Big Data

Designed for the cloud, Zoomdata’s microservices architecture and Data Sharpening™ technology delivers visual analytics of big datasets in seconds for real-time streaming and historical data without the need to move or transform data.

Learn more about qualified consulting partners with AWS big data expertise in your region.

APN Partners interested in listing their Big Data product or solution must have achieved the Big Data Competency through the AWS Competency Program.

To learn more about the Competency Program and apply for the Big Data Competency, click here »

Note: All solutions on the Big Data Partner Solutions webpages are created, sold, and implemented by the third party.

Learn more about the Competency Program
AWS Financial Services