AWS Partner Network (APN) Blog

Tag: Data Pipeline

Cloudera-APN-Blog-121422-1

Building a Serverless Trigger-Based Data Movement Pipeline Using Apache NiFi, DataFlow Functions, and AWS Lambda

Organizations have a wide range of data processing use cases, collecting data from variety of sources, transforming it and loading it to different destinations to fulfill diverse business needs. Learn how DataFlow Functions, combined with the serverless compute services provided by AWS Lambda, enables developers to implement a wide spectrum of use cases using the low-code NiFi flow designer user interface, and deploy the flows as short-lived serverless functions.

Next-Caller-AWS-Partners

Building a Data Processing and Training Pipeline with Amazon SageMaker

Next Caller uses machine learning on AWS to drive data analysis and the processing pipeline. Amazon SageMaker helps Next Caller understand call pathways through the telephone network, rendering analysis in approximately 125 milliseconds with the VeriCall analysis engine. VeriCall verifies that a phone call is coming from the physical device that owns the phone number, and flags spoofed calls and other suspicious interactions in real-time.

Fivetran_AWS-Service-Ready

Enabling Customer Attribution Models on AWS with Automated Data Integration

Attribution models allow companies to guide marketing, sales, and support efforts using data, and then custom tailor every customer’s experience for maximum effect. Combined together, cloud-based data pipeline tools like Fivetran and data warehouses like Amazon Redshift form the infrastructure for integrating and centralizing data from across a company’s operations and activities, enabling business intelligence and analytics activities.

Datacoral_AWS Solutions

Building Serverless Data Pipelines on Amazon Redshift By Writing SQL with Datacoral

Amazon Redshift is a powerful yet affordable data warehouse, and while getting data out of Redshift is easy, getting data into and around Redshift can pose problems as the warehouse grows. Datacoral is a serverless data platform that manages metadata changes, data transformations, and orchestrating pipelines for data consumers. In this post, learn how to write Redshift SQL to represent data flow, and how serverless data pipelines get automatically generated for that data flow.