Semi-structured Data Pipeline

Quickly aggregate disparate files ingested into a data lake for analytics use cases. Make better data-driven business decisions using analytics across financial reports, customer trends, supply chain and other functions, while ensuring data security, quality and consistency!

Request private offer

Overview

Try agent mode

Create proposal

Ask question

Consistently aggregates disparate files (JSON, CSV, XML) securely and accurately into a data lake for analytics. Enables better data-driven decisions across financial reports, customer trends, supply chain and other functions. Ensures data security, quality and consistency.

Typical POC Length: 7 days

Up to 3 JSON, CSV, or XML files (mix and match) Data Lake (S3) - with three areas: ingestion, organized, analysis Change Data Capture (S3 Event Notifications, Lambda Functions) - incremental, cumulative, cumultaive-ytd, cumulative-mtd CDC process that automatically updates "organized" section of the data lake Data is stored in parquet format for better performance and lower cost Transformation (Glue Job, Python Shell) Data aggregation and analysis Transformed data can be explored/queried using Athena Publishing Transformed data is pushed to RDS or Aurora

Technologies: Amazon S3, AWS Lambda, AWS Glue, Python, Amazon Athena, Amazon RDS or Amazon Aurora

Use Cases: Drug Use & Health Analytics Aggregate, store and process very large datasets from different agencies within a country government to study the effects of Opioid use and understand how Human Services can better serve the citizens.

Government Services Ingest, cleanse, aggregate, analyze, publish, and present data from 26+ government agencies, with various data formats, in order to understand how citizens use the services provided by the county

Highlights

Quickly and accurately aggregate data for analytics uses
Ensure data quality and security
Typical POC in 7 days or less

Details

Sold by

GDECA

Introducing multi-product solutions

You can now purchase comprehensive solutions tailored to use cases and industries.

Learn more

Explore multi-product solutions

Pricing

Custom pricing options

Request private offer

Pricing is based on your specific requirements and eligibility. To get a custom quote for your needs, request a private offer.

How can we make this page better?

Tell us how we can improve this page, or report an issue with this product.

Legal

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Support

Vendor support

CTO igor.royzis@gdeca.net

Get support