Listing Thumbnail

    Data Engineering on AWS

     Info
    Learn how to design, build, and secure modern data solutions on AWS, including scalable data lakes, data warehouses, and batch or streaming data pipelines for end-to-end data engineering at scale.

    Overview

    Course Overview

    This comprehensive 3-day instructor-led training provides a deep dive into data engineering practices and solutions on Amazon Web Services (AWS). Participants will learn how to design, build, optimize, and secure data engineering solutions by using AWS services. Topics range from foundational concepts to hands-on implementation of data lakes, data warehouses, and both batch and streaming data pipelines. This course equips data professionals with the skills needed to architect and manage modern data solutions at scale.

    Start your AWS Essentials journey by accessing Official AWS e-Learning for FREE. Learn AWS Foundations: Getting Started with the AWS Cloud Essentials, Migrating to AWS: A high level introduction, Introduction to Amazon Elastic Compute Cloud (EC2) and more - GET STARTED 

    Level: Intermediate

    Duration: 3 Day/24 hours

    Delivery Type: Instructor-Led Training

    Course Objectives

    • Design and implement scalable data lakes and data warehouses on AWS.
    • Build, optimize, and secure batch data processing pipelines.
    • Develop and manage streaming data solutions.
    • Apply best practices for data governance and security.
    • Automate data engineering workflows by using AWS services.
    • Implement access control and security measures for data solutions.

    Who Should Go For This Training?

    • Data Engineer
    • DevOps Engineer
    • IT Professional
    • Data Analyst

    Pre-Requisites

    Recommended

    • Basic understanding of AWS services
    • Familiarity with database concepts
    • Basic programming or scripting knowledge
    • Understanding of data processing fundamentals

    Course Outline

    Data Engineering Roles and Key Concepts

    • The role of a data engineer
    • Data discovery for a data analytics system
    • AWS services for data workflows
    • Continuous integration and continuous delivery
    • Networking considerations

    Designing and Implementing Data Lakes

    • Data lake introduction
    • Data lake storage
    • Ingest data
    • Catalog data
    • Transform data
    • Serve data for consumption
    • Lab: Setting up a Data Lake on AWS

    Optimizing and Securing Data Lake Solutions

    • Optimizing performance
    • Security using Lake Formation
    • Setting permissions with Lake Formation
    • Security and governance
    • Troubleshooting
    • Lab: Automating Data Lake Creation using AWS Lake Formation Blueprints

    Data Warehouse Architecture and Design Principles

    • Introduction to data warehouses
    • Amazon Redshift overview
    • Ingesting data into Amazon Redshift
    • Processing data
    • Serving data for consumption
    • Lab: Setting up a Data Warehouse using Amazon Redshift Serverless

    Performance Optimization Techniques for Data Warehouses

    • Monitoring and optimization options
    • Data optimization in Amazon Redshift
    • Query optimization in Amazon Redshift
    • Data orchestration

    Security and Access Control for Data Warehouses

    • Authentication and access control in Amazon Redshift
    • Data security in Amazon Redshift
    • Lab: Working with Amazon Redshift

    Designing Batch Data Pipelines

    • Introduction to batch data pipelines
    • Designing a batch data pipeline
    • Ingesting batch data

    Implementing Strategies for Batch Data Pipelines

    • Processing and transforming data
    • Transforming data formats
    • Integrating your data
    • Cataloging data
    • Serving data for consumption
    • Lab: A Day in the Life of a Data Engineer

    Optimizing, Orchestrating, and Securing Batch Data Pipelines

    • Optimizing the batch data pipeline
    • Orchestrating the batch data pipeline
    • Securing the batch data pipeline
    • Lab: Orchestrating Data Processing in Spark using AWS Step Functions

    Streaming Data Architecture Patterns

    • Introduction to streaming data pipelines
    • Ingesting data from stream sources
    • Storing streaming data
    • Processing streaming data
    • Analyzing streaming data
    • Lab: Streaming Analytics with Amazon Managed Service for Apache Flink

    Optimizing and Securing Streaming Solutions

    • Optimizing a streaming data solution
    • Securing a streaming data pipeline
    • Lab: Access Control with Amazon Managed Streaming for Apache Kafka

    Compliance and Cost Optimization

    • Compliance considerations
    • Cost optimization tools

    Course Wrap-Up

    Highlights

    • Master end-to-end, scalable data engineering on AWS—from data lakes and warehouses to batch and streaming pipelines—in just 3 days of intensive, hands-on training.

    Details

    Delivery method

    Deployed on AWS

    Unlock automation with AI agent solutions

    Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.
    AI Agents

    Pricing

    Custom pricing options

    Pricing is based on your specific requirements and eligibility. To get a custom quote for your needs, request a private offer.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Support

    Vendor support

    To learn more about our AWS trainings please visit NetCom Learning  or do not hesitate to contact our Sales Team: [aws@netcomlearning.com ] | (888)563-8266