AWS Big Data Blog

Deenbandhu Prasad

Author: Deenbandhu Prasad

Getting started with AWS Glue Data Quality for ETL Pipelines

June 2023: This post was reviewed and updated with the latest release from AWS Glue Data Catalog. Today, hundreds of thousands of customers use data lakes for analytics and machine learning. However, data engineers have to cleanse and prepare this data before it can be used. The underlying data has to be accurate and recent […]

Effective data lakes using AWS Lake Formation, Part 1: Implementing cell-level and row-level security

July 2023: This post was reviewed for accuracy. We announced the general availability of AWS Lake Formation transactions, cell-level and row-level security, and acceleration at AWS re: Invent 2021. In this post, we focus on cell-level and row-level security and show you how to enforce business needs by restricting access to specific rows. Effective data […]