Architecture Best Practices for Analytics & Big Data
Learn architecture best practices for cloud data analysis, data warehousing, and data management on AWS.
Featured Content
Data Lakes
- Architecture Monthly: Data Lakes on AWS
- Whitepaper: Building Big Data Storage Solutions (Data Lakes) for Maximum Flexibility
- Workshop: AWS Lake Formation Workshop
- Video: Build and Automate a Modern Serverless Data Lake on AWS
- Blog: A Public Data Lake for Analysis of COVID-19 Data
Data Analytics
- Blog: How Wind Mobility Built a Serverless Data Architecture
- Technical Guide: Amazon EMR Migration Guide
- Workshop: Data Engineering Immersion Day
- Workshop: Streaming Analytics Workshop
- Blog: Analyzing Google Analytics Data with Amazon AppFlow and Amazon Athena
Data Warehousing
- Blog: Develop an Application Migration Methodology to Modernize Your Data Warehouse with Amazon Redshift
- Workshop: Redshift Immersion Labs
- Video: Deep Dive and Best Practices for Amazon Redshift
- Blog: Amazon Redshift Update – Next-Generation Compute Instances and Managed, Analytics-Optimized Storage
- Blog: Lower Your Costs with the New Pause and Resume Actions on Amazon Redshift
Data Management
- Blog: Enable Fine-Grained Permissions for Amazon QuickSight Authors in AWS Lake Formation
- Blog: Restrict Amazon Redshift Spectrum External Table Access to Amazon Redshift IAM Users and Groups Using Role Chaining
- Blog: Enforce Column-Level Authorization with Amazon QuickSight and AWS Lake Formation
- Blog: Achieve Finer-Grained Data Security with Column-Level Access Control in Amazon Redshift
- Blog: Load Data Incrementally and Optimized Parquet Writer with AWS Glue
Latest Trends
- Blog: Best Practices from Delhivery on Migrating from Apache Kafka to Amazon MSK
- Blog: Introducing Amazon EMR Managed Scaling – Automatically Resize Clusters to Lower Cost
- Blog: Best Practices for Amazon Redshift Federated Query
- Blog: Federate Access to Your Amazon Redshift Cluster with Active Directory Federation Services
Most Popular
- Reference Implementation: Centralized Logging
- Reference Implementation: Data Lake on AWS
- Reference Implementation: Fraud Detection Using Machine Learning
- Blog: Test Data Quality at Scale with Deequ
- Quick Start: Tableau Server on AWS
Didn't find what you were looking for? Let us know.
Filter by:
Clear all filters
Analytics & Big Data Blog Posts