Guidance for Automated Geospatial Insights Engine on AWS

I'm ready to deploy

This Guidance shows how build an automated geospatial insights engine on AWS to improve supply and demand forecasting and risk management. Many industries depend upon intelligence and insights gained from processed geospatial and Earth-observation data from sources like satellite and aerial imagery and remote sensing. This Guidance couples this data with mechanistic and AI-powered models, so you can enhance forecasting, automate risk mitigation, and meet regulations.

Please note: [Disclaimer]

Architecture Diagram

Download the architecture diagram PDF

Overview
This architecture diagram shows an overview of the automated geospatial analysis engine, specifically the key modules and their interactions.

Step 6
The results module acts as an intermediary between the region and executor modules, transforming and publishing region data to a SpatioTemporal Asset Catalog (STAC) server and mapping execution results to their corresponding region.

Step 8
Note: The STAC server module is an AWS Cloud Development Kit (AWS CDK) port of stac-server, an execution of the STAC API specification for searching and serving metadata for geospatial data, including but not limited to satellite imagery.

Step 1
The UI module is responsible for transforming analysis results into displayable visual assets. It uses a geospatial mapping service to obtain base map data, and it renders interactive maps on the frontend using a map-rendering library.

Step 6
The results module acts as an intermediary between the region and executor modules, transforming and publishing region data to a SpatioTemporal Asset Catalog (STAC) server and mapping execution results to their corresponding region.

Step 8
Note: The STAC server module is an AWS Cloud Development Kit (AWS CDK) port of stac-server, an execution of the STAC API specification for searching and serving metadata for geospatial data, including but not limited to satellite imagery.

Step 1
The UI module is responsible for transforming analysis results into displayable visual assets. It uses a geospatial mapping service to obtain base map data, and it renders interactive maps on the frontend using a map-rendering library.

Click to enlarge

Step 6
The results module acts as an intermediary between the region and executor modules, transforming and publishing region data to a SpatioTemporal Asset Catalog (STAC) server and mapping execution results to their corresponding region.

Step 8
Note: The STAC server module is an AWS Cloud Development Kit (AWS CDK) port of stac-server, an execution of the STAC API specification for searching and serving metadata for geospatial data, including but not limited to satellite imagery.

Step 1
The UI module is responsible for transforming analysis results into displayable visual assets. It uses a geospatial mapping service to obtain base map data, and it renders interactive maps on the frontend using a map-rendering library.

Step 6
The results module acts as an intermediary between the region and executor modules, transforming and publishing region data to a SpatioTemporal Asset Catalog (STAC) server and mapping execution results to their corresponding region.

Step 8
Note: The STAC server module is an AWS Cloud Development Kit (AWS CDK) port of stac-server, an execution of the STAC API specification for searching and serving metadata for geospatial data, including but not limited to satellite imagery.

Step 1
The UI module is responsible for transforming analysis results into displayable visual assets. It uses a geospatial mapping service to obtain base map data, and it renders interactive maps on the frontend using a map-rendering library.
UI Module
This architecture diagram shows how to set up the user interface (UI), manages authentication and authorization, and query analysis results.

Step 1
Load the sample React application from the S3 bucket, using Amazon CloudFront to maintain low latency and high performance.

Click to enlarge

Step 1
Load the sample React application from the S3 bucket, using Amazon CloudFront to maintain low latency and high performance.
Region Module
This architecture diagram demonstrates the hierarchical structure of groups, regions, polygons, and states.

Step 1
Create group, region, and polygon resources using the region API through API Gateway. During the process of setting up a new region, you’ll be able to define the schedule and prioritization for the geospatial data processing tasks associated with it.

Step 1
Create group, region, and polygon resources using the region API through API Gateway. During the process of setting up a new region, you’ll be able to define the schedule and prioritization for the geospatial data processing tasks associated with it.

Click to enlarge

Step 1
Create group, region, and polygon resources using the region API through API Gateway. During the process of setting up a new region, you’ll be able to define the schedule and prioritization for the geospatial data processing tasks associated with it.

Step 1
Create group, region, and polygon resources using the region API through API Gateway. During the process of setting up a new region, you’ll be able to define the schedule and prioritization for the geospatial data processing tasks associated with it.
Scheduler Module
This architecture diagram shows how to schedule the engine's processing tasks, based on each region's processing configuration.

Step 1
The scheduler module subscribes to region events published by the region module.

Click to enlarge

Step 1
The scheduler module subscribes to region events published by the region module.
Executor Module
This architecture diagram demonstrates the execution of the region analysis invoked by the schedule module.

Step 1
The executor module subscribes to the job queue of the scheduler module, which queues messages when a region is scheduled for processing.

Click to enlarge

Step 1
The executor module subscribes to the job queue of the scheduler module, which queues messages when a region is scheduled for processing.
Results Module
This architecture diagram shows how to transform data into STAC items, publish them to the STAC server, and map execution results to corresponding regions.

Step 1
Interact with the results module through API Gateway.

Click to enlarge

Step 1
Interact with the results module through API Gateway.
Notification Module
This module shows how to manage subscriptions to notifications generated by other modules.

Step 1
Create a subscription for region processing events through API Gateway.

Click to enlarge

Step 1
Create a subscription for region processing events through API Gateway.

Get Started

Deploy this Guidance

Sample code

Use sample code to deploy this Guidance in your AWS account

Well-Architected Pillars

The AWS Well-Architected Framework helps you understand the pros and cons of the decisions you make when building systems in the cloud. The six pillars of the Framework allow you to learn architectural best practices for designing and operating reliable, secure, efficient, cost-effective, and sustainable systems. Using the AWS Well-Architected Tool, available at no charge in the AWS Management Console, you can review your workloads against these best practices by answering a set of questions for each pillar.

The architecture diagram above is an example of a Solution created with Well-Architected best practices in mind. To be fully Well-Architected, you should follow as many Well-Architected best practices as possible.

Operational Excellence

X-Ray provides complete tracing and monitoring capabilities to help you identify performance bottlenecks and troubleshoot issues. It enables you to visualize and analyze the components of this Guidance, such as API calls, Lambda functions, and AWS Step Functions workflows. Finally, AWS CloudFormation provisions and manages the required resources, enabling automated deployments and changes.

Read the Operational Excellence whitepaper
Security

Amazon Cognito provides secure user authentication mechanisms so that only authorized users can access your applications and resources. AWS Identity and Access Management (IAM) policies and roles let you control access to resources according to the principle of least privilege. Additionally, Verified Permissions provides a centralized and efficient system for managing access to your custom applications. Through a policy-based approach that uses the Cedar policy language for defining fine-grained permissions, it promotes consistency and maintainability in the authorization process. It also separates the authorization process from the application code so that you can easily update and manage permissions.

Read the Security whitepaper
Reliability

AWS Batch automatically scales the number of compute resources based on the number of jobs in the queue. As a result, this Guidance provisions the right amount of resources for satellite image processing, reducing the risk of job failures due to resource constraints. Amazon SQS and EventBridge maintain durability and persistence by decoupling components through asynchronous messaging and by storing messages redundantly across multiple Availability Zones. This makes applications more resilient: if an individual component fails, messages are temporarily stored in the queue and processed when the component recovers.

Read the Reliability whitepaper
Performance Efficiency

DynamoDB lets you define a partition key (and an optional sort key) for your table to enhance performance efficiency. By using defined keys to distribute your data across servers and partitions in addition to using appropriate indexes, you can optimize data access patterns. Additionally, Amazon S3 offers seamless scalability, enabling your applications to handle high request rates effortlessly. For example, each partitioned Amazon S3 prefix can sustain at least 3,500 requests per second for PUT, COPY, POST, and DELETE operations. Each prefix can also sustain up to 5,500 requests per second for GET and HEAD operations. Because there are no restrictions on the number of prefixes you can create within a bucket, you can partition your data effectively to achieve optimal performance and scalability. And by encoding the region, polygon, or results ID in the file prefix when storing satellite analysis images, you can enable parallel processing and listing.

Read the Performance Efficiency whitepaper
Cost Optimization

The serverless and event-driven nature of Lambda, EventBridge, Amazon SQS, and AWS Batch means that they automatically scale based on demand and don’t overprovision capacity. This helps you optimize costs, because you only pay for the resources you use. Additionally, EventBridge and Amazon SQS remove the need for you to set up polling-based architectures, helping you further reduce costs.

Read the Cost Optimization whitepaper
Sustainability

This Guidance uses serverless services for compute and storage—including Lambda, AWS Batch, and DynamoDB—so they automatically scale to zero when not in use. This removes the need for always-on infrastructure and reduces the overall environmental impact of your workloads. Additionally, Lambda uses AWS Graviton2 processors, which are designed to be more energy-efficient than traditional x86-based processors. They consume less power while delivering comparable performance, reducing your carbon emissions.

Read the Sustainability whitepaper

Disclaimer

The sample code; software libraries; command line tools; proofs of concept; templates; or other related technology (including any of the foregoing that are provided by our personnel) is provided to you as AWS Content under the AWS Customer Agreement, or the relevant written agreement between you and AWS (whichever applies). You should not use this AWS Content in your production accounts, or on production or other critical data. You are responsible for testing, securing, and optimizing the AWS Content, such as sample code, as appropriate for production grade use based on your specific quality control practices and standards. Deploying AWS Content may incur AWS charges for creating or using AWS chargeable resources, such as running Amazon EC2 instances or using Amazon S3 storage.

References to third-party services or organizations in this Guidance do not imply an endorsement, sponsorship, or affiliation between Amazon or AWS and the third party. Guidance from AWS is a technical starting point, and you can customize your integration with third-party services when you deploy the architecture.