Guidance for Patient Entity Resolution with AWS HealthLake
Overview
How it works
These technical details feature an architecture diagram to illustrate how to effectively use this solution. The architecture diagram shows the key components and their interactions, providing an overview of the architecture's structure and functionality step-by-step.
Well-Architected Pillars
The architecture diagram above is an example of a Solution created with Well-Architected best practices in mind. To be fully Well-Architected, you should follow as many Well-Architected best practices as possible.
Operational Excellence
Use Step Functions to orchestrate your entire workflow as a state machine. Step Functions coordinates the processing of multiple Lambda functions, allowing you to perform operations as code for automated processing. You can also limit human error and enable consistent responses to events. EventBridge can schedule the Step Functions state machine to run automatically, reducing operational overhead and ensuring regular processing of your entity resolution process. Additionally, you can automate your extract, transfer, and load (ETL) process by using AWS Glue to crawl your patient dataset and populate the Glue Data Catalog. Finally, monitor the metrics and logs using Amazon CloudWatch, gaining operational visibility and simplifying troubleshooting.
Security
AWS Identity and Access Management (IAM) enforces least-privilege access and can integrate with Lake Formation to create and grant appropriate permissions to stakeholders. This allows your stakeholders to securely query your HealthLake data store using Athena. HealthLake has default encryption at rest and in transit to safeguard your data. You can further enhance your security posture by using Amazon S3 with encryption, access controls, and versioning.
Reliability
Amazon S3 offers durable data storage with automatic replication across multiple Availability Zones (AZs). You can use Athena for reliable and highly available access to your data in Amazon S3. In addition, orchestrate your workflows using Step Functions, which provide built-in error handling and retry mechanisms. And by running your services on the global infrastructure of AWS, which is designed for fault tolerance and high availability, you help ensure that issues in one Region do not impact services in other Regions.
Performance Efficiency
By using serverless technologies like EventBridge, Lambda, AWS Glue, Athena, and Amazon S3, this Guidance scales your configured resources based on your workload demands. Furthermore, with AWS Glue crawlers, you can automate your ETL process by streamlining data preparation and minimizing manual effort. Also, use the advanced matching capabilities of AWS Entity Resolution to accurately identify and link disparate patient records, optimizing resource utilization and reducing the need for manual intervention. You can then monitor the performance of your resources using CloudWatch, so you can identify and address potential bottlenecks or inefficiencies.
Cost Optimization
Athena, Amazon S3, Lambda, AWS Glue, and EventBridge scale on demand and only charge you for the resources you use. With Athena, you can analyze data in your HealthLake data store without provisioning or managing any infrastructure, eliminating idle resource costs. AWS Entity Resolution follows a pay-per-use model, where you only pay for the number of source records processed by your workflows.
Sustainability
EventBridge and Step Functions orchestrate workflows in a resilient, efficient manner with minimal resources. And by using Amazon S3, Lambda, Athena, and other serverless services that utilize the renewable energy infrastructure of AWS, your architecture is equipped to scale efficiently, optimizing energy usage.
Deploy with confidence
Ready to deploy? Review the sample code on GitHub for detailed deployment instructions to deploy as-is or customize to fit your needs.
Disclaimer
Did you find what you were looking for today?
Let us know so we can improve the quality of the content on our pages