
Overview
Diabetes, an important cause of ill health and a risk factor for other diseases in developed countries, is spreading rapidly in developing countries. Highest among the elderly, prevalence rates are rising among younger and productive populations in developing countries. Economic development has led to the spread of Western lifestyles and diet to developing countries, resulting in a substantial increase in diabetes. Without effective prevention and control programs, diabetes will likely continue to increase. This release contains diabetes prevalence (% of population ages 20 to 79) for all countries in the world.
The original publisher of this data is The World Bank. This content is published as World Bank Open Data and provides free and open access to global development data. This work is licensed under a Creative Commons Attribution 4.0 (CC-BY 4.0). This data is anonymized/aggregated.
More Information:
- Source - International Diabetes Federation, Diabetes Atlas
- Schema Definitions
- Sample Dataset
- Terms of Use
- World Bank Open Data Homepage
- Frequency: Annual
What's included?
You will receive access to the following:
- Diabetes prevalence (% of population ages 20 to 79) (diabetes-prevalence.csv)
- CloudFormation template that setups up automatic revision updates plus AWS analytics services such as AWS Glue and Amazon Athena (cloudformation.yaml)
- AWS Lambda code for revision updates (post-processing-code.zip)
Please note, in the post processing code, we use a Lambda layer that extends the AWS Python SDK (boto3) that is built into the Lambda Python runtime by adding the AWS Data Exchange and AWS Marketplace Catalog API SDKs as of November 13, 2019. Once the public SDKs are updated to include AWS Data Exchange APIs, we will update the code to remove this Lambda layer.
Deploy CloudFormation template to set up automatic revision updates and AWS Analytics services
Assuming you have subscribed to this product listing, below are the detailed steps to deploy CloudFormation template:
(Please note that you will need IAM permissions for CloudFormation, AWS Data Exchange, IAM, Lambda, Glue, Athena and QuickSight, in order to deploy the CloudFormation template.)
- Under the product listing, scroll down to Data sets section and click on the Data set name
- Under the Revisions section, click on the most recent revision
- Under Assets, checkmark diabetes-prevalence/automation/post-processing-code.zip and click Export to S3
- Choose the S3 Bucket where you would like to store the dataset. Make sure you only choose the S3 bucket. The asset comes with a pre-defined directory structure
- Under Assets, checkmark diabetes-prevalence/automation/cloudformation.yaml and click either Export to S3 or Export to computer
- If you exported the cloudformation.yaml to S3, go to the S3 UI on the AWS console and navigate to the location where the cloudformation.yaml is stored. In S3, click on the cloudformation.yaml and copy the url from the Object URL
- Now, from your AWS Management Console, log onto Amazon CloudFormation UI and click Create Stack
- Under Choose a template either provide the template via uploading from local computer or specify the S3 object url and click Next
- Provide a friendly stack name in the Stack name text box
- In the SourceS3Bucket field, input the S3 bucket name that you chose earlier to store the diabetes-prevalence/automation/post-processing-code.zip file
- Leave rest of the fields as is
- Click Next
- In the Options screen, click Next
- Tick mark the I acknowledge that AWS CloudFormation might create IAM resources. box
- Click Create
At a high level, CloudFormation will setup following resources automatically.
- Lambda function to setup automatic AWS Data Exchange revision updates for this dataset
- CloudWatch Event rule that will automatically trigger the Lambda function every time a new revision update is published
- Another Lambda function to setup AWS Glue and Amazon Athena
- Necessary IAM roles and permissions
If you are interested in looking at the AWS Lambda code or the CloudFormation template, feel free to inspect files inside diabetes-prevalence/automation/post-processing-code.zip and diabetes-prevalence/automation/cloudformation.yaml
Analytics & Visualizations
Apart from the source data, what we are also providing in this product listing is an easy way to interact and extract value out of the dataset. Native AWS Analytics services such as AWS Glue, Amazon Athena and Amazon QuickSight provide different ways to interact and visualize the data. The included AWS CloudFormation template sets up AWS Glue and Amazon Athena automatically in your AWS account.
Data Analysis - This diagram shows how all the AWS services interact