
Hepatitis Type A & B - Total number of cases in the US | CDC
Provided By: Rearc

Hepatitis Type A & B - Total number of cases in the US | CDC
Provided By: Rearc
Centers for Disease Control and Prevention provides free and open access to various health related data. This release contains total number of Hepatitis type A and B cases reported in the United States, by region and by states or territory. This table contains provisional cases of selected national notifiable diseases from the National Notifiable Diseases Surveillance System (NNDSS). The data is available for past 2 years.
Product offers
The following offers are available for this product. Choose an offer to view the pricing and access duration options for the offer. Select an offer and continue to subscribe. Your subscription begins on the date that your request is approved by the provider. Additional taxes or fees might apply.
Public offer
Payment schedule: Upfront payment | Offer auto-renewal: Supported
$0 for 12 months
Overview
CDC works 24/7 to protect America from health, safety and security threats, both foreign and in the U.S. Whether diseases start at home or abroad, are chronic or acute, curable or preventable, human error or deliberate attack, CDC fights disease and supports communities and citizens to do the same. As the nation’s health protection agency, CDC saves lives and protects people from health threats. To accomplish its mission, CDC conducts critical science and provides health information that protects against expensive and dangerous health threats, and responds when these arise.
This release contains total number of Hepatitis type A and B cases reported in the United States, by region and by states or territory. This table contains provisional cases of selected national notifiable diseases from the National Notifiable Diseases Surveillance System (NNDSS). The data is available for past 2 years. NNDSS data from the 50 states, New York City, the District of Columbia and the U.S. territories are collated and published weekly on the NNDSS Data and Statistics web page (https://wwwn.cdc.gov/nndss/data-and-statistics.html ). Cases reported by state health departments to CDC for weekly publication are provisional because of the time needed to complete case follow-up. Therefore, numbers presented in later weeks may reflect changes made to these counts as additional information becomes available. This data is anonymized/aggregated.
More Information:
- Source - Division of Health Informatics and Surveillance (DHIS), Centers for Disease Control and Prevention
- Schema Definitions
- Sample Dataset
- Terms of Use
- CDC Data Homepage
- Frequency: Annual
What's included?
You will receive access to the following:
- Total number of Hepatitis Type A & B cases reported in the US (hepatitis-ab-cdc.csv)
- CloudFormation template that setups up automatic revision updates plus AWS analytics services such as AWS Glue and Amazon Athena (cloudformation.yaml)
- AWS Lambda code for revision updates (post-processing-code.zip)
Please note, in the post processing code, we use a Lambda layer that extends the AWS Python SDK (boto3) that is built into the Lambda Python runtime by adding the AWS Data Exchange and AWS Marketplace Catalog API SDKs as of November 13, 2019. Once the public SDKs are updated to include AWS Data Exchange APIs, we will update the code to remove this Lambda layer.
Deploy CloudFormation template to set up automatic revision updates and AWS Analytics services
Assuming you have subscribed to this product listing, below are the detailed steps to deploy CloudFormation template:
(Please note that you will need IAM permissions for CloudFormation, AWS Data Exchange, IAM, Lambda, Glue, Athena and QuickSight, in order to deploy the CloudFormation template.)
- Under the product listing, scroll down to
Data sets
section and click on the Data set name - Under the
Revisions
section, click on the most recent revision - Under
Assets
, checkmarkhepatitis-ab-cdc/automation/post-processing-code.zip
and clickExport to S3
- Choose the S3 Bucket where you would like to store the dataset. Make sure you only choose the S3 bucket. The asset comes with a pre-defined directory structure
- Under
Assets
, checkmarkhepatitis-ab-cdc/automation/cloudformation.yaml
and click eitherExport to S3
orExport to computer
- If you exported the
cloudformation.yaml
to S3, go to the S3 UI on the AWS console and navigate to the location where thecloudformation.yaml
is stored. In S3, click on the cloudformation.yaml and copy the url from theObject URL
- Now, from your AWS Management Console, log onto Amazon CloudFormation UI and click
Create Stack
- Under
Choose a template
either provide the template via uploading from local computer or specify the S3 object url and clickNext
- Provide a friendly stack name in the
Stack name
text box - In the
SourceS3Bucket
field, input the S3 bucket name that you chose earlier to store the hepatitis-ab-cdc/automation/post-processing-code.zip file - Leave rest of the fields as is
- Click
Next
- In the
Options
screen, clickNext
- Tick mark the
I acknowledge that AWS CloudFormation might create IAM resources.
box - Click
Create
At a high level, CloudFormation will setup following resources automatically.
- Lambda function to setup automatic AWS Data Exchange revision updates for this dataset
- CloudWatch Event rule that will automatically trigger the Lambda function every time a new revision update is published
- Another Lambda function to setup AWS Glue and Amazon Athena
- Necessary IAM roles and permissions
If you are interested in looking at the AWS Lambda code or the CloudFormation template, feel free to inspect files inside hepatitis-ab-cdc/automation/post-processing-code.zip
and hepatitis-ab-cdc/automation/cloudformation.yaml
Analytics & Visualizations
Apart from the source data, what we are also providing in this product listing is an easy way to interact and extract value out of the dataset. Native AWS Analytics services such as AWS Glue, Amazon Athena and Amazon QuickSight provide different ways to interact and visualize the data. The included AWS CloudFormation template sets up AWS Glue and Amazon Athena automatically in your AWS account.
Data Analysis - This diagram shows how all the AWS services interact
Using AWS Glue and Amazon Athena to run interactive queries against the dataset
Once the CloudFormation template is successfully deployed, the data is immediately searchable, queryable, and available on Athena. You can go to the Athena UI from the AWS Management Console and run SQL queries on the dataset.
Here are some sample Athena SQL queries you can try on the dataset.
# list total no. of Hepatitis type A weekly cases reported for year 2018
SELECT "reporting_area", "mmwr_year", "mmwr_week", "hepatitis_viral_acute_by_type_a_cum_2018" FROM "hepatitis_ab_cdc"."data" ORDER BY "hepatitis_viral_acute_by_type_a_cum_2018" DESC;
# compare total no. of Hepatitis type A & B cases reported for year 2018 based on weekly data
SELECT "reporting_area", "mmwr_year", "mmwr_week", "hepatitis_viral_acute_by_type_a_cum_2018", "hepatitis_viral_acute_by_type_b_cum_2018" FROM "hepatitis_ab_cdc"."data" ORDER BY "reporting_area" ASC;
Setup Amazon QuickSight to create visualizations on the dataset
Below are the detailed steps to analyze dataset using Amazon QuickSight
- From your AWS Management Console, log onto Amazon QuickSight
- Click
Manage data
- Click
New data set
- If you ran the provided CloudFormation template, you should already have your database and table with schema created in AWS Glue and Athena
- Click on
Athena
to connect to your data source - Provide a name for your QuickSight
Data source name
and clickCreate data source
- In the
Database: contain sets of table
dropdown, choose database ashepatitis_ab_cdc
and underTables: contain the data you can visualize
, choose table asdata
- At this point, you can
Edit/Preview data
if you like - You can then click on
Select
- In the
Finish data set creation
screen, you can selectVisualize
to finish the creation of data set process - Visualize the data set by selecting the
Horizontal bar chart
from theVisual types
- Drag
reporting_area
field to theY axis
inField wells
and for e.g. draghepatitis_viral_acute_by_type_a_cum_2018
field in theValue
block to chart the data
You are now ready to start analyzing and visualizing the dataset.
Contact Information
If you have questions about the source data, please contact cdcinfo@cdc.gov. If you have any questions about the CloudFormation stack, Lambda code or any of the AWS services being used, please contact data@rearc.io.
About Rearc
Rearc is a cloud, software and services company. We believe that empowering engineers drives innovation. Cloud-native architectures, modern software and data practices, and the ability to safely experiment can enable engineers to realize their full potential. We have partnered with several enterprises and startups to help them achieve agility. Our approach is simple — empower engineers with the best tools possible to make an impact within their industry.
Provided By
Fulfillment Method
AWS Data Exchange
Data sets (1)
You will receive access to the following data sets
Revision access rules
All historical revisions | All future revisions
Name | Type | Data dictionary | AWS Region |
---|---|---|---|
Hepatitis Type A & B - Total number of cases in the US | CDC | Not included | US East (N. Virginia) |
Usage information
By subscribing to this product, you agree that your use of this product is subject to the provider's offer terms including pricing information and Data Subscription Agreement . Your use of AWS services remains subject to the AWS Customer Agreement or other agreement with AWS governing your use of such services.
Support information
Support contact email address
Support contact URL
Refund policy
Refunds Not Applicable
General AWS Data Exchange support