Select your cookie preferences

We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.

If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”

Sign in
We were unable to launch AWS Marketplace.
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Synthetic Data Pack: 500 patients, 1 year of longitudinal data

Provided By: Interoperability Institute LLC

Synthetic Data Pack: 500 patients, 1 year of longitudinal data

Provided By: Interoperability Institute LLC

Using a statistic population health model generator, this data set is made up of highly realistic, but synthetic patient data that can be used for testing purposes without risk of disclosing PHI (protected health information). This dataset is for single organization use only. Please contact us for more information on synthetic datasets for multi-partner use, FHIR server options available for testing, or hand curated datasets to meet your needs.

Product offers

The following offers are available for this product. Choose an offer to view the pricing and access duration options for the offer. Select an offer and continue to subscribe. Your subscription begins on the date that your request is approved by the provider. Additional taxes or fees might apply.

Public offer

Payment schedule: Upfront payment | Offer auto-renewal: Supported
$1,999 for 12 months

Overview

This data pack is a synthetic healthcare dataset comprised of 500 patients with 1 year of longitudinal history.

Using a Monte Carlo simulation technique, each synthetic record is modeled to emulate clinically relevant treatment scenarios. During the generation synthetic patients progress through a series of healthcare encounters. It is these encounters and their events that are used to generate the dataset which is comprised of healthcare data messages across several HL7 messaging standards.

These records are highly realistic and even include gaps of information like a patient record in a real-world healthcare ecosystem.


Common conditions that may be contained in the data pack:

• Appendicitis

• Cancer

• Covid

• Deep venous thrombosis

• Diabetes

• Food Insecurity (SDOH)

• Hypertension

• Osteoporosis

• Pregnancy

• Pulmonary embolism

• STIs

• Zika


HL7 Message standards output that may be included for a synthetic patient record:

ADT

Admission, Discharge, Transfer (ADT) messages are used to communicate patient demographics, visit information and patient state at a healthcare facility.

This synthetic data set contains the following number of synthetic Admit, Discharge and Transfer (ADT) messages in HL7 messaging standard version 2.6 with the following event types:

• A01 - Admit / visit notification

• A03 - Discharge/end visit

• A04 - Register a patient

Message count in data set:

484 total A01

484 total A03

2,991 total A04


VXU

Unsolicited Vaccination Update (VXU) messages are used to receive and send patient’s vaccination information.

This synthetic data set contains synthetic Unsolicited Vaccination Record (VXU) messages in HL7 messaging standard version 2.5.1 with an event type of V04.

Message count in data set: 1,098


ORU

ORUs are unsolicited transmission of an observation message designed contain information about a patient's clinical observations and are used for transmitting patient’s laboratory results to other systems.

This synthetic data set contains synthetic Observation Result (ORU) messages in HL7 messaging standard version 2.5.1 with an event type of R01.

Message count in data set: 299


CCD

Continuity of Care Documents (CCD) are XML based markup standard built using HL7 Clinical Document Architecture (CDA) elements. CCD’s carry summary information about the patient within the broader context of the personal health record.

Current data fields in CCD’s:

• Patient demographics

• Medications

• Allergies

• Encounters

• Problem lists

• Diagnosis

• Lab results

• Immunization

• Social History

Message count in data set: 300


FHIR

Fast Healthcare Interoperability Resources (FHIR) is a modern standard for exchanging healthcare information electronically. FHIR leverages web standards like HTTP, RESTful APIs, and JSON to enable seamless communication between different healthcare systems, applications, and devices.

FHIR facilitates interoperability by providing a framework for representing and exchanging clinical data in a structured, standardized format, allowing healthcare stakeholders to easily access and share patient information across disparate systems, leading to improved care coordination, streamlined workflows, and enhanced patient outcomes.

The synthetic patient records generated by our statistic population health model generator are output in JSON FHIR version R4 resources.

Message count in data set: 300 bundles containing an average of 100 FHIR resources in each bundle (~30,000 total FHIR resources)

Fulfillment Method
AWS Data Exchange

Data sets (1)

You will receive access to the following data sets

Revision access rules
Last 1 revision | All future revisions
Name
Type
Data dictionary
AWS Region
Synthetic Data Pack: 500 patients with 1 year of longitudinal data history
Not included
US East (N. Virginia)

Usage information

By subscribing to this product, you agree that your use of this product is subject to the provider's offer terms including pricing information and Data Subscription Agreement . Your use of AWS services remains subject to the AWS Customer Agreement  or other agreement with AWS governing your use of such services.

Support information

Refund policy
Refunds are not offered for this product.
General AWS Data Exchange support