
Synthetic Data Pack: 2500 patients with 5 years of longitudinal data
Provided By: Interoperability Institute LLC

Synthetic Data Pack: 2500 patients with 5 years of longitudinal data
Provided By: Interoperability Institute LLC
Using a statistic population health model generator, this data set is made up of highly realistic, but synthetic patient data that can be used for testing purposes without risk of disclosing PHI (protected health information). This dataset is for single organization use only. Please contact us for more information on synthetic datasets for multi-partner use, FHIR server options available for testing, or hand curated datasets to meet your needs.
Product offers
The following offers are available for this product. Choose an offer to view the pricing and access duration options for the offer. Select an offer and continue to subscribe. Your subscription begins on the date that your request is approved by the provider. Additional taxes or fees might apply.
Public offer
Overview
This data pack is a synthetic healthcare dataset comprised of 2500 patients with 5 years of longitudinal history. Using a Monte Carlo simulation technique, each synthetic record is modeled to emulate clinically relevant treatment scenarios. During the generation synthetic patients progress through a series of healthcare encounters. It is these encounters and their events that are used to generate the dataset which is comprised of healthcare data messages across several HL7 messaging standards.
These records are highly realistic and even include gaps of information like a patient record in a real-world healthcare ecosystem.
Common conditions that may be contained in the data pack:
Appendicitis
Cancer
Covid
Deep venous thrombosis
Diabetes
Food Insecurity (SDOH)
Hypertension
Osteoporosis
Pregnancy
Pulmonary embolism
STIs
Zika
HL7 Message standards output that may be included for a synthetic patient record:
ADT
Admission, Discharge, Transfer (ADT) messages are used to communicate patient demographics, visit information and patient state at a healthcare facility.
This synthetic data set contains the following number of synthetic Admit, Discharge and Transfer (ADT) messages in HL7 messaging standard version 2.6 with the following event types:
A01 - Admit / visit notification
A03 - Discharge/end visit
A04 - Register a patient
Message count in data set:
8,518 total A01
33 total A02
8,518total A03
92,752 total A04
VXU
Unsolicited Vaccination Update (VXU) messaged are used to receive and send patient vaccination information. This synthetic data set contains synthetic Unsolicited Vaccination Record (VXU) messages in HL7 messaging standard version 2.5.1 with an event type of V04.
Message count in data set: 34,908
ORU
ORUs are unsolicited transmission of an observation message designed contain information about patient clinical observations and are used for transmitting patient laboratory results to other systems.
This synthetic data set contains synthetic Observation Result (ORU) messages in HL7 messaging standard version 2.5.1 with an event type of R01.
Message count in data set: 2,764
CCD
Continuity of Care Documents (CCD) are XML based markup standard built using HL7 Clinical Document Architecture (CDA) elements. CCDs carry summary information about the patient within the broader context of the personal health record.
Current data fields in CCDs: Patient demographics Medications Allergies Encounters Problem lists Diagnosis Lab results Immunization Social History
Message count in data set: 101,270
FHIR
Fast Healthcare Interoperability Resources (FHIR) is a modern standard for exchanging healthcare information electronically. FHIR leverages web standards like HTTP, RESTful APIs, and JSON to enable seamless communication between different healthcare systems, applications, and devices.
FHIR facilitates interoperability by providing a framework for representing and exchanging clinical data in a structured, standardized format, allowing healthcare stakeholders to easily access and share patient information across disparate systems, leading to improved care coordination, streamlined workflows, and enhanced patient outcomes.
The synthetic patient records generated by our statistic population health model generator are output in JSON FHIR version R4 resources.
Message count in data set: 49,514 bundles containing an average of 100 FHIR resources in each bundle (~4,951,400 total FHIR resources)
Data sets (1)
You will receive access to the following data sets
Name | Type | Data dictionary | AWS Region |
---|---|---|---|
Synthetic Data Pack: 2500 patients with 5 years of longitudinal data history | Not included | US East (N. Virginia) |
Usage information
By subscribing to this product, you agree that your use of this product is subject to the provider's offer terms including pricing information and Data Subscription Agreement . Your use of AWS services remains subject to the AWS Customer Agreement or other agreement with AWS governing your use of such services.