Synthetic Data Pack: 500 patients, 1 year of longitudinal data
Provided By: Interoperability Institute LLC
Synthetic Data Pack: 500 patients, 1 year of longitudinal data
Provided By: Interoperability Institute LLC
Using a statistic population health model generator, this data set is made up of highly realistic, but synthetic patient data that can be used for testing purposes without risk of disclosing PHI (protected health information). This dataset is for single organization use only. Please contact us for more information on synthetic datasets for multi-partner use, FHIR server options available for testing, or hand curated datasets to meet your needs.
Product offers
The following offers are available for this product. Choose an offer to view the pricing and access duration options for the offer. Select an offer and continue to subscribe. Your subscription begins on the date that your request is approved by the provider. Additional taxes or fees might apply.
Public offer
Payment schedule: Upfront payment | Offer auto-renewal: Supported
$1,999 for 12 months
Overview
This data pack is a synthetic healthcare dataset comprised of 500 patients with 1 year of longitudinal history.
Using a Monte Carlo simulation technique, each synthetic record is modeled to emulate clinically relevant treatment scenarios. During the generation synthetic patients progress through a series of healthcare encounters. It is these encounters and their events that are used to generate the dataset which is comprised of healthcare data messages across several HL7 messaging standards.
These records are highly realistic and even include gaps of information like a patient record in a real-world healthcare ecosystem.
Common conditions that may be contained in the data pack:
• Appendicitis
• Cancer
• Covid
• Deep venous thrombosis
• Diabetes
• Food Insecurity (SDOH)
• Hypertension
• Osteoporosis
• Pregnancy
• Pulmonary embolism
• STIs
• Zika
HL7 Message standards output that may be included for a synthetic patient record:
ADT
Admission, Discharge, Transfer (ADT) messages are used to communicate patient demographics, visit information and patient state at a healthcare facility.
This synthetic data set contains the following number of synthetic Admit, Discharge and Transfer (ADT) messages in HL7 messaging standard version 2.6 with the following event types:
• A01 - Admit / visit notification
• A03 - Discharge/end visit
• A04 - Register a patient
Message count in data set:
484 total A01
484 total A03
2,991 total A04
VXU
Unsolicited Vaccination Update (VXU) messages are used to receive and send patient’s vaccination information.
This synthetic data set contains synthetic Unsolicited Vaccination Record (VXU) messages in HL7 messaging standard version 2.5.1 with an event type of V04.
Message count in data set: 1,098
ORU
ORUs are unsolicited transmission of an observation message designed contain information about a patient's clinical observations and are used for transmitting patient’s laboratory results to other systems.
This synthetic data set contains synthetic Observation Result (ORU) messages in HL7 messaging standard version 2.5.1 with an event type of R01.
Message count in data set: 299
CCD
Continuity of Care Documents (CCD) are XML based markup standard built using HL7 Clinical Document Architecture (CDA) elements. CCD’s carry summary information about the patient within the broader context of the personal health record.
Current data fields in CCD’s:
• Patient demographics
• Medications
• Allergies
• Encounters
• Problem lists
• Diagnosis
• Lab results
• Immunization
• Social History
Message count in data set: 300
FHIR
Fast Healthcare Interoperability Resources (FHIR) is a modern standard for exchanging healthcare information electronically. FHIR leverages web standards like HTTP, RESTful APIs, and JSON to enable seamless communication between different healthcare systems, applications, and devices.
FHIR facilitates interoperability by providing a framework for representing and exchanging clinical data in a structured, standardized format, allowing healthcare stakeholders to easily access and share patient information across disparate systems, leading to improved care coordination, streamlined workflows, and enhanced patient outcomes.
The synthetic patient records generated by our statistic population health model generator are output in JSON FHIR version R4 resources.
Message count in data set: 300 bundles containing an average of 100 FHIR resources in each bundle (~30,000 total FHIR resources)
Provided By
Fulfillment Method
AWS Data Exchange
Data sets (1)
You will receive access to the following data sets
Revision access rules
Last 1 revision | All future revisions
Name | Type | Data dictionary | AWS Region |
---|---|---|---|
Synthetic Data Pack: 500 patients with 1 year of longitudinal data history | Not included | US East (N. Virginia) |
Usage information
By subscribing to this product, you agree that your use of this product is subject to the provider's offer terms including pricing information and Data Subscription Agreement . Your use of AWS services remains subject to the AWS Customer Agreement or other agreement with AWS governing your use of such services.
Support information
Support contact email address
Support contact URL
Refund policy
Refunds are not offered for this product.
General AWS Data Exchange support