
Overview
The Clinical De-Identification model is designed to recognize and anonymize PHI in English-language clinical notes. It employs state-of-the-art natural language processing techniques to detect sensitive information such as patient names, addresses, medical record numbers, and other identifiers. Once identified, the PHI is effectively masked or obfuscated, rendering the text safe for broader use while maintaining its informational integrity.
IMPORTANT USAGE INFORMATION:
After subscribing to this product and creating a SageMaker endpoint, billing occurs on an HOURLY BASIS for as long as the endpoint is running.
-Charges apply even if the endpoint is idle and not actively processing requests.
-To stop charges, you MUST DELETE the endpoint in your SageMaker console.
-Simply stopping requests will NOT stop billing.
This ensures you are only billed for the time you actively use the service.
Highlights
- Process up to 10M chars per hour in real-time and 18M chars per hour in batch mode. Key Features: -The model is finely tuned to identify a wide range of PHI elements in medical texts, ensuring comprehensive de-identification. -The de-id process aligns with HIPAA and other healthcare privacy regulations, aiding in legal compliance and data protection. -Ideal for research, analytics, and training purposes, safe utilization of medical texts without compromising patient privacy.
- Covered entities: LOCATION, CONTACT, PROFESSION, NAME, DATE, ID, AGE, MEDICALRECORD, ORGANIZATION, HEALTHPLAN, DOCTOR, USERNAME, LOCATION-OTHER, URL, DEVICE, CITY, ZIP, STATE, PATIENT, COUNTRY, STREET, PHONE, HOSPITAL, EMAIL, IDNUM, BIOID, FAX, LOCATION_OTHER, DLN, SSN, ACCOUNT, PLATE, VIN, LICENSE, IP
- This model is a useful asset in the healthcare and research sectors, where the protection of patient privacy is paramount. It allows for the ethical and legal use of valuable medical data, promoting research and analysis while upholding
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Free trial
Dimension | Description | Cost/host/hour |
|---|---|---|
ml.m4.xlarge Inference (Batch) Recommended | Model inference on the ml.m4.xlarge instance type, batch mode | $23.76 |
ml.m4.xlarge Inference (Real-Time) Recommended | Model inference on the ml.m4.xlarge instance type, real-time mode | $23.76 |
Vendor refund policy
No refunds are possible.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
Model optimization
Additional details
Inputs
- Summary
The model expects as input a JSON object with three key-value pairs. The first key, "text", is associated with a string or text data that requires de-identification. The second key, "masking_policy" specifies the policy or method to be applied for de-identifying the text.
- Input MIME type
- application/json, application/jsonlines
Input data descriptions
The following table describes supported input data fields for real-time inference and batch transform.
Field name | Description | Constraints | Required |
|---|---|---|---|
text | Input Text that needs to be de-identified. | Type: FreeText | Yes |
masking_policy | De-identification policy the user wants to follow.
- masked: Default policy that masks entities with their type.
- obfuscated: Replaces sensitive entities with random values of the same type. | Default value: masked
Type: Categorical
Allowed values: masked, obfuscated | No |
Support
Vendor support
For any assistance, please reach out to support@johnsnowlabs.com .
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products

