
Overview
This advanced pipeline extracts DRUG entities from clinical texts and utilizes the sbiobert_base_cased_mli Sentence Bert Embeddings to map these entities to their corresponding Anatomic Therapeutic Chemical (ATC) codes.
Just pass in your clinical text document and get back:
-
Detected NER chunk and position
-
Named Entity Recognition (NER) chunk label and its confidence score
-
Resolution code of the NER chunk.
-
Resolution of the NER chunk.
-
Cosine distance score of the resolution.
-
All the other possible resolutions of the NER chunk.
-
All the concept class IDs for the all resolutions.
-
Codes of the resolutions
-
All the cosine distance scores of the all resolutions.
Use Cases:
- Medication Normalization for Clinical Data Pipelines
Clinical narratives often contain medication references in free text, with variations in spelling, brand names, and dosage descriptions.
The model can automatically extract drug mentions and map them to standardized ATC codes, enabling:
-
Clean, interoperable medication data for downstream analytics
-
Consistent drug categorization across EHR systems
-
Improved accuracy in clinical decision-support systems
This significantly reduces manual coding effort and enhances data quality across clinical workflows.
- Pharmacovigilance & Drug Utilization Analysis
Healthcare organizations and research teams need structured, comparable drug information to detect trends, measure drug usage, and identify potential safety signals.
By converting raw text into ATC-classified drug data, your model supports:
-
Automated detection of drug patterns in real-world clinical documents
-
Aggregation of medication usage by therapeutic class
-
More accurate monitoring of adverse drug events and outcomes
This enables faster, more reliable insights for safety monitoring and population-level studies.
IMPORTANT USAGE INFORMATION: After subscribing to this product and creating a SageMaker endpoint, billing occurs on an HOURLY BASIS for as long as the endpoint is running. -Charges apply even if the endpoint is idle and not actively processing requests. -To stop charges, you MUST DELETE the endpoint in your SageMaker console. -Simply stopping requests will NOT stop billing. This ensures you are only billed for the time you actively use the service.
Highlights
- Process up to 6 M chars per hour in real-time and 18 M chars per hour in batch mode.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Free trial
Dimension | Description | Cost/host/hour |
|---|---|---|
ml.m4.2xlarge Inference (Batch) Recommended | Model inference on the ml.m4.2xlarge instance type, batch mode | $47.52 |
ml.m4.xlarge Inference (Real-Time) Recommended | Model inference on the ml.m4.xlarge instance type, real-time mode | $23.76 |
Vendor refund policy
No refunds are possible.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
johnsnowlabs_version: 6.0.4 Spark-NLP==6.0.4 Spark-Healthcare==6.0.4
Additional details
Inputs
- Summary
To use the model, you need to provide input in one of the following supported formats:
JSON Format Provide input as JSON. We support two variations within this format:
Array of Text Documents:
Use an array containing multiple text documents. Each element represents a separate text document.
{ "text": [ "Text document 1", "Text document 2", ... ] }
Single Text Document: Provide a single text document as a string.
{ "text": "Single text document" }
JSON Lines (JSONL) Format
Provide input in JSON Lines format, where each line is a JSON object representing a text document.
{"text": "Text document 1"} {"text": "Text document 2"}
- Input MIME type
- application/json, application/jsonlines
Resources
Vendor resources
Support
Vendor support
For any assistance, please reach out to support@johnsnowlabs.com .
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products

