
Overview
Upstage Document Parse is a powerful API designed to automatically convert any document to HTML. It detects layout elements such as paragraphs, tables, images, equations, charts and more to determine the structure of the document. The API then serializes the elements according to reading order, and finally converts the document into HTML.
Highlights
- ### Key Features **Text Recognition** detects text via OCR or PDF parsing, excelling in English and CJK documents, including digital-born PDFs. **Layout Element Detection (LED)** identifies paragraphs, figures, tables, and captions, arranging them in human reading order - great for complex layouts. **Table Structure Recognition (TSR)** converts complex tables to HTML, handling merged cells and hidden gridlines.
- ### Key Applications The Upstage Document Parse model enhances LLM-based document processing and information retrieval by preserving contextual information better than traditional OCR. It is valuable for scenarios where LLMs process documents, integrating RAG with Layout Analysis via embedding techniques. It excels in information extraction and recognizing document structures across various templates, making it ideal for handling the same type of documents in different formats.
- ### Key Tasks - Document OCR - Document Parsing - Layout Analysis - Information Extraction - Layout Element Detection - Table Structure Recognition
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/host/hour |
|---|---|---|
ml.m5.12xlarge Inference (Batch) Recommended | Model inference on the ml.m5.12xlarge instance type, batch mode | $15.00 |
ml.g6.xlarge Inference (Real-Time) Recommended | Model inference on the ml.g6.xlarge instance type, real-time mode | $15.00 |
ml.p3.2xlarge Inference (Real-Time) | Model inference on the ml.p3.2xlarge instance type, real-time mode | $15.00 |
ml.g6.2xlarge Inference (Real-Time) | Model inference on the ml.g6.2xlarge instance type, real-time mode | $15.00 |
ml.g5.xlarge Inference (Real-Time) | Model inference on the ml.g5.xlarge instance type, real-time mode | $15.00 |
ml.g5.2xlarge Inference (Real-Time) | Model inference on the ml.g5.2xlarge instance type, real-time mode | $15.00 |
ml.g4dn.xlarge Inference (Real-Time) | Model inference on the ml.g4dn.xlarge instance type, real-time mode | $15.00 |
Vendor refund policy
We do not support any refunds currently.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
âť— Important Notice It requires NVIDIA Driver version 525.0 or later and is optimized for ml.g6 instances. For users running ml.g5 instances, please use version 250404c.3 for compatibility.
🚀 Updates
-
Support for various form types has been significantly improved, enabling more accurate parsing across diverse document layouts.
-
Tables split across multiple pages are now automatically linked and merged into a single coherent structure.
-
Rotated documents are now handled correctly without requiring manual adjustments, improving robustness in real-world scenarios.
-
Patchify has been optimized for better performance and accuracy when processing long vertical images.
-
Added safeguards to prevent infinite loops, improving system stability.
-
Fixed an issue where errors incorrectly returned a 200 OK response, ensuring accurate HTTP status reporting.
-
Improved GPU memory management for resource efficiency.
Additional details
Inputs
- Summary
Provide input data in multipart form data View more detailed description hereÂ
- Input MIME type
- multipart/form-data
Resources
Vendor resources
Support
Vendor support
Contact us for model inquiries.
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products
