Overview
Upstage Document Intelligence - AI Agent is an all-in-one REST API solution that lets you unlock the value of unstructured documents instantly. It includes two capabilities:
Document Parse
Transform unstructured PDFs, scanned files, or multi-layout reports into clean, layout-preserving HTML or Markdown - perfect for LLM pipelines, RAG indexing, summarization, or search.
- LLM-Ready Output: Converts scanned or digital PDFs into richly structured HTML, preserving tables, charts, layout, and visual context.
- Layout-Aware Parsing: Handles rotated pages, checkbox states, multi-page tables, and embedded visuals.
- Blazing Fast: Parses 100 pages in under 1 minute - up to 10x faster than alternatives.
- High Accuracy: Benchmarked with industry-leading performance on DP-Bench (TEDS score 93+).
Information Extract
Extract key data from documents as structured JSON - aligned to your custom or auto-generated schema. Ideal for automating workflows.
- Universal Document Compatibility: Extracts information from any document format, including complex PDFs, images, and Office documents.
- Schema-Agnostic Flexibility: Instantly adapts to different schemas, providing structured outputs without additional customization.
- Hidden and Implied Data Extraction: Surfaces not only visible text but also inferred values and line-item calculations.
- No Fine-Tuning Required: Works out-of-the-box without template creation or model adjustments.
- Efficient JSON Conversion: Converts extracted information directly into structured JSON key-value pairs for seamless integration.
Whether you're building an RAG system or automating business processes, Document Intelligence makes document understanding radically simple.
Typical use cases:
- Contract analysis
- Invoice & receipt extraction
- Loan & insurance form processing
- Academic paper summarization
- Document chunking for LLM ingestion
Highlights
- Two-in-One Intelligence - Choose between schema-aligned JSON(information extract) or HTML/Markdown output(document parse) depending on your use case.
- Zero Training Required - Works out-of-the-box with messy scans, complex tables, and diverse document layouts.
- Enterprise-Scale Performance - Sync up to 100 pages or async up to 1000 pages, with blazing speed and audited accuracy.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/unit |
|---|---|---|
InformationExtract__InferenceCount | Charged per page for each information-extract model inference. | $0.048 |
DocumentParse__InferenceCount | Charged per page for each document-parse model inference. | $0.012 |
Vendor refund policy
we currently do not support refund, but you can ask through support channel (https://get.support.upstage.ai/servicedesk/customer/portals