Overview
Getting high quality extraction and chunking for PDF, Microsoft Word (doc, docx), and PowerPoint (ppt, pptx) formats just got a lot easier. Aryn DocParse segments and labels complex documents into structured JSON or markdown output, does OCR, extracts tables and images, and more. Quickly improve the quality for your document processing workloads, ETL pipelines, retrieval-augmented generation (RAG) applications, and GenAI platforms.
DocParse runs the Aryn Partitioner and its state-of-the-art, open source deep learning DETR AI model trained on 80k+ enterprise documents. Up to 6x more accurate, 5x faster, and 5x cheaper than alternatives document parsing systems.
Highlights
- Up to 6x more accurate document chunking than other systems. Get better results from complex documents to power unstructured ETL workloads, RAG applications, document processing workflows, semantic search, and more.
- Supports high-quality OCR, table extraction, image processing, and layout parsing. Unlock the value from your unstructured data and accurately use it in your downstream use cases.
- Low cost, pay as you go model per page processed, and as low as a few hundred milliseconds of processing time per page.
Details
Features and programs
Financing for AWS Marketplace purchases
Pricing
Free trial
Dimension | Description | Cost/unit |
---|---|---|
Page | Page from documents processed by Aryn DocParse | $0.002 |
Vendor refund policy
We do not offer refunds at this time.
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.