Overview
Ingest and preprocess complex natural language data from any document, file type, or layout with Unstructured.
Under the hood, the Unstructured engine involves breaking a document into its constituent parts and identifying the document's structure, such as its header, tables, and body text. Unstructured provides diverse preprocessing strategies for documents each catering to different document types and requirements. Utilizing the optimal strategy enhances document element classification accuracy and extraction efficiency, which is crucial for image-based files and layout-intensive documents.
Click on Continue to Subscribe to start using Unstructured for your data preprocessing needs.
We are constantly improving our products and love feedback.
Highlights
- Transforms all your data for downstream analytics. Next-generation vision transformer for images, PDF, and table extraction
- Enhanced models for table extraction, document hierarchy, and element classification. Chunks your data for LLM applications
- Compatible with any embedding model, vector database, and LLM framework. API client libraries in multiple client languages (e.g. Python, Javascript)
Details
Typical total price
$4.60/hour
Features and programs
Financing for AWS Marketplace purchases
Pricing
- ...
Instance type | Product cost/hour | EC2 cost/hour | Total/hour |
---|---|---|---|
t2.nano | $1.10 | $0.006 | $1.106 |
t2.micro AWS Free Tier | $1.10 | $0.012 | $1.112 |
t2.small | $1.10 | $0.023 | $1.123 |
t2.medium | $2.20 | $0.046 | $2.246 |
t2.large | $2.20 | $0.093 | $2.293 |
t2.xlarge | $3.30 | $0.186 | $3.486 |
t2.2xlarge | $3.30 | $0.371 | $3.671 |
t3.nano | $2.20 | $0.005 | $2.205 |
t3.micro AWS Free Tier | $2.20 | $0.01 | $2.21 |
t3.small | $2.20 | $0.021 | $2.221 |
Additional AWS infrastructure costs
Type | Cost |
---|---|
EBS General Purpose SSD (gp3) volumes | $0.08/per GB/month of provisioned storage |
Vendor refund policy
We do not currently support refunds, but you can cancel at any time.
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
1.0.51
- Bump unstructured to 0.15.14-dev2
Additional details
Usage instructions
"To connect to the operating system, use SSH and the username rocky. You will need the same SSH Key Pair supplied during stack launch. For more details see the Unstructed API deployment guide here: https://docs.unstructured.io/api-reference/api-services/aws "
Resources
Vendor resources
Support
Vendor support
Please allow 24 hours. Join us in our Slack workspace for support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.