Overview
YomiToku is a proprietary document analysis engine specialized for Japanese. It integrates AI OCR plus layout and table parsing models, accurately structuring vertical text, multi column documents, and complex business forms. It supports a wide range of use cases, including generating data for RAG / search, creating searchable PDFs, and extracting information from table data.
** This product is intended to be offered through a private offer, so please contact us for pricing.**
Highlights
- Each model is specifically trained for Japanese document images, supporting the recognition of over 7,000 Japanese characters, including vertical text and other layout structures unique to Japanese documents. (It also supports English documents.)
- Equipped with four AI models trained on Japanese datasets: text detection, text recognition, layout analysis, and table structure recognition. All models are independently trained and optimized for Japanese documents, delivering high-precision inference.
- By leveraging layout analysis, table structure parsing, and reading order estimation, it extracts information while preserving the semantic structure of the document layout.
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/host/hour |
|---|---|---|
ml.g4dn.xlarge Inference (Batch) Recommended | Model inference on the ml.g4dn.xlarge instance type, batch mode | $10.00 |
ml.g4dn.xlarge Inference (Real-Time) Recommended | Model inference on the ml.g4dn.xlarge instance type, real-time mode | $10.00 |
ml.g5.xlarge Inference (Batch) | Model inference on the ml.g5.xlarge instance type, batch mode | $10.00 |
ml.g5.xlarge Inference (Real-Time) | Model inference on the ml.g5.xlarge instance type, real-time mode | $10.00 |
ml.g6.xlarge Inference (Real-Time) | Model inference on the ml.g6.xlarge instance type, real-time mode | $10.00 |
Vendor refund policy
We do not support any refunds currently.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
- Minor bug fixes
- Improved efficiency in text recognition model processing
- Hyperparameter tuning of the text detection model
Additional details
Inputs
- Summary
Supported Content Types:
- application/pdf - PDF documents (multi-page supported)
- image/jpeg - JPEG images
- image/png - PNG images
- image/tiff - TIFF images
Request Body: Send the binary file data directly in the request body with appropriate Content-Type header.
Example Usage: bash aws sagemaker-runtime invoke-endpoint \ --endpoint-name ${ENDPOINT_NAME} \ --content-type "application/pdf" \ --body fileb://demo/sample.pdf \ --region ${AWS_REGION} \ output.json
Sample Image Source: The sample images used in this documentation are from 「広報誌 令和 6 年 11 月号」 published by 総務省.
- Input MIME type
- application/pdf, image/jpeg, image/png, image/tiff
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.