
Sold by: Cotonoha
Open data
|
Deployed on AWS
Japanese Tokenizer Dictionaries for use with MeCab.
Overview
Japanese Tokenizer Dictionaries for use with MeCab.
Features and programs
Open Data Sponsorship Program
This dataset is part of the Open Data Sponsorship Program, an AWS program that covers the cost of storage for publicly available high-value cloud-optimized datasets.
Pricing
This is a publicly available data set. No subscription is required.
How can we make this page better?
We'd like to hear your feedback and ideas on how to improve this page.
Legal
Content disclaimer
Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.
Delivery details
AWS Data Exchange (ADX)
AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.
Open data resources
Available with or without an AWS account.
- How to use
- To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more
- Description
- Dictionary Files
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::cotonoha-dic
- AWS region
- ap-northeast-1
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://cotonoha-dic/
Resources
Vendor resources
Support
Contact
Managed By
Cotonoha
How to cite
Japanese Tokenizer Dictionaries was accessed on DATE from https://registry.opendata.aws/cotonoha-dic .
License
Versions of Unidic offered here are available under the GPL/LGPL/BSD license.
IPADic is offered under a unique BSD-like license. See below.
<https://github.com/polm/ipadic-py/blob/master/ipadic/dicdir/COPYING>Similar products

Japanese dictionaries and pre-trained models (word embeddings and language models) for natural language processing.
SudachiDict is the dictionary for a Japanese tokenizer (morphological analyzer) Sudachi.
chiVe is Japanese pretrained word embeddings (word vectors), trained using the ultra-large-scale web corpus NWJC by National Institute for Japanese Language and Linguistics, analyzed by Sudachi.
chiTra is a library for using large-scale pre-trained language models with the Japanese tokenizer SudachiPy.
We, Works Applications, authors and maintainers of Sudachi: a highly-featured Japanese Tokenizer & Morphological Analyzer, provide services to increase the accuracy of search in Japanese documents
This product has charges associated with it for seller support. The Windows 2019 Server - Japanese AMI provides a robust and versatile environment, specifically tailored for Japanese-speaking users and businesses. This image includes everything needed to deploy and manage applications with enhanced performance, security, and reliability. Ideal for enterprises looking to streamline their operations, it supports a wide range of workloads, from web hosting to cloud applications and virtual desktop infrastructure. With built-in features such as Windows Admin Center, improved container services, and enhanced security protocols, users can maximize their infrastructure's efficiency. Leverage the power of Windows Server to ensure your applications run smoothly while meeting local compliance and language requirements, making it a perfect fit for businesses focused on the Japanese market.
Includes support and maintenance fees. Minimal Windows Server 2019 Datacenter Japanese AMI using a 30 GiB volume. Only the Administrator account is allowed to login using the password randomly generated during the first boot. Primary partition and filesystem extends automatically at launch. ENA enabled.
Includes support and maintenance fees. Minimal Windows Server 2022 Datacenter Japanese AMI using a 30 GiB volume. Only the Administrator account is allowed to login using the password randomly generated during the first boot. Primary partition and filesystem extends automatically at launch. ENA enabled.
This Windows Server 2025 Japanese product has charges associated with it for seller support and maintenance. Ready-to-use minimal Windows Server 2025 Datacenter Japanese AMI using a 30 GiB volume. Only the Administrator account is allowed to login using the password randomly generated during the first boot. In this Windows Server 2025 AMI, the primary partition and filesystem automatically extends at launch if instance volume is bigger than the default 30 GiB. This Windows Server 2025 image has ENA enabled. All Windows Server 2025 security updates available at the image release date are included. This Windows Server 2025 AMI delivers a powerful and scalable environment tailored for enterprise applications in the AWS EC2 public cloud. Whether used for hosting applications, building cloud-native services, or managing database workloads, Windows Server 2025 adapts to diverse enterprise needs.
This product has charges associated with it for seller support. This Windows Server 2019 Japanese by Arara Solutions image comes with the following softwares: Windows Server 2019 in Japanese.