Amazon Elasticsearch Service now supports the Seunjeon plugin for Korean language analysis

Posted on: Mar 13, 2018

Amazon Elasticsearch Service now supports the Seunjeon plugin, a popular open-source Korean language text analyzer, which makes it easy for developers to implement full-text search on Korean documents. The plugin internally uses a Korean language dictionary and is capable of recognizing compound words and separating them into terms based on context. Developers can now use this plugin to perform Korean text analysis operations such as tokenizing (separating a string into words), stemming (converting the text to its root form), removing stop words (frequent, low-value terms), and matching based on synonyms.

The plugin is available for all new domains on Amazon Elasticsearch Service running Elasticsearch version 5.1 and above. The plugin is automatically installed when the Elasticsearch cluster is set up, enabling developers to directly refer to it in their index mappings without needing any prior installation steps. 

Amazon Elasticsearch Service is available across 17 regions globally: US East (N. Virginia, Ohio), US West (Oregon, N. California), AWS GovCloud (US), Canada (Central), South America (Sao Paulo), EU (Ireland, London, Frankfurt, Paris), Asia Pacific (Singapore, Sydney, Tokyo, Seoul, Mumbai), and China (Ningxia) operated by NWCD.