
Overview
Search applications are ubiquitous in today's digital world across various use cases of plagiarism systems, ranking products, items, text descriptions etc. This is a text search solution that is based on efficient index based approximate nearest neighbours algorithm which is much faster than a brute force search algorithm. It can be integrated with existing systems or applications which have large text data bases and a need to look for similar text descriptions. One of the applications of this solution is to search for potentially similar companies based on their description when conducting competitive market analysis.
Highlights
- This solution is tested on publicly available company descriptions dataset. It uses efficient indexing based on state of the art text similarity search algorithms and can be utilized for creating search indexes for any large text databases like news articles, websites, user profiles, blogs, product descriptions etc.
- This solution is primarily focused on textual descriptive data but can be repurposed for other use cases like image search, neural (multi-modal) search etc. It can aid in building plagiarism check tools, recommendation systems, ranking systems, search engines, etc.
- Mphasis DeepInsights is a cloud-based cognitive computing platform that offers data extraction & predictive analytics capabilities. Need Customized Deep learning and Machine Learning Solutions? Get in Touch!
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost |
|---|---|---|
ml.m5.large Inference (Batch) Recommended | Model inference on the ml.m5.large instance type, batch mode | $10.00/host/hour |
ml.m5.large Training Recommended | Algorithm training on the ml.m5.large instance type | $10.00/host/hour |
ml.m4.4xlarge Inference (Batch) | Model inference on the ml.m4.4xlarge instance type, batch mode | $10.00/host/hour |
ml.m5.4xlarge Inference (Batch) | Model inference on the ml.m5.4xlarge instance type, batch mode | $10.00/host/hour |
ml.m4.16xlarge Inference (Batch) | Model inference on the ml.m4.16xlarge instance type, batch mode | $10.00/host/hour |
ml.m5.2xlarge Inference (Batch) | Model inference on the ml.m5.2xlarge instance type, batch mode | $10.00/host/hour |
ml.p3.16xlarge Inference (Batch) | Model inference on the ml.p3.16xlarge instance type, batch mode | $10.00/host/hour |
ml.m4.2xlarge Inference (Batch) | Model inference on the ml.m4.2xlarge instance type, batch mode | $10.00/host/hour |
ml.c5.2xlarge Inference (Batch) | Model inference on the ml.c5.2xlarge instance type, batch mode | $10.00/host/hour |
ml.p3.2xlarge Inference (Batch) | Model inference on the ml.p3.2xlarge instance type, batch mode | $10.00/host/hour |
Vendor refund policy
Currently we do not support refunds, but you can cancel your subscription to the service at any time.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker algorithm
An Amazon SageMaker algorithm is a machine learning model that requires your training data to make predictions. Use the included training algorithm to generate your unique model artifact. Then deploy the model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
v1.0.0
Additional details
Inputs
- Summary
The trained model expects the following as input query in a json file called "model_input.json". It should contain the necessary keys as mentioned in input data descriptions.
- Limitations for input type
- The RAM requirement for performing the company description similarity search on a 2000 companies dataset is less than 8GiB (m5.large). Please setup your AWS instance according to your requirement.
- Input MIME type
- application/json
Input data descriptions
The following table describes supported input data fields for real-time inference and batch transform.
Field name | Description | Constraints | Required |
|---|---|---|---|
company_name | The name of the company for which we need similar companies. | Type: FreeText
Limitations: Provide only one company name for each inferencing job. | Yes |
k | The number of similar companies expected in output (k). | Type: Integer
Minimum: 1 | Yes |
Resources
Vendor resources
Support
Vendor support
For any assistance reach out to us at:
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products



