- Version v1
- By Cloud Software Group
This algorithm produces similarity scores for a document or a line of text compared to documents in a corpus. The algorithm includes a tf-idf text featurizer to create n-gram features describing the text. It then uses the library scipy.spatial.distance to compute the cosine distance between the new...
Algorithm - Fulfilled on Amazon SageMaker