Alexander Arzhanov | Artificial Intelligence

Applying data loading best practices for ML training with Amazon S3 clients

In this post, we present practical techniques and recommendations for optimizing throughput in ML training workloads that read data directly from Amazon S3 general purpose buckets.

Implement semantic video search using open source large vision models on Amazon SageMaker and Amazon OpenSearch Serverless

In this post, we demonstrate how to use large vision models (LVMs) for semantic video search using natural language and image queries. We introduce some use case-specific methods, such as temporal frame smoothing and clustering, to enhance the video search performance. Furthermore, we demonstrate the end-to-end functionality of this approach by using both asynchronous and real-time hosting options on Amazon SageMaker AI to perform video, image, and text processing using publicly available LVMs on the Hugging Face Model Hub. Finally, we use Amazon OpenSearch Serverless with its vector engine for low-latency semantic video search.

Artificial Intelligence

Author: Alexander Arzhanov

Applying data loading best practices for ML training with Amazon S3 clients

Implement semantic video search using open source large vision models on Amazon SageMaker and Amazon OpenSearch Serverless

Learn

Resources

Developers

Help