Filters

Text (557 results) showing 51 - 60

The Inference Server - Llama.cpp - CUDA - NVIDIA Container - Ubuntu 22

Version Inference-2023.12.26-NVIDIA-535.104-CUDA12.2.2-LLAMA.CPP-Ubu22
By NI SP - High-End Remote Desktop and HPC

Starting from $0.06 to $0.56/hr for software + AWS usage fees

The Inference server offers the full infrastructure to run fast inference on GPUs. It includes llama.cpp inference, latest CUDA and NVIDIA Docker container toolkit. Leverage the multitude of models freely available to run inference with 8 bit or lower quantized models which makes inference...

Linux/Unix, Ubuntu 22 - 64-bit Amazon Machine Image (AMI)

Free Trial

AI21 Jurassic-2 Light

Version 2.0.004
By AI21 Labs

Jurassic-2 Light is the quickest large language model (LLM) by AI21 Labs and is available for you to deploy in your private environment. Small but mighty, Jurassic-2 Light is ideal for simple language tasks that require maximum affordability and minimal latency in your private environment. Common...

Model Package - Fulfilled on Amazon SageMaker

LLaMa 3 Meta AI 8B: OpenAI API Compatible AMI

Version 1.0.0
By Meetrix.io

Starting from $0.10/hr or from $899.00/yr (1% savings) for software + AWS usage fees

Meetrix brings you the all new Ilama 3, repackaged from the Llama3 open-source project. This powerful new model, with 8 billion parameters, is similar to OpenAI's ChatGPT and aims to change how we use information and boost our creativity. Llama 3 Step-by-step Installation Guide:...

Linux/Unix, Ubuntu 22.04 - 64-bit Amazon Machine Image (AMI)

C3 Generative AI: Standard Edition

By C3 AI

C3 Generative AI enables enterprise users to rapidly locate, retrieve, and act on insights contained in disparate enterprise data - through an intuitive search and chat interface. By combining state of the art large language models (LLMs), deep learning retrieval models, and the C3 AI Platform, C3...

Free Trial

NVIDIA Gaming PC - Ubuntu 18.04 with Support by Terracloudx

Version Latest Version: 510.68.02
By terracloudx

Starting from $0.01 to $0.05/hr for software + AWS usage fees

This product has charges associated with it for seller support by Terracloudx and pre-configured stack. The NVIDIA Gaming AMI driver enables cloud gaming on NVIDIA T4 GPUs. The new G4 instance type features the NVIDIA T4 GPU and supports this driver on Ubuntu 18.04. Benefits include: - Next...

Linux/Unix, Ubuntu Linux/Unix, Ubuntu 18.04 - 64-bit Amazon Machine Image (AMI)

Free Trial

Voyage-code-2 Embedding Model

Version v1.0
By Voyage AI

Embedding models, a crucial building block for retrieval systems, semantic search, and retrieval-augmented generation (RAG), are neural networks that convert documents into numerical vectors. Voyage-code-2 is a cutting-edge embedding model that is trained particularly for semantic retrieval of code...

Model Package - Fulfilled on Amazon SageMaker

Free Trial

Cohere Embed Light v3 - Multilingual

Version v3.3.0
By Cohere

Embed Light translates text into numerical vectors that models can understand. The most advanced generative AI apps rely on high-performing embedding models to understand the nuances of user inputs, search results, and documents. Embed Light is a smaller version of Embed with 384 dimensions. This...

Model Package - Fulfilled on Amazon SageMaker

Free Trial

AI21 Paraphrase

Version 1.0.005
By AI21 Labs

Paraphrase is a task-specific model (TSM) by AI21 Labs that’s built to paraphrase any text – from entire paragraphs and sentences to single words – and offer suggested rewrites that take context into account and preserve the text’s original meaning. The endpoint is pre-loaded and ready to serve...

Model Package - Fulfilled on Amazon SageMaker

Jina Embeddings v2 Base - en

Version 3.2
By Jina AI

Jina Embeddings v2 Base model is optimized for highly accurate embeddings - For speed of inference and memory efficiency use the Small model. jina-embeddings-v2-base-en is an open-source English embedding model supporting 8192 sequence length. This state-of-the-art AI embedding model enables many...

Model Package - Fulfilled on Amazon SageMaker

Free Trial

Voyage-finance-2 Embedding Model

Version v1
By Voyage AI

Embedding models are neural networks that convert documents into numerical vectors. They are a crucial building block for retrieval systems, semantic search, and retrieval-augmented generation (RAG). voyage-finance-2 is optimized for finance domain retrieval and RAG. It demonstrates superior...

Model Package - Fulfilled on Amazon SageMaker

showing 51 - 60