Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Generation-Text (153 results) showing 51 - 60



This is a Text Generation model built upon a Transformer model from [Hugging Face](https://huggingface.co/distilgpt2). It takes a text string as input and predicts next words in the sequence.

Model Package - Fulfilled on Amazon SageMaker


Friendli Dedicated Endpoints is our SaaS service for deploying generative AI models that runs our flagship LLM serving engine, previously known as PeriFlow, on various cloud platforms. Our Friendli Engine cuts LLM inference serving costs by 40~80% while providing low latency and high throughput LLM...


VARCO-LLM is NCSOFT’s large language model, which can be applied to develop various NLP-based AI services such as Q&A, chatbot, summarization, information extraction etc. VARCO-LLM, trained with public pre-training data and internally constructed high-quality Korean data, boasts the highest...

Model Package - Fulfilled on Amazon SageMaker


Starting from $0.10/hr or from $899.00/yr (1% savings) for software + AWS usage fees

OpenChat 3.5 is the first 7B model that achieves comparable results with ChatGPT! It overcomes the challenges of larger models such as ChatGPT, provides comparable performance against much bigger models and it is served with vLLM out-of-the-box. Key Features of AMI: Effortless Deployment: Bid...

Linux/Unix, Ubuntu 22.04 - 64-bit Amazon Machine Image (AMI)

  • Version Llama2-7B: 7.0.0 Git:2.25.1 Python3:3.8.10
  • By Apps4rent LLC

Starting from $0.10 to $0.10/hr for software + AWS usage fees

This is a repackaged open-source software product wherein additional charges apply for technical support and maintenance by Apps4Rent. The Llama2-7B-Chat on Ubuntu20.04LTS pre-configured image is a great way to deploy a chat application on an Ubuntu 20.04 LTS server. The image includes all of the...

Linux/Unix, Ubuntu 22.04LTS - 64-bit Amazon Machine Image (AMI)


OctoAI is a compute service to run, tune, and scale generative AI models. With OctoAI, developers get the simplicity and reliability of closed-source API endpoint services for generative AI, with the flexibility to select and run your choice of models. OctoAI lets your team focus on your...


LHTM-Opt is an instruction-tuned Japanese Language Large Model developed by alt Inc., which has excellent Japanese knowledge and can be applied in various NLP tasks. alt Inc. is a venture firm with the mission of freeing humankind from non-creative/unproductive labor through the creation of P.A.I.®...

Model Package - Fulfilled on Amazon SageMaker


Starting from $0.10/hr or from $899.00/yr (1% savings) for software + AWS usage fees

Code Llama - Instruct 7B is an instruction fine-tuned and aligned variation of Code Llama. Instruction tuning continues the training process, but with a different objective. The model is fed a natural language instruction input and the expected output. This makes it better at understanding what...

Linux/Unix, Ubuntu 22.04 - 64-bit Amazon Machine Image (AMI)


This is a Extractive Question Answering model built upon a Text Embedding model from [PyTorch Hub](https://pytorch.org/hub/huggingface_pytorch-transformers/). It takes as input a pair of question-context strings, and returns a sub-string from the context as a answer to the question. The Text...

Model Package - Fulfilled on Amazon SageMaker