Listing Thumbnail

    Falcon 180B Chat: OpenAI & API Compatible

     Info
    Sold by: Meetrix.io 
    This is a single-click deployment AMI package of Falcon 180B, which is a super-powerful language model with 180 billion parameters, trained on 3.5 trillion tokens. It's currently at the top of the Hugging Face Leaderboard for pre-trained Open Large Language Models and is available for both research and commercial use.. This model performs exceptionally well in various tasks like reasoning, coding, proficiency, and knowledge tests, even beating competitors like Meta's LLaMA 2. Among closed source models, it ranks just behind OpenAI's GPT 4, and performs on par with Google's PaLM 2 Large, which powers Bard, despite being half the size of the model.
    Listing Thumbnail

    Falcon 180B Chat: OpenAI & API Compatible

     Info
    Sold by: Meetrix.io 

    Overview

    Falcon 180B is the largest openly available language model, with 180 billion parameters. It was trained on 3.5 trillion tokens using TII's RefinedWeb dataset. This represents the longest single-epoch pre-training for an open model.

    Falcon 180B scaled up for its predecessor Falcon 40B, with new capabilities such as multiquery attention for enhanced scalability. The model used 4096 GPUs on Amazon SageMaker and was trained on 3.5 trillion tokens. This is roughly around 7,000,000 GPU hours. This means that Falcon 180B is 2.5x faster than LLMs such as Llama 2 and was trained on 4x more computing.

    Key Features: Unmatched Pretrained Depth: Boasting an impressive 180 billion parameters, this AMI is set to produce results with unparalleled depth, precision, and richness.

    Quick Deployment: Overcome the challenges of intricate setups effortlessly. The AMI variant provides a clean, ready-to-use experience, simplifying the complexities associated with raw models.

    Effortless API Integration: Seamlessly connect with a multitude of applications. The integrated API design guarantees adaptable versatility and seamless operations.

    Highlights

    • Easy Integration: Designed for interoperability, this AMI effortlessly aligns with the OpenAI landscape and offers seamless API connections, ensuring smooth operations across diverse platforms.
    • Effortless Deployment: Designed for ease-of-use, this AMI provides users with a hassle-free setup experience, ensuring quick implementation without the complexities.
    • Pay-as-You-Go Flexibility: Enjoy cost efficiency with a pay-as-you-go pricing model.

    Details

    Delivery method

    Delivery option
    cloudformation template for Falcon deployment

    Latest version

    Operating system
    Ubuntu 22.04

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Falcon 180B Chat: OpenAI & API Compatible

     Info
    Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time. Alternatively, you can pay upfront for a contract, which typically covering your anticipated usage for the contract duration. Any usage beyond contract will incur additional usage-based costs.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    Usage costs (1)

     Info
    Instance type
    Product cost/hour
    EC2 cost/hour
    Total/hour
    g4dn.metal
    $0.104
    $7.824
    $7.928

    Vendor refund policy

    We do not currently support refunds, but you can cancel at any time.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    cloudformation template for Falcon deployment

    Falcon is capable of generating text, answering questions, generating content, and performing a range of text-related tasks in response to the provided prompts

    CloudFormation Template (CFT)

    AWS CloudFormation templates are JSON or YAML-formatted text files that simplify provisioning and management on AWS. The templates describe the service or application architecture you want to deploy, and AWS CloudFormation uses those templates to provision and configure the required services (such as Amazon EC2 instances or Amazon RDS DB instances). The deployed application and associated resources are called a "stack."

    Version release notes

    First Release

    Additional details

    Usage instructions

    Click the "Continue to Subscribe" button. After subscribing, you will need to accept the terms and conditions. Click on "Accept Terms" to proceed. Please wait for a few minutes while the processing takes place. Once it's completed, click on "Continue to Configuration".

    IAM Role is set up and configured with the necessary permissions to assume the role for the Falcon service. The IAM Policy is created and responsible for accessing Route53 and creating Letsencrypt SSL certificates. Apart from that volume permissions also given for changing volume type of instance. Access the application via a browser at http://<your domain name>/docs or http://<Public IPv4 address>/docs.

    Product will try to setup SSL based on provided domain name, if domain hosted on route53. If the automatic SSL creation unsuccesful then you have to point domain name into server ip, ssh into server and run /root/certificate_generate_standalone.sh. Admin email is also using for generate SSL certificates.

    Support

    Vendor support

    Please contact us through support@meetrix.io . Please allow up to 12 hours for our support team to address your request.

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Similar products

    Customer reviews

    Ratings and reviews

     Info
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 AWS reviews
    No customer reviews yet
    Be the first to write a review for this product.