Weights & Biases AI Development Platform for AWS logo

    Weights & Biases AI Development Platform for AWS

    Weights & Biases provides AI developers with the tools needed to build models faster, fine-tune LLMs, and develop GenAI applications with confidence for enterprises of all sizes in any vertical.

    Ratings and reviews

    4.6
    49 ratings
    3 star
    2 star
    1 star
    82%
    18%
    0%
    0%
    0%
    2 AWS reviews
    |
    47 external reviews
    External reviews are from G2  and PeerSpot .

    Filters

    Review type

    AWS Marketplace reviews
    External reviews
    Reviews (49)
    reviewer2859075

    Tracking model metrics and artifacts has improved workflows but documentation needs clarity

    Reviewed on Jun 20, 2026
    Review from a verified AWS customer

    What is our primary use case?

    My main use case for Weights & Biases is tracking metrics for my models when I'm training or testing new models.

    I can give a quick specific example of how I use Weights & Biases to track metrics during my model training or testing by setting up the script to send my metrics when I'm training models in my Jupyter Notebook.

    In addition to tracking metrics, I also track down errors, precisions, and recalls.

    What is most valuable?

    In my opinion, the best features Weights & Biases offers are that they are easy to adapt and navigate through inside their UI, and I can check the model artifacts by versions. Sometimes when it throws errors, I can check them easily, and it has access control that's a good fit for corporate usage.

    What I like about the UI and the access control features is that they are just easy to navigate, easy to understand, and finding is also easy.

    Weights & Biases has positively impacted my organization a lot because many users are using Weights & Biases for tracking models and seeing metrics. I believe it places a lot less weight on setting up this tracking method.

    I can share specific outcomes or metrics since using Weights & Biases. For example, I can check each epoch's model artifacts and see how effectively it worked on my evaluation set, and it can be fetched by the unique ID.

    What needs improvement?

    I think Weights & Biases can be improved by being more stable because sometimes we have a lot of network issues and it's hard to debug. Also, there are some minor UI bugs where some graphs in specific formats don't show.

    I would add that their documentation is not easy to go through, and when I'm trying to deploy to Kubernetes, their examples are huge and really difficult to understand.

    For how long have I used the solution?

    I have been using Weights & Biases for three years.

    What do I think about the stability of the solution?

    Weights & Biases is stable.

    What do I think about the scalability of the solution?

    Since Weights & Biases is running on AWS on our side, I think it's pretty scalable for now, but I want to expand it to Kubernetes.

    How are customer service and support?

    Their customer support is great because they have 24/7 support and created separate Slack channels for our company users.

    Which solution did I use previously and why did I switch?

    I didn't previously use a different solution; I was just using a notebook.

    Which other solutions did I evaluate?

    Before choosing Weights & Biases, I wasn't in a position to evaluate other options, but somebody did.

    What other advice do I have?

    My advice to others looking into using Weights & Biases is to try it out first and also look at the other options, as it's hard to switch when you are used to Weights & Biases's UI. Other than that, I think it's just about trying to understand the real metrics and concept.

    I would rate this product a 7 out of 10.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    reviewer2857155

    Centralized experiment tracking has guided our model selection for critical economic research

    Reviewed on Jun 15, 2026
    Review provided by PeerSpot

    What is our primary use case?

    My main use case for Weights & Biases was for my end-of-studies project with the Ministry of the Economy and Finance.

    As part of my end-of-studies project with the Ministry of the Economy and Finance, we used Weights & Biases mainly to track and compare our machine learning experiments in a structured way.

    The main goal of our project was to choose the best possible embedding model for the Ministry of the Economy and Finance, and through metric visualization and quick comparison, we were able to decide in a smooth and helpful way which was the best model.

    What is most valuable?

    In my opinion, the best features that Weights & Biases offers are the most useful ones, such as experiment tracking, metric visualization via dashboards, and quick comparison between multiple runs.

    Weights & Biases had a positive impact on my organization in that it allowed us to centralize our research, specifically centralizing our results, metric tracking, and visualization.

    Weights & Biases allowed each person to run their tests and then we could decide directly what decision to make next.

    What needs improvement?

    In my opinion, Weights & Biases could be improved by enhancing the tool with artificial intelligence to allow for faster research and a more intuitive experimentation process.

    For how long have I used the solution?

    I have been using this solution for a year.

    What's my experience with pricing, setup cost, and licensing?

    Regarding the pricing, setup cost, and license management of Weights & Biases, we stayed on the free plan, which allowed us to explore this tool and test all the features.

    What other advice do I have?

    My experience with Weights & Biases was very interesting and very intuitive. I do not have anything else to add about the features I appreciated in Weights & Biases. I do not have any other ideas or details about improvements that could be made to Weights & Biases. I have been able to cover everything, and I do not see any other areas for improvement for Weights & Biases. I would rate this product a 9 overall.

    Gouthami

    Experiment tracking has improved reproducibility while storage costs still need refinement

    Reviewed on Jun 11, 2026
    Review provided by PeerSpot

    What is our primary use case?

    My main use case for Weights & Biases is as a platform to understand machine learning experiments, including tuning model evolutions and managing data. I use it heavily when I need to write ML scripts for my data processing and data analyst responsibilities. It primarily functions as a research tool for tracking and visualizing, serving as a framework to support my data work.

    To provide a specific example of how I have used Weights & Biases in my work, I have a script in my current process that involves code I want to write in Python and PyTorch. I install Weights & Biases, log in to the website, and run queries by configuring it. I have used it to create graphs of the accuracy for the process I'm working on, specifically for a customer prediction model that uses Python scripting.

    When I used Weights & Biases for my customer prediction model in Python, it helped me not only with the visualization aspect but also in managing datasets and collaborating with my team members during model development and deployment. It compares the model and visualizes the metrics we want to derive from it by running ML scripts, helping us understand the project in a deeper way through image classifications and NLP models, monitoring accuracy and utilization, and managing hyperparameters while sharing experiments and comparing results.

    I have used Weights & Biases mainly for predictions, as it generates automated predictions based on the LLMs we create and stores those predictions, providing data with accuracy and creating charts to help us compare the data and run against future experiments.

    What is most valuable?

    Weights & Biases offers several best features including tracking, visualization, the registry part, datasets, dataset versioning, and monitoring framework integrations.

    Out of tracking, visualization, registry, dataset versioning, and monitoring, I find myself relying on data versioning the most because it stores changes in datasets for reproducibility, and the model registry keeps track of which prediction model is currently deployed, along with real-time monitoring during training. It also provides accuracy scores, functioning as an experimental tracking tool.

    These features help in ML engineering, comparing models, and collaborating with the team while maintaining reproducibility throughout the ML lifecycle.

    Weights & Biases has positively impacted my organization by improving the processes we build for the client, particularly in prediction projects. It has improved our datasets and metrics, resulting in faster model comparisons. It provides tracking and model management that helps maintain reproducibility throughout the ML lifecycle.

    What needs improvement?

    Weights & Biases can be improved in areas such as user experience, user interface, cost, and features.

    Regarding needed improvements, I believe the storage cost should be addressed because we want to store larger datasets and more items, so it can be enhanced in that aspect.

    For how long have I used the solution?

    I have been using Weights & Biases for a year.

    What do I think about the stability of the solution?

    Weights & Biases is a stable platform.

    What do I think about the scalability of the solution?

    In terms of scalability, it can support individuals and small teams effectively.

    Which solution did I use previously and why did I switch?

    Previously, we used MLflow, ClearML, and TensorBoard, but they lacked the experimental tracking and model management features that Weights & Biases offers, so we switched to it for its end-to-end ML platforms and storage capacity. It stands out from others due to its open-source nature and stronger model registry with good deployment support.

    What was our ROI?

    In terms of return on investment, it has helped reduce manual efforts, especially in areas such as fraud detection. It provides accuracy and validation by giving us precision metrics, regression models, and more.

    Which other solutions did I evaluate?

    I evaluated various options including MLflow, Neptune.ai, and TensorBoard before settling on Weights & Biases.

    What other advice do I have?

    My advice to others considering Weights & Biases is to leverage its storage registry and track all important metrics while using the model registry and production monitoring accurately. It maintains proper experiment tracking and is very useful for versioning and comparing features to make data-driven model decisions.

    Regarding Weights & Biases' AI capabilities, I believe it has solid governance and security measures.

    I find that audit logging is stronger than in other tools, but we still need to examine perspectives regarding unauthorized access and similar concerns.

    As for accuracy and reliability of outputs from Weights & Biases, it does not directly help with model accuracy but improves model development through experiments and comparisons, making it easier to identify which models deliver the highest accuracy indirectly, as it optimizes the data in a better way.

    I have given Weights & Biases an overall rating of 7 out of 10.

    reviewer2842122

    Experiment tracking has streamlined hyperparameter search and collaboration in daily model work

    Reviewed on May 16, 2026
    Review provided by PeerSpot

    What is our primary use case?

    My main use case for Weights & Biases is experiment tracking.

    What is most valuable?

    Weights & Biases is a very handy library when I want to track experiments and find the optimal parameters for training models or determine which model is the best when experimenting with multiple models. This library is very useful.

    When I need to find the optimal hyperparameter, I can use Weights & Biases to track different hyperparameters for training a model. Weights & Biases offers experiment tracking, hyperparameter optimization, and model artifact versioning.

    I rely the most on hyperparameter optimization in my daily work because it is very useful for training models.

    Weights & Biases is very useful when I need to review the past and see which model performed better or which parameters were the best. It provides good versioning and history, which is a feature I use frequently.

    I think it provides easier collaboration. Even if I want to share my model with someone, they can see the metrics that I am getting in that model.

    What needs improvement?

    I think there are not enough tutorials or training available for Weights & Biases. That would be much more beneficial. A better integration with cloud providers would also help.

    For how long have I used the solution?

    I have been using Weights & Biases for three years.

    What do I think about the stability of the solution?

    Weights & Biases is stable in my experience.

    What do I think about the scalability of the solution?

    Weights & Biases is scalable.

    Which other solutions did I evaluate?

    I evaluated MLflow and TensorBoard before choosing Weights & Biases.

    What other advice do I have?

    I chose a rating of 8 out of 10 for Weights & Biases because it is easy to use and a very good library for machine learning engineers. Weights & Biases is a very useful library if you want to track training experiments.

    reviewer2842017

    Experiment tracking has improved collaboration and has reduced time spent debugging workflows

    Reviewed on May 16, 2026
    Review from a verified AWS customer

    What is our primary use case?

    My main use case for Weights & Biases revolves around experiment tracking and model evaluation.

    In my previous job, I used Weights & Biases for experiment tracking, model evaluation, visibility, and collaboration between the different teams that we had at the company, mostly in product and engineering, while we were working on AI-driven workflows and AI features.

    How has it helped my organization?

    Weights & Biases has positively impacted my organization by improving operational efficiency and functional visibility.

    After adopting Weights & Biases, I noticed positive outcomes such as reduced time spent on debugging and rerunning experiments, allowing teams to quickly identify what configuration produced the best results.

    What is most valuable?

    One of the best features Weights & Biases offers is hyperparameter optimization because it lets us run large-scale hyperparameter searches using random or grid search.

    Hyperparameter optimization from Weights & Biases helped my team significantly by reducing the manual trial and error and improving model performance much faster.

    What needs improvement?

    For improvement, I would say cost and scalability could be addressed, and visibility could be improved further on AI workflows.

    For how long have I used the solution?

    I have used Weights & Biases for around a year in my last job.

    What do I think about the stability of the solution?

    Weights & Biases is stable.

    What do I think about the scalability of the solution?

    Scalability of Weights & Biases has not become a bottleneck in our training workflow.

    How are customer service and support?

    My experience with customer support for Weights & Biases was good.

    Which solution did I use previously and why did I switch?

    Weights & Biases was the first solution I used for experiment tracking and model management, although we were considering Arise AI or BrainTrust at some point.

    Before choosing Weights & Biases, I did evaluate other options, including Arise AI and BrainTrust.

    How was the initial setup?

    Weights & Biases was already in our system and we did not purchase it through AWS Marketplace.

    What about the implementation team?

    I had a good experience with pricing, setup cost, and licensing, and everything was smooth.

    What was our ROI?

    While I cannot share specific metrics due to confidentiality, the return on investment from using Weights & Biases has been really good.

    What's my experience with pricing, setup cost, and licensing?

    I had a good experience with pricing, setup cost, and licensing, and everything was smooth.

    What other advice do I have?

    My advice for others looking into using Weights & Biases is that they should use it and experience the benefits of this product. I would rate this review as a 9 out of 10.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Amazon Web Services (AWS)
    Mamoon K.

    Effortless Training Run Tracking Made Simple

    Reviewed on Dec 04, 2025
    Review provided by G2
    What do you like best about the product?
    It helps track training runs easily. I can see all the logged runs in one place without manually checking
    What do you dislike about the product?
    There should be an easy way to discard non useful runs.
    What problems is the product solving and how is that benefiting you?
    The problem of manually checking training runs again and again in the code makes it tedious. For people learning code like me Weights and Biases presents a unique alternative
    Research

    Very useful quite powerful tool

    Reviewed on Mar 24, 2025
    Review provided by G2
    What do you like best about the product?
    Easy of use, ease of implementation, the possibility to gather all my results, ease of sharing results with teammates, It can compare a lot of data interactively which in other cases could be hard to implement
    What do you dislike about the product?
    It is online approach which is both strong and weak side, sometimes servers are bit laggy.
    What problems is the product solving and how is that benefiting you?
    It makes easy for me to store and analyze experiments results, which in case of using own implementation approach using matplotlib for example would require quite a lot of work.
    Amir Masoud N.

    Seamless Integration and Reliable Support: A Daily Essential for Machine Learning

    Reviewed on Mar 08, 2025
    Review provided by G2
    What do you like best about the product?
    It is highly and well integrated with libraries I am using like PyTorch Lightning. Many times I have set that up because it was very easy, and after a while, it has happened that I lost part of my results, and W&B helped me to easily recover them through the logs, which without it, I probably wouldn't have. The next and very important feature to me is that I can use many different machines and servers at the same time and without being worried about gathering all results together, then using tools like TensorBoard, having them online without any effort(most of the time saves me when I am using supercomputers). It is part of my daily tools, and when I am teaching students machine learning, in very early sessions after teaching them visualization, I will have them use W&B to repeat whatever they have learned so far. I have never had any problem with W&B, but I have heard from one of my friends, whom I recommended he use W&B, that customer support is very fast and experienced.
    What do you dislike about the product?
    Sometimes it is bothering me when I am looking for very basic functionality of W&B and it doesn't provided good documentation for that.
    What problems is the product solving and how is that benefiting you?
    Monitoring training models is an integral part of my research ad W&B made it easy for me.
    Simon S.

    Review for wandb

    Reviewed on Jan 28, 2025
    Review provided by G2
    What do you like best about the product?
    I like the flexibility to log custom parameters, and the colorful comparisons between models in for instance confusion matrices.
    What do you dislike about the product?
    There are a couple of features I would have liked, such as the ability to set a global normalization flag, or controlling which windows stay on screen on reload, maybe these features exist, in that case I just haven't been able to find them
    What problems is the product solving and how is that benefiting you?
    We build image classification software, and for that it is very useful to track model performance
    Research

    The fastest way to log your training runs

    Reviewed on Jan 07, 2025
    Review provided by G2
    What do you like best about the product?
    Everything works almost out of the box and it has a "nice look" once logged in
    What do you dislike about the product?
    I really can't resize images and masked images the way I want inside the GUI and looking at images on mobile is quite a nightmare
    What problems is the product solving and how is that benefiting you?
    Logging my training runs. I could achieve the same result with tensorboard (and alike) + saving images and masks and periodically generating a webpage that shows them, but W&B already offers that and much more