Select your cookie preferences

We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.

If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”

Sign in
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Missing tool in Data Science pipeline

  • By Grzegorz M.
  • on 10/05/2019

Quilt simplified our flow in data maintenance and versioning. Now, it became extremely easy to keep track of changes in a dataset and refer in a reproducible manner a specific revision without worrying if someone overwrites the data. We have it already integrated into our flow, so the dataset updates interfere with model building no more.
Quilt team provides us with ongoing support. Bugs happen in every software, but in the case of small bug we found, we received a fixup in no time, so we could smoothly continue our work.
We spotted some drawbacks in Quilt Teams some time ago. These are mostly resolved here, and remaining "wishes" are on the roadmap. It's really nice that devs listen to our needs!
What we love most about Quilt, is the caching feature. We reduced data transfer costs while keeping low complexity of scripts.
Overall grade is 5/5 since that tool was missing heavily in the flow we had for Machine Learning. At this moment we use it also for versioning models (especially that we generate models in a bunch of formats each time) and Jupyter Notebooks (for which Git isn't the best option)


There are no comments to display