Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Diffbot APIs

Diffbot | 1

Reviews from AWS Marketplace

0 AWS reviews
  • 5 star
    0
  • 4 star
    0
  • 3 star
    0
  • 2 star
    0
  • 1 star
    0

External reviews

29 reviews
from G2

External reviews are not included in the AWS star rating for the product.


    Justin W.

The most Competant Web Crawling Service I've used

  • February 03, 2023
  • Review verified by G2

What do you like best about the product?
Overall, Diffbot's tools are simple to use and understand outside of more complex use cases. We use several of their features to deliver content insights to our clients. I would recommend Diffbot to any person or organization that needs to pull large amounts of data from arbitrary web sources.

The first tool we use is the crawlbot, which we appreciate is configurable and extremely capable. In most of our use cases - we just need to point to a URL and have it repeat every so often to discover new content. After crawling, the data is available via an easy-to-parse JSON file.

We also use the Diffbot Knowledge Graph API. The powerful DQL language allows us to query a massive amount of data to find articles and entities. DQL is simple to use, and the GUI interface allows easy testing and iteration.

Diffbot's customer service is also exceptional. Our contact has been very attentive in helping us learn how to properly use Diffbot's services to meet our needs. He has organized one-off Zoom meetings to walk us through the appropriate method for creating DQL queries and has expedited bug fixes required for our use cases.
What do you dislike about the product?
Diffbot is a powerful tool, and with its numerous capabilities, it can be difficult for those unfamiliar with it to understand how to use it properly. Fortunately, Diffbot provides excellent customer service, which can help guide you through the process of determining the best practices for your use case.
What problems is the product solving and how is that benefiting you?
Diffbot offloads the complex and difficult process of web crawling, scraping and analysis/parsing. Rather than writing our own in-house web crawler, we can spend our time elsewhere building features for our clients.

Diffbot's Knowledge Graph allows us to find relationships between articles and entities across the web in near real-time. This feature has been invaluable in providing insightful information to our clients.


    Kurt L.

Diffbot is a game-changer.

  • December 07, 2022
  • Review verified by G2

What do you like best about the product?
Diffbot makes the difficult task of managing data and extracting useful information much easier. They provide access to a seemingly infinite amount of company and contact information and are continuously improving their user interface to add even more value. I use Diffbot every chance I can!
What do you dislike about the product?
Diffbot is very responsive and always willing to help. Their interface still needs some improvements, but I have been their client for over a year now and have seen vast improvements.
What problems is the product solving and how is that benefiting you?
Diffbot is a better version of ZoomInfo with more capabilities beyond primary company, industry and contact info. They have additional tools which allow for data enrichment and are progressing towards in-depth market analytics. Indeed a total-package solution.


    Computer Software

Diffbot Increases Efficiency

  • February 25, 2021
  • Review verified by G2

What do you like best about the product?
Prior to using Diffbot, we relied primarily on RSS feeds and a web scraping tool that is based on the visual layout and HTML of a webpage. We were very dependent on X Paths to get the data we wanted. We find that the Diffbot crawlers are more stable in the long term because they are not as impacted by website design changes. This saves us a lot of time that we would otherwise be spending on maintenance.
What do you dislike about the product?
The two issues that are most challenging for us are:

1. Diffbot does not recognize PDF documents, and we frequently would like to ingest them as articles.

2. We find it difficult to troubleshoot a crawler in situations where it is not bringing in data or it is not bringing in the data we are expecting.
What problems is the product solving and how is that benefiting you?
The biggest problem that Diffbot solved for us is reducing the amount of maintenance we have to do on our scraped websites. We use heavily Diffbot's full text capability and Diffbot’s metadata is also useful for us. The metadata that we use most is Diffbot’s language designation to ensure that our clients are seeing only articles in the languages that they choose.

We also see great potential for using the bulk API to become more efficient in our content ingest process and we are excited to continue to explore this option.


    Venture Capital & Private Equity

Great enrichment tool

  • February 12, 2021
  • Review provided by G2

What do you like best about the product?
1) Enrichment data
2) Ability to query data in aggregate
What do you dislike about the product?
1) Being charged based on entities
2) Being charged as we go (I wish there was a way to limit my queries)
What problems is the product solving and how is that benefiting you?
Lead enrichment
Lead sourcing
Customer profiling


    Tom W.

Excellent and reliable service over 4 years

  • January 21, 2021
  • Review verified by G2

What do you like best about the product?
High detection accuracy and uptime: most of the time we can send API requests and know that the responses from Diffbot will be valid.
What do you dislike about the product?
Some old versions of Python are used (<3.0) and could be upgraded.
What problems is the product solving and how is that benefiting you?
We have been using the Article and Analyse APIs as a core part of our pipeline. After doing a build-vs-buy comparison, we realized that it would be far preferable to leave this step to an external best-in-class solution, rather than to build (and importantly *maintain*) in-house. Wherever the automated page structure analysis fails, our team can easily "teach" it the structure, and in the rare cases where that fails, the Diffbot team are very responsive to address issues.


    Nitin A.

social media and news monitoring

  • November 23, 2020
  • Review verified by G2

What do you like best about the product?
Diffbot provides great APIs, technical resource, and overall service. Their technical resources are one of the most advanced and highly accurate. Diffbot's team keeps their APIs up to date with social media's rapid evolution. The customer support is equally helpful and very friendly. They are very willing to work with flexible scenarios, accommodate needs and low budgets for small research groups, provide demo and trial accounts to experiment. Overall, they are the best social media data provider and analysis company, in my experience of over a decade.
What do you dislike about the product?
This is more like a suggestion. Diffbot has several excellent capabilities and they are constantly improving and adding new features. Current customers and perhaps prospective ones too would benefit from a weekly/monthly newsletter, or social media updates, about these new developments.
What problems is the product solving and how is that benefiting you?
Social media and news monitoring.

Diffbot's services have allowed us to streamline our data collection method. Previously, we wrote our own web crawlers/scrapers for blog sites which would break quite frequently. Diffbot has removed that hurdle. We are now looking forward to using the NLP/AI capabilities provided by Diffbot.
Recommendations to others considering the product:
I would strongly recommend Diffbot. But if you are still undecided, contact their support staff for demo/trial account. You won't regret it!


    Eddie C.

A very good service for anyone needing content extraction and much more.

  • September 01, 2020
  • Review provided by G2

What do you like best about the product?
Having tried a number of similar services in the past, we were very pleasantly surprised as to how good the content extraction is. The contacts we have dealt with at Diffbot have also been extremely helpful.
What do you dislike about the product?
There's not much to dislike. It does the job very well.
What problems is the product solving and how is that benefiting you?
Clean spidering and content extraction of websites.


    Georg H.

Great service for both, quick MVPs and professional applications

  • June 10, 2020
  • Review verified by G2

What do you like best about the product?
The parsing quality is best of breed - and adapts well to all sorts of websites
What do you dislike about the product?
The scheduling and organizing of crawljobs is not perfect yet - hoping to see some improvements coming up there.
What problems is the product solving and how is that benefiting you?
We use Diffbot to crawl a large amount of global news outlets


    Oleg L.

Using Diffbot to analyze product pages

  • June 01, 2020
  • Review verified by G2

What do you like best about the product?
Diffbot is the best web crawler and analyzer on the market. You can get the structured data about any web page that you want.
What do you dislike about the product?
We have been using Diffbot for 5 months so far, and haven't got any issues with it.
What problems is the product solving and how is that benefiting you?
We are using Diffbot to get structured data about e-commerce web pages. It just does the job. One API call and you get the result.
Recommendations to others considering the product:
I recommend Diffbot.


    Information Technology and Services

Diffbot has been invaluable for news monitoring

  • June 01, 2020
  • Review provided by G2

What do you like best about the product?
We use Diffbot for news monitoring, and their article extraction capabilities are scalable, cost efficient and the right fit for our use case.
What do you dislike about the product?
There's not much to dislike for how we use Diffbot.
What problems is the product solving and how is that benefiting you?
Scalable news monitoring is difficult to accomplish when your solution is completely built or managed in-house - Diffbot AI solves the technical challenges of article extraction from unstructured web pages, for us to get rich structured public data.