
Airbyte
Centralized data pipelines have reduced costs and now power faster analytics and reporting
What is our primary use case?
A specific example of how my data analytics team uses Airbyte Cloud is by obtaining datasets from various sources, such as logs and metrics from multiple sources. These sources need to be managed centrally through a system that functions as a data warehouse or lake, which Airbyte accomplishes. Airbyte Cloud extracts data from applications and databases, such as Salesforce, Stripe, and APIs, and loads them into destinations such as Snowflake, BigQuery, and Redshift. Airbyte Cloud collects our data and keeps it synced automatically to multiple destination sources.
Airbyte Cloud functions as a data pipeline engine for a modern data stack.
What is most valuable?
Airbyte Cloud has positively impacted my organization by reducing the manpower required for managing the underlying resources of a data sync. It directly performs the job that a database engineer would do by managing a huge connector ecosystem with over 600 connectors across SaaS tools and databases, enabling faster integration. Whenever new data arrives, it automatically syncs to the destination source without requiring any engineer to manually copy or replicate the data. This approach helps our organization significantly.
What needs improvement?
There are some bugs in the user interface that could be improved.
For how long have I used the solution?
What do I think about the stability of the solution?
What do I think about the scalability of the solution?
How are customer service and support?
Which solution did I use previously and why did I switch?
We switched from Fivetran due to price constraints.
What was our ROI?
What's my experience with pricing, setup cost, and licensing?
Which other solutions did I evaluate?
What other advice do I have?
I rate Airbyte Cloud nine out of ten because I generally do not give perfect scores, as the technology is still evolving and still has some bugs in the user interface. Additionally, there is a lack of documentation for new users to understand the product quickly and utilize its functionality and features properly. My overall rating for this review is nine out of ten.
Streamlined data pipelines have transformed daily reporting and accelerated decision making
What is our primary use case?
My main use case for Airbyte Cloud is to move data automatically from multiple source systems into a centralized data platform for analytics, reporting, AI, and business intelligence. Airbyte Cloud is primarily used for data warehouse ingestion, moving data from business applications and databases. It also has business benefits such as providing a single source of truth, faster reporting, and better decision making. Additionally, it supports real-time or near real-time data using CDC, and Airbyte Cloud copies only changed data, which covers the main use cases from our side.
A quick example of a workflow with Airbyte Cloud is very simple: it starts with the source system, then data extraction, followed by Airbyte Cloud processing, data transformation, and ending with the destination system, along with monitoring and alerts. The detailed workflow includes connecting to the source, configuring the source connector, and then Airbyte Cloud connects using hostname, database name, username, and password. After that, Airbyte Cloud scans the database and discovers tables automatically. During every run, it copies all data for small database datasets, and it copies only new or modified records. The most common production use case is change data capture, which reads database transaction logs and captures inserts, updates, and deletes. Next, we select the destination and configure the frequency, then Airbyte Cloud automatically runs the pipeline. Finally, Airbyte Cloud continuously tracks sync success, sync failure, records processed, data volume, and duration.
What is most valuable?
The best features Airbyte Cloud offers are security features, including authentication, encryption, and enterprise controls. It has authentication with SSO support and role-based access. The encryption covers data in transit and data at rest, while enterprise controls help with governance, access management, and audit capabilities.
Airbyte Cloud has positively impacted our organization by reducing the time and effort required for our daily tasks. It decreases the cost for the company and has streamlined our daily work. Before using Airbyte Cloud, we received requests such as pulling data from Salesforce or Azure, syncing Jira tickets to Power BI, or moving PostgreSQL data to Snowflake. After we started using Airbyte Cloud, meeting business requirements became much easier. With Airbyte Cloud, we can select the source, choose the destination, and configure the sync schedule, all of which can be done in minutes. It is truly time-saving and has improved the traditional approach; previously, tasks took two to five days, but now with Airbyte Cloud, they are completed within 30 minutes to one hour.
What needs improvement?
Airbyte Cloud does have some limitations, particularly in terms of connector quality, as it varies, and not every connector has the same maturity level since many are community-driven. Additionally, the CDC setup often requires database log configuration and careful operation management. Some users report that large self-hosted deployments can require tuning and operational expertise. For complex enterprise workflows, troubleshooting connector-specific issues sometimes necessitates deeper investigation, which are areas that can be improved in the future.
I would suggest that if Airbyte Cloud could enhance its monitoring and troubleshooting capabilities, it would be very helpful for us.
For how long have I used the solution?
I have been using Airbyte Cloud for the last two years.
What do I think about the stability of the solution?
Airbyte Cloud is stable.
What do I think about the scalability of the solution?
The scalability of Airbyte Cloud is good; it is scalable.
How are customer service and support?
I don't have much information about customer support because we haven't needed it. However, when we initially used Airbyte Cloud, customer support was very helpful.
Which solution did I use previously and why did I switch?
I previously used some other cloud solutions, but they were not reliable, which is why we moved to Airbyte Cloud.
How was the initial setup?
My experience with pricing, setup cost, and licensing for Airbyte Cloud has been positive; it reduces the pricing for our organization, and the setup cost was not too high. It is easy to use and beneficial from a cost perspective. I already mentioned that before using Airbyte Cloud, some tasks needed two to five days, but after using Airbyte Cloud, they only take 30 minutes to one hour, significantly saving our time.
What was our ROI?
I can provide some specific metrics: before using Airbyte Cloud, one task would take two to five days, but now it can be completed within one hour, which saves a significant amount of time. It also reduces our effort spent on things such as API updates, version updates, and authentication changes, which used to take considerable time. After using Airbyte Cloud, the engineering team spends less time maintaining integrations because Airbyte Cloud maintains many connectors for us.
Which other solutions did I evaluate?
Before choosing Airbyte Cloud, we evaluated many options, but they were not sufficient for our needs, which is why we selected Airbyte Cloud.
What other advice do I have?
Airbyte Cloud has a huge connected ecosystem and it is open-source, which means everyone can use it. Furthermore, it features strong CDC support, making it excellent for modern data stacks.
The learning curve for new users adopting Airbyte Cloud is easy; it does not require much training. However, if someone is new to this tool, they may need some training, but not extensively. It is very easy to use.
The Airbyte Cloud connector ecosystem is broad enough for our needs, so I don't wish for any additional connectors.
The documentation and community support for Airbyte Cloud are helpful; they have been beneficial when we run into issues.
When handling large data volumes or high-frequency data transfers, we need to monitor to ensure correctness. Generally, it works well after monitoring, but we cannot fully rely on it for large data, so we believe it is essential to monitor from our side.
Airbyte Cloud's ease of integration with other tools or platforms is very good, as it is extremely useful for our needs due to its security features. It is positioning itself to become a data and context layer for AI agents by connecting business systems and making unified data available for AI applications.
Airbyte Cloud's AI capabilities are very much secure. The accuracy and reliability of Airbyte Cloud's AI output are considerable; it is scalable and reliable, but on large-scale operations, some users have reported that large self-hosted deployments require tuning and operational expertise. Therefore, while it is reliable, it has limitations in large-scale scenarios.
My advice to others considering Airbyte Cloud is that they should use it because it is very helpful and saves a lot of time in managing day-to-day tasks. Everyone should take advantage of Airbyte Cloud. I would rate my overall experience with Airbyte Cloud an 8 out of 10.
Automated data flows have unified sensor and app insights and now drive faster product decisions
What is our primary use case?
Our main use case for Airbyte Cloud is consolidating data from multiple sources: drone flight logs, RTs, soil sensors, weather APIs, mobile app backends, and CRM tools, all into one central data warehouse. As a product team, we use the unified data to track product usage patterns, monitor field performance, and make better decisions about future priorities.
We had a specific challenge where our drone data was stored in one database, farm engagement data was in another system, and weather data was coming from a third-party API. Our data analysts were manually downloading and combining this data every week, which was error-prone and slow. I helped implement Airbyte Cloud to automate all three data pipelines in our BigQuery warehouse within a two-week setup. Our analysts had a single source of trust, updating automatically every hour, and the weekly manual data merge process was completely eliminated.
What is most valuable?
The best features Airbyte Cloud offers are the huge connector library, automatic schema change detection, and scheduling and synchronized frequency control. The transformation support with dbt integration, and the clear monitoring dashboards that show sync status and error every time are also notable.
Definitely the pre-built connectors have been the most valuable feature for my team, and it has made my workflow easier. As a product manager intern, I don't have deep engineering resources to build custom data pipelines from scratch. Having a ready-made connector for tools such as Google Sheets, PostgreSQL, HubSpot, and various API tools means I can set up a new data pipeline in under one hour without writing a single line of code. The self-service capability has been incredibly empowering for the product team specifically.
Airbyte Cloud has positively impacted our organization by directly improving our product decision-making speed. Before, we were making feature decisions based on gut feelings or out-of-date weekly reports. Now we have nearly real-time data flowing into our dashboards, and we can see exactly how farmers are using our app, which drone features are being used the most, and where the drop-offs happen. This has made our product roadmap more evidence-based.
What needs improvement?
I give it an eight because of error messages. If they solve some error messages, that would help significantly. Sync failures can be technical and hard to understand for a non-engineer. A more user-friendly error explanation would be beneficial.
For how long have I used the solution?
We have been using Airbyte Cloud for approximately eight months now during a phase where our data is scattered across too many disconnected systems, and we need a reliable way to bring everything together.
What do I think about the stability of the solution?
Regarding accuracy and reliability, Airbyte Cloud's sync accuracy has been reliable in our experience. Data arrives complete and correctly structured almost every time. We have had very few incidents of data loss or corruption. The incremental sync feature is particularly very accurate as it only moves new or changed records, which keeps our warehouse clean and our data cost-controlled.
What do I think about the scalability of the solution?
Airbyte Cloud scales well as our data needs grow to a scale of ten.
Which solution did I use previously and why did I switch?
Airbyte Cloud compares favorably to other data integration tools I have used or evaluated, as it is more smooth and manageable, and you can set it up on your own without a developer.
How was the initial setup?
The experience of integrating Airbyte Cloud into our existing tech stack was much smoother than I expected, especially considering how complex our tech stack is at Adarsh Human. We have a fairly diverse setup, using PostgreSQL for our core application database.
What was our ROI?
Since using Airbyte Cloud, we save approximately seventy to seventy-five percent of the time our data team was spending on manual data preparation. That is roughly six to eight hours per week recovered. For a lean startup team, that is significant. We also avoid hiring a dedicated data engineer for pipeline maintenance, which has saved us a significant salary. Airbyte Cloud essentially covers that function at a fraction of the cost.
What other advice do I have?
Airbyte Cloud is already a good application and does not need improvement.
The learning curve for new users on our team is very easy to understand. It does not require coding skills to implement it, and users can use it very easily.
I would describe the documentation and resources provided by Airbyte Cloud as awesome. Their connectivity and core scale are good, and the complex parts, such as connectivity to IoT and APIs, are well documented. For a product intern such as myself who needs coordination and does not have deep developer skills, Airbyte Cloud made everything very manageable.
My advice for others looking into using Airbyte Cloud is that if they have multiple data flows, this is a great application and a great product for connectivity and all types of data in one system. Airbyte Cloud provides more complex customized IoT and API solutions, and I believe everyone should use Airbyte Cloud. I rate this product an eight overall.
Incremental data pipelines have accelerated analytics while observability and governance still improve
What is our primary use case?
I have been using Airbyte Cloud for the last year. I have mainly worked with Airbyte Cloud in the context of data integration organization workflows. My involvement has included validating data pipelines, monitoring sync jobs, troubleshooting data discrepancies, and ensuring data quality between source and destination systems.
I can describe how the incremental data extraction feature of Airbyte Cloud impacted my daily workflow. Before Airbyte, a lot of our validation effort was around full dataset comparisons, which was slow and expensive. Once we moved to Airbyte Cloud with incremental syncs, the workflow shifted. Instead of revalidating entire tables, I focused on delta-based validation, only validating new and updated records. I built SQL checks around max timestamp tracking, primary key plus updated at comparisons, and row count deltas per sync run. It also meant I had to think more about data consistency over time, not just snapshot correctness.
What is most valuable?
Airbyte Cloud has impacted us very positively in the perspective of faster time to data for analytics teams. Earlier, getting a new data source into the warehouse required engineering effort, custom scripts, and testing cycles. Now with Airbyte Cloud, new sources could be connected in hours instead of days or weeks. Analysts and product teams got access to fresh data much faster, and this improved decision-making speed, especially for campaign tracking and product usage metrics.
I can estimate how much time I saved with Airbyte Cloud. Before Airbyte, building a new ingestion pipeline typically took three to seven days. This included coding, testing, debugging, and deployment coordination. With Airbyte Cloud, most standard connectors were ready in a few hours to one day. My involvement shifted mostly to validation rather than setup. Earlier, full regression on data pipelines often took one to two days per release cycle because we had to validate full data sets and debug integration issues manually. Now, we moved to delta-based testing. Most validation cycles came down to a few hours per pipeline run. Faster failure detection reduced debugging time significantly.
What needs improvement?
Though Airbyte Cloud is a mature product, there is room for improvement. One limitation is that a sync being marked successful does not necessarily mean the data is correct. You can still get issues such as partial null ingestion, schema mismatches, or silent mapping problems. Airbyte Cloud focuses more on pipeline execution status than data correctness validation. One improvement that I would like is built-in data validation checks or anomaly detection at Airbyte Cloud level itself. Right now, teams like ours had to build these validations externally in test frameworks. Airbyte Cloud logs are useful, but sometimes not deep enough when debugging complex issues. It is hard to trace exact record-level failures in some cases. There is limited visibility into transformation mapping behavior in connectors, and debugging often requires jumping between source, destination, and logs. One improvement that I can think of is that more end-to-end lineage visibility and record-level tracing for failed or skipped records will be a better move.
For small setups, Airbyte Cloud UI is straightforward, but as the number of connections grows, managing dozens of sources and destinations becomes cluttered, and it is not always easy to quickly understand pipeline dependencies at a glance. These improvements should take place at a dashboard-style operation level. While job failure notifications exist, they can be improved with limited flexibility in defining alert conditions and not enough customization for severity levels.
Airbyte Cloud's AI-adjacent capabilities, such as assisted setup, schema suggestions, and automation features, are still in a relatively early stage. Governance and security need to be reviewed more from a data platform plus cloud SaaS governance lens rather than full AI governance maturity. From a governance standpoint, Airbyte Cloud provides basic controls, such as workspace-level access control, role-based access to some extent, and separation of sources by configuration boundaries.
For how long have I used the solution?
I have been working in the field of testing for four years where I have explored UI testing, mobile testing, and API testing.
What other advice do I have?
I gave it a seven because I see gaps. Airbyte Cloud's UI and UX becomes harder to manage at scale, and observability is not deep enough for complex debugging. Alerting and testing workflows are not fully mature, and schema evolution and connector consistency can be uneven. Airbyte Cloud is excellent for fast, low-effort data ingestion, but still requires external validation and observability layers for enterprise-grade reliability.
Airbyte Cloud's AI capabilities are still mostly assistive rather than fully autonomous. When we talk about accuracy and reliability of output, it is important to separate two things: the data pipeline output and any AI-assisted suggestion or automation features. Airbyte Cloud is highly accurate and reliable for data ingestion in standard use cases, but its correctness guarantees are limited to pipeline execution. Teams still need external validation layers to ensure end-to-end data integrity. I rated this review a seven overall.
Powerful CDC, Scheduler with Seamless Integration
I also appreciate the idea that we can quickly modify connectors when needed. The source code is easy to navigate and adapt to our needs, which makes it easier to share data between processes and to plug and play. It’s also helpful that, if something fails, we can send alerts to Slack via an incorporated webhook. Deploying it locally was straightforward.
We still need to investigate how user authentication works, so not everyone is able to change connections and so on.
Data workflows have accelerated and now optimize migration and cost management
What is our primary use case?
What is most valuable?
The best feature Airbyte Cloud offers is data transformation.
Airbyte Cloud has positively impacted our organization by being very helpful and speeding up our work environment.
An example of how it has helped speed things up is that we have data here and there, which helps us to organize and speed up our migration.
Data tasks help our workflow and organization because most of the things we do manually, so that helps organize.
What needs improvement?
For how long have I used the solution?
What do I think about the stability of the solution?
How are customer service and support?
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
How was the initial setup?
What was our ROI?
Which other solutions did I evaluate?
What other advice do I have?
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Cloud data workflows have streamlined ETL and now need broader, more customizable connectors
What is our primary use case?
My main use case for Airbyte Cloud is for ETL. I tend to use Airbyte Cloud to extract data from one data source and put it into another data source for VTN. We extracted data from Postgres and dumped it into Redshift, which is an example of a real workflow I have set up.
What is most valuable?
I think the various connectors out of the box are the best features Airbyte Cloud offers. I don't need to create custom code to do this kind of work, so it is easy for me to use out-of-the-box connectors in my day-to-day work.
Airbyte Cloud has positively impacted our organization because we were looking for multiple products and ended up choosing Airbyte because it is easy to use and set up. It reduced the development effort, and we did not have to build anything by ourselves, so it was easy to get into Airbyte and build the workflows.
What needs improvement?
I think Airbyte Cloud can be improved by adding more connectors and more customizable connectors.
For how long have I used the solution?
I have been using Airbyte Cloud for the last one year.
What do I think about the stability of the solution?
Airbyte Cloud is stable.
What do I think about the scalability of the solution?
Airbyte Cloud's scalability is good since it is on the cloud.
How are customer service and support?
Customer support for Airbyte Cloud is all good.
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
We used Fivetran, and it was costly, which is why we switched to Airbyte Cloud.
What was our ROI?
I think it required fewer employees, indicating that I have seen a return on investment.
What's my experience with pricing, setup cost, and licensing?
My experience with pricing, setup cost, and licensing was good.
Which other solutions did I evaluate?
Before choosing Airbyte Cloud, I evaluated other options, specifically Fivetran.
What other advice do I have?
My advice to others looking into using Airbyte Cloud is that since it is reliable and easy to use, you can use it. I would rate Airbyte Cloud overall as seven because it has room for improvement.