Overview
Video 1
Video 1
Video 2

Product video
Visual Flow, powered by Apache Spark is a cloud-native open-source ETL tool based on Apache Spark unified analytics engine with a Graphical User Interface. It combines the best attributes of Kubernetes, Spark, and Argo into a single system with the following features:
Portability, flexibility and multi-cloud compatibility Increased developer productivity High availability, performance and fault tolerance Cost-effective Open-source
Visual Flow provides a Graphical User Interface to create ETL jobs and combine them in Data Processing Pipelines, run/schedule them, and monitor execution. It is a multi-tenant system. You can create multiple projects in a single cluster. Each project is represented as a namespace, which allows allocating resources and providing access to the users in different roles.
Visual Flow does not require a database. It is possible to create all objects as native Kubernetes resources. Our tool leverages Kubernetes authorization to manage users and their roles within a project.
Visual Flow provides the ability to create parameters (e.g., connection info) and reuse them in Jobs.
Why Visual Flow?
Graphical User Interface speeds up learning and jobs development Good monitoring and alerting capabilities out of the box Error Handling out of the box Organizing jobs into Data pipelines Unlimited parallelism Unlimited scalability Easy to customize and extend Easy to test automatically Can be installed on-premise as well as on cloud provider of your choice Can be packaged and provided as SaaS solution on your own Cloud platform
Highlights
- Unique open-source ETL tool with a Graphical User Interface Leverages the best features of Apache Spark, Kubernetes and Argo. Unlimited parallelism and scalability
- Visual Flow landing page https://visual-flow.com/
- Visual Flow on YouTube https://tinyurl.com/VisualFlow-playlistst
Details
Features and programs
Financing for AWS Marketplace purchases
Pricing
Vendor refund policy
Open source software is freely available to use and made by the people, for the people.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Deployment into EKS
- Amazon EKS
Container image
Containers are lightweight, portable execution environments that wrap server application software in a filesystem that includes everything it needs to run. Container applications run on supported container runtimes and orchestration services, such as Amazon Elastic Container Service (Amazon ECS) or Amazon Elastic Kubernetes Service (Amazon EKS). Both eliminate the need for you to install and operate your own container orchestration software by managing and scheduling containers on a scalable cluster of virtual machines.
Version release notes
1.5.0 (2025-03-11)
Improvements and new features:
- Added GenAI feature (new stage).
- Added Incremental load feature (Read stage).
- Added Interactive mode feature (new 'Debugging' status) with the ability to preview metadata and data exemplar.
- Added SQL editor with syntax markup.
- Added new connectors (Databricks, Azure Blob Storage, Google Cloud Storage, Kafka).
- Added new types, formats and parcers (Delta, Binary, PDF, DOC/DOCX etc).
- Added Sandbox mode.
- Updated Spark version to 3.4.1.
- Added hints for stage helper.
- Changed search crieria for Users/Roles to search by the name instead of id.
- Improved CDC stage: processing null values.
- Added ability to copy/paste stages between jobs within the same project (namespace).
- Changed boolean drop down lists to a new toggle element.
- Improved security to manage connection passwords.
- Improved search placeholders.
- Improved Avro schema.
- Improved UI: prevention to run empty jobs and pipelines.
- Improved import/export functionality.
- Added new sort option 'By last edit' for Jobs and Pipelines.
- Improved sorting method.
- Added possibility to sort jobs/pipelines in Pipeline Designer.
- Cosmetic updates in JD for drop down lists, scroll bars and multiline fields.
Fixed:
- Date/time stage issue.
- Role-based issues.
- Jobs/Pipelines dependency issue for 'Copy' action.
- Notification parameters issue.
- Fields duplication issue.
- Drop-down list moves while scrolling the page issue.
- Logs level list moves while choosing a level.
- Incorrect rows number in resulting STDOUT for write stage issue.
- Incorrect data in Dataframe modal window.
- Typo issues in different modes.
- Jobs vulnerabilities (CVE-2022-33891, CVE-2021-25642, CVE-2021-33036, CVE-2021-37404, CVE-2020-36632, CVE-2020-10650, CVE-2020-9480, CVE-2008-1997, CVE-2021-32626, CVE-2021-40531, CVE-2018-7489, CVE-2020-35491, CVE-2020-35490, CVE-2020-10673, CVE-2017-7525, CVE-2022-25168, CVE-2023-25194, CVE-2019-14887, CVE-2023-22946, CVE-2020-8840).
- DBService vulnerabilities (CVE-2024-45772, CVE-2024-1597, CVE-2024-7254, CVE-2024-32888, CVE-2024-22262, CVE-2024-38809, CVE-2024-22243, CVE-2024-38816).
Additional details
Usage instructions
Use provided Helm chart to deploy Visual Flow containers into EKS
Resources
Support
Vendor support
Support options:
Essential support - EU Business hours Advanced support - 24x5 Premium support - 24x7 + Technical Account Manager
Additionally on demand:
Professional Services Architect's consult Initial installation Feature requests
Contact details for pre-purchase support: info@visual-flow.com Contact details for post-purchase support: info@visual-flow.com
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products

