Overview
An AWS Glue Studio connector for accessing Exasol database. You can use it to perform read or write operations to Exasol database from AWS Glue Studio.
Highlights
- Supports reading, writing, column projection and predicate pushdown
Details
Features and programs
Financing for AWS Marketplace purchases
Pricing
Vendor refund policy
This is a placeholder value. Please update this value via the AWS Marketplace Management Portal.
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Activate in AWS Glue Studio
- Amazon ECS
Container image
Containers are lightweight, portable execution environments that wrap server application software in a filesystem that includes everything it needs to run. Container applications run on supported container runtimes and orchestration services, such as Amazon Elastic Container Service (Amazon ECS) or Amazon Elastic Kubernetes Service (Amazon EKS). Both eliminate the need for you to install and operate your own container orchestration software by managing and scheduling containers on a scalable cluster of virtual machines.
Version release notes
Exasol AWS Glue Connector 2.0.0, released 2023-03-08
Code name: AWS Glue Version 4.0.0
Summary
In this release we updated support for the latest AWS Glue Studio 4.0.0 release.
It supports the Spark 3.3.0 and Python 3.10 versions. Here are some notable improvements:
- Many Spark functionality upgrades from Spark 3.1 to Spark 3.3
- Log4j 2 migration from Log4j 1.x
- Several Python module updates from AWS Glue 3.0, such as an upgraded version of Boto
- Native support for open-data lake frameworks with Apache Hudi, Delta Lake, and Apache Iceberg
- Native support for the Amazon S3-based Cloud Shuffle Storage Plugin (an Apache Spark plugin) to use Amazon S3 for shuffling and elastic storage capacity
You can read more about the changes on the release notes .
Refactorings
- #66: Updated to AWS Glue version 4.0.0
- #64: Updated dependencies and removed references to maven.exasol.com repository
Dependency Updates
Compile Dependency Updates
- Updated com.exasol:exasol-jdbc:7.1.11 to 7.1.17
- Added com.fasterxml.woodstox:woodstox-core:6.5.0
- Updated software.amazon.awssdk:s3:2.18.4 to 2.20.19
Test Dependency Updates
- Updated com.amazonaws:AWSGlueETL:3.0.0 to 4.0.0
- Updated com.amazonaws:aws-java-sdk-s3:1.12.329 to 1.12.422
- Updated com.exasol:exasol-testcontainers:6.3.0 to 6.5.1
- Updated com.exasol:java-util-logging-testing:2.0.2 to 2.0.3
- Updated com.exasol:test-db-builder-java:3.4.1 to 3.4.2
- Removed log4j:log4j:1.2.17
- Updated nl.jqno.equalsverifier:equalsverifier:3.10.1 to 3.14
- Added org.apache.logging.log4j:log4j-api:2.20.0
- Added org.apache.logging.log4j:log4j-core:2.20.0
- Updated org.junit.jupiter:junit-jupiter-api:5.9.1 to 5.9.2
- Updated org.junit.jupiter:junit-jupiter:5.9.1 to 5.9.2
- Updated org.mockito:mockito-core:4.8.1 to 5.1.1
- Updated org.mockito:mockito-junit-jupiter:4.8.1 to 5.1.1
- Updated org.testcontainers:junit-jupiter:1.17.5 to 1.17.6
- Updated org.testcontainers:localstack:1.17.5 to 1.17.6
Plugin Dependency Updates
- Updated com.exasol:error-code-crawler-maven-plugin:1.1.2 to 1.2.2
- Updated com.exasol:project-keeper-maven-plugin:2.8.0 to 2.9.3
- Updated io.github.zlika:reproducible-build-maven-plugin:0.15 to 0.16
- Updated org.apache.maven.plugins:maven-assembly-plugin:3.3.0 to 3.4.2
- Updated org.apache.maven.plugins:maven-failsafe-plugin:3.0.0-M5 to 3.0.0-M8
- Updated org.apache.maven.plugins:maven-jar-plugin:3.2.2 to 3.3.0
- Updated org.apache.maven.plugins:maven-shade-plugin:3.4.0 to 3.4.1
- Updated org.apache.maven.plugins:maven-surefire-plugin:3.0.0-M5 to 3.0.0-M8
- Updated org.codehaus.mojo:flatten-maven-plugin:1.2.7 to 1.3.0
- Updated org.codehaus.mojo:versions-maven-plugin:2.10.0 to 2.14.2
Additional details
Usage instructions
Please subscribe to the product from AWS Marketplace and Activate the Glue connector from AWS Glue Studio
Support
Vendor support
Please let us know by opening support ticket at https://www.exasol.com/support Or by opening Github issue at project repository
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products
Customer reviews
Review
The best Analytic Database performance
Best analytics DB
It is also great, that it can handle multiple users to work on the same dataset without any problems with performance.
Great tool
Speed is phenomenal even when running complex queries.
Best choice for on-premise, getting better in the cloud
+ Cost-to-performance ratio is very competitive, you'll need much less hardware;
+ Efficient indexed JOINs, probably best in class;
+ Flexible UDF scripts, can be implemented in any programming language and run in parallel;
- Lack of native partial backup (per-table, per-schema), must use EXPORT instead;
- Detection of data lineage is a bit difficult, unless you implement it yourself externally;
- Cloud capabilities are not fully utilised (yet), but it's not a problem for "on-premise";
A high-performance database for Analytics. Would definitely recommend it.
This extraordinary performance allowed to, over time, squeeze much more value from the database, while keeping low maintenance and development costs.
For my Analytics department, migrating to Exasol was the single most important step that we took to provide a better service to business, assure quick development cycles and incur in minimal admin effort.