Overview
MarkLogic customers now have an easy-to-use option to build and manage their Apache Spark based data pipelines using serverless, fully-managed AWS Glue service. The AWS Glue connector from MarkLogic enables customers to use Apache Spark for data ingestion and data transformation, and load the resulting data into MarkLogic. It also allows customers to read data from MarkLogic using Apache Spark.
The AWS Glue connector from MarkLogic enables customers to use Apache Spark for data ingestion and data transformation, and load the resulting data into MarkLogic Data Hub Service. It also helps the customers to read data from MarkLogic using Apache Spark.
Highlights
- High Performance Data Ingestion: Build rich and scalable Spark based data ingestion pipelines with readily-available connectors to diverse data sources for all data types.
- High Performance Data export: Build rich and scalable Spark based data export pipelines to read data from MarkLogic.
Details
Features and programs
Financing for AWS Marketplace purchases
Pricing
Vendor refund policy
No refunds
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
MarkLogic Connector for AWS Glue
- Amazon ECS
Container image
Containers are lightweight, portable execution environments that wrap server application software in a filesystem that includes everything it needs to run. Container applications run on supported container runtimes and orchestration services, such as Amazon Elastic Container Service (Amazon ECS) or Amazon Elastic Kubernetes Service (Amazon EKS). Both eliminate the need for you to install and operate your own container orchestration software by managing and scheduling containers on a scalable cluster of virtual machines.
Version release notes
Initial release of the MarkLogic connector for Apache Spark 3. The previous MarkLogic connector was designed for Apache Spark 2 and required use of the MarkLogic Data Hub Framework. This connector requires Apache Spark 3 and does not depend on the Data Hub Framework.
This release is compatible with AWS Glue 4.x
Additional details
Usage instructions
Please subscribe to the product from AWS Marketplace and Activate the Glue connector from AWS Glue Studio
Resources
Support
Vendor support
For support, Contact MarkLogic by creating a ticket at https://help.marklogic.com/ or sending an email to cloud-support@marklogic.com . Support is not included in hourly fee. Community-based support is available at http://developer.marklogic.com/qa . Free MarkLogic training is available here https://www.marklogic.com/learn/university/
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products
Customer reviews
Marklogic review by a Senior Consultant
1. Multi media database capabilities- The storage and ability to manage different types of data models in single platform has helped me to run my projects simultaneously handling data types such as JSON, XML
2. Enterprise Search Functionality- The in built search options had helped me in searching across all data types. The feature also helped me to search using full text, range queries etc.
3. Storage and Query capability- This feautre helped me in optimizing the storage infrastructure by placing data in different storage tiers. It helped me to reduce the cost incurred and the data can be accessed frequently .
1. Learning curve- despite the features , marklogic proven to be having a deep learning curve.
2. Cost and Licensing - The total ownership model is having a greated inmpact in taking this software as a package for any development. The cost will be a barrier to any small and medium sized projects. Even though my org has provided access to this platform by to get to that stage we had to go through a lot of to and fro
The project also helped me explore the advanced search capabilities by frequently using this for searching content within XML.
Database, search engine and integration tool rolled into one
MarkLogic- A powerful tool/database for all your NoSQL transactions.
It also provides the tools for data integrations.
It is comparatively fast,economical and easier to manage.
It is very easy to use and the customer support is also very prompt to response.
The features are easy to implement and can be integrfated with other tools as well.
Even an unexperienced person can implement its features thus it is frequently use in our organization.
The spreadsheet capabilities can also be increased.
The amount of data that is being processed is very large and it requires a platform that can caters the ever growing need of the database along with the faster processing and MarkLogic proves to be a perfect tool for it.
Very efficient database platform
Capable & Efficient Product
a. The flexibilty it provides to handle various type of data
b. Its in-build capability to support CICD.
c. Its capability to integrate with Pega
a. Increase its community base as sometimes it becomes tad difficult to find support.
b. Sometimes it crashes so there is a need to improve on the stability/relaibity of the product.I am sure it will be worked upon in future releases