Listing Thumbnail

    MarkLogic Multi-Model Database: Enterprise Edition v. 11

     Info
    Deployed on AWS
    MarkLogic Server Enterprise v. 11 with Semantics, Advanced Security, and Tiered Storage Options
    4.3

    Overview

    MarkLogic Server is the agile, scalable, and secure foundation of the MarkLogic Data Platform. A multi-model database with a wide array of enterprise-level data integration and management features, MarkLogic helps you create value from complex data - faster.

    MarkLogic Server natively stores JSON, XML, text, geospatial, and semantic data in a single, unified data platform. This ability to store and query a variety of data models provides unprecedented flexibility and agility when integrating data from silos. MarkLogic is the best, most comprehensive database to power an enterprise data platform.

    MarkLogic Server is built to securely integrate data, track it through the integration process, and safely share in it in its curated form. Meet business-critical goals and accelerate innovation with MarkLogic.

    Highlights

    • Best-in-class multi-model database: Advanced search, robust metadata management and semantic capabilities.
    • ACID Transactions: 100% ACID compliant, high-performance distributed transactions. Guaranteed strongly consistent read and write operations.
    • Secure and Governed: Granular role-based access controls and advanced security certifications. Includes features like BYOK, data loss prevention, ABAC policies, and more.

    Details

    Delivery method

    Delivery option
    64-bit (x86) Amazon Machine Image (AMI)

    Latest version

    Operating system
    AmazonLinux Amazon Linux 2023

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Buyer guide

    Gain valuable insights from real users who purchased this product, powered by PeerSpot.
    Buyer guide

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    MarkLogic Multi-Model Database: Enterprise Edition v. 11

     Info
    Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time. Alternatively, you can pay upfront for a contract, which typically covers your anticipated usage for the contract duration. Any usage beyond contract will incur additional usage-based costs.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    Usage costs (498)

     Info
    • ...
    Dimension
    Cost/hour
    r5.2xlarge
    Recommended
    $4.373
    r7iz.2xlarge
    $4.373
    c3.8xlarge
    $17.49
    x2idn.16xlarge
    $34.98
    c6i.2xlarge
    $4.373
    m8a.24xlarge
    $52.47
    c5ad.12xlarge
    $26.235
    c8i.4xlarge
    $8.745
    g4ad.8xlarge
    $17.49
    m6a.48xlarge
    $104.94

    Vendor refund policy

    We do not currently support refunds, but you can cancel at any time.

    How can we make this page better?

    Tell us how we can improve this page, or report an issue with this product.
    Tell us how we can improve this page, or report an issue with this product.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    64-bit (x86) Amazon Machine Image (AMI)

    Amazon Machine Image (AMI)

    An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.

    Version release notes

    This is the 11.3.4 (AL2023) release of MarkLogic on AWS Marketplace. See http://developer.marklogic.com/products/cloud/aws  for additional details.

    Additional details

    Usage instructions

    This AMI includes a MarkLogic Essential Enterprise Production license. The AMI is configured to store MarkLogic configuration and data on an attached EBS storage. When you launch this AMI via the EC2 Console, the storage will be pre-configured and it must remain on /dev/sdf device. Leave off the 'Delete-on-termination' checkbox, to enable you to keep your data. If you start the EC2 instance without using supplying any configuration data as described in documentation (link below), then the MarkLogic server will initialize the server and create a default administrator account. You can access the Administration portal on port 8001 using username "admin" and the password equal to the EC2 instance ID (e.g. "i-001602692a5d518a4").

    MarkLogic also provides a Cloud Formation template for launching this AMI that provides the easiest way to gain the benefits of high-availability and scalability.

    FOR MORE DETAILED INSTRUCTIONS, SEE http://developer.marklogic.com/products/cloud/aws 

    Support

    Vendor support

    For support, Contact MarkLogic by creating a ticket at https://help.marklogic.com/  or sending an email to cloud-support@marklogic.com . Support is not included in hourly fee. Community-based support is available at http://developer.marklogic.com/qa . Free MarkLogic training is available here https://www.marklogic.com/learn/university/  https://help.marklogic.com 

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    10
    In Data Catalogs
    Top
    25
    In Financial Services, Databases
    Top
    100
    In Databases, Analytic Platforms

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Multi-Model Data Storage
    Natively stores JSON, XML, text, geospatial, and semantic data in a single unified platform
    ACID Transaction Support
    100% ACID compliant with high-performance distributed transactions and strongly consistent read and write operations
    Advanced Search and Metadata Management
    Advanced search capabilities with robust metadata management and semantic query functionality
    Role-Based Access Control
    Granular role-based access controls with attribute-based access control (ABAC) policies for fine-grained authorization
    Tiered Storage Options
    Support for tiered storage configurations to optimize data placement and performance across different storage tiers
    Distributed SQL Database Architecture
    Cloud-native, distributed SQL database designed for high availability and global distribution across multiple regions and availability zones
    High Availability and Fault Tolerance
    Continues serving queries during node failures, availability zone failures, and AWS region failures without service interruption
    PostgreSQL Compatibility
    Postgres-compatible SQL interface enabling seamless integration with existing applications and tools
    ACID Transaction Support
    Supports ACID transactions ensuring data consistency and reliability across distributed deployments
    Multi-Region Data Placement
    Enables single unified database deployment across multiple AWS regions with configurable data locality and low-latency access
    Graph Database Engine
    Cloud-based graph database powered by ArangoDB supporting native graph query processing and relationship traversal for connected data analysis
    Multi-Model Data Support
    Unified platform supporting graph, JSON document, full-text search, and machine learning capabilities through a single query language
    Security and Access Control
    Advanced security features including private endpoints, single sign-on (SSO), and audit logging for access management and compliance
    High Availability and Disaster Recovery
    Data replication with multi-region cloud backups and fully-managed infrastructure ensuring business continuity
    Advanced Analytics and Machine Learning
    Integrated machine learning capabilities enabling predictive analytics, pattern detection, and insights extraction from connected data

    Contract

     Info
    Standard contract
    No
    No

    Customer reviews

    Ratings and reviews

     Info
    4.3
    73 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    53%
    41%
    5%
    0%
    0%
    7 AWS reviews
    |
    66 external reviews
    External reviews are from G2  and PeerSpot .
    Mampi Bhattacharya

    Unified data hub has streamlined multi-format integration and improved data quality

    Reviewed on Apr 04, 2026
    Review provided by PeerSpot

    What is our primary use case?

    Our main use case for MarkLogic  is as a data hub for integrating and managing data from multiple sources. I mostly work on ingesting data from different systems and transforming it and storing it in a structured way so it could be used by downstream applications. For example, in one of our projects, we were pulling data from a couple of legacy systems and normalizing it into a common format using a MarkLogic  pipeline. I also use it for writing queries, mainly XQuery or JavaScript, to fetch and process data based on day-to-day business needs. Sometimes, it is about building APIs on top of MarkLogic to expose that data to other services. Overall, it is a mix of data ingestion, transformation, and querying, making that data usable for applications.

    When considering that use case, the biggest difference I notice with MarkLogic compared to traditional approaches or manual processes is how much it reduces the effort of handling different data formats. Before this, in some projects, we had to write separate pipelines or scripts to process XML, JSON, and other formats. This became quite fragmented. With MarkLogic, we could handle both structured and unstructured data in the same place, which simplified things a lot. For example, in one use case, we were getting data from two systems, one sending XML and another JSON. Instead of building separate transformation layers, we ingested both into MarkLogic and then harmonized them into a common model. That saved a lot of development time and also reduced complexity. Another thing that stands out is how quickly we could query and retrieve data. Since everything is stored in a flexible document model, we did not have to redesign schemas every time whenever requirements changed. That made iterations much faster compared to traditional setups. It is not always simple. There is a learning curve, especially with data modeling and writing efficient queries. Once the team gets comfortable, it becomes a very powerful platform for data integration.

    One thing I did not expect initially was how useful MarkLogic is for tracking data lineage and debugging data issues. When something looked wrong in the final output, we could actually trace back how the data was ingested and transformed step-by-step. Since everything is stored as documents and transformations are applied in stages, it became easier to identify where things went wrong. Regarding one of the cases I remember, a field was getting overwritten during harmonization, and instead of digging through multiple systems, we tracked it directly within MarkLogic and fixed it quickly. The flexibility of the data model helped a lot. When requirements kept changing, it really helped. Instead of redesigning schemas, we could adjust the document structure and update transformations, which made our workflow more adaptable. Overall, it helped with debugging and maintaining data quality, which was a nice advantage.

    What is most valuable?

    A few features really stand out for me, especially when working on data integration projects. One of the most valuable ones is its ability to handle both structured and unstructured data in a single platform. For example, we deal with XML, JSON, and some semi-structured data from different systems. MarkLogic allows us to store and query everything together without needing separate pipelines. That simplified our architecture quite a bit.

    Another strong feature is the flexible document model. Unlike relational databases, we do not have to strictly define schemas upfront. In one project, requirements kept changing and it was dynamic. Instead of redesigning tables, we could adjust document structures and just continue with that. That saved a lot of rework. I also found the search and indexing capabilities very powerful. Even with large datasets, we could retrieve data quickly. We had use cases where we needed to search across multiple kinds of fields and documents, and MarkLogic handled that efficiently without much tuning.

    One feature I personally relied on a lot is the built-in support for XQuery and JavaScript. It gave us flexibility in how we query and process data. For some transformations, XQuery worked really well, while for others we used JavaScript, depending on the complexity. Overall, what stands out is how all these capabilities are available in one platform, which reduces the need for multiple tools.

    The flexible document model probably had the biggest impact on our projects. I can share a good example from a project where we were integrating customer data from multiple systems. Each system had a slightly different structure: some had extra fields, some were missing fields, and formats were not consistent at all. If we used a relational database, we would have had to keep altering schemas or creating multiple tables to handle those variations. But with MarkLogic, we could store each record as a document and gradually harmonize it into a common model. At one point, the business added new attributes to customer data midway through the project. Instead of redesigning anything, we just updated the transformation logic and started storing the new fields in the document. That change was handled pretty smoothly without impacting existing data. That flexibility saved a lot of time and avoided rework. It also made our system more adaptable to changing requirements, which happens quite often in real projects.

    One small but really useful feature is the built-in transaction and consistency handling. In one scenario, we were ingesting data in batches and there was a failure midway due to a data issue. MarkLogic handled it in a way that partial data was not left in an inconsistent state, which saved us from a lot of cleanup work. Additionally, the security and role-based access control is quite granular. We had cases where certain data needed to be restricted to specific roles. We managed that at a document level, which was really very helpful. These are not things you notice immediately, but in real-world usage, they make the system much more reliable and easier to manage.

    MarkLogic has had a strong impact, especially in how we handle and use data across teams. Earlier, data was scattered across different systems, and getting a unified view required a lot of manual effort and coordination. With MarkLogic acting as a central data hub, that process became much more streamlined. Instead of pulling data from multiple sources every time, we had a consolidated dataset available, which made development and reporting faster. It reduced a lot of back and forth between teams and saved a few hours every week during development and testing stages. I think it also improved data quality. Since we had defined ingestion and transformation steps, inconsistencies were caught earlier rather than later in the pipeline. Overall, it helped improve efficiency, reduced duplication of work, and made it easier for teams to rely on consistent data.

    In terms of metrics, we did see measurable improvements over time. Earlier, when we had to pull and reconcile data from multiple systems manually, it used to take a few hours for certain use cases. After moving that into MarkLogic pipelines, the same process became almost near real-time or at least reduced to minutes in most cases. In terms of development effort, we saved around 20 to 30 percent of time on data handling tasks, mainly because we did not have to build separate transformation layers or repeatedly fetch data from different sources. We also saw fewer data-related issues during testing since data was being standardized during ingestion, so inconsistency reduced. Improvements were noticeable in faster data access and also reduced manual effort and better data consistency.

    What needs improvement?

    One area where I feel MarkLogic can improve is the learning curve. When I first started, concepts like data modeling, indexing, or writing efficient XQuery were not very intuitive. It took some time to understand how to structure data properly to get good performance. More beginner-friendly guidance or simplified onboarding would help new users. Another thing is tooling and debugging experience. While it is powerful, debugging queries or transformations can sometimes feel less straightforward compared to more modern developer tools. In a few cases, we spent extra time figuring out why a query was not performing well. Better integration with modern development ecosystems and more lightweight setup options would make it easier for teams to adopt quickly. It is a strong platform, but improving ease of use and developer experience would make a big difference.

    One small thing I have noticed is around monitoring and visibility. When things are running fine, it is great, but when something slows down, such as a query or ingestion job, it is not always very easy to quickly pinpoint the issue. We sometimes had to dig through logs or try different things to understand what was happening. Having more intuitive monitoring dashboards or clearer, cleaner insights into query performance and pipeline stages would be really helpful. Small improvements in documentation with more real-world examples would make a difference, especially for new developers trying to understand best practices. Nothing major, but these small things would make day-to-day usage smoother.

    For how long have I used the solution?

    I have been using MarkLogic for around two years.

    What do I think about the scalability of the solution?

    In my experience, MarkLogic has handled scalability quite well, especially as our data volumes grew. For example, in one project, we started with a relatively small dataset, but over time it scaled to millions of documents. We did not have to redesign the system; we mainly scaled by adding more resources and adjusting indexes. The platform handled that transition smoothly. On the user side, as more services started consuming data from MarkLogic, we did not see major issues in terms of performance. Query response times remained fairly consistent as long as indexing and data modeling were done properly. Overall, it scales well, but getting the best performance depends on how well you design and configure it.

    How are customer service and support?

    Regarding customer support for MarkLogic, in our experience, we did not rely heavily on customer support on a day-to-day basis because once the system was set up, it was quite stable, and most issues could be handled internally. That said, there were occasions, especially early on, where we needed help around configuration and performance tuning. In those cases, the support team was helpful and guided us in the right direction, although it sometimes took back and forth to fully resolve the issue. Overall, the support is considered above average, with good responsiveness and helpful guidance when needed. We mostly relied on internal expertise and documentation and reached out for complex issues.

    Which solution did I use previously and why did I switch?

    Before moving to MarkLogic, we were mainly using a combination of relational databases and some custom ETL scripts for handling data integration. The challenge with that setup was that it became quite complex as the data volume and variety increased. For example, handling both XML and JSON data required separate processing logic, and maintaining those pipelines took a lot of effort. We switched to MarkLogic because it offered a more unified approach, allowing us to handle different data formats in one place and reducing the need for multiple tools and scripts. The flexibility of the document model and built-in capabilities for ingestion and querying made the overall system simpler and easier to manage compared to what we had earlier.

    Which other solutions did I evaluate?

    We evaluated a few other options before choosing MarkLogic. We looked at traditional relational database solutions as well as a couple of NoSQL options like MongoDB. Relational databases were strong for structured data but did not handle changing schemas or mixed data formats well for our use case. MongoDB was closer to what we needed in terms of flexibility, but when it came to advanced search, indexing, and built-in support for both XML and JSON, MarkLogic had an edge.

    What other advice do I have?

    I suggest starting with a clear understanding of your data use case before adopting MarkLogic, especially if you are dealing with multiple data formats or large volumes of data. That is where it really adds value. From our experience, it is important to invest time early in data modeling and indexing. We initially underestimated that, leading to performance issues later. Once we optimized indexes properly, things improved a lot. Also, make sure your team gets comfortable with the basics of XQuery or JavaScript in MarkLogic. There is a learning curve, but once you get past it, development becomes much smoother. I recommend starting with a smaller use case or pilot project, which helped us understand how to structure data and pipelines before scaling it across the system. It is a powerful platform, but you get the best results when you plan the foundation properly and gradually build on it.

    Regarding additional thoughts on MarkLogic, one additional thought I would add is how MarkLogic fits as a long-term solution. From my experience, it is not just a tool you use for a short-term project. Once it is set up properly, it becomes a core part of your data architecture. More teams gradually started relying on it once they saw the value of having centralized and consistent data. It is important to have the right expertise in the team. It is a powerful platform, but to get the best out of it, you need people who understand how to design data models and optimize queries properly. It is a strong platform for enterprise use when you treat it as a long-term investment rather than just a quick solution.

    My overall rating for MarkLogic is 8 out of 10.

    Rituraj NSIT

    Unified search and data management has simplified complex XML and JSON workflows

    Reviewed on Apr 04, 2026
    Review from a verified AWS customer

    What is our primary use case?

    I used MarkLogic  for a total of two years.

    When it comes to main use cases, I used MarkLogic  as a backend service for handling complex structured data such as XML or JSON. I have REST API services and modules using XQuery where the system needed efficient storage, query, and transforms of large volumes of data. Additionally, I worked with real-time ingestion pipelines where data from multiple sources were processed and stored in MarkLogic, enabling real-time access and updates.

    MarkLogic is designed for multi-handle multi-model data, which means it can natively store and query XML documents, JSON documents, and unstructured and semi-structured data. Instead of normal database joins, MarkLogic works by querying inside a document efficiently using indexes. In one of my projects, we used MarkLogic to manage a large-scale document processing system, where we ingested data from multiple upstream systems in XML and JSON format, such as product or property-related data. As soon as the data was ingested, it became immediately searchable due to MarkLogic indexing. MarkLogic handles semi-structured data by storing it as a document, automatically indexing it, and allowing real-time query and updates using XQuery and strong consistency.

    My experience with MarkLogic demonstrates how we leveraged its features beyond just data storage. For example, I worked on optimizing queries written in multiple modules, mostly related to searching with text and applying structured filters, which significantly improved query accuracy and performance. Apart from basic features, I have worked on performance tuning, indexing strategies, and combining full-text search with structured query. I also used MarkLogic as both a database and search engine, which helped to simplify our architecture.

    In our use cases, MarkLogic's universal indexing and clustering have a direct impact on performance and scalability, and it has helped us significantly. In normal databases, we need to define indexes up front, and if a new query comes in, we often need a schema or indexes. In MarkLogic, all data such as XML and JSON were automatically indexed, and we did not need to pre-plan any query patterns. In real time, we had a dynamic search requirement with filters, pricing, location, and keywords, and instead of creating multiple indexes manually, we leveraged our universal index plus range index. For example, when a user searches with multiple filters plus keywords, queries are still fast because MarkLogic uses its internal index instead of scanning documents. Regarding clustering, we have our MarkLogic clustered environment. When multiple nodes work together, horizontal scaling is part of it, as we could add more nodes if data grew, ensuring high availability. For instance, if one node failed, another would handle that traffic. During high traffic, the system stayed stable, and we handled the large data volume without performance degradation. Universal indexing helps us avoid manual indexing management while still providing fast queries for dynamic searches, and clustering allows us to scale horizontally and handle high traffic with no latency and high availability.

    What is most valuable?

    MarkLogic offers several powerful features. First, there is universal indexing, in which it automatically indexes all the stored XML and JSON documents. Second, it can handle both XML and JSON unstructured data in a single database, which makes it flexible for complex and evolving data requirements. The third is the ACID property. Unlike many NoSQL databases, it provides strong consistency with ACID transactions, which is critical for real-time and reliable applications. It also supports horizontal scaling and clustering, which helps in handling large volumes of data and high traffic efficiently.

    One feature that stood out to me in our project is its ability to combine search and database capability on a single platform. The tight integration of full-text search with structured query makes it very powerful for building real-time search applications without relying on any external tools. It simplifies our architecture and reduces system complexity. I also appreciate its flexibility with data models, especially handling both XML and JSON seamlessly, which can be very useful in our use cases with multiple data resources.

    Using MarkLogic has had a significant positive impact on our organization, especially in terms of performance, flexibility, and reliability. With MarkLogic's universal indexing and built-in search, we have seen query response times improve noticeably. Complex searches dropped from a few seconds to sub-second response times in many cases. Users could perform combined keyword plus filter searches in real time, directly improving our application experience. Before implementing MarkLogic, we were using a relational database and NoSQL as separate search engines, requiring Elasticsearch and others. With MarkLogic providing a single platform for both storage and search, we reduced integration overhead, maintenance efforts, and failure points. The schema flexibility for XML and JSON allowed us to onboard new data sources faster. The ACID transactions that MarkLogic provides are crucial and something rarely supported by NoSQL databases. MarkLogic improved our system by enabling faster search, reducing the response time from seconds to sub-seconds, reducing architectural complexity by combining database and search, and improving reliability through ACID transactions and clustering.

    What needs improvement?

    While MarkLogic is powerful, there are areas where I feel it could improve. When I started with MarkLogic, I found that its learning curve and developer experience are not very comfortable for beginners. Technologies such as XQuery are less common compared to Java and Python, so new developers take time to get comfortable with it. Improving documentation and modern tooling would greatly aid onboarding. Second, the cost and licensing can be a concern for smaller teams and startups. MarkLogic's enterprise status makes it less likely to be the first choice for those teams. While it supports deployment in the cloud, the experience could be more seamless compared to fully cloud-native databases. Overall, MarkLogic is excellent for enterprise use cases, especially where search and structured data need to work together, but improving developer experience and ecosystem support would enhance its efficiency.

    A couple of additional areas where MarkLogic could improve are around integration, performance tuning, visibility, and support experiences. While MarkLogic supports REST APIs well, integrating with the modern data ecosystem sometimes requires extra effort compared to other platforms, as out-of-the-box connectors are limited. Although performance is strong, understanding query behavior can be challenging, and debugging slow queries or analyzing indexing usage is not always transparent. Regarding support and documentation, response times can vary depending on the issue or server availability. More real-world examples and troubleshooting guides would enhance developer productivity. Improvements in integration and modern tools in XQuery, along with better observability, are necessary.

    Beyond what I mentioned earlier, there are a few additional areas I can point to. While MarkLogic supports powerful querying via XQuery and JavaScript, many developers are more comfortable with SQL. An intuitive SQL-like query support or a better abstraction layer would enhance adoption across teams. Furthermore, migrating from other databases, whether relational or non-SQL, requires effort in data transformation. Better migration tooling with automated data mapping would also make transitions smoother.

    For how long have I used the solution?

    In my current field, I have been working for the last three years.

    What do I think about the stability of the solution?

    MarkLogic is pretty stable in my experience. It is highly stable and reliable.

    What do I think about the scalability of the solution?

    MarkLogic offers excellent scalability, especially for enterprise-scale applications. In our use case, as data and traffic increased, we were able to scale by adding nodes to the clusters without major changes to the applications, making the scaling very smooth and predictable.

    How are customer service and support?

    MarkLogic has been generally good and reliable in my experience. When I connect with them, it is very responsive. I have gone through support tickets, and proper tracking is available, so overall, it is a good support system, and I would rate it slightly higher than average.

    I would rate MarkLogic's customer support an eight due to its responsiveness, especially for higher priority issues. Support engineers demonstrate good product expertise, and the structure of the ticketing and enterprise support models work well. If someone inquires, I would suggest looking for alternatives if their team is small or they have cost constraints, but if there are no budget issues and their team is large, MarkLogic is reliable and comfortable, providing scalability.

    Which solution did I use previously and why did I switch?

    Before adopting MarkLogic, we were using a combination of traditional relational databases such as Oracle along with a separate search solution, such as Elasticsearch.

    The main reason for switching from Oracle and Elasticsearch to MarkLogic was simplifying our architecture by consolidating database and search into a single platform. With Oracle and Elasticsearch, we had two separate systems, and syncing between them was complex and error-prone. MarkLogic allowed us to manage these components on one platform. Given that our data was semi-structured, managing it in a relational database was tough, but MarkLogic's document model made schema evolution easier without extensive migration.

    How was the initial setup?

    We did not purchase MarkLogic through the AWS  marketplace.

    What was our ROI?

    We saw a clear return on the investment after implementing MarkLogic in terms of saving and personnel efficiency. Since we did not need a separate database and search system, we avoided building and maintaining integrations. This led to roughly a thirty to forty percent reduction in backend development effort. With flexible schema and universal indexing, new features and data sources were onboarded faster, reducing feature delivery time by around forty to fifty percent. In terms of infrastructure and maintenance, we also achieved twenty to thirty percent savings in infrastructure and maintenance overhead.

    What's my experience with pricing, setup cost, and licensing?

    My experience with MarkLogic's pricing and licensing is that it positions itself as an enterprise-grade product. The cost is relatively high compared to open-source alternatives. We use enterprise licensing models, which gives us access to enterprise features and official support. The initial setup cost is moderate to high, mainly due to infrastructure provisioning, licensing costs, and initial configuration and onboarding efforts.

    Which other solutions did I evaluate?

    Before finalizing MarkLogic, we evaluated a few alternatives. We looked at MongoDB, which is good for flexible document storage but required a separate search solution for advanced queries. We also considered using Oracle, which is strong and reliable but less flexible for semi-structured data. Therefore, we selected MarkLogic because it uniquely provides multi-model support along with built-in search and ACID transactions with real-time indexing.

    What other advice do I have?

    My experience with MarkLogic has been very positive. It is a powerful platform, especially for data-driven and search-driven applications where handling complex XML and JSON data and real-time querying is important. The combination of database and search capabilities along with strong consistency and scalability make it an excellent choice for enterprise use cases. However, there are areas such as developer experience, ecosystems, and the learning curve that could be improved to enhance accessibility. I would rate MarkLogic an eight overall.

    Which deployment model are you using for this solution?

    Hybrid Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Islam Md

    Consistent monitoring has ensured long-running batch jobs complete successfully and on time

    Reviewed on Mar 27, 2026
    Review from a verified AWS customer

    What is our primary use case?

    My main use case for MarkLogic  involves running queries to check some of the jobs. I run batch jobs and then I want to check whether the batch jobs are running fine. I check the data on MarkLogic  by running the query on the query logic portal.

    Regarding my main use case with MarkLogic, I find it very handy because every time I run a job, I go and run the query. I go to different databases and then see whether it's running fine. It is enjoyable working with MarkLogic.

    A recent task where MarkLogic was especially helpful involved trying to check the number of batch jobs, DES or PDM jobs, and different jobs. We always check the number and then based on the number, we compare with other tools and then see whether it's matching. It is a comparison with multiple tools. If, for example, PG Admin was not working with PostgreSQL , but MarkLogic was working fine, we were able to fix the issue on the other tools which were not working.

    What is most valuable?

    In my opinion, the best features MarkLogic offers are that it is very easy to use and has a very fast response time.

    The fast response time and ease of use help me in my daily work because it is really helpful since we have to run a lot of jobs at the same time and then we want to make sure everything is running as expected. It always helps us to check whether our jobs are running fine.

    MarkLogic has positively impacted my organization as I think company-wise, it is one of the go-to tools for validating the jobs we are running. It is very helpful for us to deliver our products with quality and on time.

    Using MarkLogic has resulted in specific outcomes, as we run jobs for a long time, sometimes for a couple of days, sometimes for ten hours or twelve hours. It helps us a lot.

    What needs improvement?

    I would say the features can be improved, as maybe the UI could be a little better. I am not sure if there are other options, but the one I am using is from the query console, so maybe I am not aware of other UI dashboards.

    There are ways MarkLogic can be improved. I would like to add that a better UI with more features on it, something user-friendly, would be beneficial.

    I think there is nothing else that could be improved about MarkLogic.

    For how long have I used the solution?

    I have been using MarkLogic for two years.

    What other advice do I have?

    If someone is looking into using MarkLogic, I would say MarkLogic is very helpful for providing the monitoring with detailed features. Running the query is very easy. I rate this product an eight out of ten.

    Which deployment model are you using for this solution?

    Private Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    reviewer2812596

    Handling hierarchical insurance data has improved ETL workflows and still needs better integration

    Reviewed on Mar 27, 2026
    Review from a verified AWS customer

    What is our primary use case?

    MarkLogic 's primary use case in my experience is handling semi-structured and hierarchical data that does not fit neatly into traditional relational tables. In my particular project, I work with NoSQL data, meaning I handle semi-structured and hierarchical data. For example, in one of my insurance projects at ValueMomentum, I worked on an initiative where I had policy and claims data in XML format coming from different legacy systems. I used MarkLogic  to ingest and normalize the data and integrate it with our Hive  data warehouse for reporting and analytics. A specific example I would share is that I created a 360-degree view of customer policies and claims using MarkLogic, which allowed me to merge XML claim documents with relational customer information, perform queries across different nodes, and feed clean aggregated data into the ETL pipeline for downstream analytics.

    This significantly saved time compared to flattening everything manually into SQL tables.

    Working with XML and integrating that with my ETL processes is quite interesting, as MarkLogic makes it far easier to handle hierarchical data instead of attempting to force it all into relational tables. The built-in support for XML and JSON, combined with the indexing and searching capabilities, allows me to query deeply nested structures without writing extensive custom parsing code. The challenging part is mostly around integration with our existing ETL pipelines, ensuring that transformed data flows correctly into Hive  and matches the relational schemas. Sometimes I have to be careful with data types and schema evaluation because MarkLogic is schema-flexible, but the downstream systems are strict. Overall, though, it speeds up handling complex semi-structured data and reduces manual transformation work significantly. I would say it is easier than many other XML handling approaches I have tried.

    I would add that I really appreciate how MarkLogic handles hierarchical relationships naturally, especially in insurance data where nested information such as policies contain multiple coverages, each with different claims and documents. This aspect of the insurance domain is really cumbersome, and MarkLogic allows me to query across these nested structures directly without having to flatten everything at the beginning. Its search and indexing features make it easier to identify anomalies or missing information in the semi-structured data before it reaches the ETL pipelines, saving considerable time during validation and reconciliation. Overall, I find it a very practical tool for bridging legacy data formats with modern data warehouses.

    What is most valuable?

    In my experience, the best features of MarkLogic include its native support for XML and JSON, which makes working with hierarchical or semi-structured data easier than flattening it into relational tables. Additionally, its flexible schema and indexing capabilities allow me to index anything, including nested elements, which speeds up queries and reduces the need for custom code. Other significant aspects include built-in search capabilities that allow for full-text searches and complex queries directly on documents, and data integration and harmonization, which combines multiple sources into one logical view, simplifying manual processes.

    Finally, ACID transactions on semi-structured data give me confidence that updates, merges, and data integrity remain intact even in large or complex datasets.

    The flexible schema, indexing, and search capabilities of MarkLogic are incredibly useful for me. For example, the flexible schema and indexing mean I can load all the XML or JSON policy and claims data without having to predefine rigid table structures, saving considerable time during ingestion, especially since different legacy systems may have slightly different formats. I can simply map the fields I need and let MarkLogic handle the rest. The search capabilities are also very helpful; I can run queries across nested elements to find specific claims, policies, or attachments quickly. Previously, doing this in a relational database would require multiple joins and a great deal of transformation logic.

    MarkLogic has had a tangible positive impact on our organization. Thanks to its flexible schema and indexing capabilities, we ingest semi-structured and hierarchical data from multiple legacy systems 30 to 40 percent faster than before, which leads to faster ETL cycles and quicker delivery of analytics-ready datasets to downstream systems. Using MarkLogic's search and query capabilities, I also reduced the time to reconcile and validate policy and claims data across systems by approximately 35 percent, helping business teams gain insights much faster. This highlights the positive impact MarkLogic has made in our organization.

    What needs improvement?

    There are several things I have observed regarding MarkLogic's improvement areas. One challenge I notice is the learning curve and setup; it can be complex for someone new, especially when integrating with other systems or setting up indexing strategies for large datasets. I occasionally spend extra time fine-tuning indexes or query performance for really large documents. Another observation concerns tooling and ecosystem support, as it does not feel as rich as mainstream databases such as Hive or SQL servers in terms of connectors and integration or community resources.

    Sometimes I need to build custom scripts to bridge these gaps. Finally, monitoring and debugging distributed queries can be tricky; while it has built-in tools, deeper performance profiling or tracing is not always intuitive. Overall, these are not deal-breakers, but improvements in onboarding, ecosystem connectors, and monitoring would enhance the experience.

    For how long have I used the solution?

    I have been using MarkLogic for approximately two to three years, primarily during my enterprise projects where I handle structured or semi-structured data, build integration pipelines, and ensure it works well with other data sources such as Hive and SQL server. I would not call myself a hardcore MarkLogic expert, but I am comfortable navigating its database features, writing queries, and integrating it into larger workflows.

    It has been practical experience rather than just academic work. I have done several things in that area, but I am not very hands-on with it.

    What do I think about the stability of the solution?

    MarkLogic is generally quite good, and in my experience, it is very stable and reliable. I have not faced any major downtime; there have only been occasional minor glitches during cluster configuration or heavy indexing, but nothing that significantly affected production. The built-in replication and failover features also help maintain uptime, ensuring the system stays operational even during maintenance or updates.

    Overall, I find it trustworthy for enterprise workloads, especially regarding the semi-structured and hierarchical data I work with.

    What do I think about the scalability of the solution?

    MarkLogic's scalability is quite solid; scaling up and out as data needs grow does require some planning. I have scaled both vertically by upgrading nodes and horizontally by adding nodes to clusters as my data volume increased. The system handles this increase in XML workloads well, and flexible indexing helps maintain query performance even as datasets expand. However, planning the cluster layout and indexing strategies carefully is crucial. If I just throw the data in without a strategy, performance can degrade, but once set up properly, scaling out is smooth.

    Overall, it is definitely enterprise-ready for growing data needs.

    Which solution did I use previously and why did I switch?

    Before MarkLogic, I primarily relied on relational databases such as SQL servers and some Hive-based data warehouses to manage my data. The reason for switching or adding MarkLogic into the mix is that a great deal of my incoming data is semi-structured, such as XML or JSON from legacy systems, which was cumbersome to flatten into relational tables. It was slow, prone to errors, and complicated ETL pipelines.

    MarkLogic made it much easier to ingest, query, and integrate semi-structured data directly without all that extra transformation overhead. It was not a complete replacement but rather a specialized tool for a specific pain point in my data flows.

    What was our ROI?

    From my perspective, there has definitely been a return on investment with MarkLogic, even if it is not always easy to quantify in exact dollar terms. For example, by using MarkLogic to handle semi-structured data directly, I have reduced ETL prep and transformation time by roughly 30 to 40 percent, freeing up engineers to focus on more value-added tasks instead of manual data cleaning.

    Additionally, faster validation and reconciliation of policy and claims data mean business teams can generate reports and insights 35 percent faster.

    What's my experience with pricing, setup cost, and licensing?

    I do not actually deal with pricing, setup costs, or licensing because I work for an organization, but I believe the pricing and licensing are definitely on the higher side compared to open-source alternatives, based on what I have heard from managers.

    Which other solutions did I evaluate?

    I evaluated multiple options before choosing MarkLogic, including alternatives such as MongoDB, Couchbase, and PostgreSQL  with JSON support. While MongoDB excels in JSON handling and NoSQL flexibility, it lacks ACID compliance across complex transactions. Couchbase is good for key-value and document storage but did not fit my hierarchical XML-heavy use cases. PostgreSQL  with JSON support could handle some semi-structured data, but its performance on large nested XML datasets was not great, and ETL complexity remained high.

    Ultimately, MarkLogic's native XML support, ACID compliance, and built-in search and indexing made it the best fit for my insurance project.

    What other advice do I have?

    MarkLogic is solid for its main features such as handling XML, flexible indexing, search, and ACID compliance on semi-structured data. However, the learning curve is steep, integration with other systems can be tricky, and the community and tooling are not as extensive as mainstream platforms. It is strong, but I would not say it is perfect.

    My advice for others looking into using MarkLogic is three-fold: First, understand your data upfront. If you have a great deal of semi-structured or hierarchical data such as XML or JSON, MarkLogic can save you a great deal of time, but you need to understand your indexing and querying strategies. Second, plan your cluster and indexing strategies carefully as it pays off in performance and scalability later. Third, take full advantage of built-in search and ACID features, which are real-time savers for validation, reconciliation, and governance.

    Additionally, do not underestimate the importance of training and onboarding; there is a learning curve, so make sure your team becomes comfortable with queries, transforms, and integrations before relying on it in production. If used correctly, it is a very strong tool for bridging legacy semi-structured data with modern analytics pipelines. I would rate MarkLogic at a seven on a scale of one to ten.

    Varuns Ug

    Unified search and storage have simplified handling of semi-structured data and complex queries

    Reviewed on Mar 26, 2026
    Review from a verified AWS customer

    What is our primary use case?

    I use MarkLogic  for performing many operations such as handling semi-structured data and building search-driven use cases. For example, I examined how it can handle and store XML documents and leverage its powerful indexing and search capabilities for fast querying. It is particularly useful in scenarios where you need flexible schemas along with advanced search capabilities including filtering, full-text search, and aggregations.

    A task where MarkLogic  was central to my work was a travel search and filtering system, similar to a hotel listing use case. In this project, the data was semi-structured, with elements such as hotel details, amenities, and pricing that vary significantly across different entities. I used MarkLogic to store this data as JSON documents. What made MarkLogic central was its built-in indexing and search capabilities. I configured indexing on fields such as location, price range, and amenities and then used its query capabilities to perform fast filtering and full-text search. Instead of relying on a separate search system, MarkLogic handled both data storage and search efficiently, which simplified my architecture and improved query performance.

    What is most valuable?

    MarkLogic for the hotel listing and search use case compared to other approaches made things easier than a traditional approach. If I compare it with using MySQL  and Elasticsearch, typically you would need to maintain two systems: one for transaction storage and another for search. That introduces challenges regarding data synchronization, consistency, and operational overhead. With MarkLogic, since it natively supports both document storage and advanced search, I could avoid the dual-system complexity. It simplified the architecture because indexing and querying are built-in and tightly integrated. In terms of performance for semi-structured data and search-heavy queries, it was quite efficient because indexes are created automatically and queries are optimized around them. The schema flexibility was a significant advantage and it also helped reduce system complexity, improve development speed, and handle search use cases efficiently.

    One thing I found particularly useful was that MarkLogic handles indexing by default, unlike a traditional system where you have to explicitly define and manage indexes. MarkLogic automatically indexes documents, which made it easier to get started and integrate quickly. Another advantage is that it can handle both structured and unstructured data together, which is very useful in real-world scenarios where travel data has a mix of fixed fields and dynamic attributes. The fact that it supports flexible querying over nested data without needing complex joins made development simpler and queries more intuitive.

    Many features offered by MarkLogic are valuable. One of the standout features is its multi-modal capability. It can handle JSON, XML, and RDF data in a single database. That is particularly useful for applications dealing with diverse and evolving data formats. Another feature is the built-in search and indexing capability, along with schema flexibility since it is document-based. It handles semi-structured and nested data very naturally, which reduces the overhead of schema migration. An important feature is ACID transactions with NoSQL flexibility, so you can get reliability from a traditional database along with the scalability and flexibility of NoSQL.

    MarkLogic's multi-modal capabilities make things easier in scenarios where JSON can be used for application-facing data such as hotel details, XML comes into play with external APIs, and RDF can be used for representing relationships. Instead of converting everything into one rigid format, MarkLogic allows you to store each in its native form and still query and access them. This opens up possibilities such as combining search data with relationships and searching in a single query, which would otherwise require multiple systems or a complex data pipeline. Overall, it reduces data transformation efforts, simplifies architecture, and makes it easier to build richer and more connected database models.

    One thing that surprised me about MarkLogic is how it has so many built-in traditional capabilities. Features such as search, indexing, and even data integration are natively available, so you do not have to rely on multiple external systems. That was unexpectedly useful because it simplified the overall architecture significantly. Another interesting aspect was its flexible querying over deeply nested data. In traditional databases, handling nested or hierarchical data often requires complex joins. I found it interesting to design data platforms rather than just a database, especially with complex capabilities around search and integration.

    MarkLogic has impacted my organization positively. Since my exposure to MarkLogic has been more on the exploration and evaluation side, I did not see full production-scale impact. However, even in the use case I worked on, a few clear benefits stood out. One was simplifying the architecture instead of thinking in terms of separate systems such as MySQL  for storage and Elasticsearch for search. MarkLogic allowed both in a single system, which reduced integration overhead and potential consistency issues. Another benefit was fast deployment and integration because of its schema flexibility and automatic indexing. It was easier to onboard new data fields and quickly test different query patterns without heavy schema changes. I noticed that the search-heavy queries on semi-structured data performed quite well, which really helped in reducing system complexity and speeding up development for search-heavy, semi-structured data use cases.

    What needs improvement?

    Regarding improvement, I have identified a few areas. MarkLogic is quite powerful, but some areas need enhancement. One thing I noticed was the learning curve. Compared to commonly used databases such as MySQL or even MongoDB, MarkLogic requires understanding concepts such as XQuery, server-side JavaScript, and its internal architecture, which can take time for new developers. Another area is community and ecosystem support; it is not as widely adopted as other databases, so finding resources can be challenging. Third-party integration can be relatively harder. Additionally, from what I have observed, cost and licensing can be a consideration, especially for smaller teams or startups compared to open-source alternatives. Finally, while it is very strong for search and document-based use cases, it might feel excessive for simpler CRUD-based operations, where a traditional relational or lightweight NoSQL database would work better.

    Documentation is an area that could improve. Learning resources and documentation could be enhanced, as the official documentation is detailed but can sometimes feel dense for beginners, especially when getting started with concepts such as indexing or writing queries in XQuery. Additionally, debugging and troubleshooting can be slightly challenging compared to more mainstream databases, mainly because the ecosystem is smaller and there are fewer community discussions and examples available. The developer experience could also be improved; setting up, experimenting, and integrating MarkLogic in an existing setup felt less straightforward compared to commonly used databases. I think improving onboarding, simplifying documentation, and expanding community support could make it even more developer-friendly in the future.

    For how long have I used the solution?

    I have been using MarkLogic for approximately half a year.

    What do I think about the stability of the solution?

    In my experience, MarkLogic is stable. It can be used in different environments and is designed for enterprise use cases involving large volumes of data and complex queries.

    What do I think about the scalability of the solution?

    MarkLogic is designed to scale horizontally, which means you can add more nodes to the cluster to handle increased data and query load. It distributes data across units called forests, and these forests can be spread across multiple nodes. This allows both storage and query processing to scale out efficiently.

    How are customer service and support?

    I have faced some situations where I needed help. While I have not interacted directly with MarkLogic support in a production environment, my understanding is based on industry feedback, which suggests it has enterprise-grade support, including ticketing systems and dedicated support channels for customers.

    Which solution did I use previously and why did I switch?

    Before MarkLogic, I used a combination of MySQL for storage and sometimes Elasticsearch for search-heavy use cases. While exploring MarkLogic, the approach was not an immediate switch in production but more of an evaluation to see how it compared to the traditional approach of using MySQL and Elasticsearch.

    What was our ROI?

    Since my experience with MarkLogic is more focused on exploration, I have not seen production-level ROI metrics such as cost and team size reduction. However, even during the evaluation, I could see potential in reduced development effort. The performance of search and filtering queries on semi-structured data felt more efficient compared to a traditional approach using MySQL and Elasticsearch. While I cannot quote exact numbers since it was not in production, it definitely showed potential for reduced development time, simplified architecture, and fast search use cases. Ultimately, it reduced development complexity and effort noticeably, especially by eliminating the need to manage multiple systems.

    Since my experience is more focused on exploration, I have not seen production-level ROI metrics such as cost or team size reduction. However, even during evaluation, I could gauge potential ROI in terms of what it generates and the additional benefits it provides.

    What's my experience with pricing, setup cost, and licensing?

    Regarding pricing and setup cost for licensing, I have not worked on a production setup, so I have not been directly involved in handling pricing and licensing. However, my exploration indicates MarkLogic follows a licensing model that can be relatively higher compared to open-source databases, making cost an important factor for smaller teams.

    Which other solutions did I evaluate?

    Before choosing MarkLogic, I explored some alternatives, primarily comparing it with the combination of MySQL and Elasticsearch, and I also considered MongoDB since it provides document-based storage and schema flexibility. The key difference I found was that while MongoDB handles flexible data effectively, it does not offer the same level of integrated search capability as MarkLogic.

    What other advice do I have?

    I would suggest first clearly evaluating whether your use case truly benefits from MarkLogic's strengths. It works particularly well for search-heavy and semi-structured data use cases where flexible and powerful querying is needed. At the same time, I would recommend comparing it with alternatives such as MySQL, MongoDB, and Elasticsearch for trade-offs. Additionally, it is important to plan for the learning curve, especially around concepts such as indexing and querying.

    Overall, I think MarkLogic is a very powerful platform, especially when involving semi-structured data and advanced searches. I would rate this review an 8 out of 10.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    View all reviews