Sign in Agent Mode
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Reviews from AWS customer

6 AWS reviews

External reviews

65 reviews
from and

External reviews are not included in the AWS star rating for the product.


    Islam Md

Consistent monitoring has ensured long-running batch jobs complete successfully and on time

  • March 27, 2026
  • Review from a verified AWS customer

What is our primary use case?

My main use case for MarkLogic involves running queries to check some of the jobs. I run batch jobs and then I want to check whether the batch jobs are running fine. I check the data on MarkLogic by running the query on the query logic portal.

Regarding my main use case with MarkLogic, I find it very handy because every time I run a job, I go and run the query. I go to different databases and then see whether it's running fine. It is enjoyable working with MarkLogic.

A recent task where MarkLogic was especially helpful involved trying to check the number of batch jobs, DES or PDM jobs, and different jobs. We always check the number and then based on the number, we compare with other tools and then see whether it's matching. It is a comparison with multiple tools. If, for example, PG Admin was not working with PostgreSQL, but MarkLogic was working fine, we were able to fix the issue on the other tools which were not working.

What is most valuable?

In my opinion, the best features MarkLogic offers are that it is very easy to use and has a very fast response time.

The fast response time and ease of use help me in my daily work because it is really helpful since we have to run a lot of jobs at the same time and then we want to make sure everything is running as expected. It always helps us to check whether our jobs are running fine.

MarkLogic has positively impacted my organization as I think company-wise, it is one of the go-to tools for validating the jobs we are running. It is very helpful for us to deliver our products with quality and on time.

Using MarkLogic has resulted in specific outcomes, as we run jobs for a long time, sometimes for a couple of days, sometimes for ten hours or twelve hours. It helps us a lot.

What needs improvement?

I would say the features can be improved, as maybe the UI could be a little better. I am not sure if there are other options, but the one I am using is from the query console, so maybe I am not aware of other UI dashboards.

There are ways MarkLogic can be improved. I would like to add that a better UI with more features on it, something user-friendly, would be beneficial.

I think there is nothing else that could be improved about MarkLogic.

For how long have I used the solution?

I have been using MarkLogic for two years.

What other advice do I have?

If someone is looking into using MarkLogic, I would say MarkLogic is very helpful for providing the monitoring with detailed features. Running the query is very easy. I rate this product an eight out of ten.

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?


    reviewer2812596

Handling hierarchical insurance data has improved ETL workflows and still needs better integration

  • March 27, 2026
  • Review from a verified AWS customer

What is our primary use case?

MarkLogic's primary use case in my experience is handling semi-structured and hierarchical data that does not fit neatly into traditional relational tables. In my particular project, I work with NoSQL data, meaning I handle semi-structured and hierarchical data. For example, in one of my insurance projects at ValueMomentum, I worked on an initiative where I had policy and claims data in XML format coming from different legacy systems. I used MarkLogic to ingest and normalize the data and integrate it with our Hive data warehouse for reporting and analytics. A specific example I would share is that I created a 360-degree view of customer policies and claims using MarkLogic, which allowed me to merge XML claim documents with relational customer information, perform queries across different nodes, and feed clean aggregated data into the ETL pipeline for downstream analytics.

This significantly saved time compared to flattening everything manually into SQL tables.

Working with XML and integrating that with my ETL processes is quite interesting, as MarkLogic makes it far easier to handle hierarchical data instead of attempting to force it all into relational tables. The built-in support for XML and JSON, combined with the indexing and searching capabilities, allows me to query deeply nested structures without writing extensive custom parsing code. The challenging part is mostly around integration with our existing ETL pipelines, ensuring that transformed data flows correctly into Hive and matches the relational schemas. Sometimes I have to be careful with data types and schema evaluation because MarkLogic is schema-flexible, but the downstream systems are strict. Overall, though, it speeds up handling complex semi-structured data and reduces manual transformation work significantly. I would say it is easier than many other XML handling approaches I have tried.

I would add that I really appreciate how MarkLogic handles hierarchical relationships naturally, especially in insurance data where nested information such as policies contain multiple coverages, each with different claims and documents. This aspect of the insurance domain is really cumbersome, and MarkLogic allows me to query across these nested structures directly without having to flatten everything at the beginning. Its search and indexing features make it easier to identify anomalies or missing information in the semi-structured data before it reaches the ETL pipelines, saving considerable time during validation and reconciliation. Overall, I find it a very practical tool for bridging legacy data formats with modern data warehouses.

What is most valuable?

In my experience, the best features of MarkLogic include its native support for XML and JSON, which makes working with hierarchical or semi-structured data easier than flattening it into relational tables. Additionally, its flexible schema and indexing capabilities allow me to index anything, including nested elements, which speeds up queries and reduces the need for custom code. Other significant aspects include built-in search capabilities that allow for full-text searches and complex queries directly on documents, and data integration and harmonization, which combines multiple sources into one logical view, simplifying manual processes.

Finally, ACID transactions on semi-structured data give me confidence that updates, merges, and data integrity remain intact even in large or complex datasets.

The flexible schema, indexing, and search capabilities of MarkLogic are incredibly useful for me. For example, the flexible schema and indexing mean I can load all the XML or JSON policy and claims data without having to predefine rigid table structures, saving considerable time during ingestion, especially since different legacy systems may have slightly different formats. I can simply map the fields I need and let MarkLogic handle the rest. The search capabilities are also very helpful; I can run queries across nested elements to find specific claims, policies, or attachments quickly. Previously, doing this in a relational database would require multiple joins and a great deal of transformation logic.

MarkLogic has had a tangible positive impact on our organization. Thanks to its flexible schema and indexing capabilities, we ingest semi-structured and hierarchical data from multiple legacy systems 30 to 40 percent faster than before, which leads to faster ETL cycles and quicker delivery of analytics-ready datasets to downstream systems. Using MarkLogic's search and query capabilities, I also reduced the time to reconcile and validate policy and claims data across systems by approximately 35 percent, helping business teams gain insights much faster. This highlights the positive impact MarkLogic has made in our organization.

What needs improvement?

There are several things I have observed regarding MarkLogic's improvement areas. One challenge I notice is the learning curve and setup; it can be complex for someone new, especially when integrating with other systems or setting up indexing strategies for large datasets. I occasionally spend extra time fine-tuning indexes or query performance for really large documents. Another observation concerns tooling and ecosystem support, as it does not feel as rich as mainstream databases such as Hive or SQL servers in terms of connectors and integration or community resources.

Sometimes I need to build custom scripts to bridge these gaps. Finally, monitoring and debugging distributed queries can be tricky; while it has built-in tools, deeper performance profiling or tracing is not always intuitive. Overall, these are not deal-breakers, but improvements in onboarding, ecosystem connectors, and monitoring would enhance the experience.

For how long have I used the solution?

I have been using MarkLogic for approximately two to three years, primarily during my enterprise projects where I handle structured or semi-structured data, build integration pipelines, and ensure it works well with other data sources such as Hive and SQL server. I would not call myself a hardcore MarkLogic expert, but I am comfortable navigating its database features, writing queries, and integrating it into larger workflows.

It has been practical experience rather than just academic work. I have done several things in that area, but I am not very hands-on with it.

What do I think about the stability of the solution?

MarkLogic is generally quite good, and in my experience, it is very stable and reliable. I have not faced any major downtime; there have only been occasional minor glitches during cluster configuration or heavy indexing, but nothing that significantly affected production. The built-in replication and failover features also help maintain uptime, ensuring the system stays operational even during maintenance or updates.

Overall, I find it trustworthy for enterprise workloads, especially regarding the semi-structured and hierarchical data I work with.

What do I think about the scalability of the solution?

MarkLogic's scalability is quite solid; scaling up and out as data needs grow does require some planning. I have scaled both vertically by upgrading nodes and horizontally by adding nodes to clusters as my data volume increased. The system handles this increase in XML workloads well, and flexible indexing helps maintain query performance even as datasets expand. However, planning the cluster layout and indexing strategies carefully is crucial. If I just throw the data in without a strategy, performance can degrade, but once set up properly, scaling out is smooth.

Overall, it is definitely enterprise-ready for growing data needs.

Which solution did I use previously and why did I switch?

Before MarkLogic, I primarily relied on relational databases such as SQL servers and some Hive-based data warehouses to manage my data. The reason for switching or adding MarkLogic into the mix is that a great deal of my incoming data is semi-structured, such as XML or JSON from legacy systems, which was cumbersome to flatten into relational tables. It was slow, prone to errors, and complicated ETL pipelines.

MarkLogic made it much easier to ingest, query, and integrate semi-structured data directly without all that extra transformation overhead. It was not a complete replacement but rather a specialized tool for a specific pain point in my data flows.

What was our ROI?

From my perspective, there has definitely been a return on investment with MarkLogic, even if it is not always easy to quantify in exact dollar terms. For example, by using MarkLogic to handle semi-structured data directly, I have reduced ETL prep and transformation time by roughly 30 to 40 percent, freeing up engineers to focus on more value-added tasks instead of manual data cleaning.

Additionally, faster validation and reconciliation of policy and claims data mean business teams can generate reports and insights 35 percent faster.

What's my experience with pricing, setup cost, and licensing?

I do not actually deal with pricing, setup costs, or licensing because I work for an organization, but I believe the pricing and licensing are definitely on the higher side compared to open-source alternatives, based on what I have heard from managers.

Which other solutions did I evaluate?

I evaluated multiple options before choosing MarkLogic, including alternatives such as MongoDB, Couchbase, and PostgreSQL with JSON support. While MongoDB excels in JSON handling and NoSQL flexibility, it lacks ACID compliance across complex transactions. Couchbase is good for key-value and document storage but did not fit my hierarchical XML-heavy use cases. PostgreSQL with JSON support could handle some semi-structured data, but its performance on large nested XML datasets was not great, and ETL complexity remained high.

Ultimately, MarkLogic's native XML support, ACID compliance, and built-in search and indexing made it the best fit for my insurance project.

What other advice do I have?

MarkLogic is solid for its main features such as handling XML, flexible indexing, search, and ACID compliance on semi-structured data. However, the learning curve is steep, integration with other systems can be tricky, and the community and tooling are not as extensive as mainstream platforms. It is strong, but I would not say it is perfect.

My advice for others looking into using MarkLogic is three-fold: First, understand your data upfront. If you have a great deal of semi-structured or hierarchical data such as XML or JSON, MarkLogic can save you a great deal of time, but you need to understand your indexing and querying strategies. Second, plan your cluster and indexing strategies carefully as it pays off in performance and scalability later. Third, take full advantage of built-in search and ACID features, which are real-time savers for validation, reconciliation, and governance.

Additionally, do not underestimate the importance of training and onboarding; there is a learning curve, so make sure your team becomes comfortable with queries, transforms, and integrations before relying on it in production. If used correctly, it is a very strong tool for bridging legacy semi-structured data with modern analytics pipelines. I would rate MarkLogic at a seven on a scale of one to ten.


    Varuns Ug

Unified search and storage have simplified handling of semi-structured data and complex queries

  • March 26, 2026
  • Review from a verified AWS customer

What is our primary use case?

I use MarkLogic for performing many operations such as handling semi-structured data and building search-driven use cases. For example, I examined how it can handle and store XML documents and leverage its powerful indexing and search capabilities for fast querying. It is particularly useful in scenarios where you need flexible schemas along with advanced search capabilities including filtering, full-text search, and aggregations.

A task where MarkLogic was central to my work was a travel search and filtering system, similar to a hotel listing use case. In this project, the data was semi-structured, with elements such as hotel details, amenities, and pricing that vary significantly across different entities. I used MarkLogic to store this data as JSON documents. What made MarkLogic central was its built-in indexing and search capabilities. I configured indexing on fields such as location, price range, and amenities and then used its query capabilities to perform fast filtering and full-text search. Instead of relying on a separate search system, MarkLogic handled both data storage and search efficiently, which simplified my architecture and improved query performance.

What is most valuable?

MarkLogic for the hotel listing and search use case compared to other approaches made things easier than a traditional approach. If I compare it with using MySQL and Elasticsearch, typically you would need to maintain two systems: one for transaction storage and another for search. That introduces challenges regarding data synchronization, consistency, and operational overhead. With MarkLogic, since it natively supports both document storage and advanced search, I could avoid the dual-system complexity. It simplified the architecture because indexing and querying are built-in and tightly integrated. In terms of performance for semi-structured data and search-heavy queries, it was quite efficient because indexes are created automatically and queries are optimized around them. The schema flexibility was a significant advantage and it also helped reduce system complexity, improve development speed, and handle search use cases efficiently.

One thing I found particularly useful was that MarkLogic handles indexing by default, unlike a traditional system where you have to explicitly define and manage indexes. MarkLogic automatically indexes documents, which made it easier to get started and integrate quickly. Another advantage is that it can handle both structured and unstructured data together, which is very useful in real-world scenarios where travel data has a mix of fixed fields and dynamic attributes. The fact that it supports flexible querying over nested data without needing complex joins made development simpler and queries more intuitive.

Many features offered by MarkLogic are valuable. One of the standout features is its multi-modal capability. It can handle JSON, XML, and RDF data in a single database. That is particularly useful for applications dealing with diverse and evolving data formats. Another feature is the built-in search and indexing capability, along with schema flexibility since it is document-based. It handles semi-structured and nested data very naturally, which reduces the overhead of schema migration. An important feature is ACID transactions with NoSQL flexibility, so you can get reliability from a traditional database along with the scalability and flexibility of NoSQL.

MarkLogic's multi-modal capabilities make things easier in scenarios where JSON can be used for application-facing data such as hotel details, XML comes into play with external APIs, and RDF can be used for representing relationships. Instead of converting everything into one rigid format, MarkLogic allows you to store each in its native form and still query and access them. This opens up possibilities such as combining search data with relationships and searching in a single query, which would otherwise require multiple systems or a complex data pipeline. Overall, it reduces data transformation efforts, simplifies architecture, and makes it easier to build richer and more connected database models.

One thing that surprised me about MarkLogic is how it has so many built-in traditional capabilities. Features such as search, indexing, and even data integration are natively available, so you do not have to rely on multiple external systems. That was unexpectedly useful because it simplified the overall architecture significantly. Another interesting aspect was its flexible querying over deeply nested data. In traditional databases, handling nested or hierarchical data often requires complex joins. I found it interesting to design data platforms rather than just a database, especially with complex capabilities around search and integration.

MarkLogic has impacted my organization positively. Since my exposure to MarkLogic has been more on the exploration and evaluation side, I did not see full production-scale impact. However, even in the use case I worked on, a few clear benefits stood out. One was simplifying the architecture instead of thinking in terms of separate systems such as MySQL for storage and Elasticsearch for search. MarkLogic allowed both in a single system, which reduced integration overhead and potential consistency issues. Another benefit was fast deployment and integration because of its schema flexibility and automatic indexing. It was easier to onboard new data fields and quickly test different query patterns without heavy schema changes. I noticed that the search-heavy queries on semi-structured data performed quite well, which really helped in reducing system complexity and speeding up development for search-heavy, semi-structured data use cases.

What needs improvement?

Regarding improvement, I have identified a few areas. MarkLogic is quite powerful, but some areas need enhancement. One thing I noticed was the learning curve. Compared to commonly used databases such as MySQL or even MongoDB, MarkLogic requires understanding concepts such as XQuery, server-side JavaScript, and its internal architecture, which can take time for new developers. Another area is community and ecosystem support; it is not as widely adopted as other databases, so finding resources can be challenging. Third-party integration can be relatively harder. Additionally, from what I have observed, cost and licensing can be a consideration, especially for smaller teams or startups compared to open-source alternatives. Finally, while it is very strong for search and document-based use cases, it might feel excessive for simpler CRUD-based operations, where a traditional relational or lightweight NoSQL database would work better.

Documentation is an area that could improve. Learning resources and documentation could be enhanced, as the official documentation is detailed but can sometimes feel dense for beginners, especially when getting started with concepts such as indexing or writing queries in XQuery. Additionally, debugging and troubleshooting can be slightly challenging compared to more mainstream databases, mainly because the ecosystem is smaller and there are fewer community discussions and examples available. The developer experience could also be improved; setting up, experimenting, and integrating MarkLogic in an existing setup felt less straightforward compared to commonly used databases. I think improving onboarding, simplifying documentation, and expanding community support could make it even more developer-friendly in the future.

For how long have I used the solution?

I have been using MarkLogic for approximately half a year.

What do I think about the stability of the solution?

In my experience, MarkLogic is stable. It can be used in different environments and is designed for enterprise use cases involving large volumes of data and complex queries.

What do I think about the scalability of the solution?

MarkLogic is designed to scale horizontally, which means you can add more nodes to the cluster to handle increased data and query load. It distributes data across units called forests, and these forests can be spread across multiple nodes. This allows both storage and query processing to scale out efficiently.

How are customer service and support?

I have faced some situations where I needed help. While I have not interacted directly with MarkLogic support in a production environment, my understanding is based on industry feedback, which suggests it has enterprise-grade support, including ticketing systems and dedicated support channels for customers.

Which solution did I use previously and why did I switch?

Before MarkLogic, I used a combination of MySQL for storage and sometimes Elasticsearch for search-heavy use cases. While exploring MarkLogic, the approach was not an immediate switch in production but more of an evaluation to see how it compared to the traditional approach of using MySQL and Elasticsearch.

What was our ROI?

Since my experience with MarkLogic is more focused on exploration, I have not seen production-level ROI metrics such as cost and team size reduction. However, even during the evaluation, I could see potential in reduced development effort. The performance of search and filtering queries on semi-structured data felt more efficient compared to a traditional approach using MySQL and Elasticsearch. While I cannot quote exact numbers since it was not in production, it definitely showed potential for reduced development time, simplified architecture, and fast search use cases. Ultimately, it reduced development complexity and effort noticeably, especially by eliminating the need to manage multiple systems.

Since my experience is more focused on exploration, I have not seen production-level ROI metrics such as cost or team size reduction. However, even during evaluation, I could gauge potential ROI in terms of what it generates and the additional benefits it provides.

What's my experience with pricing, setup cost, and licensing?

Regarding pricing and setup cost for licensing, I have not worked on a production setup, so I have not been directly involved in handling pricing and licensing. However, my exploration indicates MarkLogic follows a licensing model that can be relatively higher compared to open-source databases, making cost an important factor for smaller teams.

Which other solutions did I evaluate?

Before choosing MarkLogic, I explored some alternatives, primarily comparing it with the combination of MySQL and Elasticsearch, and I also considered MongoDB since it provides document-based storage and schema flexibility. The key difference I found was that while MongoDB handles flexible data effectively, it does not offer the same level of integrated search capability as MarkLogic.

What other advice do I have?

I would suggest first clearly evaluating whether your use case truly benefits from MarkLogic's strengths. It works particularly well for search-heavy and semi-structured data use cases where flexible and powerful querying is needed. At the same time, I would recommend comparing it with alternatives such as MySQL, MongoDB, and Elasticsearch for trade-offs. Additionally, it is important to plan for the learning curve, especially around concepts such as indexing and querying.

Overall, I think MarkLogic is a very powerful platform, especially when involving semi-structured data and advanced searches. I would rate this review an 8 out of 10.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?


    reviewer2811600

Advanced indexing has delivered faster flexible searches on evolving customer policy data

  • March 25, 2026
  • Review from a verified AWS customer

What is our primary use case?

My main use case for MarkLogic is for a specific client where I handle XML data in the backend. I use XQuery, and we utilize MarkLogic mostly for querying the data in the backend.

In one of my use cases, we stored customer policy records as an XML document in MarkLogic. Each document contained details such as policy number, customer name, and other metadata. A common requirement was that users often did not remember the full policy number, so they searched using partial input, such as part of the policy number or their name. I implemented XQuery-based search logic along with proper indexing, so MarkLogic could efficiently return the matching document even with incomplete input. This significantly improved search performance and user experience.

Apart from the search cases, one key advantage we leveraged with MarkLogic's schema flexibility was that since our data was stored in XML, we could easily accommodate changes in the structure without major migration. I also worked on optimizing query performance by configuring indexes properly, which reduced query response time significantly. Additionally, we used MarkLogic as a central data store, integrated with the backend service through APIs, ensuring fast and reliable data access. We also ensured that the queries were written efficiently and aligned with the index configurations to avoid full document scans, which is critical for performance in MarkLogic.

What is most valuable?

The best features of MarkLogic are its powerful search capabilities, flexible schema, built-in indexing, and high performance for XML or JSON data. MarkLogic provides a Google search-like capability, including full-text search, partial matching, and relevance scoring. Another feature is schema flexibility; since it is a document-based database, we can store XML or JSON without a strict schema constraint, which makes it easy to evolve data structures.

The third feature is built-in indexing, as MarkLogic automatically maintains indexes, and we can configure the range indexes to specifically improve query performance. MarkLogic's XQuery support, which is native to the platform, allows efficient querying and transformation of XML data, while it even supports ACID properties. Unlike other NoSQL options, MarkLogic supports full ACID property compliance, ensuring data integrity and consistency.

MarkLogic's built-in indexing allows queries to run directly on indexes instead of scanning documents, which significantly improves performance. MarkLogic uses a universal index that automatically indexes all the content in the database, both structured and text, without requiring manual indexing as a traditional database would. We can configure range indexes for specific fields, such as policy number or customer name, allowing faster filtering and sorting of the results. In my workflow, this has helped tremendously because queries execute directly instead of scanning the XML data. Search performance improved significantly for partial and filtered searches, and it also reduced response time for user queries, even with a large database.

MarkLogic has improved our system performance, enabled flexibility in data handling, and specifically enhanced search efficiency. It improved search performance, provided flexibility in data modeling, and since it supports XML and JSON without a strict schema, we could easily adapt to changes in business requirements without any major database alterations. It even reduced development effort, as features such as built-in indexing and search reduce the need for external search systems, simplifying our architecture. It resulted in a better user experience with faster query responses and flexible searches. For example, earlier search operations were slow and less flexible, but after using MarkLogic, we delivered near real-time results, improving both system efficiency and user satisfaction.

What needs improvement?

While MarkLogic itself is powerful, it can be improved in terms of ease of usage, cost, and the learning curve. MarkLogic is a very strong enterprise-level database, but there are areas for improvement. There is a steep learning curve for this technology; XQuery and internal concepts such as indexing and CTS queries take time to learn compared to more common databases such as MongoDB. It is also relatively expensive compared to open-source alternatives such as MongoDB, which can be a concern for small organizations. Compared to databases such as MongoDB or MySQL, the community is smaller for finding resources or solutions, which can sometimes make it harder. Even debugging and development tools could be more user-friendly and modern.

Documentation and learning resources could definitely be improved to make onboarding easier. While MarkLogic does have official documentation, it can sometimes be harder to navigate and not very user-friendly, particularly for developers new to concepts such as XQuery and CTS queries. In my experience, it sometimes took extra time to find the right examples or best practices; even practical real-world scenarios were limited. Compared to more popular databases, community-driven tutorials are very few. Better documentation along with improved tooling would make MarkLogic even more developer-friendly without compromising its powerful capabilities.

Our setup was managed by AWS infrastructure, and my main focus was on development and working with MarkLogic from the application perspective. We saw a positive return on investment through improved performance, reduced system complexity, and better user efficiency.

For how long have I used the solution?

I have been using MarkLogic for the last two years.

What do I think about the stability of the solution?

MarkLogic is very stable and reliable for enterprise applications. It supports ACID transactions, which ensure data consistency and reliability. Its clustered architecture ensures high availability.

What do I think about the scalability of the solution?

MarkLogic is highly scalable and supports horizontal scaling through its clustered architecture. We can add more nodes to the cluster, and data is distributed across them using Forest, which helps in handling increasing data volume and traffic. Since we are utilizing AWS, we can scale resources up or down based on the load and manage large datasets efficiently without significant degradation in performance.

How are customer service and support?

Customer support for MarkLogic provides strong enterprise-level assistance through direct interactions. It is usually handled by a specific team that is very responsive to a variety of issues we encounter.

Which solution did I use previously and why did I switch?

I started only with MarkLogic. While I was aware of MySQL, I began working with MarkLogic when I joined the organization, so I did not use any different solutions.

How was the initial setup?

MarkLogic is deployed as a cluster environment on the server, enabling scalability, high availability, and fault tolerance. My organization has MarkLogic deployed in a cluster setup, which helps with scalability and high availability.

What was our ROI?

We saw a positive return on investment through improved performance, reduced system complexity, and better user efficiency.

Which other solutions did I evaluate?

We evaluated other options such as MongoDB and Elasticsearch before choosing MarkLogic.

What other advice do I have?

MarkLogic's built-in indexing allows queries to run directly on indexes instead of scanning documents, which significantly improves performance. MarkLogic uses a universal index that automatically indexes all the content in the database, both structured and text, without requiring manual indexing as a traditional database would. We can configure range indexes for specific fields, such as policy number or customer name, allowing faster filtering and sorting of the results. In my workflow, this has helped tremendously because queries execute directly instead of scanning the XML data. Search performance improved significantly for partial and filtered searches, and it also reduced response time for user queries, even with a large database.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)


    reviewer2811294

Flexible data hub has improved complex data storage and enables rapid semantic search

  • March 24, 2026
  • Review from a verified AWS customer

What is our primary use case?

My main use case for MarkLogic is for storing data. A quick specific example of the kind of data I store in MarkLogic is that we have a data warehouse, and we use it as a NoSQL database to store, manage and search complex heterogeneous data.

How has it helped my organization?

MarkLogic has positively impacted my organization by making our job easier, although we have yet to notice the full details. It made my job easier because we can store a large number of data, and the built-in search feature is great, including semantic data management.

What is most valuable?

The best features MarkLogic offers include multi-model flexibility, built-in search, data hub platform integration, and semantic data management. I personally appreciate the built-in search feature because it indexes all data immediately upon ingestion for rapid searching, so we can perform full-text, phrase, or geospatial searches.

MarkLogic has positively impacted my organization by making our job easier, although we have yet to notice the full details. It made my job easier because we can store a large number of data, and the built-in search feature is excellent for semantic data management.

What needs improvement?

I would rate this a nine because I think MarkLogic can incorporate some AI features that are emerging in other databases.

For how long have I used the solution?

I have been using MarkLogic for five years.

What do I think about the stability of the solution?

MarkLogic is stable.

What do I think about the scalability of the solution?

MarkLogic's scalability is impressive as it is scalable, and we can scale it to an unlimited number of data, allowing it to store unlimited data.

How are customer service and support?

The customer support is good.

Which solution did I use previously and why did I switch?

I did not previously use a different solution.

How was the initial setup?

My experience with pricing, setup cost, and licensing was good.

What about the implementation team?

I purchased MarkLogic through the AWS Marketplace.

What was our ROI?

I do not have metrics to share about return on investment as I am not the right person for this question; that is calculated by our Chief Financial Officer.

What's my experience with pricing, setup cost, and licensing?

My experience with pricing, setup cost, and licensing was good.

Which other solutions did I evaluate?

Before choosing MarkLogic, we were evaluating other options in addition to Cassandra DB. MarkLogic compared to Cassandra is preferred because of the features and pricing.

What other advice do I have?

MarkLogic is great, and my advice for others looking into using MarkLogic is that it is a great database with awesome features that you should consider. I would rate this product a nine out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?


    Dixit Singla

Search workflows have become faster for complex data while cloud-native flexibility still needs work

  • January 25, 2026
  • Review from a verified AWS customer

What is our primary use case?

My main use case for MarkLogic is primarily for health and insurance companies, and secondarily for publishers who maintain a large set of books online and any company that deals with big data. We keep high volume of data in MarkLogic because it provides a good search engine to query data in large datasets.

A specific example of how a publisher company would use MarkLogic in their workflow is maintaining data of a book, including who published that book, what the author name is, and how many editions exist. You can imagine that there are billions of books across the world, and we want to keep that digitized information in MarkLogic. If you want to search for a particular author, find how many books are published by a particular publisher, or discover editions of a particular book, you can get that information very quickly.

What is most valuable?

The best features MarkLogic offers include the ability to store unstructured data. You really don't need any particular schema, although it does support storing documents according to a particular schema, but that is not mandatory. You can store your data in XML and JSON format and you can store even binaries, JPEGs, PDFs and that kind of content in MarkLogic. The second feature is that it has a very rich search and cts APIs to build search engines on large datasets. The third is the spell suggestion APIs where, as you type, you will get suggestions of what you are searching for, which helps users understand what they are really searching for. Fourth is the redaction APIs. If you have a large set of data in a live environment and want redacted data in lower environments, the redaction APIs are very effective. From a user perspective, you can keep your data in redacted format in any environment, even on production.

The other valuable feature is the CPF, which is the Content Processing Framework. That is a very powerful tool. You can orchestrate how your document is processed before insertion or after insertion. A big example is that as soon as you have loaded a PDF into MarkLogic and want to extract the data from that PDF and store it as XML in MarkLogic for searching, that is where CPF is very useful.

MarkLogic has impacted my organization positively. We used to have one fetch API which was built using .NET API and the backend was SQL. We got to know from customers that the API was very slow and they were getting impacted. We moved to MarkLogic and created the API using JavaScript server-side language, and we saw almost 60% improvement in the speed of the search. The fetch results were 60% faster after that.

What needs improvement?

I think MarkLogic can be improved by providing good cloud infrastructure. Since we are in a very tech era, MarkLogic should provide good cloud infrastructure. If you look at other databases or systems like Kafka and MongoDB, they have cloud infrastructure. You do not need to worry about maintaining your own servers or provisioning your own servers. You simply log in and tell MarkLogic you want a certain number of clusters or nodes in a cluster and what cloud provider you want to use, then click okay, and they will build it for you. That is a headache with MarkLogic.

Apart from that, I don't see any issues. It is a really great product.

For how long have I used the solution?

I have been using MarkLogic for 11 years.

What do I think about the stability of the solution?

MarkLogic is not entirely stable. We used to have production issues, mainly due to MarkLogic failures. The node goes down sometimes or some memory errors occur. It is stable, but if I rate it out of 100%, it is 80-85% stable. Every year we used to have four to five issues.

What do I think about the scalability of the solution?

MarkLogic's scalability is very bad. In production, when you get to know that your data is increasing and you need to add one more node, that is not easy and not straightforward. You really need to do a lot of work and a lot of planning before you will be able to scale. But in today's world, if I talk about other systems like MarkLogic, if you have to scale, it is just a five-minute job. With MarkLogic, that is not the case.

How are customer service and support?

I find customer support to be good.

How was the initial setup?

MarkLogic has really good deployment processes. If you have a big MarkLogic architecture with hundreds of nodes in the cluster, many databases, forests, and many app servers, their deployment features like ML-Gradle can automate everything. You can create a new cluster within minutes through that Gradle. That is a very good feature they have provided.

What's my experience with pricing, setup cost, and licensing?

I'm not much aware of pricing, setup costs, and licensing because that is another team that deals with it. But if I am correct, a few times I heard them saying that MarkLogic is quite costly, and they are looking to move away in the longer run for that reason.

Which other solutions did I evaluate?

Before choosing MarkLogic, we analyzed SQL and other solutions, but after analyzing and doing some POCs, we found that in our case, we had to store a large amount of binary data and SQL was not that good at storing binary data, especially when your need is to perform searches inside the binaries. That was something we did not find in SQL, and we did not prefer SQL for that reason.

What other advice do I have?

My advice to others looking into using MarkLogic is that they really need to evaluate their use case. First, they need to evaluate what MarkLogic provides and what the features of MarkLogic are. Then they need to evaluate their use cases and check whether they are able to use the core features of MarkLogic. If not, then there is no point in moving to MarkLogic. Since it has a costly license, they really need to evaluate whether the features MarkLogic is providing would allow them to utilize most of them or not. I would love to see MarkLogic in more cloud-native architecture. I am giving this review a rating of 7.


    Ansh Ramsani

Banking data workflows have become faster and now support rich PDF and media management

  • January 06, 2026
  • Review provided by PeerSpot

What is our primary use case?

My main use case for MarkLogic is data development. We have a banking type of data, so we used to convert it to PDF or use some for applications we were running.

A quick, specific example of how I use MarkLogic with my banking data involves creating PDFs based on XML files and developing subscription kind of applications. In that, we used to send scheduled PDFs and some scheduled emails generated from that application using MarkLogic.

In addition to my main use case, I also perform development and some MarkLogic admin work as well.

What is most valuable?

In my experience, the best features MarkLogic offers include indexing, which is a quick and efficient way to get the files and I really appreciate that.

The indexing feature of MarkLogic has helped my work by being quick, faster, and very helpful for the team to get the code quicker and to ensure everything is fast.

MarkLogic has positively impacted my organization by making everything quick and fast, and I believe that is a major change we have seen here. In fact, the development process is much quicker than other applications we were using previously.

What needs improvement?

As for areas where MarkLogic can be improved, nothing else comes to mind. I have been using MarkLogic for seven years, and many improvements have already been addressed. I do not recall any additional needed improvements.

For how long have I used the solution?

I have been using MarkLogic for around seven years.

Which solution did I use previously and why did I switch?

Previously, we were using SQL, but to save the PDFs and everything was not possible in SQL. Right now, it is possible in MarkLogic to save all the images, videos, and everything, which is very useful, and that is why we switched from SQL to MarkLogic.

What was our ROI?

I have seen a return on investment as time is really being saved using MarkLogic.

What's my experience with pricing, setup cost, and licensing?

Regarding my experience with pricing, setup cost, and licensing, I find it quite high compared to other things, but I believe it is justified.

Which other solutions did I evaluate?

My advice for others looking into using MarkLogic is that it is a better way than SQL and while some main things can be improved, it is the best way right now in the market.

What other advice do I have?

I chose a nine for my rating because it is faster, and I deducted one point because of the price of MarkLogic. My overall review rating is 9.


    Babu S.

Marklogic review by a Senior Consultant

  • May 23, 2024
  • Review provided by G2

What do you like best about the product?
Below are some of the features that I like about Marklogic
1. Multi media database capabilities- The storage and ability to manage different types of data models in single platform has helped me to run my projects simultaneously handling data types such as JSON, XML
2. Enterprise Search Functionality- The in built search options had helped me in searching across all data types. The feature also helped me to search using full text, range queries etc.
3. Storage and Query capability- This feautre helped me in optimizing the storage infrastructure by placing data in different storage tiers. It helped me to reduce the cost incurred and the data can be accessed frequently .
What do you dislike about the product?
Below are some of the dislikes of Marklogic
1. Learning curve- despite the features , marklogic proven to be having a deep learning curve.
2. Cost and Licensing - The total ownership model is having a greated inmpact in taking this software as a package for any development. The cost will be a barrier to any small and medium sized projects. Even though my org has provided access to this platform by to get to that stage we had to go through a lot of to and fro
What problems is the product solving and how is that benefiting you?
The software helped me in storing and managing documents which is in xml format. The software helped us in preventing the doc structure which was crucial for my project.
The project also helped me explore the advanced search capabilities by frequently using this for searching content within XML.


    Jithin J.

Database, search engine and integration tool rolled into one

  • May 19, 2024
  • Review provided by G2

What do you like best about the product?
It's multimodel structure of combining different files is really helpful. It has capability to handle both structured and unstructured data is great and its build in AI capability is greats.
What do you dislike about the product?
Building predefined models on large datasets takes a lot of time. Documentation could have been better.
What problems is the product solving and how is that benefiting you?
We use it as a database and ETL tool capable of building models quickly.


    Rajat J.

MarkLogic- A powerful tool/database for all your NoSQL transactions.

  • May 18, 2024
  • Review provided by G2

What do you like best about the product?
The best thing about the Mark Logic is that it is a single platform that combines the capability of being a database,a search engine.
It also provides the tools for data integrations.
It is comparatively fast,economical and easier to manage.
It is very easy to use and the customer support is also very prompt to response.
The features are easy to implement and can be integrfated with other tools as well.
Even an unexperienced person can implement its features thus it is frequently use in our organization.
What do you dislike about the product?
The User interface could be improved to be more user friendly.
The spreadsheet capabilities can also be increased.
What problems is the product solving and how is that benefiting you?
We are using it to cater the data integration capabilities for one of our client working in the pharma sector.
The amount of data that is being processed is very large and it requires a platform that can caters the ever growing need of the database along with the faster processing and MarkLogic proves to be a perfect tool for it.