Listing Thumbnail

    Onehouse Managed Lakehouse

     Info
    Sold by: Onehouse 
    Onehouse Managed Lakehouse is a cloud-native SaaS product that replaces painful, inefficient DIY data lake management around file sizing, masking, deletion, clustering, access control, caching, etc. with foundational data infrastructure as a service, to ingest, store, optimize and transform your data on industry-leading open data formats. For more information, please visit www.onehouse.ai.
    4

    Overview

    Onehouse Managed Lakehouse solution is designed to help customers build and implement a modern data lakehouse in a fast and cost effective way. It turns data lakes into fully managed lakehouses by bringing data warehouse functionality to the data lake as well as automating data management services like ingestion, performance optimization, indexing, observability or data quality management. Apache Hudi is the ground-breaking open-source Lakehouse technology that powers Onehouse.

    Onehouse is a truly open solution that allows customers to access data from any query engine such as Presto, Amazon Athena/Redshift, GCP BigQuery, Snowflake or Spark. It can help organizations build data lakes in days, not months, realize large cost savings but still own their data in open formats.

    Onehouse Product Features

    1. Managed Ingestion 1.1) Incremental Ingestion from popular RDBMS, NoSQL, Kafka, S3 files 1.2) Autocapture data streams using regex patterns 1.3) Multi-regional Kafka ingestion to merge data that is geo-replicated 1.4) Advanced Multiplexed streaming ingestion

    2. ETL Transformations 2.1) Low/No-code incremental ETL pipelines 2.2) Ability to plug in custom code transformations 2.3) Data deduplication

    3. Data Quality Quarantine 3.1) Schema validation 3.2) Timestamp validation 3.3) Quarantine bad data

    4. Catalog Syncing 4.1) AWS Glue Catalog 4.2) Hive Metastore 4.3) GCP DataProc Metastore 4.4) GCP BigQuery + BigLake 4.5) DataHub 4.6) Snowflake and Databricks via Onetable

    5. Table Services 5.1) Table bookkeeping and metadata management 5.2) Time travel and data versioning 5.3) Auto-Savepoints and recovery

    6. Management Plane 6.1) Automated Monitoring/Alerting 6.2) Metrics 6.3) Access control & permissions

    We offer one month free trial to approved customers. Reach out to gtm@onehouse.ai  if you are interested.

    For custom pricing, EULA, or a private contract, please contact gtm@onehouse.ai , for a private offer.

    Highlights

    • Continuous Data Ingestion - Effortlessly ingest data from your databases, event streams, cloud storage and other services at low latency. Built on industry leading change data capture technology for the lakehouse.
    • Automated Data Management - Onehouse eliminates tedious data chores by managing all of your table services that perform file-sizing, partitioning, cleaning, clustering, Z-order/Hilbert-Curves, compaction, masking, encryption, and more.
    • Low-Code Incremental Pipelines - Create declarative templates for low-latency incremental ingestion and transformation pipelines. Forget about operational burdens of scheduling, monitoring, and data quality management.

    Details

    Sold by

    Delivery method

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Onehouse Managed Lakehouse

     Info
    Pricing is based on the duration and terms of your contract with the vendor, and additional usage. You pay upfront or in installments according to your contract terms with the vendor. This entitles you to a specified quantity of use for the contract duration. Usage-based pricing is in effect for overages or additional usage not covered in the contract. These charges are applied on top of the contract price. If you choose not to renew or replace your contract before the contract end date, access to your entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    1-month contract (1)

     Info
    Dimension
    Description
    Cost/month
    Managed Lakehouse
    Onehouse Managed Lakehouse solution. Contact gtm@onehouse.ai.
    $0.00

    Additional usage costs (1)

     Info

    The following dimensions are not included in the contract terms, which will be charged based on your usage.

    Dimension
    Cost/unit
    Onehouse Consumption Unit
    $0.01

    Vendor refund policy

    We offer refunds on a case-by-case basis. Please contact us at support@onehouse.ai  if you believe you should be eligible.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Resources

    Vendor resources

    Support

    Vendor support

    Onehouse offers multiple tiers of support. Please contact gtm@onehouse.ai  about detailed pricing for each tier. support@onehouse.ai 

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    100
    In Analytic Platforms
    Top
    25
    In Data Analysis
    Top
    10
    In Streaming solutions, ELT/ETL

    Overview

     Info
    AI generated from product descriptions
    Change Data Capture and Incremental Ingestion
    Continuous data ingestion from RDBMS, NoSQL, Kafka, and S3 files using industry-leading change data capture technology with support for multi-regional Kafka ingestion and multiplexed streaming.
    Automated Table Services and Data Management
    Automated management of file-sizing, partitioning, clustering, Z-order/Hilbert-Curves, compaction, masking, encryption, and data deduplication operations.
    Data Quality Validation and Quarantine
    Schema validation, timestamp validation, and automatic quarantine of data that fails quality checks.
    Multi-Catalog Metadata Synchronization
    Integration with AWS Glue Catalog, Hive Metastore, GCP DataProc Metastore, GCP BigQuery with BigLake, DataHub, Snowflake, and Databricks via Onetable.
    Time Travel and Data Versioning
    Table bookkeeping with metadata management, time travel capabilities, auto-savepoints, and recovery functionality for data versioning.
    Unified Lakehouse Architecture
    Fully managed lakehouse platform integrating data storage, analytics, and AI workflows on a single unified platform
    Incremental Compute Engine
    Real-time incremental compute capabilities enabling faster model iteration and scalable experimentation
    Open Standards Support
    Built on open-source technologies and industry-leading open formats with native support for data lake standards
    Multi-Workload Integration
    Seamless integration of batch, streaming, and interactive workloads eliminating traditional data silos
    Unified Governance and Security
    Centralized governance and security controls across all data and AI use cases with vendor-agnostic architecture
    SQL-Based Pipeline Development
    Cloud-native platform enabling pipeline development, testing, and deployment using SQL without requiring complex code, with automated orchestration and scheduling.
    Streaming and Batch Data Ingestion
    Continuous ingestion and replication of data from multiple sources including PostgreSQL, SQLServer, Kinesis, and file systems to Iceberg lakehouse and data warehouse systems.
    Iceberg Live Tables with Automated Optimization
    Managed transformation layer that automates task orchestration, compute scaling, data quality validation, schema evolution, and file system optimization for Iceberg tables.
    Adaptive Iceberg Optimizer
    Optimization engine that profiles data files and write patterns to automatically determine optimization techniques, file selection, and frequency to improve query performance and reduce storage costs.
    Data Observability and Monitoring
    Built-in monitoring and observability capabilities for detecting operational and data-related issues including volume changes, data value drift, and schema evolution with alerting functionality.

    Contract

     Info
    Standard contract
    No

    Customer reviews

    Ratings and reviews

     Info
    4
    3 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    33%
    67%
    0%
    0%
    0%
    0 AWS reviews
    |
    3 external reviews
    External reviews are from G2 .
    Ali M.

    Personal information is properly encrypted

    Reviewed on Oct 19, 2024
    Review provided by G2
    What do you like best about the product?
    Onehouse has been helpful for us managing data lake on cloud, for our case it has been a great tool. It has the advantage of simplifying storage, sorting, and searching data that the professionals involved in data analysis require in their work.
    What do you dislike about the product?
    Onehouse however does not offer easily retrievable information on such integration. Much time and effort is spent trying to locate information regarding supported tools or functionalities, mostly from web searches, or calls to the IT helpdesk.
    What problems is the product solving and how is that benefiting you?
    Onehouse simplifies managing data lakes in the cloud environment to a single house. It also proved useful in creating a unified place for data storage and retrieval and how this information can be effectively parsed through our workforce.
    Mahesh B.

    Onehouse - A secured data lake to use in cloud

    Reviewed on Jul 23, 2024
    Review provided by G2
    What do you like best about the product?
    It has a user friendly interface
    It helps in data management , analytics and reporting
    We can integrate this with various tools
    It solves the scalability issues and also it is cost effective.
    What do you dislike about the product?
    The setup process is complex and time consuming
    There is a support delay
    We have to pay more for advanced features
    What problems is the product solving and how is that benefiting you?
    The data is centralized. Means we can have all the teams or departments or applications data in one platform
    Reporting and analysis is another big problem solved by onehouse
    Market Research

    Exploring the Enigmatic: A Deep Dive into Onehouse

    Reviewed on May 29, 2024
    Review provided by G2
    What do you like best about the product?
    Ease of Implementation,Ease of Use,Number of Features
    What do you dislike about the product?
    Their is nothing to dislike. everything is good.
    What problems is the product solving and how is that benefiting you?
    It helps to provide insight from tha large data base in a minut also support all query engines and prvoide real time analytics.
    View all reviews