Overview
MDACA Big Data Virtualization (BDV) is an enterprise-grade MPP (massively parallel processing) Federated SQL query engine that is fast and scalable. BDV is designed for speed, supporting enterprise and government organizations in their big data query needs. BDV provides organizations with a logical data layer that integrates enterprise data across disparate systems and manages the unified data for centralized access.
BDV supports data queries across systems without data copy and replication, thereby bolstering master data management, Analytics, Insights, AI/ML, and legacy data migration initiatives while reducing cost. It is operationally optimized for Amazon Web Services (AWS) and offers a comprehensive data abstraction, federation, integration, and transformation layer. Easily integrate MDACA BDV with AWS core service offerings such as Redshift, Aurora, AWS RDS, AWS Lake Formation, Glue, S3, Amazon Quick Sights, EMR, and SageMaker. Additionally, easily integrate with MDACA enterprise tools through single sign-on (sso) with MDACA products such as Data Explorer, Cloud Storage Explorer, Data Lake and Synthetic Data Engine - all designed to provide advanced enterprise capabilities for big data solutions. MDACA is powered by SpinSys, an AWS Advanced Tier partner offering Professional Services support through the AWS Marketplace.
MDACA BDV is designed to support enterprise and government big data initiatives, adding integration, optimization, security enhancement, and add-ons to industry leading best practice approaches and standards. BDV supports enterprises with advanced DoD-grade security features, connectors, performance optimizations, and fully integrated tools within the enterprise MDACA big data platform.
Key features/benefits of MDACA BDV include:
-
Logical Data Layer - BDV provides a virtual approach to accessing, managing, and delivering data. Execute high performance data queries across multiple environments with robust Massively Parallel Processing (MPP). Legacy Data Migration Support - BDV reduce risk of system modernization by updating business applications while replacing legacy systems.
-
Query Federation - BDV enables access to data from multiple systems within a single query. Supporting integration with enterprise Extract Transform Load (ETL) tools, BI tools, AI/ML engines as well as meeting the needs for enterprise big data analytics
-
De-identified Data Support- BDV provides out-of the box data de-identification support that is managed through user access control groups. This provides a major functionality for data to be accessible by multiple teams while adhering to data privacy rules in systems, supporting a wide variety of business needs such as healthcare, financial, logistics, sales and legal.
-
Synthetic Data Support - Fully integrated with the MDACA Synthetic Data Engine, BDV provides organizations and enterprises with an approach to leveraging synthetic data for enterprise software development, AI/ML, and other initiatives. Eliminate Data Silos - BDV delivers integrated information while reducing data silos, allowing data to remain in source systems and reducing number of data copies.
-
Data Management Support- BDV provides a centralized, secure Role Based Access Control (RBAC) layer to catalog, search, discover, and govern unified data and its relationships.
-
Advanced Query, Insights, AI/ML Support - BDV supports American National Standards Institute (ANSI) Structured Query Language (SQL) semantics, including complex queries, aggregations, and sub-queries.
-
Data Integration - BDV integrates data across enterprise systems, supporting a wide range of data formats and sources. Scaled to Support Business and Security Needs - BDV scales easily to run large queries and on-demand clusters coupled with fine grained security and privacy controls.
-
High Performance Queries - BDV supports highly parallel and distributed queries built from the ground up for efficient, low latency analytics.
-
Cost Management - Cost management is a key component of managing the success of efforts that need to execute within given budgets. BDV provides the ability to securely execute queries across the enterprise without the need to copy and duplicate data, provides options for project delivery efficiency, and plays a major role in cost management and savings on a given effort.
BDV is a core component of the MDACA DIGIN offering, providing a logical data layer and tools that integrate enterprise data across disparate systems. It manages the unified data for centralized access, allowing you to scale your on-prem workloads to the cloud at the speed of the cloud regardless of where your data is deployed.
Highlights
- Delivers integrated information while reducing data silos, allowing data to remain in source systems and reducing number of data copies
- Provides a single view of enterprise data - while concealing the technical complexities of database types, data locations, and data transformations - making it easy for business owners to understand
- Optimized for deployment on AWS (both Commercial and GovCloud) with advanced security integration using Keycloak and Apache Ranger
Details
Typical total price
$4.368/hour
Pricing
Instance type | Product cost/hour | EC2 cost/hour | Total/hour |
---|---|---|---|
m4.2xlarge | $1.80 | $0.40 | $2.20 |
m4.4xlarge | $3.60 | $0.80 | $4.40 |
m4.10xlarge | $4.80 | $2.00 | $6.80 |
m5.2xlarge | $1.80 | $0.384 | $2.184 |
m5.4xlarge Recommended | $3.60 | $0.768 | $4.368 |
m5.8xlarge | $4.80 | $1.536 | $6.336 |
Additional AWS infrastructure costs
Type | Cost |
---|---|
EBS General Purpose SSD (gp3) volumes | $0.08/per GB/month of provisioned storage |
Vendor refund policy
no refunds
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
Additional details
Usage instructions
Please view the MDACA Big Data Virtualization Launch Guide: https://mdaca.io/support/documentation/big-data-virtualization-documentation/launch-guide/
Resources
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.