Overview
Full fidelity metadata sans sampling generates a blueprint of source data, comprehensive, deep and exact insights, for AI, ML, data science, analytics, governance, test data, and more. Brings a unique combination of automated discovery and analysis with a processing engine in one solution to control escalating compute costs. Metadata discovery, analysis and quality metrics are through a no code, drill down GUI, nLite. If you wish to develop custom solutions using our products we offer f2md API Toolkit for programming.
The product is data store agnostic and hybrid, working directly on files in object stores, data lakes or databases, without data movement. Its AI driven metadata engine f2mdbX helps you discover schema and quality anomalies even on unknown, legacy datasets resident in cloud. It learns your source data in production pipelines to identify and inform you of changes in schema, data types, values or statistics for proactive data monitoring. Metadata on source data extends beyond profiling to rules, referential integrity, accuracy and privacy to identify sensitive data values.
Available as a self-hosted solution to run on single or multi-node clusters, product is scaleable to suit your data needs and is optimized to work well on a few large files from data warehouses, or thousands of small files from IoT.
Highlights
- Discovery: Hyper-efficient processing of every value in a dataset to generate comprehensive and deep metadata. A bottom-up approach to let data tell its own story without bias or ambiguity.
- Analysis: Insights derived from full fidelity metadata to identify data issues that are not easily found by sample based metadata or by running a plethora of SQL queries on source data.
- Performance: Runs as a distributed metadata engine on AWS clusters, taking advantage of both compute and data parallelism, to deliver high performance for both discovery and analysis.
Details
Typical total price
$8.896/hour
Features and programs
Financing for AWS Marketplace purchases
Pricing
Instance type | Product cost/hour | EC2 cost/hour | Total/hour |
---|---|---|---|
m5.4xlarge | $3.68 | $0.768 | $4.448 |
m5.8xlarge | $7.36 | $1.536 | $8.896 |
m6i.4xlarge | $3.68 | $0.768 | $4.448 |
m6i.8xlarge Recommended | $7.36 | $1.536 | $8.896 |
m6i.12xlarge | $11.04 | $2.304 | $13.344 |
m6i.16xlarge | $14.72 | $3.072 | $17.792 |
c6i.8xlarge | $7.36 | $1.36 | $8.72 |
c6i.12xlarge | $11.04 | $2.04 | $13.08 |
c6i.16xlarge | $14.72 | $2.72 | $17.44 |
Additional AWS infrastructure costs
Type | Cost |
---|---|
EBS General Purpose SSD (gp2) volumes | $0.10/per GB/month of provisioned storage |
Vendor refund policy
We do not currently support refunds, but you can cancel at any time.
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
Update
Additional details
Usage instructions
Resources
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.