AI Ready Data

The AI Ready Data dataset encompasses a comprehensive array of textual content across Energy publications produced by in-house editorial and research teams, including market reports, news articles, rationales, commentaries, fundamentals analyses, outlooks, and more - all in an LLM-friendly format prepared for seamless integration with AI systems.

View purchase options

Overview

Try agent mode

Create proposal

Ask question

The offer comprises a sample data set containing approximately 100 dated records

Customers can effortlessly leverage AI Ready Data for their Retrieval-Augmented Generation (RAG) solutions, enhancing their analytical capabilities and driving informed decision-making. This dataset removes restrictions as you integrate your choice of large language models (LLMs), to uncover patterns, correlations, and insights across commodities. Our flexibility aids processing and understanding data to suit your organizations, and you can utilize the provided data embeddings or set your own as per your preference. Additionally, you can integrate with your own vector database and leverage various internal and external data sources to enrich the dataset.

This dataset includes:

Unstructured data in an AI-ready format broken down into documents and segments with LLM-friendly metadata
Flexible data delivery
Easy customization of your own search and relevancy-boosting algorithms
Ease of discovery of relevant content for your end users

Sample Fields:

DOCUMENT_METADATA
PUBLISHED
UPDATED
FILETYPE
FILESIZE
SOURCEURL
REPORTINGFREQUENCY
PRIMARYENTITYTYPE
PRIMARYENTITYNAME
DOCUMENT_PRIMARY_ENTITY_IDF
OTHERDOCUMENTMETADATA

SEGMENT_METADATA
DOCUMENTID
SEGMENTATIONSTRATEGY
SEGMENTID
SEGMENTTYPE
SEGMENTLOCATION
RAWSEGMENTCONTENT
PROCESSEDSEGMENTCONTENT
LANGUAGE
SEGMENTOVERLAP
OTHERSEGMENTMETADATA
SEGMENTEMBEDDINGS
SEGMENTORDER

Tables:

DOCUMENT_METADATA
Contains metadata about various documents such as id, name, file type, size, sourceURL, and reportingFrequency. Additionally, it includes related tags like primary entity, commodity, geography, and any additional metadata that helps in identifying the document.

SEGMENT_METADATA
Contains chunked segments from documents along with metadata such as related document id, segment id, type, location, along with the processed and raw content of the segment. Additionally, it contains information on the segmentation strategy used to chunk the data and the embedding ids for each segment.

Highlights

Machine Learning Leverage AI Ready data from Energy as a RAG solution.
Pricing Analysis Uncover insights into pricing assessments using information from assessment summaries, market commentaries, and rationales.
Sentiment Analysis Perform sentiment analysis on News articles leveraging dedicated libraries.

Details

Sold by

S&P Global Energy

Introducing multi-product solutions

You can now purchase comprehensive solutions tailored to use cases and industries.

Learn more

Explore multi-product solutions

Features and programs

Financing for AWS Marketplace purchases

AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.

View financing details

Pricing

AI Ready Data

Info

View purchase options

This product is available free of charge. Free subscriptions have no end date and may be canceled any time.

Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator to estimate your infrastructure costs.

Vendor refund policy

Refunds are not offered for this product.

How can we make this page better?

Tell us how we can improve this page, or report an issue with this product.

Legal

Vendor terms and conditions

Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Delivery details

AWS Data Exchange (ADX)

AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.

Additional details

Data sets (1)

Info

You will receive access to the following data sets.

Data set name	Type	Historical revisions	Future revisions	Sensitive information	Data dictionaries	Data samples
ai-ready-data-cet		All historical revisions	All future revisions		Not included	Not included

Resources

Vendor resources

Support contact URL

Similar products

CirrusHQ: AI-Ready Data Lakehouse & Vector Store on AWS

By CirrusHQ Ltd

CirrusHQ builds the governed AWS data lakehouse, catalog and vector store that gets your data ready for AI, analytics and BI. Built on Amazon Redshift, AWS Glue, AWS Lake Formation, Amazon Athena and Amazon OpenSearch Serverless, delivered by a CirrusHQ builds the governed AWS data lakehouse, catalog and vector store that gets your data ready for AI, analytics and BI.

View product

IBM watsonx.data as a Service - GenAI Ready Data Lakehouse for AWS

By IBM Software

IBM watsonx.data is an open, hybrid data lakehouse with built-in data fabric and multi-engine optimization to prepare structured and unstructured data for AI.

View product

Build AI ready Data Architecture with Data Lakes

By Applify, Inc.

Empower your business with Applify's AI-ready data architecture services. Our solutions leverage AWS analytics to transform data into actionable insights, fostering growth and efficiency.

View product

Lakestack: AI Ready Data Lakehouse for Automotive & Mobility

By Applify, Inc.

Lakestack is an AWS native SaaS data platform for the automotive and mobility sector. It consolidates vehicle telemetry, ERP, CRM, and supply chain data into a governed, AI ready foundation, enabling predictive maintenance, warranty analytics, and customer experience insights in under 4 weeks.

View product

Lakestack: No Code, AI Ready Data Platform for Retail

By Applify, Inc.

Lakestack is an AWS native SaaS data platform for retailers. It unifies POS, ERP, CRM, and loyalty data into a governed, AI ready foundation, enabling real time dashboards, predictive analytics, and improved customer insights in under 4 weeks.

View product