Brain Language Metrics on Company Filings (History Trial)

The Brain Language Metrics on Company Filings (BLMCF) dataset has the objective of monitoring several language metrics on 10-Ks and 10-Qs company reports for approximately 6000+ US stocks. Example of metrics are financial sentiment, percentage of specific language type in the document (e.g. litigious language) and similarity among documents. This extended version provides additional language metrics and an analysis of the whole report together with specific report sections (e.g. Risk Factors).

View purchase options

Overview

Try agent mode

Create proposal

Ask question

Brain Language Metrics on Company Filings

Overview

The Brain Language Metrics on Company Filings (BLMCF) dataset has the objective of monitoring several language metrics on 10-Ks and 10-Qs company reports for approximately 6000+ US stocks.

Recent literature works claim inefficiencies in the market response to company filings information due to the increased complexity and length of such reports, see for example "Lazy Prices" Cohen et al. 2018 or "The Positive Similarity of Company Filings and the Cross-Section of Stock Returns", M. Padysak 2020 .

Some literature works claim inefficiencies in the market response to company filings information due to the increased complexity and length of such reports; over the last 20 years, the length of the average 10-K has in fact increased dramatically.

Our dataset is made of two parts; the first one includes the language metrics of the most recent 10-K or 10-Q report for each firm, namely:

Financial sentiment
Percentage of words belonging to financial domain classified by language types: constraining, litigious, uncertainty and interesting language.
Readability score
Lexical metrics such as lexical density and richness
Text statistics such as the report length and the average sentence length

The second part includes the differences between the two most recent 10-Ks or 10-Qs reports of the same period for each company, namely:

Difference of the various language metrics (e.g. delta sentiment, delta readability score delta, delta percentage of a specific language type etc.)
Similarity metrics between documents, also with respect to a specific language type (for example similarity with respect to “litigious” language or “uncertainty” language)

Our dataset includes the metrics and related differences both for the whole report and for specific sections (Risk Factors and Management Discussion and Analysis)

Feed Details

The dataset is updated with a daily frequency since new 10-Ks and 10-Qs reports are released every day for some of the universe companies. Clearly the largest update will be around February, April, August and November when the largest number of reports is released. The historical dataset is available from year 2010.

Data Dictionary

Factsheet

Historical Trial

The dataset contains historical data from January 2010 that can be freely accessed for 2 months. For a live feed please contact us at support@braincompany.co and we will make accessible a customized version of the product on AWS Data Exchange according to Client requirements.

Disclaimer

The content of this dataset is not to be intended as investment advice. The material is provided for informational purposes only and does not constitute an offer to sell, a solicitation to buy, or a recommendation or endorsement for any security or strategy, nor does it constitute an offer to provvaluee investment advisory or other services by Brain. Brain makes no guarantees regarding the accuracy and completeness of the information expressed in the dataset.

Details

Sold by

Brain

Introducing multi-product solutions

You can now purchase comprehensive solutions tailored to use cases and industries.

Learn more

Explore multi-product solutions

Features and programs

Financing for AWS Marketplace purchases

AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.

View financing details

Pricing

Brain Language Metrics on Company Filings (History Trial)

Info

View purchase options

This product is available free of charge. Free subscriptions have no end date and may be canceled any time.

Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator to estimate your infrastructure costs.

Vendor refund policy

No refunds are offered for this product, for more information please contact support@braincompany.co

How can we make this page better?

We'd like to hear your feedback and ideas on how to improve this page.

Legal

Vendor terms and conditions

Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Delivery details

AWS Data Exchange (ADX)

AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.

Additional details

Data sets (1)

Info

You will receive access to the following data sets.

Data set name	Type	Historical revisions	Future revisions	Sensitive information	Data dictionaries	Data samples
Brain Language Metrics on Company Filings - Extended Version (Historical Trial)		All historical revisions	All future revisions		Not included	Not included

Resources

Vendor resources

Support contact URL

Similar products

Brain Language Metrics on Earnings Calls Transcripts (History Trial)

By Brain

The Brain Language Metrics on Earnings Calls Transcripts (BLMECT) dataset has the objective of monitoring several language metrics for the quarterly earnings call transcripts of 4500+ US stocks. With this dataset we aim at providing additional building blocks to asset managers to build investment strategies based on alternative data.

View product

Metastatic Brain Tumor Disease State

By Perception Health

Identify risk of being diagnosed with a Metastatic Brain Tumor

View product

Clinical Brain unlimited

By MedicineOne

We are going to use an extra brain to help combat clinical error! MedicineOne has created Clinical Brain, a clinical intelligence platform to support medical decisions and help reduce clinical error.

View product

BYOB - Build Your Own Brain

By Akaike

BYOB - Build Your Own Brain can seamlessly connect to different modalities data from multiple sources, understand and analyze, monitor your data real time, converse with it, and support decision-making. We are centralizing data to decentralize wisdom so that stakeholders can spend more of their time in second-order decision making.

View product

DeepBrain AI - AI Human - SDK (Software Development Kit)

By DeepBrain AI

DeepBrain AI creates AI technologies such as video and speech synthesis, live chatbots, and more. Create your digital human today to elevate your customer experience and engagement with the power of conversational AI.

View product

Amdocs brAIn Platform

By Amdocs

Are you looking for a GenAI solution that speeds time to market, reduces costs, and improves product quality in your testing practice? The brAIn by the Amdocs QE Studio transforms your quality engineering operations through AI-driven automation, predictive analytics, and cognitive intelligence, providing a comprehensive testing solution that streamlines operations, identifies defects early, and optimizes workflows.

View product