Listing Thumbnail

    Brain Language Metrics on Company Filings (History Trial)

     Info
    Sold by: Brain 
    Deployed on AWS
    The Brain Language Metrics on Company Filings (BLMCF) dataset has the objective of monitoring several language metrics on 10-Ks and 10-Qs company reports for approximately 6000+ US stocks. Example of metrics are financial sentiment, percentage of specific language type in the document (e.g. litigious language) and similarity among documents. This extended version provides additional language metrics and an analysis of the whole report together with specific report sections (e.g. Risk Factors).

    Overview

    Brain Language Metrics on Company Filings


    Overview

    The Brain Language Metrics on Company Filings (BLMCF) dataset has the objective of monitoring several language metrics on 10-Ks and 10-Qs company reports for approximately 6000+ US stocks.

    Recent literature works claim inefficiencies in the market response to company filings information due to the increased complexity and length of such reports, see for example "Lazy Prices" Cohen et al. 2018  or "The Positive Similarity of Company Filings and the Cross-Section of Stock Returns", M. Padysak 2020 .

    Some literature works claim inefficiencies in the market response to company filings information due to the increased complexity and length of such reports; over the last 20 years, the length of the average 10-K has in fact increased dramatically.

    Our dataset is made of two parts; the first one includes the language metrics of the most recent 10-K or 10-Q report for each firm, namely:

    • Financial sentiment

    • Percentage of words belonging to financial domain classified by language types: constraining, litigious, uncertainty and interesting language.

    • Readability score

    • Lexical metrics such as lexical density and richness

    • Text statistics such as the report length and the average sentence length

    The second part includes the differences between the two most recent 10-Ks or 10-Qs reports of the same period for each company, namely:

    • Difference of the various language metrics (e.g. delta sentiment, delta readability score delta, delta percentage of a specific language type etc.)

    • Similarity metrics between documents, also with respect to a specific language type (for example similarity with respect to “litigious” language or “uncertainty” language)

    Our dataset includes the metrics and related differences both for the whole report and for specific sections (Risk Factors and Management Discussion and Analysis)


    Feed Details

    The dataset is updated with a daily frequency since new 10-Ks and 10-Qs reports are released every day for some of the universe companies. Clearly the largest update will be around February, April, August and November when the largest number of reports is released. The historical dataset is available from year 2010.

    Data Dictionary 

    Factsheet 


    Historical Trial

    The dataset contains historical data from January 2010 that can be freely accessed for 2 months. For a live feed please contact us at support@braincompany.co  and we will make accessible a customized version of the product on AWS Data Exchange according to Client requirements.


    Disclaimer

    The content of this dataset is not to be intended as investment advice. The material is provided for informational purposes only and does not constitute an offer to sell, a solicitation to buy, or a recommendation or endorsement for any security or strategy, nor does it constitute an offer to provvaluee investment advisory or other services by Brain. Brain makes no guarantees regarding the accuracy and completeness of the information expressed in the dataset.

    Details

    Sold by

    Delivery method

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Brain Language Metrics on Company Filings (History Trial)

     Info
    This product is available free of charge. Free subscriptions have no end date and may be canceled any time.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    Vendor refund policy

    No refunds are offered for this product, for more information please contact support@braincompany.co 

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    AWS Data Exchange (ADX)

    AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.

    Additional details

    Data sets (1)

     Info

    You will receive access to the following data sets.

    Data set name
    Type
    Historical revisions
    Future revisions
    Sensitive information
    Data dictionaries
    Data samples
    Brain Language Metrics on Company Filings - Extended Version (Historical Trial)
    All historical revisions
    All future revisions
    Not included
    Not included

    Resources

    Vendor resources

    Similar products