Machine-readable data from certain electronic 990 forms filed with the IRS from 2011 to present are available for anyone to use via Amazon S3.

Form 990 is the form used by the United States Internal Revenue Service to gather financial information about nonprofit organizations. Data for each 990 filing is provided in an XML file that contains structured information that represents the main 990 form, any filed forms and schedules, and other control information describing how the document was filed. Some non-disclosable information is not included in the files.

This data set includes Forms 990, 990-EZ and 990-PF which have been electronically filed with the IRS and is updated regularly in an XML format. The data can be used to perform research and analysis of organizations that have electronically filed Forms 990, 990-EZ and 990-PF. Forms 990-N (e-Postcard) are not available withing this data set. Forms 990-N can be viewed and downloaded from the IRS website.

Each electronic 990 filing is available as a unique XML file in the "irs-form-990" S3 bucket in the US East (N. Virginia) region. Schemas for electronic 990 filings are available on the IRS website. Each filing is named based on the year it was filed and a unique identifier. For example, we can tell that the filing named "201541349349307794_public.xml" was filed in 2015 because the file name starts with "2015." "41349349307794" is the unique identifier of the filing.

All of the data is publicly accessible via the S3 bucket's HTTPS endpoint at https://s3.amazonaws.com/irs-form-990. No authentication is required to download data over HTTPS. For example, the example filing mentioned above can be accessed at https://s3.amazonaws.com/irs-form-990/201541349349307794_public.xml.

Index listings of available filings are available in JSON and CSV files, organized based on the year they were filed. Index files exist for each year going back to 2011 and are named based on their year and file type. For example, the CSV index for 2011 is available at https://s3.amazonaws.com/irs-form-990/index_2011.csv, and the JSON index file for 2015 is available at https://s3.amazonaws.com/irs-form-990/index_2011.json

These index files includes basic information about each filing, including the name of the filer, the Employer Identification Number (EIN) of the filer, the date of the filing, and unique identifier for the filing.

If you use the AWS Command Line Interface, you can list the index files and calculate the total size of the files with the following "ls" command:

aws s3 ls s3://irs-form-990/index --human-readable --summarize

Source
U.S. Internal Revenue Service
Category Regulatory
Format xml, json, csv
License None
Storage Service Amazon S3
Location s3://irs-form-990 in US East Region
Update Frequency New filings are added regularly

If you would like to show us what you can do with IRS 990 Filings on AWS or would like to receive updates on the project, please fill out the form below.

Educators, researchers and students can also apply for free credits to take advantage of the utility computing platform offered by AWS, along with Public Datasets such as IRS 990 Filings on AWS. If you have a research project that could take advantage of IRS 990 Filings on AWS, you can apply for AWS Cloud Credits for Research.