
Overview
This dataset provides how-to articles from wikihow.com and their summaries, written as a coherent paragraph. The dataset itself is available at wikisum.zip , and contains the article, the summary, the wikihow url, and an official fold (train, val, or test). In addition, human evaluation results are available at wikisum-human-eval.zip . It consists of human evaluation of the summary of the Pegasus system, annotators response regarding the difficulty of the task, and words they marked as unknown.
Features and programs
Open Data Sponsorship Program
Pricing
This is a publicly available data set. No subscription is required.
How can we make this page better?
Legal
Content disclaimer
Delivery details
AWS Data Exchange (ADX)
AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.
Open data resources
Available with or without an AWS account.
- How to use
- To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more
- Description
- WikiSum Dataset
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::wikisum
- AWS region
- us-east-1
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://wikisum/
Resources
Vendor resources
Support
Contact
Managed By
How to cite
WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation was accessed on DATE from https://registry.opendata.aws/wikisum .
License
Dataset is published under CC-NC-SA-3.0 . Human evaluation is published under CC-SA-4.0 .