
Overview
VOiCES is a speech corpus recorded in acoustically challenging settings, using distant microphone recording. Speech was recorded in real rooms with various acoustic features (reverb, echo, HVAC systems, outside noise, etc.). Adversarial noise, either television, music, or babble, was concurrently played with clean speech. Data was recorded using multiple microphones strategically placed throughout the room. The corpus includes audio recordings, orthographic transcriptions, and speaker labels.
Features and programs
Open Data Sponsorship Program
Pricing
This is a publicly available data set. No subscription is required.
How can we make this page better?
Legal
Content disclaimer
Delivery details
AWS Data Exchange (ADX)
AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.
Open data resources
Available with or without an AWS account.
- How to use
- To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more
- Description
- wav audio files, orthographic transcriptions, and speaker ID
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::lab41openaudiocorpus
- AWS region
- us-east-1
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://lab41openaudiocorpus/
Resources
Vendor resources
Support
Managed By
How to cite
Voices Obscured in Complex Environmental Settings (VOiCES) was accessed on DATE from https://registry.opendata.aws/lab41-sri-voices .
License
Creative Commons BY 4.0 (see here for more details)