Select your cookie preferences

We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.

If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”

Sign in
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Reuters News Archive: Chinese Document Alignment (30 Days) - SAMPLE

Provided By: Reuters

Reuters News Archive: Chinese Document Alignment (30 Days) - SAMPLE

Provided By: Reuters

Reuters’ Mandarin Chinese Document Alignment Package provides paired news stories in English and the translated Mandarin Chinese equivalent story that Reuters published. The content itself is primarily related to financial and general news stories. These pre-aligned documents make this dataset ideal for any machine learning translation model.

Product offers

The following offers are available for this product. Choose an offer to view the pricing and access duration options for the offer. Select an offer and continue to subscribe. Your subscription begins on the date that your request is approved by the provider. Additional taxes or fees might apply.

Public offer

Payment schedule: Upfront payment | Offer auto-renewal: Supported
$0 for 1 month

Overview

ABOUT THIS DATASET

A data extract from our pure text Mandarin Chinese language news archive which includes 30 days of content structured in our standard newsML-G2 XML format.  

Content Type: Text (txt)

Language: Mandarine (Zh)

Date Range: September 1st - September 30th of 2019

Format: NewsML G2 

Encoding: UTF-8

Catalog Reference: http://www.iptc.org/std/catalog/catalog.IPTC-G2-Standards_3.xml 

Basic Metadata:

DescriptionField
Unique Story IDtransmitID, guid
Publication DatafirstCreated
Copyright HolderrightsInfo
FilenamefileName
Content Typechannel, signal qcode="prodId:TXT, itemClass
Languagelanguage
Titletitle
Urgencyurgency
Located (Country/Province or State)located
Categorization, Topic and Region Codessubject qcode="N2:XXX"
Creatorcreator
Slug Lineslugline
Headlineheadline
DateLinedateline
Authorby
Credit Linecreditline
Descriptiondescription

   

ABOUT REUTERS ## button:Learn More 

As the world’s largest news agency, Reuters continuously produces substantial multimedia content, enabling you to thoroughly test and build your AI. Our large body of trusted news data continues to grow on a daily basis with 200 transcripted videos added per day, over 1,500 images with intelligent metadata added per day, and 2.2 million translated text articles added every year.

Our news data is professionally produced and fully-licensed, allowing you to reach insights with greater speed and effectiveness:

  • Rights: Reuters has the proprietary rights to our data corpus and visual assets

  • Trust & Accuracy: Over 2000 media companies rely on Reuters news to make editorial and business decisions every day. Guided by Reuters Trust principles, our news preserves integrity, independence and freedom from bias

  • Diversity: Broad coverage of major topics from over 200 global locations and 16 languages, including business, finance, politics, sports, entertainment, technology, and much more

  • Metadata: Our advanced metadata contains regional and category-specific codes, allowing for intelligent grouping

 

NEED ASSISTANCE

If you have any questions or concerns regarding this dataset, please contact Reuters Support Services.

button:Contact Reuters Support Services 

Provided By
Fulfillment Method
AWS Data Exchange

Data sets (1)

You will receive access to the following data sets

Revision access rules
All historical revisions | All future revisions
Name
Type
Data dictionary
AWS Region
Reuters News Archive: Pharmaceutical (30 Days) - SAMPLE
Not included
US East (N. Virginia)

Usage information

By subscribing to this product, you agree that your use of this product is subject to the provider's offer terms including pricing information and Data Subscription Agreement . Your use of AWS services remains subject to the AWS Customer Agreement  or other agreement with AWS governing your use of such services.

Support information

Support contact email address
Support contact URL
-
Refund policy
No Refunds Offered
General AWS Data Exchange support