Reuters News Archive: Chinese Document Alignment (30 Days) - SAMPLE

Reuters’ Mandarin Chinese Document Alignment Package provides paired news stories in English and the translated Mandarin Chinese equivalent story that Reuters published. The content itself is primarily related to financial and general news stories. These pre-aligned documents make this dataset ideal for any machine learning translation model.

View purchase options

Overview

Try agent mode

Create proposal

Ask question

ABOUT THIS DATASET

A data extract from our pure text Mandarin Chinese language news archive which includes 30 days of content structured in our standard newsML-G2 XML format.

Content Type: Text (txt)

Language: Mandarine (Zh)

Date Range: September 1st - September 30th of 2019

Format: NewsML G2

Encoding: UTF-8

Catalog Reference: http://www.iptc.org/std/catalog/catalog.IPTC-G2-Standards_3.xml

Basic Metadata:

Description	Field
Unique Story ID	transmitID, guid
Publication Data	firstCreated
Copyright Holder	rightsInfo
Filename	fileName
Content Type	channel, signal qcode="prodId:TXT, itemClass
Language	language
Title	title
Urgency	urgency
Located (Country/Province or State)	located
Categorization, Topic and Region Codes	subject qcode="N2:XXX"
Creator	creator
Slug Line	slugline
Headline	headline
DateLine	dateline
Author	by
Credit Line	creditline
Description	description

ABOUT REUTERS ## button:Learn More

As the world’s largest news agency, Reuters continuously produces substantial multimedia content, enabling you to thoroughly test and build your AI. Our large body of trusted news data continues to grow on a daily basis with 200 transcripted videos added per day, over 1,500 images with intelligent metadata added per day, and 2.2 million translated text articles added every year.

Our news data is professionally produced and fully-licensed, allowing you to reach insights with greater speed and effectiveness:

Rights: Reuters has the proprietary rights to our data corpus and visual assets
Trust & Accuracy: Over 2000 media companies rely on Reuters news to make editorial and business decisions every day. Guided by Reuters Trust principles, our news preserves integrity, independence and freedom from bias
Diversity: Broad coverage of major topics from over 200 global locations and 16 languages, including business, finance, politics, sports, entertainment, technology, and much more
Metadata: Our advanced metadata contains regional and category-specific codes, allowing for intelligent grouping

NEED ASSISTANCE

If you have any questions or concerns regarding this dataset, please contact Reuters Support Services.

button:Contact Reuters Support Services

Details

Sold by

Reuters

Introducing multi-product solutions

You can now purchase comprehensive solutions tailored to use cases and industries.

Learn more

Explore multi-product solutions

Features and programs

Financing for AWS Marketplace purchases

AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.

View financing details

Pricing

Reuters News Archive: Chinese Document Alignment (30 Days) - SAMPLE

Info

View purchase options

This product is available free of charge. Free subscriptions have no end date and may be canceled any time.

Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator to estimate your infrastructure costs.

Vendor refund policy

No Refunds Offered

How can we make this page better?

Tell us how we can improve this page, or report an issue with this product.

Legal

Vendor terms and conditions

Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Delivery details

AWS Data Exchange (ADX)

AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.

Additional details

Data sets (1)

Info

You will receive access to the following data sets.

Data set name	Type	Historical revisions	Future revisions	Sensitive information	Data dictionaries	Data samples
Reuters News Archive: Pharmaceutical (30 Days) - SAMPLE		All historical revisions	All future revisions		Not included	Not included

Similar products

Thomson Reuters ONESOURCE Global Trade

By Thomson Reuters Corporation

Duty Optimization, Risk Reduction, Compliance and Content & Connectivity

View product

Thomson Reuters ONESOURCE Indirect Tax Determination

By Thomson Reuters Corporation

Indirect Tax Calculation, Reporting, Analytics, and Exemption Certificate Management Software for ERP, Financial Systems, POS, and eCommerce

View product

Imagen

By Reuters Imagen

The cloud-first Media Asset Management service of choice

View product

Reuters News Archive: Automotive (1 Year)

By Reuters

Reuters’ Automotive Package provides automobile related articles that Reuters has published. This will include news about automobile manufacturers (cars, light trucks, and motorcycles) as well as related vehicle parts. These articles make this dataset ideal for any natural language processing (NLP) algorithms or ML applications specializing in this space.

View product

Reuters News Archive: Pharmaceutical (1 Year)

By Reuters

Reuters’ Pharmaceutical Package provides pharmaceutical-related articles that Reuters has published. This will include news about pharmaceutical manufacturers (generic and specialty drugs), researchers and developers of new drugs, as well as medical products and procedures. These articles make this dataset ideal for any natural language processing (NLP) algorithms or ML applications specializing in this space.

View product