Profile PicturePolyglot Technology LLC

English-Spanish CDC COVID-19 Website Corpus v1

$0+
0 ratings

Parallel Corpus of the Centers for Disease Control and Prevention (CDC) website on COVID-19 captured on June 24, 2020. The download contains the corpus in two formats:

  • TMX including non-deduplicated segments in origin order with meta-data on source documents and alignment quality
  • TSV: UTF-8 encoded text file containing tab-separated segments deduplicated in randomized order

License

This translation memory of CDC COVID-19 translations by Polyglot Technology LLC is made available under the Open Data Commons Attribution License: http://opendatacommons.org/licenses/by/1.0. Individual contents of the database are in the public domain.

Source: CDC; Reference to specific commercial products, manufacturers, companies, or trademarks does not constitute its endorsement or recommendation by the U.S. Government, Department of Health and Human Services, or Centers for Disease Control and Prevention; The material is available on the agency website https://www.cdc.gov/ for no charge.

This product is not currently for sale.
$
English source words
538,842
English source words (deduplicated)
248,780
Size
5.26 MB
Copy product URL
$0+

English-Spanish CDC COVID-19 Website Corpus v1

0 ratings