European Commission’s Translation Memory ready to DOWNLOAD (updated 01.12.2022)

 

DGT-Translation Memory translation service European Commission’s Translation Memory ready to DOWNLOAD (updated 10.15.2018) DGT Translation Memory

Since November 2007, the European Commission’s Directorate-General for Translation (DGT) has made its multilingual Translation Memory for the Acquis Communautaire, DGT-TM, publicly accessible in order to support the European Commission’s efforts in promoting multilingualism, language diversity, and the re-use of Commission information.

This resource contains translations of the entire body of European legislation, known as the Acquis Communautaire, into the 24 official languages of the European Union. These translations are in the form of parallel texts, also known as bi-texts, which are texts and their corresponding translations. Translation memories, which consist of small segments of text and their translations, can be used to support translators by ensuring that pieces of text do not need to be translated multiple times.

Purposes of Translation Memories and Parallel Texts

Translation memories and parallel texts can be used for various purposes, such as:

  • Training automatic systems for statistical machine translation
  • Producing monolingual or multilingual lexical and semantic resources such as dictionaries and ontologies
  • Training and testing multilingual information extraction software
  • Checking translation consistency automatically
  • Testing and benchmarking alignment software (for sentences, words, etc.)

The DGT-TM: The Largest Parallel Corpus Available

The DGT-TM is the largest parallel corpus available, with a wide range of rare language pairs. It is released annually, with the alignments between text segments and translations verified manually for the 2007 version and automatically aligned for releases starting in 2011. The data is available for download and can be used to produce bilingual aligned corpora for any of the 276 language pairs or 552 language pair directions.

Conditions for Use

The DGT-TM is available for free and can be used for non-commercial purposes, provided that the conditions of use are followed.

About this Translation Memory

It is a huge multilingual Translation memory from the European Commission’s Directorate-General for Translation. The content of the TM consists of the majority of the documents comprising the Acquis Communautaire, which is the entire body of the EU legislation, i.e. regulations, treaties and directives adopted by the EU, as well as documents that are not part of the Acquis.

The DGT Translation Memory is currently available in 24 languages and accumulated 2.032.636.854 words since 2007. There are 276 language pairs and 552 language pair directions!

Although the database of DGT-TM is exclusive property of the European Commission, it is downloadable for free under specific Conditions for Use. The translation segments can be extracted in the eXchange (tmx) format. The TM is available through a collection of 75 zip files no bigger than 100 MB each, containing the tmx files.

INSTRUCTIONS ON HOW TO USE IT:

1  –  Download the TMs: we divided the content in 5 parts, that you can download here:

Click here to DOWNLOAD the memory

2  –  Unzip the files

3 – Download the translation memory extractor HERE

4 –  Open the translation memory extractor by double clicking on the TMXtract.jar file

  • Note that, if you the extractor do not open by double clicking on it, it is becouse you have an new java version. Please refer to this link to solve the problem https://bit.ly/2PAN0iV

5  –  Select Input files (Volume_1.zip, etc.; multiple selection is possible)

6 –  Specify Output file (the result is always 1 file)

7 – Choose Source and Target language

8 – Click on Start

 

Some language combination examples that you can extract

German to DutchDutch to German

German to FrenchFrench to German

German to ItalianItalian to German

German to PolishPolish to German

German to SpanishSpanish to German

German to PortuguesePortuguese to German

From/To French Translation Service Memories to DOWNLOAD

German to FrenchFrench to German

French to SpanishSpanish to French

French to PortuguesePortuguese to French

French to ItalianItalian to French

French to PolishPolish to French

French to DutchDutch to French

From/To Spanish Translation Service Memories to DOWNLOAD

German to SpanishSpanish to German

French to SpanishSpanish to French

Spanish to PortuguesePortuguese to Spanish

Spanish to ItalianItalian to Spanish

From/To Portuguese Translation Service Memories to DOWNLOAD

German to PortuguesePortuguese to German

French to PortuguesePortuguese to French

Spanish to PortuguesePortuguese to Spanish

Italian to PortuguesePortuguese to Italian