European Commission’s Translation Memory ready to DOWNLOAD (updated 01.12.2022)
About this Translation Memory
It is a huge multilingual Translation memory from the European Commission’s Directorate-General for Translation. The content of the TM consists of the majority of the documents comprising the Acquis Communautaire, which is the entire body of the EU legislation, i.e. regulations, treaties and directives adopted by the EU, as well as documents that are not part of the Acquis.
The DGT Translation Memory is currently available in 24 languages and accumulated 2.032.636.854 words since 2007. There are 276 language pairs and 552 language pair directions!
Although the database of DGT-TM is exclusive property of the European Commission, it is downloadable for free under specific Conditions for Use. The translation segments can be extracted in the eXchange (tmx) format. The TM is available through a collection of 75 zip files no bigger than 100 MB each, containing the tmx files.
INSTRUCTIONS ON HOW TO USE IT:
1 – Download the TMs: we divided the content in 5 parts, that you can download here:
2 – Unzip the files
3 – Download the translation memory extractor HERE
4 – Open the translation memory extractor by double clicking on the TMXtract.jar file
- Note that, if you the extractor do not open by double clicking on it, it is becouse you have an new java version. Please refer to this link to solve the problem https://bit.ly/2PAN0iV
5 – Select Input files (Volume_1.zip, etc.; multiple selection is possible)
6 – Specify Output file (the result is always 1 file)
7 – Choose Source and Target language
8 – Click on Start
Some language combination examples that you can extract