The EC leads on PSI
ePSIplus, 18th January 2008
European Commission makes computer-assisted translation easier and more accessible.
The European Commission announced in Press Release IP/08/60 that:
"The European Commission is going a step further in its efforts to foster multilingualism as a key part of European unity in diversity. The Commission's collection of about 1 million sentences and their high quality translations in 22 of the 23 official EU languages — including those of the new Member States — is the biggest ever collection in so many languages and is now freely available. This kind of data is highly sought after by developers of machine translation systems in which automatic translation software "learns" from manually translated texts how words and phrases are correctly and contextually translated.
The data can also help the development of other linguistic software tools such as grammar and spell checkers, online dictionaries and multilingual text classification systems."
The release goes on to state:
“This release of language data is a good example of the Commission's open policy of re-use of its information resources and follows the opening of the EU's documentary and terminological databases Eur-Lex and IATE.”
The DGT Multilingual Translation Memory of the Acquis Communautaire: DGT-TM portal states:
As of November 2007, the European Commission's Directorate General for Translation (DGT) made publicly accessible its multilingual Translation Memory for the Acquis Communautaire - a collection of parallel texts (texts and their translation, also referred to as bi-texts) in 22 languages. On this page, you will find a summary of this unique resource and instructions on where to download it and how to produce bilingual aligned corpora for any of the 231 language pair combinations (462 language pair directions).