Comment fun at Slashdot:
compuglot writes “Google gave journalists a glimpse of its next generation machine translation system at a May 19th Google Factory Tour. “Google Blogoscoped” offers an excellent overview of the presentation. The system has been trained using the United Nations Documents as a corpus. This corpus is some 20 billion words worth of content. It uses existing source and target language translations (done by human translators at the U.N.) to find patterns it then uses to build rules for translating between those languages. Apparently it was successful where the current version had failed in translating certain phrases. If anyone were capable of making a serious go of MT, that would have to be Google.”
So, basically, Google is launching a new translator which has learned, itself, how to translate (rather than being taught), using UN Documents, got it? So, the comments:
So when you go to translate.google.com and translate something, the result will be legal-eze in the resulting languages.
Spanish: “Que pasa?”<br/> English translation: “With regards to the current situation, how is the day progressing?”
No, it actually translates “que pasa” into “We hereby condemn these actions taken by the Israeli government.”