Most SMT systems are trained using the procedings of the European Court - as it'...

woodson · on Sept 24, 2010

There are some research projects (in rather early stages) that develop SMT techniques for translation to languages without big parallel corpora (essentially by bootstrapping such corpora, assisted by active learning). This could be of particular importance to keep smaller languages from disappearing, otherwise less and less works in that language will be available (yes, I'm aware that there are many people who consider language death a good thing).

user24 · on Sept 25, 2010

Sounds fascinating; links? research? Applicability to lost languages like Linear A?

crux · on Sept 23, 2010

'(corpii?)'

corpores, if you must.

rudd · on Sept 23, 2010

The plural of corpus is 'corpora': http://en.wikipedia.org/wiki/Text_corpus

user24 · on Sept 23, 2010

slaps forehead of course it is! Thanks :)