The project in a nutshell
Machine translation is an increasingly vital technology for a global and interconnected world
It works very well in situations where there are millions of translated sentences for training models. For low-resourced language pairs, however, the quality of translation is barely, if at all, usable. Our project is focussed on both collecting and creating low-resource language data, and also pushing forward the latest research in machine learning to be able to make the best use of the little data we have.
The media industry is one of the pillars of a functioning democracy and it is increasingly under a range of threats such as political populism and social media content aggregators. Our project will help the media industry to thrive by allowing them to reach a bigger audience with less effort. Our translation models will allow journalists to understand a broad spread of news from countries of interest, and also to produce content faster in local languages by leveraging the output of machine translation models.