This session will offer several quick and dynamic presentations covering topics and ideas too interesting to ignore:
Bitextor: Get the Translation Data You Need from the Web – Miguel Esplá-Gomis (Universitat d’Alacant)
In this session we will present a free, open source tool to harvest parallel data from the internet, which can be used as a translation memory or to train machine translation systems. This tool is being developed as part of the CEF Project. We will briefly cover how the tool works, and how it may be adapted to different use scenarios.
Takeaways: Attendees will get an awareness about the usefulness of web data for the translation industry and knowledge about an industry-mature, freely-available tool to crawl parallel data from the internet.