Customizing the Watson Language Translation Service

With the Watson Language Translation service in Bluemix you can translate from and to English, French, Portuguese, Arabic, Spanish and Egyptian. To optimize the translations you can customize the service for your own domains. Below is a simple example.

Let’s say you want to translate news about your products. You can use the out of the box translation capabilities based on the pre-defined news corpus for most text. However for your product names you often need to customize the service to teach it with the right translations or don’t translate product names at all. For example according to the Bluemix catalog the Watson service names “Concept Insights” and “Visual Recognition” are not translated in Spanish.

So when you use the out of the box translation of the Watson service or other non customized translation services you’d get something like this:

translation2

To improve this you can create a simple glossary file and train the Watson service.

<?xml version="1.0" encoding="UTF-8"?>
<tmx version="1.4">
  <header creationtool="" creationtoolversion=""
	segtype="sentence" o-tmf="" adminlang="EN"
	srclang="en" datatype="rtf" o-encoding="UTF-8" />
  <body>
    <tu>
      <tuv xml:lang="en">
        <seg>Concept Insights</seg>
      </tuv>
      <tuv xml:lang="es">
        <seg>Concept Insights</seg>
      </tuv>
    </tu>
	<tu>
      <tuv xml:lang="en">
        <seg>Visual Recognition</seg>
      </tuv>
      <tuv xml:lang="es">
        <seg>Visual Recognition</seg>
      </tuv>
    </tu>
  </body>
</tmx>

As a result the Watson service provides now a better translation (or in this case doesn’t translate the names at all).

translation1

To find out more read the documentation and the article “Customization and languages expand in Watson Language Translation” or watch the video “Getting started: IBM Watson Language Translation customization“.

The Watson translation service is also used by the Globalization Pipeline service so that you can globalize your Bluemix Applications without recompiling them.