Recognizing Text in Images with Watson's Visual Recognition API

The Watson Visual Recognition service is now generally available and a lot of new functionality has been added. The service combines the old beta capabilities plus the Alchemy functionality and the previous Visual Insights service.

With the new service you can receive classifications of images, faces can be detected and even text is recognized. All of this can be done by simply invoking APIs without requiring deep learning skills.

With the new service you can now define and train multiple custom visual classifiers at once. Additionally it’s not longer necessary (but still possible) to upload a set of negative images for the training.

The screenshot shows a sample for the new functionality (beta) to recognize text, in this case the Bluemix slogan.

Try the API for your own images online.

There are REST APIs and language bindings for Node, Java and Python. Here are the curl commands for testing the API.

curl -X POST -F "images_file=@jax-16-7.jpg" "https://gateway-a.watsonplatform.net/visual-recognition/api/v3/classify?api_key={api-key}&version=2016-05-20"
curl -X POST -F "images_file=@jax-16-7.jpg" "https://gateway-a.watsonplatform.net/visual-recognition/api/v3/classify?api_key={api-key}&version=2016-05-20"
curl -X POST -F "images_file=@jax-16-7.jpg" "https://gateway-a.watsonplatform.net/visual-recognition/api/v3/detect_faces?api_key={api-key}&version=2016-05-20"