The proposal of the project was to implement an end-to-end text recognition pipeline. In order to achieve that we extend the existing holistic text recognizer implemented last year and implement a deep text detector, which can filter the image and pass only text bounding boxes to the recognizer. Thus a complete deep pipeline for end-to-end text recognition can be achieved. This will also facilitate to implement an end to end pipeline of combined model that can predict bounding boxes over an image and also provide transcriptions for each bounding box. The proposed text detector implements the algorithm of following article
Liao, Minghui; Shi, Baoguang; Bai, Xiang; Wang, Xinggang; Liu, WenyuTextBoxes: A Fast Text Detector with a Single Deep Neural Network in Arxiv https://arxiv.org/pdf/1611.06779.pdf
- The existing text recognition module uses caffe as a backend for deep neural network forward pass calculation. Another backend based on