Suman Ghosh sghoshcvc

## GSOC-2017.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                sghoshcvc
                / GSOC-2017.md
            
            
              Last active
              February 16, 2019 08:10
            
              
                GSOC-2017-End to End text detection and recognition
              
          
    GSOC-2017-End to End text detection and recognition

The proposal of the project was to implement an end-to-end text recognition pipeline. In order to achieve that we extend the existing holistic text recognizer implemented last year and implement a deep text detector, which can filter the image and pass only text bounding boxes to the recognizer. Thus a complete deep pipeline for end-to-end text recognition can be achieved. This will also facilitate to implement an end to end pipeline of combined model that can predict bounding boxes over an image and also provide transcriptions for each bounding box. The proposed text detector implements the algorithm of following article
Liao, Minghui; Shi, Baoguang; Bai, Xiang; Wang, Xinggang; Liu, WenyuTextBoxes: A Fast Text Detector with a Single Deep Neural Network in Arxiv https://arxiv.org/pdf/1611.06779.pdf
Achieved target


The existing text recognition module uses caffe as a backend for deep neural network forward pass calculation. Another backend based on