Skip to content

Instantly share code, notes, and snippets.

View sghoshcvc's full-sized avatar

Suman Ghosh sghoshcvc

  • United Kingdom Atomic Energy Authority
View GitHub Profile
@sghoshcvc
sghoshcvc / GSOC-2017.md
Last active February 16, 2019 08:10
GSOC-2017-End to End text detection and recognition

GSOC-2017-End to End text detection and recognition

The proposal of the project was to implement an end-to-end text recognition pipeline. In order to achieve that we extend the existing holistic text recognizer implemented last year and implement a deep text detector, which can filter the image and pass only text bounding boxes to the recognizer. Thus a complete deep pipeline for end-to-end text recognition can be achieved. This will also facilitate to implement an end to end pipeline of combined model that can predict bounding boxes over an image and also provide transcriptions for each bounding box. The proposed text detector implements the algorithm of following article

Liao, Minghui; Shi, Baoguang; Bai, Xiang; Wang, Xinggang; Liu, WenyuTextBoxes: A Fast Text Detector with a Single Deep Neural Network in Arxiv https://arxiv.org/pdf/1611.06779.pdf

Achieved target

  • The existing text recognition module uses caffe as a backend for deep neural network forward pass calculation. Another backend based on