GSoC Project Proposal: Improve the OCR Subsystem
Email: saurabhshah.0410@gmail.com
The goal of my project was to make the hard subtitle extraction user friendly by making the subsystem independent of arbitrary user input parameters like sub_color
, conf_thresh
, luminance
, whiteness
etc. This would also extend CCExtractor's usage to extract burned in subtitles from video files containing multi color captions. The whole idea was to implement Neumann Mata's text detection algorithm which would meet the above objectives and also work with a reasonable time complexity and memory requirements.