Skip to content

Instantly share code, notes, and snippets.

@saurabhshri
Last active August 23, 2017 19:07
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save saurabhshri/05f662806a23243bc049c4676c904233 to your computer and use it in GitHub Desktop.
Save saurabhshri/05f662806a23243bc049c4676c904233 to your computer and use it in GitHub Desktop.
This gist will contain detailed timeline and milestones in the form of checklist for my GSoC 2017 project CCAligner, with CCExtractor Development. https://saurabhshri.github.io/gsoc/

Legend:

  • Work To Be Done.
  • Work In Progress.
  • Work Completed.

1. Community Bonding Period (6th May - 27th May) :

2. 28th May - 31st May :

  • Design basic workflow.
  • Create basic design scheme and testing plan.
  • Begin coding officially.
  • Post the update on blog.
    (Covered in the previous blog post)

3. Week 1 (1st June - 7th June) :

4. Week 2 (8th June - 15th June) :

Work load has been kept less in the beginning two week so that I can practice other crucial techniques such as adding test cases and understanding code review work flow by mentors.

5. Week 3 (16th June - 23rd June) :

6. Week 4 (24th June - 29th June) :

PHASE 1 DELIVERABLE :

  • Tool for subtitle processing and basic testing architecture.
  • Sample repository.
  • Algorithmic and Probability based word - audio matching.
  • VAD implementation.
  • Post the update on blog.

7. Week 5 & 6 (30th June - 12th July) :

  • Test and experiment with different ASR.
  • Start full blown ASR work.
  • Begin word detection.
  • Creating and fine tuning acoustic models.
  • Continue completing remaining work on word detection.
  • Creating and fine tuning language models and dictionaries based on subtitles.
  • Creating FSGs to direct ASR to restrict recognition to specific words.
  • Post the update on blog.
    (https://saurabhshri.github.io/2017/07/gsoc/google-summer-of-code-week-5-6-what-d-you-say)

8. Week 7 (13th July - 19th July) :

  • Write code for intelligently assigning timestamps on basis of frames and also probability.
  • Try different setting and combinations to achieve maximum accuracy.
  • Create a logic based on fuzzy search that shall look for words approximately ahead and behind the set domain.
  • Create custom dictionaries on the fly.

9. Week 8 (20th July - 26th July) :

PHASE 2 DELIVERABLE :

10. Week 9 & 10 (1st August - 14th August) :

  • Begin phoneme detection.
  • Creating and fine tuning phonetic language models and dictionaries based on subtitles.
  • Write code for intelligently assigning timestamps on basis of frames and also probability.
  • Post the update on blog.

11. Week 11 & 12 (15th August - 28th August) :

  • Work on various output formats such as XML, JSON, text to store and dump data in .
  • Clean up the code. Fix bugs, memory leaks.
  • Logging and error handling.
  • Work on missing documentation and missed milestones.
  • Prepare submission reports.
  • Post the update on blog.

PHASE 3 DELIVERABLE :

  • Tool capable of word by word audio - subtitle synchronisation.
  • API for for word by word audio - subtitle synchronisation.
  • Complete project documentation.
  • Finalisation and completion of project.
  • Project submision.
  • Post the update on blog.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment