Skip to content

Instantly share code, notes, and snippets.

Last active August 25, 2019 08:57
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
Star You must be signed in to star a gist
What would you like to do?
Final Report for the project Tool for Extracting Quantitative Text Profiles for GSOC 2019

Tool for Extracting Quantitative Text Profiles GSOC 2019

This is the final report of the work that was done for our text-extraction project in the framework of the Google Summer of Code 2019 (GSoC 2019).

Project Description

In this project, I developed a user-friendly desktop GUI to extract various linguistic features from texts, using existing NLP packages. The application was developed using Electron, ReactJS, MaterialUI CSS framework and MongoDB for the database.

The application's main target groups are students and scientists of computational glossology, who lack programming skills and need an easy to use tool to perform their analysis. Within the application, the user can import texts, select the indices he wants to calculate and export the generated results. Additionally, the application is flexible and modular, offering to the user the ability to add custom scripts to be executed upon the selected texts.

For more information about the project, the used technologies and instructions on how to install and operate, visit the project wiki.


All of my work can be found at the project's repository, along with the code of the tool. My commits are here.

Project Progress

Since this project was developed under GSoC 2019 program, I kept weekly reports on my progress, which can be found at the relevant wiki page

Future Work

The current version of the tool can be considered alpha. It is functional but contains many bugs and has many areas for improvement. Planned and suggested future work can be found here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment