Contributor: Aditya Venkatesh
Mentors: Dr. Sanju Tiwari, Debarghya Datta, Dr. Ronak Panchal
Description: This project took place over the summer of 2025 as part of Google Summer of Code under DBpedia. The aim of this project was to evaluate and enhance various stages of the existing information extraction pipeline from Hindi text. The goals of this project were multi-fold:
- Streamline the existing pipeline and make it easy to run
- Evaluate the performance of the existing pipeline
- Experiment and implement new triplet extraction methods using Small Language Models (SLM)