- Student's Name : Jigyasa Gupta
- Mentors : Guangchuang Yu , Augustin Luna
clusterProfiler provides an impressive universal interface for gene functional annotation and can access data to generate enrichment results and provide effective data interpretation.
Pathway Commons is an aggregated database of molecular interaction pathways , collected from approximately 20 databases. At present, there exists no direct way to analyze data from Pathway Commons leveraging all the functionality of clusterProfiler. The data will be fetched just by specifying the URL of the database with the help of R scripts. It is proposed to enable support for Pathway Commons database in clusterProfiler to yield advanced and more accurate enrichment results and visualizations of the data.
Single sample gene set enrichment analysis is an effective method of data interpretation that uses permutations of gene sets to calculate enrichment scores. It has use cases like detecting outliers or tumors in the medical context. This is a variation of GSEA, however, clusterprofiler currently supports only GSEA. To integrate the functionality of ssGSEA, we can use GSVA package with GenePatterns to analyze gene sets in various permutations and get more indicative enrichment scores using clusterProfiler.
https://github.com/YuLab-SMU/clusterProfiler
Supporting Pathway Commons and ssGSEA in clusterProfiler
https://summerofcode.withgoogle.com/programs/2023/projects/GYJ9YvJD
- Creating support for Pathway Commons in clusterProfiler : PR
https://github.com/YuLab-SMU/clusterProfiler/blob/devel/R/pathwayCommons.R - Supporting ssGSEA in clusterProfiler : PR
https://github.com/Jigyasa-G/clusterProfiler-JG/blob/master/R/ssgsea.R
Github Links: https://github.com/Jigyasa-G/clusterProfiler-JG
- Getting the GMT file and Source :
https://medium.com/@jigyasatata/gsoc23-nrnb-week-1-experience-6cafe95dde8f - Getting the data get_pc_data :
https://medium.com/@jigyasatata/gsoc23-nrnb-week-2-experience-cfcc8e4eefaa - Data Extraction :
https://medium.com/@jigyasatata/gsoc23-nrnb-week-3-data-extraction-eb60c465b26d - Data Preparation :
https://medium.com/@jigyasatata/gsoc23-nrnb-week-4-data-extraction-and-preparation-3113156a9f0a - Insights with Over-Representation Analysis :
https://medium.com/@jigyasatata/gsoc23-nrnb-week-5-data-extraction-13f3284e228c - Exploring Pathways through Gene Set Enrichment Analysis : gsePC
https://medium.com/@jigyasatata/gsoc23-nrnb-week-6-gene-set-enrichment-analysis-98ca8eb09dcd - ssGSEA implementation using GSVA :
https://medium.com/@jigyasatata/gsoc23-nrnb-ssgsea-in-clusterprofiler-1e199240d542