Skip to content

Instantly share code, notes, and snippets.

View BREN1234's full-sized avatar

birendra rai BREN1234

  • DEFTeam Solution
  • India
View GitHub Profile
Term Frequencies:
[('performance', 25),
('employee', 20),
('appraisal', 12),
('employees', 11),
('goals', 10),
('process', 8),
('feedback', 8),
('every', 7),
('management', 7),
@BREN1234
BREN1234 / blog.py
Last active May 11, 2020 13:11
This program finds ten most frequent words in corpus after removing the punctuations and stopwords. I have used the Upraizal.com as my corpus. I have used the https://www.upraizal.com/top-7-elements-ideal-employee-performance-appraisal.
import requests
import string
import nltk
import re
import pandas as pd
from bs4 import BeautifulSoup
from collections import Counter
from nltk.corpus import stopwords
from sklearn.feature_extraction.text import CountVectorizer,TfidfVectorizer