Skip to content

Instantly share code, notes, and snippets.

Embed
What would you like to do?
cnt=Counter()
english_stopwords = set(stopwords.words('english'))
for path in df.path:
words = re.split("[-/]", path)
for word in words:
if len(word) > 0 and word not in english_stopwords and not word.isdigit():
cnt[word] += 1
cnt.most_common(25)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment