Skip to content

Instantly share code, notes, and snippets.

View borgesalkan's full-sized avatar

William Borges borgesalkan

  • Rippling
  • Palo Alto, CA
View GitHub Profile
@borgesalkan
borgesalkan / extract_food_items.py
Created October 2, 2020 04:43
Extract Food Items
from textblob import TextBlob
from nltk.stem.porter import PorterStemmer
def _extract_ngrams(data: str, num: int):
n_grams = TextBlob(data).ngrams(num)
return [' '.join(grams).lower() for grams in n_grams]
def _delete_duplicate_food_n_grams(text: str, foods: List[str]) -> List[str]:
foods.sort(key=lambda x: -len(x.split())) # Sort desc by number of words
result_foods = []