Skip to content

Instantly share code, notes, and snippets.

@keitazoumana
Created July 24, 2023 03:30
Show Gist options
  • Save keitazoumana/72e8b58fe8642d69bfe232d4b2a145ea to your computer and use it in GitHub Desktop.
Save keitazoumana/72e8b58fe8642d69bfe232d4b2a145ea to your computer and use it in GitHub Desktop.
Natural Language Processing Table
Title Link Video
Do You Want To Cluster Unlabeled Text Data? Try Out Topic Modeling https://towardsdatascience.com/do-you-want-to-cluster-unlabeled-text-data-try-out-topic-modeling-235795ae7cb7
Financial Text Classification With Deep Learning Using FinBERT https://medium.com/mlearning-ai/financial-text-classification-with-deep-learning-using-finbert-51a73b393a08
Named Entity Recognition with Spacy and the Mighty roBERTa https://towardsdatascience.com/named-entity-recognition-with-spacy-and-the-mighty-roberta-97d879f981 https://youtu.be/OLwYBSE64m4
Scientific Documents Similarity Search With Deep Learning Using Transformers (SciBERT) https://towardsdatascience.com/scientific-documents-similarity-search-with-deep-learning-using-transformers-scibert-d47c4e501590
Meet BERTopic— BERT’s Cousin For Advanced Topic Modeling https://towardsdatascience.com/meet-bertopic-berts-cousin-for-advanced-topic-modeling-ea5bf0b7faa3 https://youtu.be/sZcGuYHWN_w
Unsupervised Multilingual Text Classification With Zero-Shot Approach https://towardsdatascience.com/unsupervised-multilingual-text-classification-with-zero-shot-approach-e65bd79b5d0
Semantic Keywords And Keyphrases Extraction With KeyBERT https://towardsdatascience.com/semantic-keywords-and-keyphrases-extraction-with-keybert-999234cab7f
4 NLP Libraries for Automatic Language Identification of Text Data In Python https://towardsdatascience.com/4-nlp-libraries-for-automatic-language-identification-of-text-data-in-python-cbc6bf664774
Data Augmentation in NLP Using Back Translation With MarianMT https://towardsdatascience.com/data-augmentation-in-nlp-using-back-translation-with-marianmt-a8939dfea50a https://youtu.be/HoqlUu_8fWE
Social Media Sentiment Analysis In Python With VADER — No Training Required! https://towardsdatascience.com/social-media-sentiment-analysis-in-python-with-vader-no-training-required-4bc6a21e87b8 https://youtu.be/fdzQLzpBqVM
Stemming and Lemmatization— Which One is Worth Going For? https://towardsdatascience.com/stemming-lemmatization-which-one-is-worth-going-for-77e6ec01ad9c
VADER Vs. TextBlob — Which One Is Better For Social Media Sentiment Analysis? https://medium.com/geekculture/vader-vs-textblob-which-one-is-better-for-social-media-sentiment-analysis-aecb15968b93
Most Common Text Processing Tasks In Natural Language Processing https://towardsdatascience.com/most-common-text-processing-tasks-in-natural-language-processing-d8cd7a076b79 https://youtu.be/169Jw3j-5pE
How to Perform Speech-to-Text and Translate Any Speech to English With OpenAI’s Whisper https://towardsdatascience.com/how-to-perform-speech-to-text-and-translate-any-speech-to-english-with-openais-whisper-50e3a366cbca https://youtu.be/wbYGdCslFVg
Plagiarism Detection Using Transformers https://www.pinecone.io/learn/plagiarism-detection/ https://youtu.be/_PqHRH55hV0
Text-to-Image and Image-to-image search Using CLIP https://www.pinecone.io/learn/clip-image-search/
A Step-by-step Guide to Solving 4 Real-life Problems With Transformers and Hugging Face https://towardsdatascience.com/4-real-life-problems-solved-using-transformers-and-hugging-face-a-complete-guide-e45fe698cc4d https://github.com/keitazoumana/keitazoumana/blob/main/link
Text data representation with one-hot encoding Tf-Idf Count Vectors Co-occurrence Vectors and Word2Vec https://towardsdatascience.com/text-data-representation-with-one-hot-encoding-tf-idf-count-vectors-co-occurrence-vectors-and-f1bccbd98bef
Fine-Tuning GPT-3 Using the OpenAI API and Python https://www.datacamp.com/tutorial/fine-tuning-gpt-3-using-the-open-ai-api-and-python
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment