GarrettMooney/whatlies_distilbert.md

## whatlies_distilbert.md

      
    Raw
  

              whatlies_distilbert.md
            
          
    %pip install "whatlies[sentence_tfm]"  # quotes for my fellow zsh users

import numpy as np
from whatlies.language import SentenceTFMLanguage
from sklearn.pipeline import Pipeline
from sklearn.linear_model import LogisticRegression

pipe = Pipeline([
    ("embed", SentenceTFMLanguage('distilbert-base-nli-stsb-mean-tokens')),
    ("model", LogisticRegression())
])

X = [
    "i really like this post",
    "thanks for that comment",
    "i enjoy this friendly forum",
    "this is a bad post",
    "i dislike this article",
    "this is not well written"
]

y = np.array([1, 1, 1, 0, 0, 0])

pipe.fit(X, y)