Skip to content

Instantly share code, notes, and snippets.

@jaganadhg
Created January 18, 2013 12:28
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save jaganadhg/4564287 to your computer and use it in GitHub Desktop.
Save jaganadhg/4564287 to your computer and use it in GitHub Desktop.
Sklearn saved classifier slow
from sklearn.datasets import fetch_20newsgroups
twenty = fetch_20newsgroups()
from sklearn.pipeline import Pipeline
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.feature_extraction.text import TfidfTransformer
X = CountVectorizer().fit_transform(twenty.data)
X_tfidf = TfidfTransformer().fit_transform(X)
from sklearn.naive_bayes import MultinomialNB
from sklearn.naive_bayes import MultinomialNB
classifier = Pipeline([('vect',CountVectorizer()),('tfidf',TfidfTransformer()),('clf',MultinomialNB()),])
_ = classifier.fit(twenty.data,twenty.target)
from sklearn.externals import joblib
_ = joblib.dump(classifier,"test_speed.model",compress=9)
clf = joblib.load("test_speed.model")
clf.predict(["this is a good sentence for debugging this code I think. what do you think"])
#Wall time: 0.83 s
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment