Skip to content

Instantly share code, notes, and snippets.

View moritzkoerber's full-sized avatar

Moritz Körber moritzkoerber

View GitHub Profile
@moritzkoerber
moritzkoerber / model_training_for_text_analysis.py
Last active March 8, 2021 08:30
Trains a model to analyze text messages.
import argparse
import pickle
import string
import sys
import nltk
import pandas as pd
from nltk.corpus import stopwords
from nltk.stem.wordnet import WordNetLemmatizer
from nltk.tokenize import word_tokenize

Keybase proof

I hereby claim:

  • I am moritzkoerber on github.
  • I am moritzkoerber (https://keybase.io/moritzkoerber) on keybase.
  • I have a public key ASBagXNuNawc5COk1wSUH57zvRWiy4bM8o7ZeCxKWVv06Ao

To claim this, I am signing this object:

import pandas as pd
from sklearn.linear_model import LogisticRegression
from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import GridSearchCV, RepeatedStratifiedKFold
from sklearn.pipeline import Pipeline
from sklearn.compose import ColumnTransformer
from sklearn.preprocessing import OneHotEncoder, StandardScaler
from sklearn.metrics import f1_score, classification_report
from sklearn.impute import SimpleImputer
from sklearn.model_selection import train_test_split
import pandas as pd
from sklearn.compose import ColumnTransformer
from sklearn.impute import SimpleImputer
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import classification_report
from sklearn.model_selection import GridSearchCV, RepeatedStratifiedKFold
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import OneHotEncoder, StandardScaler
titanic = pd.read_csv('./titanic.csv')