Skip to content

Instantly share code, notes, and snippets.

@grahamharrison68
Created November 24, 2021 07:40
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save grahamharrison68/28617a467e7d074b1ad732639a2186ab to your computer and use it in GitHub Desktop.
Save grahamharrison68/28617a467e7d074b1ad732639a2186ab to your computer and use it in GitHub Desktop.
import pandas as pd
from sklearn.tree import DecisionTreeClassifier
from sklearn.preprocessing import LabelEncoder
from sklearn.model_selection import KFold, cross_val_score
df_drug = pd.read_csv("drug200.csv")
label_encoder = LabelEncoder()
categorical_features = [feature for feature in df_drug.columns if df_drug[feature].dtypes == 'O']
for feature in categorical_features:
df_drug[feature]=label_encoder.fit_transform(df_drug[feature])
X = df_drug.drop("Drug", axis=1)
y = df_drug["Drug"]
model = DecisionTreeClassifier(criterion="entropy")
model.fit(X, y)
kfold = KFold(random_state=42, shuffle=True)
cv_results = cross_val_score(model, X, y, cv=kfold, scoring="accuracy")
print(cv_results.mean(), cv_results.std())
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment