Skip to content

Instantly share code, notes, and snippets.

@parvathysarat
Last active September 18, 2017 03:12
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save parvathysarat/e04f518acd7ca6bc7b57a85eb38bd336 to your computer and use it in GitHub Desktop.
Save parvathysarat/e04f518acd7ca6bc7b57a85eb38bd336 to your computer and use it in GitHub Desktop.
Spot checking Decision Tree Classifier for firewall logs. Original data (18 columns) trimmed to 10.
# CART Classification
import pandas as pd
from sklearn import model_selection
from sklearn.tree import DecisionTreeClassifier
dataframe = pd.read_csv("data.csv", names=['ID', 'No.', 'Smth', 'Number', 'Count', 'Count2', 'UDP/TCP', 'RandomNo',
'IP', 'AUDIT/ALLOW/BLOCK'])
array = dataframe.values
X = array[:,0:9]
Y = array[:,9]
seed = 7
kfold = model_selection.KFold(n_splits=10, random_state=seed)
model = DecisionTreeClassifier()
results = model_selection.cross_val_score(model, X, Y, cv=kfold)
print(results.mean())
#0.731300116075
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment