Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save diogodanielsoaresferreira/10721f8f24ebcb828f947f7c1d594d5d to your computer and use it in GitHub Desktop.
Save diogodanielsoaresferreira/10721f8f24ebcb828f947f7c1d594d5d to your computer and use it in GitHub Desktop.
Calculate most common combinations using Pandas
import pandas as pd
pd.read_csv('Dataset-Unicauca-Version2-87Atts.csv', header = 0)
.groupby(['Source.IP', 'Source.Port', 'Destination.IP', 'Destination.Port'])
.size()
.sort_values(ascending=False)[:20]
.to_csv('processed_data_python.csv')
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment