Skip to content

Instantly share code, notes, and snippets.

@yuriybash
Created January 24, 2019 21:14
Show Gist options
  • Save yuriybash/10d92af598e4a3dbcdee7c0474e4af61 to your computer and use it in GitHub Desktop.
Save yuriybash/10d92af598e4a3dbcdee7c0474e4af61 to your computer and use it in GitHub Desktop.
sample config
models:
1:
vectorizer:
title:
name: 'tfidf'
parameters:
- ngram_range: [[1, 1], [1, 2]]
max_features: [500]
url:
name: 'tfidf'
parameters:
- ngram_range: [[1, 1]]
max_features: [500]
estimator:
name: 'SVC'
parameters:
- kernel: ['rbf']
gamma: [1.0e-3, 1.0e-4]
C: [1, 10, 100, 1000]
- kernel: ['linear']
C: [1, 10, 100, 1000]
2:
vectorizer:
title:
name: 'tfidf'
parameters:
- ngram_range: [[1, 1]]
max_features: [500]
url:
name: 'tfidf'
parameters:
- ngram_range: [[1, 1]]
max_features: [500]
estimator:
name: 'MultinomialNB'
parameters:
- alpha: [1.0, 2.0, 5.0, 10.0, 20.0, 25.0]
3:
vectorizer:
title:
name: 'tfidf'
parameters:
- ngram_range: [[1, 1]]
max_features: [500]
url:
name: 'tfidf'
parameters:
- ngram_range: [[1, 1]]
max_features: [500]
estimator:
name: 'LogisticRegression'
parameters:
- class_weight:
- 0: 0.8
1: 0.2
- 0: 0.85
1: 0.15
- 0: 0.9
1: 0.1
4:
vectorizer:
title:
name: 'tfidf'
parameters:
- ngram_range: [[1, 1]]
max_features: [500]
url:
name: 'tfidf'
parameters:
- ngram_range: [[1, 1]]
max_features: [500]
estimator:
name: 'Perceptron'
parameters:
- alpha: [1.0e-4, 1.0e-3]
max_iter: [5]
5:
vectorizer:
title:
name: 'tfidf'
parameters:
- ngram_range: [[1, 1]]
max_features: [500]
url:
name: 'tfidf'
parameters:
- ngram_range: [[1, 1]]
max_features: [500]
estimator:
name: 'MLPClassifier'
parameters:
- activation: ['relu', 'logistic', 'tanh']
max_iter: [200]
cross_validation:
train_test_split: [75, 25] # these need to sum to 100
n_folds: 5
test:
scores: ['precision', 'recall', 'accuracy']
data: 'data/data.csv'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment