Skip to content

Instantly share code, notes, and snippets.

@jrzaurin
Last active June 13, 2021 09:18
Show Gist options
  • Save jrzaurin/95ab8203ebb29c43003222d46ece97b2 to your computer and use it in GitHub Desktop.
Save jrzaurin/95ab8203ebb29c43003222d46ece97b2 to your computer and use it in GitHub Desktop.
Results for the NYC Taxi rode duration dataset with Tabnet
n_steps step_dim attn_dim ghost_bn virtual_batch_size momentum gamma dropout embed_dropout lr batch_size weight_decay lambda_sparse optimizer lr_scheduler base_lr max_lr div_factor final_div_factor n_cycles val_loss_or_metric
5 8 8 False 128 0.75 1.5 0.0 0.0 0.01 1024 0.0 0.0001 Adam ReduceLROnPlateau 0.001 0.01 25 10000.0 5 144819.1190
5 8 8 False 128 0.98 1.5 0.0 0.0 0.01 1024 0.0 0.0001 Adam ReduceLROnPlateau 0.001 0.01 25 10000.0 5 146057.8078
5 8 8 False 128 0.5 1.5 0.0 0.0 0.01 1024 0.0 0.0001 Adam ReduceLROnPlateau 0.001 0.01 25 10000.0 5 146201.3771
5 16 16 False 128 0.98 1.5 0.0 0.0 0.01 1024 0.0 0.0001 Adam ReduceLROnPlateau 0.001 0.01 25 10000.0 5 146461.7343
5 8 8 False 128 0.25 1.5 0.0 0.0 0.01 1024 0.0 0.0001 Adam ReduceLROnPlateau 0.001 0.01 25 10000.0 5 148636.8888
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment