Skip to content

Instantly share code, notes, and snippets.

@taylanbil
Created September 27, 2019 21:55
Show Gist options
  • Save taylanbil/bbfec9307a2f4c35833d70976fd96bf8 to your computer and use it in GitHub Desktop.
Save taylanbil/bbfec9307a2f4c35833d70976fd96bf8 to your computer and use it in GitHub Desktop.
[fseq][transformer] warmed up run
Epoch 1 begin 21:38:10
training/ 21:39:31, device xla:1, step 1, Rate=19.64, GlobalRate=19.64
training/ 21:39:31, device xla:2, step 1, Rate=19.52, GlobalRate=19.52
training/ 21:39:31, device xla:5, step 1, Rate=19.37, GlobalRate=19.37
training/ 21:39:31, device xla:8, step 1, Rate=38.76, GlobalRate=38.76
training/ 21:39:31, device xla:4, step 1, Rate=19.31, GlobalRate=19.31
training/ 21:39:31, device xla:6, step 1, Rate=38.53, GlobalRate=38.53
training/ 21:39:31, device xla:7, step 1, Rate=76.98, GlobalRate=76.98
training/ 21:39:31, device xla:3, step 1, Rate=38.31, GlobalRate=38.31
training/ 21:39:51, device xla:8, step 2, Rate=45.96, GlobalRate=46.01
training/ 21:39:51, device xla:5, step 2, Rate=22.96, GlobalRate=22.99
training/ 21:39:51, device xla:1, step 2, Rate=37.88, GlobalRate=38.21
training/ 21:39:51, device xla:6, step 2, Rate=30.67, GlobalRate=30.64
training/ 21:39:51, device xla:7, step 2, Rate=46.06, GlobalRate=45.97
training/ 21:39:51, device xla:3, step 2, Rate=30.61, GlobalRate=30.60
training/ 21:39:51, device xla:4, step 2, Rate=22.93, GlobalRate=22.95
training/ 21:39:51, device xla:2, step 2, Rate=22.81, GlobalRate=22.87
training/ 21:40:13, device xla:6, step 3, Rate=26.08, GlobalRate=27.59
training/ 21:40:13, device xla:1, step 3, Rate=42.77, GlobalRate=41.33
training/ 21:40:13, device xla:8, step 3, Rate=32.17, GlobalRate=36.79
training/ 21:40:14, device xla:4, step 3, Rate=15.96, GlobalRate=18.26
training/ 21:40:14, device xla:5, step 3, Rate=15.97, GlobalRate=18.27
training/ 21:40:14, device xla:7, step 3, Rate=25.21, GlobalRate=31.97
training/ 21:40:14, device xla:3, step 3, Rate=19.01, GlobalRate=22.80
training/ 21:40:14, device xla:2, step 3, Rate=15.93, GlobalRate=18.24
training/ 21:40:32, device xla:2, step 4, Rate=14.62, GlobalRate=17.12
training/ 21:40:32, device xla:4, step 4, Rate=14.60, GlobalRate=17.12
training/ 21:40:32, device xla:5, step 4, Rate=39.21, GlobalRate=27.39
training/ 21:40:32, device xla:3, step 4, Rate=24.06, GlobalRate=23.95
training/ 21:40:33, device xla:1, step 4, Rate=25.02, GlobalRate=34.06
training/ 21:40:33, device xla:8, step 4, Rate=20.77, GlobalRate=30.68
training/ 21:40:33, device xla:7, step 4, Rate=26.20, GlobalRate=30.67
training/ 21:40:33, device xla:6, step 4, Rate=18.20, GlobalRate=23.76
training/ 21:40:53, device xla:3, step 5, Rate=16.90, GlobalRate=21.35
training/ 21:40:53, device xla:1, step 5, Rate=24.76, GlobalRate=32.00
training/ 21:40:53, device xla:8, step 5, Rate=23.09, GlobalRate=29.37
training/ 21:40:53, device xla:6, step 5, Rate=14.78, GlobalRate=21.35
training/ 21:40:53, device xla:5, step 5, Rate=44.74, GlobalRate=32.03
training/ 21:40:53, device xla:7, step 5, Rate=17.87, GlobalRate=26.69
training/ 21:40:53, device xla:2, step 5, Rate=13.10, GlobalRate=16.01
training/ 21:40:54, device xla:4, step 5, Rate=20.05, GlobalRate=18.59
training/ 21:41:13, device xla:6, step 6, Rate=21.70, GlobalRate=22.19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment