Skip to content

Instantly share code, notes, and snippets.

@stas-sl
Last active June 21, 2016 20:47
Show Gist options
  • Save stas-sl/182d9999cbf8f67a791ec205623dbba7 to your computer and use it in GitHub Desktop.
Save stas-sl/182d9999cbf8f67a791ec205623dbba7 to your computer and use it in GitHub Desktop.
This file has been truncated, but you can view the full file.
I0612 14:48:17.543987 6181 solver.cpp:280] Solving mixed_lstm
I0612 14:48:17.543999 6181 solver.cpp:281] Learning Rate Policy: fixed
I0612 14:48:17.554961 6181 solver.cpp:338] Iteration 0, Testing net (#0)
I0612 14:49:17.784380 6181 solver.cpp:393] Test loss: 2.74935
I0612 14:49:17.784834 6181 solver.cpp:406] Test net output #0: loss1/accuracy = 0.62328
I0612 14:49:17.784857 6181 solver.cpp:406] Test net output #1: loss1/accuracy01 = 0.772
I0612 14:49:17.784870 6181 solver.cpp:406] Test net output #2: loss1/accuracy02 = 0.65
I0612 14:49:17.784883 6181 solver.cpp:406] Test net output #3: loss1/accuracy03 = 0.535
I0612 14:49:17.784894 6181 solver.cpp:406] Test net output #4: loss1/accuracy04 = 0.498
I0612 14:49:17.784906 6181 solver.cpp:406] Test net output #5: loss1/accuracy05 = 0.509
I0612 14:49:17.784920 6181 solver.cpp:406] Test net output #6: loss1/accuracy06 = 0.714
I0612 14:49:17.784932 6181 solver.cpp:406] Test net output #7: loss1/accuracy07 = 0.853
I0612 14:49:17.784945 6181 solver.cpp:406] Test net output #8: loss1/accuracy08 = 0.925
I0612 14:49:17.784957 6181 solver.cpp:406] Test net output #9: loss1/accuracy09 = 0.967
I0612 14:49:17.784968 6181 solver.cpp:406] Test net output #10: loss1/accuracy10 = 0.985
I0612 14:49:17.784981 6181 solver.cpp:406] Test net output #11: loss1/accuracy11 = 0.998
I0612 14:49:17.784991 6181 solver.cpp:406] Test net output #12: loss1/accuracy12 = 1
I0612 14:49:17.785003 6181 solver.cpp:406] Test net output #13: loss1/accuracy13 = 1
I0612 14:49:17.785014 6181 solver.cpp:406] Test net output #14: loss1/accuracy14 = 1
I0612 14:49:17.785025 6181 solver.cpp:406] Test net output #15: loss1/accuracy15 = 1
I0612 14:49:17.785037 6181 solver.cpp:406] Test net output #16: loss1/accuracy16 = 1
I0612 14:49:17.785048 6181 solver.cpp:406] Test net output #17: loss1/accuracy17 = 1
I0612 14:49:17.785059 6181 solver.cpp:406] Test net output #18: loss1/accuracy18 = 1
I0612 14:49:17.785071 6181 solver.cpp:406] Test net output #19: loss1/accuracy19 = 1
I0612 14:49:17.785082 6181 solver.cpp:406] Test net output #20: loss1/accuracy20 = 1
I0612 14:49:17.785094 6181 solver.cpp:406] Test net output #21: loss1/accuracy21 = 1
I0612 14:49:17.785105 6181 solver.cpp:406] Test net output #22: loss1/accuracy22 = 1
I0612 14:49:17.785117 6181 solver.cpp:406] Test net output #23: loss1/accuracy_incl_empty = 0.890411
I0612 14:49:17.785130 6181 solver.cpp:406] Test net output #24: loss1/accuracy_top3 = 0.832317
I0612 14:49:17.785145 6181 solver.cpp:406] Test net output #25: loss1/cross_entropy_loss = 1.45759 (* 0.3 = 0.437278 loss)
I0612 14:49:17.785159 6181 solver.cpp:406] Test net output #26: loss1/cross_entropy_loss_incl_empty = 0.426676 (* 0.3 = 0.128003 loss)
I0612 14:49:17.785174 6181 solver.cpp:406] Test net output #27: loss1/loss01 = 0.964696 (* 0.0272727 = 0.0263099 loss)
I0612 14:49:17.785188 6181 solver.cpp:406] Test net output #28: loss1/loss02 = 1.35698 (* 0.0272727 = 0.0370084 loss)
I0612 14:49:17.785202 6181 solver.cpp:406] Test net output #29: loss1/loss03 = 1.72278 (* 0.0272727 = 0.0469849 loss)
I0612 14:49:17.785215 6181 solver.cpp:406] Test net output #30: loss1/loss04 = 1.8751 (* 0.0272727 = 0.051139 loss)
I0612 14:49:17.785229 6181 solver.cpp:406] Test net output #31: loss1/loss05 = 1.67692 (* 0.0272727 = 0.0457341 loss)
I0612 14:49:17.785243 6181 solver.cpp:406] Test net output #32: loss1/loss06 = 1.04425 (* 0.0272727 = 0.0284796 loss)
I0612 14:49:17.785256 6181 solver.cpp:406] Test net output #33: loss1/loss07 = 0.565391 (* 0.0272727 = 0.0154198 loss)
I0612 14:49:17.785270 6181 solver.cpp:406] Test net output #34: loss1/loss08 = 0.283869 (* 0.0272727 = 0.00774188 loss)
I0612 14:49:17.785284 6181 solver.cpp:406] Test net output #35: loss1/loss09 = 0.141395 (* 0.0272727 = 0.00385623 loss)
I0612 14:49:17.785298 6181 solver.cpp:406] Test net output #36: loss1/loss10 = 0.0745112 (* 0.0272727 = 0.00203212 loss)
I0612 14:49:17.785312 6181 solver.cpp:406] Test net output #37: loss1/loss11 = 0.0178277 (* 0.0272727 = 0.00048621 loss)
I0612 14:49:17.785343 6181 solver.cpp:406] Test net output #38: loss1/loss12 = 0.0102666 (* 0.0272727 = 0.000279998 loss)
I0612 14:49:17.785361 6181 solver.cpp:406] Test net output #39: loss1/loss13 = 0.00613571 (* 0.0272727 = 0.000167338 loss)
I0612 14:49:17.785389 6181 solver.cpp:406] Test net output #40: loss1/loss14 = 0.00383044 (* 0.0272727 = 0.000104467 loss)
I0612 14:49:17.785405 6181 solver.cpp:406] Test net output #41: loss1/loss15 = 0.00271295 (* 0.0272727 = 7.39895e-05 loss)
I0612 14:49:17.785419 6181 solver.cpp:406] Test net output #42: loss1/loss16 = 0.00185899 (* 0.0272727 = 5.06998e-05 loss)
I0612 14:49:17.785434 6181 solver.cpp:406] Test net output #43: loss1/loss17 = 0.00150055 (* 0.0272727 = 4.09241e-05 loss)
I0612 14:49:17.785447 6181 solver.cpp:406] Test net output #44: loss1/loss18 = 0.00119731 (* 0.0272727 = 3.26539e-05 loss)
I0612 14:49:17.785461 6181 solver.cpp:406] Test net output #45: loss1/loss19 = 0.00111577 (* 0.0272727 = 3.04301e-05 loss)
I0612 14:49:17.785475 6181 solver.cpp:406] Test net output #46: loss1/loss20 = 0.000914118 (* 0.0272727 = 2.49305e-05 loss)
I0612 14:49:17.785490 6181 solver.cpp:406] Test net output #47: loss1/loss21 = 0.000783918 (* 0.0272727 = 2.13796e-05 loss)
I0612 14:49:17.785503 6181 solver.cpp:406] Test net output #48: loss1/loss22 = 0.000455443 (* 0.0272727 = 1.24212e-05 loss)
I0612 14:49:17.785516 6181 solver.cpp:406] Test net output #49: loss2/accuracy = 0.777339
I0612 14:49:17.785528 6181 solver.cpp:406] Test net output #50: loss2/accuracy01 = 0.864
I0612 14:49:17.785539 6181 solver.cpp:406] Test net output #51: loss2/accuracy02 = 0.828
I0612 14:49:17.785552 6181 solver.cpp:406] Test net output #52: loss2/accuracy03 = 0.751
I0612 14:49:17.785562 6181 solver.cpp:406] Test net output #53: loss2/accuracy04 = 0.673
I0612 14:49:17.785574 6181 solver.cpp:406] Test net output #54: loss2/accuracy05 = 0.66
I0612 14:49:17.785586 6181 solver.cpp:406] Test net output #55: loss2/accuracy06 = 0.799
I0612 14:49:17.785598 6181 solver.cpp:406] Test net output #56: loss2/accuracy07 = 0.88
I0612 14:49:17.785609 6181 solver.cpp:406] Test net output #57: loss2/accuracy08 = 0.935
I0612 14:49:17.785620 6181 solver.cpp:406] Test net output #58: loss2/accuracy09 = 0.966
I0612 14:49:17.785631 6181 solver.cpp:406] Test net output #59: loss2/accuracy10 = 0.984
I0612 14:49:17.785643 6181 solver.cpp:406] Test net output #60: loss2/accuracy11 = 0.995
I0612 14:49:17.785655 6181 solver.cpp:406] Test net output #61: loss2/accuracy12 = 0.999
I0612 14:49:17.785666 6181 solver.cpp:406] Test net output #62: loss2/accuracy13 = 0.999
I0612 14:49:17.785677 6181 solver.cpp:406] Test net output #63: loss2/accuracy14 = 1
I0612 14:49:17.785688 6181 solver.cpp:406] Test net output #64: loss2/accuracy15 = 1
I0612 14:49:17.785699 6181 solver.cpp:406] Test net output #65: loss2/accuracy16 = 1
I0612 14:49:17.785712 6181 solver.cpp:406] Test net output #66: loss2/accuracy17 = 1
I0612 14:49:17.785723 6181 solver.cpp:406] Test net output #67: loss2/accuracy18 = 1
I0612 14:49:17.785734 6181 solver.cpp:406] Test net output #68: loss2/accuracy19 = 1
I0612 14:49:17.785745 6181 solver.cpp:406] Test net output #69: loss2/accuracy20 = 1
I0612 14:49:17.785756 6181 solver.cpp:406] Test net output #70: loss2/accuracy21 = 1
I0612 14:49:17.785768 6181 solver.cpp:406] Test net output #71: loss2/accuracy22 = 1
I0612 14:49:17.785778 6181 solver.cpp:406] Test net output #72: loss2/accuracy_incl_empty = 0.932956
I0612 14:49:17.785790 6181 solver.cpp:406] Test net output #73: loss2/accuracy_top3 = 0.899304
I0612 14:49:17.785804 6181 solver.cpp:406] Test net output #74: loss2/cross_entropy_loss = 0.985322 (* 0.3 = 0.295597 loss)
I0612 14:49:17.785817 6181 solver.cpp:406] Test net output #75: loss2/cross_entropy_loss_incl_empty = 0.292092 (* 0.3 = 0.0876276 loss)
I0612 14:49:17.785831 6181 solver.cpp:406] Test net output #76: loss2/loss01 = 0.686494 (* 0.0272727 = 0.0187226 loss)
I0612 14:49:17.785845 6181 solver.cpp:406] Test net output #77: loss2/loss02 = 0.849009 (* 0.0272727 = 0.0231548 loss)
I0612 14:49:17.785873 6181 solver.cpp:406] Test net output #78: loss2/loss03 = 1.04144 (* 0.0272727 = 0.0284028 loss)
I0612 14:49:17.785888 6181 solver.cpp:406] Test net output #79: loss2/loss04 = 1.26467 (* 0.0272727 = 0.0344909 loss)
I0612 14:49:17.785902 6181 solver.cpp:406] Test net output #80: loss2/loss05 = 1.17103 (* 0.0272727 = 0.0319373 loss)
I0612 14:49:17.785915 6181 solver.cpp:406] Test net output #81: loss2/loss06 = 0.780123 (* 0.0272727 = 0.0212761 loss)
I0612 14:49:17.785929 6181 solver.cpp:406] Test net output #82: loss2/loss07 = 0.461953 (* 0.0272727 = 0.0125987 loss)
I0612 14:49:17.785943 6181 solver.cpp:406] Test net output #83: loss2/loss08 = 0.253267 (* 0.0272727 = 0.00690727 loss)
I0612 14:49:17.785956 6181 solver.cpp:406] Test net output #84: loss2/loss09 = 0.125268 (* 0.0272727 = 0.00341641 loss)
I0612 14:49:17.785974 6181 solver.cpp:406] Test net output #85: loss2/loss10 = 0.0717455 (* 0.0272727 = 0.0019567 loss)
I0612 14:49:17.785987 6181 solver.cpp:406] Test net output #86: loss2/loss11 = 0.0167397 (* 0.0272727 = 0.000456538 loss)
I0612 14:49:17.786001 6181 solver.cpp:406] Test net output #87: loss2/loss12 = 0.00967639 (* 0.0272727 = 0.000263902 loss)
I0612 14:49:17.786015 6181 solver.cpp:406] Test net output #88: loss2/loss13 = 0.00630112 (* 0.0272727 = 0.000171849 loss)
I0612 14:49:17.786029 6181 solver.cpp:406] Test net output #89: loss2/loss14 = 0.00451248 (* 0.0272727 = 0.000123068 loss)
I0612 14:49:17.786042 6181 solver.cpp:406] Test net output #90: loss2/loss15 = 0.00345135 (* 0.0272727 = 9.41277e-05 loss)
I0612 14:49:17.786056 6181 solver.cpp:406] Test net output #91: loss2/loss16 = 0.0026692 (* 0.0272727 = 7.27965e-05 loss)
I0612 14:49:17.786070 6181 solver.cpp:406] Test net output #92: loss2/loss17 = 0.00236119 (* 0.0272727 = 6.4396e-05 loss)
I0612 14:49:17.786084 6181 solver.cpp:406] Test net output #93: loss2/loss18 = 0.00210323 (* 0.0272727 = 5.73609e-05 loss)
I0612 14:49:17.786098 6181 solver.cpp:406] Test net output #94: loss2/loss19 = 0.00171325 (* 0.0272727 = 4.6725e-05 loss)
I0612 14:49:17.786111 6181 solver.cpp:406] Test net output #95: loss2/loss20 = 0.00148562 (* 0.0272727 = 4.05168e-05 loss)
I0612 14:49:17.786125 6181 solver.cpp:406] Test net output #96: loss2/loss21 = 0.00100667 (* 0.0272727 = 2.74545e-05 loss)
I0612 14:49:17.786139 6181 solver.cpp:406] Test net output #97: loss2/loss22 = 0.000551716 (* 0.0272727 = 1.50468e-05 loss)
I0612 14:49:17.786150 6181 solver.cpp:406] Test net output #98: loss3/accuracy = 0.851664
I0612 14:49:17.786162 6181 solver.cpp:406] Test net output #99: loss3/accuracy01 = 0.874
I0612 14:49:17.786173 6181 solver.cpp:406] Test net output #100: loss3/accuracy02 = 0.864
I0612 14:49:17.786185 6181 solver.cpp:406] Test net output #101: loss3/accuracy03 = 0.85
I0612 14:49:17.786196 6181 solver.cpp:406] Test net output #102: loss3/accuracy04 = 0.843
I0612 14:49:17.786207 6181 solver.cpp:406] Test net output #103: loss3/accuracy05 = 0.843
I0612 14:49:17.786219 6181 solver.cpp:406] Test net output #104: loss3/accuracy06 = 0.891
I0612 14:49:17.786231 6181 solver.cpp:406] Test net output #105: loss3/accuracy07 = 0.922
I0612 14:49:17.786242 6181 solver.cpp:406] Test net output #106: loss3/accuracy08 = 0.952
I0612 14:49:17.786252 6181 solver.cpp:406] Test net output #107: loss3/accuracy09 = 0.969
I0612 14:49:17.786264 6181 solver.cpp:406] Test net output #108: loss3/accuracy10 = 0.986
I0612 14:49:17.786275 6181 solver.cpp:406] Test net output #109: loss3/accuracy11 = 0.995
I0612 14:49:17.786288 6181 solver.cpp:406] Test net output #110: loss3/accuracy12 = 0.996
I0612 14:49:17.786298 6181 solver.cpp:406] Test net output #111: loss3/accuracy13 = 0.997
I0612 14:49:17.786309 6181 solver.cpp:406] Test net output #112: loss3/accuracy14 = 0.999
I0612 14:49:17.786321 6181 solver.cpp:406] Test net output #113: loss3/accuracy15 = 0.999
I0612 14:49:17.786332 6181 solver.cpp:406] Test net output #114: loss3/accuracy16 = 0.999
I0612 14:49:17.786353 6181 solver.cpp:406] Test net output #115: loss3/accuracy17 = 1
I0612 14:49:17.786366 6181 solver.cpp:406] Test net output #116: loss3/accuracy18 = 1
I0612 14:49:17.786377 6181 solver.cpp:406] Test net output #117: loss3/accuracy19 = 1
I0612 14:49:17.786389 6181 solver.cpp:406] Test net output #118: loss3/accuracy20 = 0.999
I0612 14:49:17.786401 6181 solver.cpp:406] Test net output #119: loss3/accuracy21 = 1
I0612 14:49:17.786412 6181 solver.cpp:406] Test net output #120: loss3/accuracy22 = 1
I0612 14:49:17.786422 6181 solver.cpp:406] Test net output #121: loss3/accuracy_incl_empty = 0.952318
I0612 14:49:17.786434 6181 solver.cpp:406] Test net output #122: loss3/accuracy_top3 = 0.920128
I0612 14:49:17.786448 6181 solver.cpp:406] Test net output #123: loss3/cross_entropy_loss = 0.696208 (* 1 = 0.696208 loss)
I0612 14:49:17.786461 6181 solver.cpp:406] Test net output #124: loss3/cross_entropy_loss_incl_empty = 0.221218 (* 1 = 0.221218 loss)
I0612 14:49:17.786475 6181 solver.cpp:406] Test net output #125: loss3/loss01 = 0.572295 (* 0.0909091 = 0.0520268 loss)
I0612 14:49:17.786489 6181 solver.cpp:406] Test net output #126: loss3/loss02 = 0.659684 (* 0.0909091 = 0.0599713 loss)
I0612 14:49:17.786502 6181 solver.cpp:406] Test net output #127: loss3/loss03 = 0.753102 (* 0.0909091 = 0.0684639 loss)
I0612 14:49:17.786516 6181 solver.cpp:406] Test net output #128: loss3/loss04 = 0.74539 (* 0.0909091 = 0.0677627 loss)
I0612 14:49:17.786530 6181 solver.cpp:406] Test net output #129: loss3/loss05 = 0.729268 (* 0.0909091 = 0.0662971 loss)
I0612 14:49:17.786543 6181 solver.cpp:406] Test net output #130: loss3/loss06 = 0.525001 (* 0.0909091 = 0.0477274 loss)
I0612 14:49:17.786556 6181 solver.cpp:406] Test net output #131: loss3/loss07 = 0.341916 (* 0.0909091 = 0.0310833 loss)
I0612 14:49:17.786571 6181 solver.cpp:406] Test net output #132: loss3/loss08 = 0.209225 (* 0.0909091 = 0.0190204 loss)
I0612 14:49:17.786583 6181 solver.cpp:406] Test net output #133: loss3/loss09 = 0.107202 (* 0.0909091 = 0.00974567 loss)
I0612 14:49:17.786597 6181 solver.cpp:406] Test net output #134: loss3/loss10 = 0.0587676 (* 0.0909091 = 0.00534251 loss)
I0612 14:49:17.786612 6181 solver.cpp:406] Test net output #135: loss3/loss11 = 0.0194487 (* 0.0909091 = 0.00176807 loss)
I0612 14:49:17.786625 6181 solver.cpp:406] Test net output #136: loss3/loss12 = 0.012872 (* 0.0909091 = 0.00117018 loss)
I0612 14:49:17.786638 6181 solver.cpp:406] Test net output #137: loss3/loss13 = 0.00781656 (* 0.0909091 = 0.000710596 loss)
I0612 14:49:17.786653 6181 solver.cpp:406] Test net output #138: loss3/loss14 = 0.00450606 (* 0.0909091 = 0.000409642 loss)
I0612 14:49:17.786666 6181 solver.cpp:406] Test net output #139: loss3/loss15 = 0.00358207 (* 0.0909091 = 0.000325642 loss)
I0612 14:49:17.786679 6181 solver.cpp:406] Test net output #140: loss3/loss16 = 0.00278909 (* 0.0909091 = 0.000253554 loss)
I0612 14:49:17.786694 6181 solver.cpp:406] Test net output #141: loss3/loss17 = 0.00248164 (* 0.0909091 = 0.000225604 loss)
I0612 14:49:17.786707 6181 solver.cpp:406] Test net output #142: loss3/loss18 = 0.00242382 (* 0.0909091 = 0.000220347 loss)
I0612 14:49:17.786721 6181 solver.cpp:406] Test net output #143: loss3/loss19 = 0.0023187 (* 0.0909091 = 0.000210791 loss)
I0612 14:49:17.786734 6181 solver.cpp:406] Test net output #144: loss3/loss20 = 0.00193507 (* 0.0909091 = 0.000175915 loss)
I0612 14:49:17.786748 6181 solver.cpp:406] Test net output #145: loss3/loss21 = 0.00122293 (* 0.0909091 = 0.000111175 loss)
I0612 14:49:17.786762 6181 solver.cpp:406] Test net output #146: loss3/loss22 = 0.000767555 (* 0.0909091 = 6.97777e-05 loss)
I0612 14:49:17.786774 6181 solver.cpp:406] Test net output #147: total_accuracy = 0.607
I0612 14:49:17.786785 6181 solver.cpp:406] Test net output #148: total_accuracy_not_rec = 0.585
I0612 14:49:17.786797 6181 solver.cpp:406] Test net output #149: total_confidence = 0.583791
I0612 14:49:17.786818 6181 solver.cpp:406] Test net output #150: total_confidence_not_rec = 0.533188
I0612 14:49:17.786831 6181 solver.cpp:338] Iteration 0, Testing net (#1)
I0612 14:50:17.771250 6181 solver.cpp:393] Test loss: 3.78327
I0612 14:50:17.771370 6181 solver.cpp:406] Test net output #0: loss1/accuracy = 0.588038
I0612 14:50:17.771389 6181 solver.cpp:406] Test net output #1: loss1/accuracy01 = 0.782
I0612 14:50:17.771402 6181 solver.cpp:406] Test net output #2: loss1/accuracy02 = 0.664
I0612 14:50:17.771415 6181 solver.cpp:406] Test net output #3: loss1/accuracy03 = 0.528
I0612 14:50:17.771427 6181 solver.cpp:406] Test net output #4: loss1/accuracy04 = 0.489
I0612 14:50:17.771440 6181 solver.cpp:406] Test net output #5: loss1/accuracy05 = 0.498
I0612 14:50:17.771451 6181 solver.cpp:406] Test net output #6: loss1/accuracy06 = 0.66
I0612 14:50:17.771463 6181 solver.cpp:406] Test net output #7: loss1/accuracy07 = 0.76
I0612 14:50:17.771474 6181 solver.cpp:406] Test net output #8: loss1/accuracy08 = 0.818
I0612 14:50:17.771486 6181 solver.cpp:406] Test net output #9: loss1/accuracy09 = 0.841
I0612 14:50:17.771498 6181 solver.cpp:406] Test net output #10: loss1/accuracy10 = 0.865
I0612 14:50:17.771510 6181 solver.cpp:406] Test net output #11: loss1/accuracy11 = 0.886
I0612 14:50:17.771522 6181 solver.cpp:406] Test net output #12: loss1/accuracy12 = 0.901
I0612 14:50:17.771533 6181 solver.cpp:406] Test net output #13: loss1/accuracy13 = 0.913
I0612 14:50:17.771545 6181 solver.cpp:406] Test net output #14: loss1/accuracy14 = 0.936
I0612 14:50:17.771558 6181 solver.cpp:406] Test net output #15: loss1/accuracy15 = 0.953
I0612 14:50:17.771569 6181 solver.cpp:406] Test net output #16: loss1/accuracy16 = 0.966
I0612 14:50:17.771580 6181 solver.cpp:406] Test net output #17: loss1/accuracy17 = 0.981
I0612 14:50:17.771591 6181 solver.cpp:406] Test net output #18: loss1/accuracy18 = 0.983
I0612 14:50:17.771603 6181 solver.cpp:406] Test net output #19: loss1/accuracy19 = 0.984
I0612 14:50:17.771615 6181 solver.cpp:406] Test net output #20: loss1/accuracy20 = 0.991
I0612 14:50:17.771626 6181 solver.cpp:406] Test net output #21: loss1/accuracy21 = 0.997
I0612 14:50:17.771638 6181 solver.cpp:406] Test net output #22: loss1/accuracy22 = 0.998
I0612 14:50:17.771651 6181 solver.cpp:406] Test net output #23: loss1/accuracy_incl_empty = 0.845411
I0612 14:50:17.771661 6181 solver.cpp:406] Test net output #24: loss1/accuracy_top3 = 0.798567
I0612 14:50:17.771677 6181 solver.cpp:406] Test net output #25: loss1/cross_entropy_loss = 1.6139 (* 0.3 = 0.484171 loss)
I0612 14:50:17.771692 6181 solver.cpp:406] Test net output #26: loss1/cross_entropy_loss_incl_empty = 0.619286 (* 0.3 = 0.185786 loss)
I0612 14:50:17.771708 6181 solver.cpp:406] Test net output #27: loss1/loss01 = 1.02833 (* 0.0272727 = 0.0280453 loss)
I0612 14:50:17.771720 6181 solver.cpp:406] Test net output #28: loss1/loss02 = 1.33908 (* 0.0272727 = 0.0365205 loss)
I0612 14:50:17.771734 6181 solver.cpp:406] Test net output #29: loss1/loss03 = 1.73826 (* 0.0272727 = 0.0474072 loss)
I0612 14:50:17.771747 6181 solver.cpp:406] Test net output #30: loss1/loss04 = 1.85847 (* 0.0272727 = 0.0506856 loss)
I0612 14:50:17.771761 6181 solver.cpp:406] Test net output #31: loss1/loss05 = 1.77693 (* 0.0272727 = 0.0484619 loss)
I0612 14:50:17.771775 6181 solver.cpp:406] Test net output #32: loss1/loss06 = 1.27717 (* 0.0272727 = 0.034832 loss)
I0612 14:50:17.771788 6181 solver.cpp:406] Test net output #33: loss1/loss07 = 0.865291 (* 0.0272727 = 0.0235988 loss)
I0612 14:50:17.771801 6181 solver.cpp:406] Test net output #34: loss1/loss08 = 0.720738 (* 0.0272727 = 0.0196565 loss)
I0612 14:50:17.771816 6181 solver.cpp:406] Test net output #35: loss1/loss09 = 0.622503 (* 0.0272727 = 0.0169774 loss)
I0612 14:50:17.771829 6181 solver.cpp:406] Test net output #36: loss1/loss10 = 0.545107 (* 0.0272727 = 0.0148666 loss)
I0612 14:50:17.771843 6181 solver.cpp:406] Test net output #37: loss1/loss11 = 0.475284 (* 0.0272727 = 0.0129623 loss)
I0612 14:50:17.771857 6181 solver.cpp:406] Test net output #38: loss1/loss12 = 0.399788 (* 0.0272727 = 0.0109033 loss)
I0612 14:50:17.771889 6181 solver.cpp:406] Test net output #39: loss1/loss13 = 0.35727 (* 0.0272727 = 0.00974374 loss)
I0612 14:50:17.771908 6181 solver.cpp:406] Test net output #40: loss1/loss14 = 0.271699 (* 0.0272727 = 0.00740997 loss)
I0612 14:50:17.771924 6181 solver.cpp:406] Test net output #41: loss1/loss15 = 0.231175 (* 0.0272727 = 0.00630478 loss)
I0612 14:50:17.771936 6181 solver.cpp:406] Test net output #42: loss1/loss16 = 0.189187 (* 0.0272727 = 0.00515964 loss)
I0612 14:50:17.771950 6181 solver.cpp:406] Test net output #43: loss1/loss17 = 0.129896 (* 0.0272727 = 0.00354261 loss)
I0612 14:50:17.771965 6181 solver.cpp:406] Test net output #44: loss1/loss18 = 0.118001 (* 0.0272727 = 0.00321821 loss)
I0612 14:50:17.771978 6181 solver.cpp:406] Test net output #45: loss1/loss19 = 0.105157 (* 0.0272727 = 0.00286792 loss)
I0612 14:50:17.771992 6181 solver.cpp:406] Test net output #46: loss1/loss20 = 0.0580764 (* 0.0272727 = 0.0015839 loss)
I0612 14:50:17.772007 6181 solver.cpp:406] Test net output #47: loss1/loss21 = 0.0213607 (* 0.0272727 = 0.000582564 loss)
I0612 14:50:17.772022 6181 solver.cpp:406] Test net output #48: loss1/loss22 = 0.0164944 (* 0.0272727 = 0.000449848 loss)
I0612 14:50:17.772033 6181 solver.cpp:406] Test net output #49: loss2/accuracy = 0.720619
I0612 14:50:17.772045 6181 solver.cpp:406] Test net output #50: loss2/accuracy01 = 0.856
I0612 14:50:17.772058 6181 solver.cpp:406] Test net output #51: loss2/accuracy02 = 0.821
I0612 14:50:17.772068 6181 solver.cpp:406] Test net output #52: loss2/accuracy03 = 0.739
I0612 14:50:17.772080 6181 solver.cpp:406] Test net output #53: loss2/accuracy04 = 0.649
I0612 14:50:17.772091 6181 solver.cpp:406] Test net output #54: loss2/accuracy05 = 0.647
I0612 14:50:17.772104 6181 solver.cpp:406] Test net output #55: loss2/accuracy06 = 0.733
I0612 14:50:17.772114 6181 solver.cpp:406] Test net output #56: loss2/accuracy07 = 0.802
I0612 14:50:17.772126 6181 solver.cpp:406] Test net output #57: loss2/accuracy08 = 0.829
I0612 14:50:17.772138 6181 solver.cpp:406] Test net output #58: loss2/accuracy09 = 0.863
I0612 14:50:17.772150 6181 solver.cpp:406] Test net output #59: loss2/accuracy10 = 0.876
I0612 14:50:17.772161 6181 solver.cpp:406] Test net output #60: loss2/accuracy11 = 0.897
I0612 14:50:17.772172 6181 solver.cpp:406] Test net output #61: loss2/accuracy12 = 0.91
I0612 14:50:17.772184 6181 solver.cpp:406] Test net output #62: loss2/accuracy13 = 0.917
I0612 14:50:17.772195 6181 solver.cpp:406] Test net output #63: loss2/accuracy14 = 0.939
I0612 14:50:17.772207 6181 solver.cpp:406] Test net output #64: loss2/accuracy15 = 0.951
I0612 14:50:17.772218 6181 solver.cpp:406] Test net output #65: loss2/accuracy16 = 0.966
I0612 14:50:17.772230 6181 solver.cpp:406] Test net output #66: loss2/accuracy17 = 0.981
I0612 14:50:17.772241 6181 solver.cpp:406] Test net output #67: loss2/accuracy18 = 0.983
I0612 14:50:17.772253 6181 solver.cpp:406] Test net output #68: loss2/accuracy19 = 0.984
I0612 14:50:17.772264 6181 solver.cpp:406] Test net output #69: loss2/accuracy20 = 0.991
I0612 14:50:17.772276 6181 solver.cpp:406] Test net output #70: loss2/accuracy21 = 0.997
I0612 14:50:17.772287 6181 solver.cpp:406] Test net output #71: loss2/accuracy22 = 0.998
I0612 14:50:17.772300 6181 solver.cpp:406] Test net output #72: loss2/accuracy_incl_empty = 0.886001
I0612 14:50:17.772310 6181 solver.cpp:406] Test net output #73: loss2/accuracy_top3 = 0.870916
I0612 14:50:17.772325 6181 solver.cpp:406] Test net output #74: loss2/cross_entropy_loss = 1.17583 (* 0.3 = 0.352748 loss)
I0612 14:50:17.772337 6181 solver.cpp:406] Test net output #75: loss2/cross_entropy_loss_incl_empty = 0.486378 (* 0.3 = 0.145913 loss)
I0612 14:50:17.772351 6181 solver.cpp:406] Test net output #76: loss2/loss01 = 0.716366 (* 0.0272727 = 0.0195373 loss)
I0612 14:50:17.772366 6181 solver.cpp:406] Test net output #77: loss2/loss02 = 0.819748 (* 0.0272727 = 0.0223568 loss)
I0612 14:50:17.772393 6181 solver.cpp:406] Test net output #78: loss2/loss03 = 1.07238 (* 0.0272727 = 0.0292467 loss)
I0612 14:50:17.772410 6181 solver.cpp:406] Test net output #79: loss2/loss04 = 1.2792 (* 0.0272727 = 0.0348872 loss)
I0612 14:50:17.772420 6181 solver.cpp:406] Test net output #80: loss2/loss05 = 1.32347 (* 0.0272727 = 0.0360947 loss)
I0612 14:50:17.772434 6181 solver.cpp:406] Test net output #81: loss2/loss06 = 1.00801 (* 0.0272727 = 0.027491 loss)
I0612 14:50:17.772447 6181 solver.cpp:406] Test net output #82: loss2/loss07 = 0.768978 (* 0.0272727 = 0.0209721 loss)
I0612 14:50:17.772461 6181 solver.cpp:406] Test net output #83: loss2/loss08 = 0.655402 (* 0.0272727 = 0.0178746 loss)
I0612 14:50:17.772475 6181 solver.cpp:406] Test net output #84: loss2/loss09 = 0.567552 (* 0.0272727 = 0.0154787 loss)
I0612 14:50:17.772488 6181 solver.cpp:406] Test net output #85: loss2/loss10 = 0.496858 (* 0.0272727 = 0.0135507 loss)
I0612 14:50:17.772502 6181 solver.cpp:406] Test net output #86: loss2/loss11 = 0.439308 (* 0.0272727 = 0.0119811 loss)
I0612 14:50:17.772516 6181 solver.cpp:406] Test net output #87: loss2/loss12 = 0.356541 (* 0.0272727 = 0.00972383 loss)
I0612 14:50:17.772541 6181 solver.cpp:406] Test net output #88: loss2/loss13 = 0.33254 (* 0.0272727 = 0.00906928 loss)
I0612 14:50:17.772567 6181 solver.cpp:406] Test net output #89: loss2/loss14 = 0.257567 (* 0.0272727 = 0.00702455 loss)
I0612 14:50:17.772584 6181 solver.cpp:406] Test net output #90: loss2/loss15 = 0.22671 (* 0.0272727 = 0.006183 loss)
I0612 14:50:17.772598 6181 solver.cpp:406] Test net output #91: loss2/loss16 = 0.188465 (* 0.0272727 = 0.00513996 loss)
I0612 14:50:17.772613 6181 solver.cpp:406] Test net output #92: loss2/loss17 = 0.133975 (* 0.0272727 = 0.00365386 loss)
I0612 14:50:17.772627 6181 solver.cpp:406] Test net output #93: loss2/loss18 = 0.120179 (* 0.0272727 = 0.00327762 loss)
I0612 14:50:17.772640 6181 solver.cpp:406] Test net output #94: loss2/loss19 = 0.118485 (* 0.0272727 = 0.00323141 loss)
I0612 14:50:17.772655 6181 solver.cpp:406] Test net output #95: loss2/loss20 = 0.059882 (* 0.0272727 = 0.00163315 loss)
I0612 14:50:17.772668 6181 solver.cpp:406] Test net output #96: loss2/loss21 = 0.0235672 (* 0.0272727 = 0.000642742 loss)
I0612 14:50:17.772682 6181 solver.cpp:406] Test net output #97: loss2/loss22 = 0.0181145 (* 0.0272727 = 0.000494032 loss)
I0612 14:50:17.772694 6181 solver.cpp:406] Test net output #98: loss3/accuracy = 0.813624
I0612 14:50:17.772706 6181 solver.cpp:406] Test net output #99: loss3/accuracy01 = 0.875
I0612 14:50:17.772718 6181 solver.cpp:406] Test net output #100: loss3/accuracy02 = 0.869
I0612 14:50:17.772729 6181 solver.cpp:406] Test net output #101: loss3/accuracy03 = 0.834
I0612 14:50:17.772742 6181 solver.cpp:406] Test net output #102: loss3/accuracy04 = 0.817
I0612 14:50:17.772753 6181 solver.cpp:406] Test net output #103: loss3/accuracy05 = 0.8
I0612 14:50:17.772764 6181 solver.cpp:406] Test net output #104: loss3/accuracy06 = 0.845
I0612 14:50:17.772775 6181 solver.cpp:406] Test net output #105: loss3/accuracy07 = 0.88
I0612 14:50:17.772786 6181 solver.cpp:406] Test net output #106: loss3/accuracy08 = 0.888
I0612 14:50:17.772799 6181 solver.cpp:406] Test net output #107: loss3/accuracy09 = 0.903
I0612 14:50:17.772809 6181 solver.cpp:406] Test net output #108: loss3/accuracy10 = 0.906
I0612 14:50:17.772821 6181 solver.cpp:406] Test net output #109: loss3/accuracy11 = 0.913
I0612 14:50:17.772832 6181 solver.cpp:406] Test net output #110: loss3/accuracy12 = 0.933
I0612 14:50:17.772845 6181 solver.cpp:406] Test net output #111: loss3/accuracy13 = 0.938
I0612 14:50:17.772855 6181 solver.cpp:406] Test net output #112: loss3/accuracy14 = 0.948
I0612 14:50:17.772866 6181 solver.cpp:406] Test net output #113: loss3/accuracy15 = 0.962
I0612 14:50:17.772877 6181 solver.cpp:406] Test net output #114: loss3/accuracy16 = 0.972
I0612 14:50:17.772900 6181 solver.cpp:406] Test net output #115: loss3/accuracy17 = 0.983
I0612 14:50:17.772913 6181 solver.cpp:406] Test net output #116: loss3/accuracy18 = 0.988
I0612 14:50:17.772925 6181 solver.cpp:406] Test net output #117: loss3/accuracy19 = 0.986
I0612 14:50:17.772936 6181 solver.cpp:406] Test net output #118: loss3/accuracy20 = 0.993
I0612 14:50:17.772950 6181 solver.cpp:406] Test net output #119: loss3/accuracy21 = 0.997
I0612 14:50:17.772964 6181 solver.cpp:406] Test net output #120: loss3/accuracy22 = 0.998
I0612 14:50:17.772975 6181 solver.cpp:406] Test net output #121: loss3/accuracy_incl_empty = 0.920091
I0612 14:50:17.772987 6181 solver.cpp:406] Test net output #122: loss3/accuracy_top3 = 0.908872
I0612 14:50:17.773001 6181 solver.cpp:406] Test net output #123: loss3/cross_entropy_loss = 0.840179 (* 1 = 0.840179 loss)
I0612 14:50:17.773015 6181 solver.cpp:406] Test net output #124: loss3/cross_entropy_loss_incl_empty = 0.356115 (* 1 = 0.356115 loss)
I0612 14:50:17.773030 6181 solver.cpp:406] Test net output #125: loss3/loss01 = 0.61886 (* 0.0909091 = 0.05626 loss)
I0612 14:50:17.773043 6181 solver.cpp:406] Test net output #126: loss3/loss02 = 0.649985 (* 0.0909091 = 0.0590895 loss)
I0612 14:50:17.773057 6181 solver.cpp:406] Test net output #127: loss3/loss03 = 0.781287 (* 0.0909091 = 0.0710261 loss)
I0612 14:50:17.773071 6181 solver.cpp:406] Test net output #128: loss3/loss04 = 0.807413 (* 0.0909091 = 0.0734012 loss)
I0612 14:50:17.773084 6181 solver.cpp:406] Test net output #129: loss3/loss05 = 0.918247 (* 0.0909091 = 0.083477 loss)
I0612 14:50:17.773097 6181 solver.cpp:406] Test net output #130: loss3/loss06 = 0.688939 (* 0.0909091 = 0.0626308 loss)
I0612 14:50:17.773111 6181 solver.cpp:406] Test net output #131: loss3/loss07 = 0.528585 (* 0.0909091 = 0.0480532 loss)
I0612 14:50:17.773129 6181 solver.cpp:406] Test net output #132: loss3/loss08 = 0.465387 (* 0.0909091 = 0.042308 loss)
I0612 14:50:17.773144 6181 solver.cpp:406] Test net output #133: loss3/loss09 = 0.412274 (* 0.0909091 = 0.0374795 loss)
I0612 14:50:17.773157 6181 solver.cpp:406] Test net output #134: loss3/loss10 = 0.388283 (* 0.0909091 = 0.0352984 loss)
I0612 14:50:17.773170 6181 solver.cpp:406] Test net output #135: loss3/loss11 = 0.348303 (* 0.0909091 = 0.0316639 loss)
I0612 14:50:17.773185 6181 solver.cpp:406] Test net output #136: loss3/loss12 = 0.286135 (* 0.0909091 = 0.0260123 loss)
I0612 14:50:17.773197 6181 solver.cpp:406] Test net output #137: loss3/loss13 = 0.248734 (* 0.0909091 = 0.0226122 loss)
I0612 14:50:17.773211 6181 solver.cpp:406] Test net output #138: loss3/loss14 = 0.206961 (* 0.0909091 = 0.0188146 loss)
I0612 14:50:17.773226 6181 solver.cpp:406] Test net output #139: loss3/loss15 = 0.181133 (* 0.0909091 = 0.0164666 loss)
I0612 14:50:17.773239 6181 solver.cpp:406] Test net output #140: loss3/loss16 = 0.149935 (* 0.0909091 = 0.0136304 loss)
I0612 14:50:17.773252 6181 solver.cpp:406] Test net output #141: loss3/loss17 = 0.112422 (* 0.0909091 = 0.0102202 loss)
I0612 14:50:17.773267 6181 solver.cpp:406] Test net output #142: loss3/loss18 = 0.0971579 (* 0.0909091 = 0.00883253 loss)
I0612 14:50:17.773280 6181 solver.cpp:406] Test net output #143: loss3/loss19 = 0.0965335 (* 0.0909091 = 0.00877577 loss)
I0612 14:50:17.773294 6181 solver.cpp:406] Test net output #144: loss3/loss20 = 0.0457746 (* 0.0909091 = 0.00416132 loss)
I0612 14:50:17.773308 6181 solver.cpp:406] Test net output #145: loss3/loss21 = 0.0177006 (* 0.0909091 = 0.00160914 loss)
I0612 14:50:17.773335 6181 solver.cpp:406] Test net output #146: loss3/loss22 = 0.0133515 (* 0.0909091 = 0.00121377 loss)
I0612 14:50:17.773350 6181 solver.cpp:406] Test net output #147: total_accuracy = 0.53
I0612 14:50:17.773362 6181 solver.cpp:406] Test net output #148: total_accuracy_not_rec = 0.488
I0612 14:50:17.773375 6181 solver.cpp:406] Test net output #149: total_confidence = 0.497304
I0612 14:50:17.773397 6181 solver.cpp:406] Test net output #150: total_confidence_not_rec = 0.457334
I0612 14:50:18.257367 6181 solver.cpp:229] Iteration 0, loss = 3.70895
I0612 14:50:18.257418 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.510204
I0612 14:50:18.257436 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 14:50:18.257450 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.375
I0612 14:50:18.257462 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.125
I0612 14:50:18.257474 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.375
I0612 14:50:18.257486 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.25
I0612 14:50:18.257499 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.5
I0612 14:50:18.257511 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 14:50:18.257524 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 14:50:18.257536 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 1
I0612 14:50:18.257550 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 1
I0612 14:50:18.257561 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 1
I0612 14:50:18.257572 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 14:50:18.257583 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 14:50:18.257596 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 14:50:18.257607 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 14:50:18.257618 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 14:50:18.257630 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 14:50:18.257642 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 14:50:18.257653 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 14:50:18.257664 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 14:50:18.257676 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 14:50:18.257688 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 14:50:18.257699 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.857955
I0612 14:50:18.257714 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.795918
I0612 14:50:18.257732 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.61438 (* 0.3 = 0.484314 loss)
I0612 14:50:18.257748 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.470885 (* 0.3 = 0.141265 loss)
I0612 14:50:18.257763 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.91043 (* 0.0272727 = 0.0248299 loss)
I0612 14:50:18.257777 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.03182 (* 0.0272727 = 0.0554134 loss)
I0612 14:50:18.257792 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.37584 (* 0.0272727 = 0.0647956 loss)
I0612 14:50:18.257807 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.91855 (* 0.0272727 = 0.052324 loss)
I0612 14:50:18.257820 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.69094 (* 0.0272727 = 0.0461166 loss)
I0612 14:50:18.257834 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.7272 (* 0.0272727 = 0.0471054 loss)
I0612 14:50:18.257848 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.03377 (* 0.0272727 = 0.0281939 loss)
I0612 14:50:18.257861 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.21682 (* 0.0272727 = 0.033186 loss)
I0612 14:50:18.257875 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.0483508 (* 0.0272727 = 0.00131866 loss)
I0612 14:50:18.257890 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.00881474 (* 0.0272727 = 0.000240402 loss)
I0612 14:50:18.257905 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.000630973 (* 0.0272727 = 1.72083e-05 loss)
I0612 14:50:18.257946 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.000101779 (* 0.0272727 = 2.77578e-06 loss)
I0612 14:50:18.257961 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 3.45867e-05 (* 0.0272727 = 9.43274e-07 loss)
I0612 14:50:18.257977 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 1.34709e-05 (* 0.0272727 = 3.67389e-07 loss)
I0612 14:50:18.257990 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 5.3489e-05 (* 0.0272727 = 1.45879e-06 loss)
I0612 14:50:18.258004 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 3.35684e-05 (* 0.0272727 = 9.15502e-07 loss)
I0612 14:50:18.258018 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 7.21788e-05 (* 0.0272727 = 1.96851e-06 loss)
I0612 14:50:18.258031 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000199045 (* 0.0272727 = 5.4285e-06 loss)
I0612 14:50:18.258045 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000142394 (* 0.0272727 = 3.88348e-06 loss)
I0612 14:50:18.258060 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00011169 (* 0.0272727 = 3.04608e-06 loss)
I0612 14:50:18.258074 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000215703 (* 0.0272727 = 5.8828e-06 loss)
I0612 14:50:18.258088 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 6.50133e-05 (* 0.0272727 = 1.77309e-06 loss)
I0612 14:50:18.258100 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.693878
I0612 14:50:18.258113 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 14:50:18.258129 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 14:50:18.258141 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 14:50:18.258153 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 14:50:18.258165 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.625
I0612 14:50:18.258177 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.625
I0612 14:50:18.258189 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 14:50:18.258201 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 14:50:18.258213 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 1
I0612 14:50:18.258224 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 1
I0612 14:50:18.258235 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 1
I0612 14:50:18.258246 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 14:50:18.258258 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 14:50:18.258270 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 14:50:18.258281 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 14:50:18.258292 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 14:50:18.258304 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 14:50:18.258316 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 14:50:18.258327 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 14:50:18.258338 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 14:50:18.258349 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 14:50:18.258361 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 14:50:18.258373 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.903409
I0612 14:50:18.258386 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.877551
I0612 14:50:18.258396 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.30027 (* 0.3 = 0.390082 loss)
I0612 14:50:18.258411 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.379204 (* 0.3 = 0.113761 loss)
I0612 14:50:18.258437 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.614588 (* 0.0272727 = 0.0167615 loss)
I0612 14:50:18.258452 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.13148 (* 0.0272727 = 0.0308585 loss)
I0612 14:50:18.258466 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.32182 (* 0.0272727 = 0.0360496 loss)
I0612 14:50:18.258481 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.84798 (* 0.0272727 = 0.0503994 loss)
I0612 14:50:18.258494 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.10038 (* 0.0272727 = 0.0300104 loss)
I0612 14:50:18.258508 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.0877 (* 0.0272727 = 0.0296645 loss)
I0612 14:50:18.258522 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.39086 (* 0.0272727 = 0.0379325 loss)
I0612 14:50:18.258536 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.83131 (* 0.0272727 = 0.0499449 loss)
I0612 14:50:18.258550 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.00336621 (* 0.0272727 = 9.18057e-05 loss)
I0612 14:50:18.258564 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.000117593 (* 0.0272727 = 3.20708e-06 loss)
I0612 14:50:18.258579 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 4.17183e-05 (* 0.0272727 = 1.13777e-06 loss)
I0612 14:50:18.258592 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 1.20999e-05 (* 0.0272727 = 3.29998e-07 loss)
I0612 14:50:18.258606 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 4.06804e-06 (* 0.0272727 = 1.10946e-07 loss)
I0612 14:50:18.258620 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 4.12766e-06 (* 0.0272727 = 1.12572e-07 loss)
I0612 14:50:18.258635 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 6.13938e-06 (* 0.0272727 = 1.67438e-07 loss)
I0612 14:50:18.258648 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 2.47361e-06 (* 0.0272727 = 6.7462e-08 loss)
I0612 14:50:18.258662 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 5.94568e-06 (* 0.0272727 = 1.62155e-07 loss)
I0612 14:50:18.258677 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 4.29158e-06 (* 0.0272727 = 1.17043e-07 loss)
I0612 14:50:18.258690 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 1.77325e-06 (* 0.0272727 = 4.83613e-08 loss)
I0612 14:50:18.258704 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 1.33074e-05 (* 0.0272727 = 3.6293e-07 loss)
I0612 14:50:18.258718 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 4.82806e-06 (* 0.0272727 = 1.31674e-07 loss)
I0612 14:50:18.258733 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 5.60295e-06 (* 0.0272727 = 1.52808e-07 loss)
I0612 14:50:18.258744 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.795918
I0612 14:50:18.258755 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 14:50:18.258769 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 14:50:18.258782 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 14:50:18.258795 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 14:50:18.258805 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 14:50:18.258817 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 14:50:18.258828 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 14:50:18.258841 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 14:50:18.258852 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 1
I0612 14:50:18.258868 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 14:50:18.258891 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 14:50:18.258911 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 14:50:18.258924 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 14:50:18.258947 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 14:50:18.258960 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 14:50:18.258971 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 14:50:18.258983 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 14:50:18.258994 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 14:50:18.259006 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 14:50:18.259018 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 14:50:18.259029 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 14:50:18.259042 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 14:50:18.259053 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.943182
I0612 14:50:18.259065 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.857143
I0612 14:50:18.259079 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.971063 (* 1 = 0.971063 loss)
I0612 14:50:18.259093 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.27544 (* 1 = 0.27544 loss)
I0612 14:50:18.259109 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.250536 (* 0.0909091 = 0.022776 loss)
I0612 14:50:18.259125 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.913931 (* 0.0909091 = 0.0830847 loss)
I0612 14:50:18.259155 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.977636 (* 0.0909091 = 0.088876 loss)
I0612 14:50:18.259179 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.2124 (* 0.0909091 = 0.110218 loss)
I0612 14:50:18.259194 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.75597 (* 0.0909091 = 0.0687246 loss)
I0612 14:50:18.259208 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.65929 (* 0.0909091 = 0.0599355 loss)
I0612 14:50:18.259222 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.08972 (* 0.0909091 = 0.0990653 loss)
I0612 14:50:18.259237 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 1.8124 (* 0.0909091 = 0.164763 loss)
I0612 14:50:18.259250 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.00244996 (* 0.0909091 = 0.000222723 loss)
I0612 14:50:18.259265 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.000466268 (* 0.0909091 = 4.2388e-05 loss)
I0612 14:50:18.259279 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.000109443 (* 0.0909091 = 9.94932e-06 loss)
I0612 14:50:18.259294 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 6.01226e-05 (* 0.0909091 = 5.46569e-06 loss)
I0612 14:50:18.259307 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 3.21136e-05 (* 0.0909091 = 2.91942e-06 loss)
I0612 14:50:18.259321 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 1.24427e-05 (* 0.0909091 = 1.13116e-06 loss)
I0612 14:50:18.259335 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 5.37935e-06 (* 0.0909091 = 4.89032e-07 loss)
I0612 14:50:18.259349 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 4.35116e-06 (* 0.0909091 = 3.9556e-07 loss)
I0612 14:50:18.259364 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 7.22713e-06 (* 0.0909091 = 6.57012e-07 loss)
I0612 14:50:18.259378 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 7.80827e-06 (* 0.0909091 = 7.09843e-07 loss)
I0612 14:50:18.259392 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 7.13772e-06 (* 0.0909091 = 6.48883e-07 loss)
I0612 14:50:18.259407 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 8.34474e-06 (* 0.0909091 = 7.58613e-07 loss)
I0612 14:50:18.259420 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 1.39179e-05 (* 0.0909091 = 1.26527e-06 loss)
I0612 14:50:18.259434 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 1.31579e-05 (* 0.0909091 = 1.19618e-06 loss)
I0612 14:50:18.259459 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.625
I0612 14:50:18.259472 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.375
I0612 14:50:18.259485 6181 solver.cpp:245] Train net output #149: total_confidence = 0.474049
I0612 14:50:18.259495 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.449427
I0612 14:50:18.259516 6181 sgd_solver.cpp:106] Iteration 0, lr = 0.001
I0612 14:50:32.973526 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.4476 > 30) by scale factor 0.780282
I0612 14:51:49.825567 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 39.6626 > 30) by scale factor 0.75638
I0612 14:52:23.229444 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 41.385 > 30) by scale factor 0.7249
I0612 14:54:42.035481 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.6393 > 30) by scale factor 0.776411
I0612 14:56:53.056725 6181 solver.cpp:229] Iteration 500, loss = 4.57109
I0612 14:56:53.056845 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.229508
I0612 14:56:53.056867 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.25
I0612 14:56:53.056880 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.5
I0612 14:56:53.056893 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.375
I0612 14:56:53.056905 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.375
I0612 14:56:53.056916 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.25
I0612 14:56:53.056929 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.625
I0612 14:56:53.056942 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 14:56:53.056954 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 14:56:53.056967 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.625
I0612 14:56:53.056978 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 14:56:53.056990 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.75
I0612 14:56:53.057003 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.625
I0612 14:56:53.057014 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 14:56:53.057026 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 14:56:53.057039 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 14:56:53.057050 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 14:56:53.057061 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 14:56:53.057073 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 14:56:53.057085 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 14:56:53.057096 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 14:56:53.057108 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 14:56:53.057121 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 14:56:53.057132 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.704545
I0612 14:56:53.057144 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.47541
I0612 14:56:53.057160 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.5409 (* 0.3 = 0.76227 loss)
I0612 14:56:53.057175 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 1.09832 (* 0.3 = 0.329496 loss)
I0612 14:56:53.057189 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 2.31463 (* 0.0272727 = 0.0631262 loss)
I0612 14:56:53.057204 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.8994 (* 0.0272727 = 0.0518018 loss)
I0612 14:56:53.057219 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.68004 (* 0.0272727 = 0.0730921 loss)
I0612 14:56:53.057235 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.53122 (* 0.0272727 = 0.0690333 loss)
I0612 14:56:53.057250 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.40306 (* 0.0272727 = 0.0655379 loss)
I0612 14:56:53.057265 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.16453 (* 0.0272727 = 0.0590325 loss)
I0612 14:56:53.057278 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.14665 (* 0.0272727 = 0.0312723 loss)
I0612 14:56:53.057292 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.769102 (* 0.0272727 = 0.0209755 loss)
I0612 14:56:53.057307 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 1.15887 (* 0.0272727 = 0.0316056 loss)
I0612 14:56:53.057332 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 1.00142 (* 0.0272727 = 0.0273116 loss)
I0612 14:56:53.057350 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 1.43449 (* 0.0272727 = 0.0391224 loss)
I0612 14:56:53.057364 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 1.25697 (* 0.0272727 = 0.0342811 loss)
I0612 14:56:53.057397 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.7081 (* 0.0272727 = 0.0193118 loss)
I0612 14:56:53.057412 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.409831 (* 0.0272727 = 0.0111772 loss)
I0612 14:56:53.057426 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.579898 (* 0.0272727 = 0.0158154 loss)
I0612 14:56:53.057441 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.318686 (* 0.0272727 = 0.00869143 loss)
I0612 14:56:53.057456 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.143555 (* 0.0272727 = 0.00391514 loss)
I0612 14:56:53.057469 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.122646 (* 0.0272727 = 0.00334488 loss)
I0612 14:56:53.057483 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.10867 (* 0.0272727 = 0.00296374 loss)
I0612 14:56:53.057498 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.11029 (* 0.0272727 = 0.0030079 loss)
I0612 14:56:53.057512 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.101986 (* 0.0272727 = 0.00278143 loss)
I0612 14:56:53.057526 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.100556 (* 0.0272727 = 0.00274242 loss)
I0612 14:56:53.057538 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.459016
I0612 14:56:53.057551 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 14:56:53.057564 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.5
I0612 14:56:53.057575 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.5
I0612 14:56:53.057587 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 14:56:53.057600 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 14:56:53.057611 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.625
I0612 14:56:53.057623 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 14:56:53.057634 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 14:56:53.057646 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 14:56:53.057658 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.625
I0612 14:56:53.057669 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.625
I0612 14:56:53.057682 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.625
I0612 14:56:53.057693 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 14:56:53.057705 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 14:56:53.057718 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 14:56:53.057729 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 14:56:53.057740 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 14:56:53.057752 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 14:56:53.057765 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 14:56:53.057775 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 14:56:53.057787 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 14:56:53.057798 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 14:56:53.057811 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.761364
I0612 14:56:53.057822 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.688525
I0612 14:56:53.057837 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.98547 (* 0.3 = 0.59564 loss)
I0612 14:56:53.057850 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.908645 (* 0.3 = 0.272593 loss)
I0612 14:56:53.057868 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.32659 (* 0.0272727 = 0.0361798 loss)
I0612 14:56:53.057883 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.86683 (* 0.0272727 = 0.0509135 loss)
I0612 14:56:53.057909 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.65537 (* 0.0272727 = 0.0451464 loss)
I0612 14:56:53.057924 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.70937 (* 0.0272727 = 0.0466191 loss)
I0612 14:56:53.057940 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.6447 (* 0.0272727 = 0.0448556 loss)
I0612 14:56:53.057953 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.22629 (* 0.0272727 = 0.0334443 loss)
I0612 14:56:53.057967 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.08971 (* 0.0272727 = 0.0297195 loss)
I0612 14:56:53.057977 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.09829 (* 0.0272727 = 0.0299532 loss)
I0612 14:56:53.057987 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.827486 (* 0.0272727 = 0.0225678 loss)
I0612 14:56:53.058001 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 1.09533 (* 0.0272727 = 0.0298726 loss)
I0612 14:56:53.058015 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 1.52025 (* 0.0272727 = 0.0414613 loss)
I0612 14:56:53.058029 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 1.38591 (* 0.0272727 = 0.0377975 loss)
I0612 14:56:53.058043 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.694017 (* 0.0272727 = 0.0189277 loss)
I0612 14:56:53.058056 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.39376 (* 0.0272727 = 0.0107389 loss)
I0612 14:56:53.058070 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.314162 (* 0.0272727 = 0.00856806 loss)
I0612 14:56:53.058084 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.153577 (* 0.0272727 = 0.00418845 loss)
I0612 14:56:53.058099 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.046419 (* 0.0272727 = 0.00126597 loss)
I0612 14:56:53.058112 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0377787 (* 0.0272727 = 0.00103033 loss)
I0612 14:56:53.058126 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.0257 (* 0.0272727 = 0.00070091 loss)
I0612 14:56:53.058140 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.0319632 (* 0.0272727 = 0.000871724 loss)
I0612 14:56:53.058154 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.0330371 (* 0.0272727 = 0.000901011 loss)
I0612 14:56:53.058169 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.0145214 (* 0.0272727 = 0.000396039 loss)
I0612 14:56:53.058182 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.672131
I0612 14:56:53.058193 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.75
I0612 14:56:53.058205 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 14:56:53.058217 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 14:56:53.058228 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 14:56:53.058239 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 14:56:53.058251 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.625
I0612 14:56:53.058264 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 14:56:53.058277 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 14:56:53.058290 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 14:56:53.058301 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.625
I0612 14:56:53.058312 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.75
I0612 14:56:53.058325 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.625
I0612 14:56:53.058336 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 14:56:53.058347 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 14:56:53.058359 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 14:56:53.058370 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 14:56:53.058392 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 14:56:53.058404 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 14:56:53.058415 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 14:56:53.058428 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 14:56:53.058439 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 14:56:53.058449 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 14:56:53.058461 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.846591
I0612 14:56:53.058473 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.836066
I0612 14:56:53.058487 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.39613 (* 1 = 1.39613 loss)
I0612 14:56:53.058501 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.674735 (* 1 = 0.674735 loss)
I0612 14:56:53.058516 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 1.15903 (* 0.0909091 = 0.105367 loss)
I0612 14:56:53.058529 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.929295 (* 0.0909091 = 0.0844814 loss)
I0612 14:56:53.058543 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.758092 (* 0.0909091 = 0.0689174 loss)
I0612 14:56:53.058557 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.885346 (* 0.0909091 = 0.080486 loss)
I0612 14:56:53.058571 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.01858 (* 0.0909091 = 0.0925985 loss)
I0612 14:56:53.058584 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.901627 (* 0.0909091 = 0.0819661 loss)
I0612 14:56:53.058599 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.686476 (* 0.0909091 = 0.0624069 loss)
I0612 14:56:53.058614 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.876767 (* 0.0909091 = 0.0797061 loss)
I0612 14:56:53.058627 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.799787 (* 0.0909091 = 0.072708 loss)
I0612 14:56:53.058641 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 1.1384 (* 0.0909091 = 0.103491 loss)
I0612 14:56:53.058655 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 1.51713 (* 0.0909091 = 0.137921 loss)
I0612 14:56:53.058670 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 1.43275 (* 0.0909091 = 0.13025 loss)
I0612 14:56:53.058682 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.768177 (* 0.0909091 = 0.0698343 loss)
I0612 14:56:53.058696 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.59866 (* 0.0909091 = 0.0544236 loss)
I0612 14:56:53.058711 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.413551 (* 0.0909091 = 0.0375955 loss)
I0612 14:56:53.058723 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.162959 (* 0.0909091 = 0.0148145 loss)
I0612 14:56:53.058737 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.103232 (* 0.0909091 = 0.00938477 loss)
I0612 14:56:53.058753 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.11875 (* 0.0909091 = 0.0107955 loss)
I0612 14:56:53.058766 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.0787526 (* 0.0909091 = 0.00715933 loss)
I0612 14:56:53.058780 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.118086 (* 0.0909091 = 0.0107351 loss)
I0612 14:56:53.058794 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.113354 (* 0.0909091 = 0.0103049 loss)
I0612 14:56:53.058809 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.0783891 (* 0.0909091 = 0.00712629 loss)
I0612 14:56:53.058820 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.125
I0612 14:56:53.058832 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.125
I0612 14:56:53.058845 6181 solver.cpp:245] Train net output #149: total_confidence = 0.199184
I0612 14:56:53.058864 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.16984
I0612 14:56:53.058878 6181 sgd_solver.cpp:106] Iteration 500, lr = 0.001
I0612 14:57:45.363608 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 40.015 > 30) by scale factor 0.749719
I0612 14:59:51.718389 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 41.243 > 30) by scale factor 0.727396
I0612 15:02:18.152859 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.4719 > 30) by scale factor 0.984513
I0612 15:02:27.566977 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.3714 > 30) by scale factor 0.926744
I0612 15:03:25.082783 6181 solver.cpp:229] Iteration 1000, loss = 4.46925
I0612 15:03:25.082867 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.442857
I0612 15:03:25.082887 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 15:03:25.082901 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 1
I0612 15:03:25.082913 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.375
I0612 15:03:25.082926 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 15:03:25.082937 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.625
I0612 15:03:25.082949 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 15:03:25.082962 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 15:03:25.082974 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.75
I0612 15:03:25.082986 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 15:03:25.082998 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.625
I0612 15:03:25.083010 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.625
I0612 15:03:25.083022 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.625
I0612 15:03:25.083034 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.625
I0612 15:03:25.083047 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 15:03:25.083060 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 15:03:25.083071 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 15:03:25.083083 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 15:03:25.083096 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 15:03:25.083107 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 15:03:25.083118 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 15:03:25.083130 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 15:03:25.083142 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 15:03:25.083158 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.778409
I0612 15:03:25.083170 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.728571
I0612 15:03:25.083187 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.71449 (* 0.3 = 0.514346 loss)
I0612 15:03:25.083202 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.717587 (* 0.3 = 0.215276 loss)
I0612 15:03:25.083217 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.733491 (* 0.0272727 = 0.0200043 loss)
I0612 15:03:25.083231 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 0.511005 (* 0.0272727 = 0.0139365 loss)
I0612 15:03:25.083245 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.75948 (* 0.0272727 = 0.0479858 loss)
I0612 15:03:25.083259 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.16382 (* 0.0272727 = 0.0317407 loss)
I0612 15:03:25.083273 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.37197 (* 0.0272727 = 0.0374174 loss)
I0612 15:03:25.083287 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.66075 (* 0.0272727 = 0.0452932 loss)
I0612 15:03:25.083302 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.24033 (* 0.0272727 = 0.0338273 loss)
I0612 15:03:25.083315 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.01088 (* 0.0272727 = 0.0275694 loss)
I0612 15:03:25.083329 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.990588 (* 0.0272727 = 0.027016 loss)
I0612 15:03:25.083343 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 1.39087 (* 0.0272727 = 0.0379329 loss)
I0612 15:03:25.083356 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 1.07813 (* 0.0272727 = 0.0294036 loss)
I0612 15:03:25.083370 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 1.26862 (* 0.0272727 = 0.0345987 loss)
I0612 15:03:25.083400 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 1.37527 (* 0.0272727 = 0.0375073 loss)
I0612 15:03:25.083415 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.601375 (* 0.0272727 = 0.0164011 loss)
I0612 15:03:25.083431 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.341626 (* 0.0272727 = 0.00931707 loss)
I0612 15:03:25.083444 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.174725 (* 0.0272727 = 0.00476523 loss)
I0612 15:03:25.083458 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0238907 (* 0.0272727 = 0.000651563 loss)
I0612 15:03:25.083472 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00399693 (* 0.0272727 = 0.000109007 loss)
I0612 15:03:25.083487 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00137555 (* 0.0272727 = 3.75151e-05 loss)
I0612 15:03:25.083501 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00089107 (* 0.0272727 = 2.43019e-05 loss)
I0612 15:03:25.083515 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000430051 (* 0.0272727 = 1.17287e-05 loss)
I0612 15:03:25.083529 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000272536 (* 0.0272727 = 7.4328e-06 loss)
I0612 15:03:25.083542 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.614286
I0612 15:03:25.083554 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 1
I0612 15:03:25.083566 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 1
I0612 15:03:25.083578 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.875
I0612 15:03:25.083590 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.75
I0612 15:03:25.083602 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.625
I0612 15:03:25.083614 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 15:03:25.083626 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 15:03:25.083637 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 15:03:25.083649 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 15:03:25.083662 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.625
I0612 15:03:25.083673 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.625
I0612 15:03:25.083684 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.625
I0612 15:03:25.083696 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.75
I0612 15:03:25.083709 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 15:03:25.083720 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 15:03:25.083732 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 15:03:25.083745 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 15:03:25.083766 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 15:03:25.083791 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 15:03:25.083811 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 15:03:25.083823 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 15:03:25.083834 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 15:03:25.083847 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.835227
I0612 15:03:25.083858 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.785714
I0612 15:03:25.083873 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.40474 (* 0.3 = 0.421423 loss)
I0612 15:03:25.083887 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.586717 (* 0.3 = 0.176015 loss)
I0612 15:03:25.083901 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.178957 (* 0.0272727 = 0.00488066 loss)
I0612 15:03:25.083916 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.205534 (* 0.0272727 = 0.00560547 loss)
I0612 15:03:25.083943 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.821837 (* 0.0272727 = 0.0224137 loss)
I0612 15:03:25.083958 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 0.961742 (* 0.0272727 = 0.0262293 loss)
I0612 15:03:25.083972 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.06285 (* 0.0272727 = 0.0289868 loss)
I0612 15:03:25.083986 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.3973 (* 0.0272727 = 0.0381082 loss)
I0612 15:03:25.084000 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.910118 (* 0.0272727 = 0.0248214 loss)
I0612 15:03:25.084014 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.775956 (* 0.0272727 = 0.0211624 loss)
I0612 15:03:25.084028 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.878747 (* 0.0272727 = 0.0239658 loss)
I0612 15:03:25.084043 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.912472 (* 0.0272727 = 0.0248856 loss)
I0612 15:03:25.084056 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.941507 (* 0.0272727 = 0.0256775 loss)
I0612 15:03:25.084070 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 1.57357 (* 0.0272727 = 0.0429155 loss)
I0612 15:03:25.084084 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.786967 (* 0.0272727 = 0.0214627 loss)
I0612 15:03:25.084098 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.428889 (* 0.0272727 = 0.011697 loss)
I0612 15:03:25.084108 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.371521 (* 0.0272727 = 0.0101324 loss)
I0612 15:03:25.084123 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.128642 (* 0.0272727 = 0.00350841 loss)
I0612 15:03:25.084137 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0221043 (* 0.0272727 = 0.000602844 loss)
I0612 15:03:25.084151 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.00313041 (* 0.0272727 = 8.53749e-05 loss)
I0612 15:03:25.084167 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00157395 (* 0.0272727 = 4.29258e-05 loss)
I0612 15:03:25.084180 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00071821 (* 0.0272727 = 1.95875e-05 loss)
I0612 15:03:25.084193 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000109839 (* 0.0272727 = 2.99561e-06 loss)
I0612 15:03:25.084210 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 1.87766e-05 (* 0.0272727 = 5.12089e-07 loss)
I0612 15:03:25.084223 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.742857
I0612 15:03:25.084236 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 15:03:25.084249 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 15:03:25.084259 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 15:03:25.084271 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 1
I0612 15:03:25.084282 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 15:03:25.084295 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 1
I0612 15:03:25.084306 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 15:03:25.084317 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 15:03:25.084329 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 15:03:25.084341 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.75
I0612 15:03:25.084352 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.625
I0612 15:03:25.084363 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.75
I0612 15:03:25.084374 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.75
I0612 15:03:25.084386 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 15:03:25.084398 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 15:03:25.084410 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 15:03:25.084431 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 15:03:25.084444 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 15:03:25.084456 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 15:03:25.084467 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 15:03:25.084480 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 15:03:25.084491 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 15:03:25.084501 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.880682
I0612 15:03:25.084513 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.928571
I0612 15:03:25.084527 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.750252 (* 1 = 0.750252 loss)
I0612 15:03:25.084542 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.347068 (* 1 = 0.347068 loss)
I0612 15:03:25.084556 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.0489971 (* 0.0909091 = 0.00445428 loss)
I0612 15:03:25.084570 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.0826722 (* 0.0909091 = 0.00751565 loss)
I0612 15:03:25.084584 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.243225 (* 0.0909091 = 0.0221113 loss)
I0612 15:03:25.084599 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.131859 (* 0.0909091 = 0.0119872 loss)
I0612 15:03:25.084612 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.454878 (* 0.0909091 = 0.0413526 loss)
I0612 15:03:25.084626 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.183835 (* 0.0909091 = 0.0167123 loss)
I0612 15:03:25.084640 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.487568 (* 0.0909091 = 0.0443244 loss)
I0612 15:03:25.084653 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.781002 (* 0.0909091 = 0.0710002 loss)
I0612 15:03:25.084667 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.472543 (* 0.0909091 = 0.0429584 loss)
I0612 15:03:25.084681 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.977864 (* 0.0909091 = 0.0888967 loss)
I0612 15:03:25.084695 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.754309 (* 0.0909091 = 0.0685736 loss)
I0612 15:03:25.084709 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 1.57888 (* 0.0909091 = 0.143535 loss)
I0612 15:03:25.084723 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.797962 (* 0.0909091 = 0.072542 loss)
I0612 15:03:25.084738 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.204278 (* 0.0909091 = 0.0185707 loss)
I0612 15:03:25.084751 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.264643 (* 0.0909091 = 0.0240584 loss)
I0612 15:03:25.084765 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.113527 (* 0.0909091 = 0.0103207 loss)
I0612 15:03:25.084779 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0126609 (* 0.0909091 = 0.00115099 loss)
I0612 15:03:25.084794 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00497748 (* 0.0909091 = 0.000452498 loss)
I0612 15:03:25.084807 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00100097 (* 0.0909091 = 9.09977e-05 loss)
I0612 15:03:25.084821 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000265911 (* 0.0909091 = 2.41738e-05 loss)
I0612 15:03:25.084836 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000242237 (* 0.0909091 = 2.20215e-05 loss)
I0612 15:03:25.084851 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 8.25968e-05 (* 0.0909091 = 7.5088e-06 loss)
I0612 15:03:25.084867 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.625
I0612 15:03:25.084879 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 15:03:25.084900 6181 solver.cpp:245] Train net output #149: total_confidence = 0.3101
I0612 15:03:25.084913 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.312959
I0612 15:03:25.084926 6181 sgd_solver.cpp:106] Iteration 1000, lr = 0.001
I0612 15:03:35.630329 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.9701 > 30) by scale factor 0.938377
I0612 15:03:49.749222 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.6222 > 30) by scale factor 0.892268
I0612 15:05:46.115312 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.5141 > 30) by scale factor 0.983151
I0612 15:05:55.485934 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.5674 > 30) by scale factor 0.777859
I0612 15:05:58.693562 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.1112 > 30) by scale factor 0.830768
I0612 15:06:26.080013 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 41.4342 > 30) by scale factor 0.724039
I0612 15:06:39.358856 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.9777 > 30) by scale factor 0.938153
I0612 15:08:24.789535 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.0277 > 30) by scale factor 0.966879
I0612 15:09:55.702971 6181 solver.cpp:229] Iteration 1500, loss = 4.3029
I0612 15:09:55.703104 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.361702
I0612 15:09:55.703125 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.625
I0612 15:09:55.703137 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.5
I0612 15:09:55.703150 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.125
I0612 15:09:55.703162 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 15:09:55.703174 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.375
I0612 15:09:55.703187 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 15:09:55.703199 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.75
I0612 15:09:55.703212 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 1
I0612 15:09:55.703225 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 15:09:55.703238 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 1
I0612 15:09:55.703251 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 1
I0612 15:09:55.703263 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 15:09:55.703274 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 15:09:55.703286 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 15:09:55.703299 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 15:09:55.703310 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 15:09:55.703322 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 15:09:55.703335 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 15:09:55.703346 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 15:09:55.703358 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 15:09:55.703369 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 15:09:55.703382 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 15:09:55.703393 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.818182
I0612 15:09:55.703407 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.617021
I0612 15:09:55.703423 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.23073 (* 0.3 = 0.669218 loss)
I0612 15:09:55.703436 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.668839 (* 0.3 = 0.200652 loss)
I0612 15:09:55.703451 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.34993 (* 0.0272727 = 0.0368162 loss)
I0612 15:09:55.703465 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.87938 (* 0.0272727 = 0.0512558 loss)
I0612 15:09:55.703480 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.88848 (* 0.0272727 = 0.0787768 loss)
I0612 15:09:55.703493 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.70074 (* 0.0272727 = 0.0463838 loss)
I0612 15:09:55.703507 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.76696 (* 0.0272727 = 0.0481898 loss)
I0612 15:09:55.703521 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.20178 (* 0.0272727 = 0.0600486 loss)
I0612 15:09:55.703536 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.897642 (* 0.0272727 = 0.0244811 loss)
I0612 15:09:55.703549 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.440464 (* 0.0272727 = 0.0120127 loss)
I0612 15:09:55.703564 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.513439 (* 0.0272727 = 0.0140029 loss)
I0612 15:09:55.703578 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.0487451 (* 0.0272727 = 0.00132941 loss)
I0612 15:09:55.703593 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.0125719 (* 0.0272727 = 0.000342869 loss)
I0612 15:09:55.703608 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.00945975 (* 0.0272727 = 0.000257993 loss)
I0612 15:09:55.703621 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.00461094 (* 0.0272727 = 0.000125753 loss)
I0612 15:09:55.703655 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.00326995 (* 0.0272727 = 8.91804e-05 loss)
I0612 15:09:55.703670 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.00105992 (* 0.0272727 = 2.89069e-05 loss)
I0612 15:09:55.703685 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.000814587 (* 0.0272727 = 2.2216e-05 loss)
I0612 15:09:55.703699 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00043615 (* 0.0272727 = 1.1895e-05 loss)
I0612 15:09:55.703713 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000460261 (* 0.0272727 = 1.25526e-05 loss)
I0612 15:09:55.703727 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000255355 (* 0.0272727 = 6.96422e-06 loss)
I0612 15:09:55.703742 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000275757 (* 0.0272727 = 7.52063e-06 loss)
I0612 15:09:55.703757 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000191192 (* 0.0272727 = 5.21434e-06 loss)
I0612 15:09:55.703770 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 6.79418e-05 (* 0.0272727 = 1.85296e-06 loss)
I0612 15:09:55.703783 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.425532
I0612 15:09:55.703795 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.25
I0612 15:09:55.703809 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.625
I0612 15:09:55.703820 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.375
I0612 15:09:55.703832 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 15:09:55.703845 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.25
I0612 15:09:55.703856 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 15:09:55.703868 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 1
I0612 15:09:55.703879 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 15:09:55.703891 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 15:09:55.703903 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 1
I0612 15:09:55.703914 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 1
I0612 15:09:55.703927 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 15:09:55.703938 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 15:09:55.703949 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 15:09:55.703961 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 15:09:55.703974 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 15:09:55.703984 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 15:09:55.703995 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 15:09:55.704007 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 15:09:55.704020 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 15:09:55.704031 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 15:09:55.704043 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 15:09:55.704056 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.829545
I0612 15:09:55.704067 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.808511
I0612 15:09:55.704082 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.779 (* 0.3 = 0.533699 loss)
I0612 15:09:55.704097 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.542652 (* 0.3 = 0.162796 loss)
I0612 15:09:55.704111 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.67104 (* 0.0272727 = 0.0455738 loss)
I0612 15:09:55.704129 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.56727 (* 0.0272727 = 0.0427437 loss)
I0612 15:09:55.704151 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 2.02307 (* 0.0272727 = 0.0551745 loss)
I0612 15:09:55.704166 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.7068 (* 0.0272727 = 0.0465491 loss)
I0612 15:09:55.704181 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.74441 (* 0.0272727 = 0.0475747 loss)
I0612 15:09:55.704195 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.14949 (* 0.0272727 = 0.0313498 loss)
I0612 15:09:55.704210 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.206261 (* 0.0272727 = 0.00562529 loss)
I0612 15:09:55.704223 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.316759 (* 0.0272727 = 0.00863887 loss)
I0612 15:09:55.704237 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.413505 (* 0.0272727 = 0.0112774 loss)
I0612 15:09:55.704252 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.0163748 (* 0.0272727 = 0.000446587 loss)
I0612 15:09:55.704265 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.00541959 (* 0.0272727 = 0.000147807 loss)
I0612 15:09:55.704282 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.00177234 (* 0.0272727 = 4.83366e-05 loss)
I0612 15:09:55.704296 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.000527612 (* 0.0272727 = 1.43894e-05 loss)
I0612 15:09:55.704310 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.000339788 (* 0.0272727 = 9.26693e-06 loss)
I0612 15:09:55.704324 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.000169376 (* 0.0272727 = 4.61933e-06 loss)
I0612 15:09:55.704339 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.000171447 (* 0.0272727 = 4.67582e-06 loss)
I0612 15:09:55.704352 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.000159662 (* 0.0272727 = 4.35442e-06 loss)
I0612 15:09:55.704366 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 8.60715e-05 (* 0.0272727 = 2.3474e-06 loss)
I0612 15:09:55.704380 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000124692 (* 0.0272727 = 3.4007e-06 loss)
I0612 15:09:55.704393 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 6.04222e-05 (* 0.0272727 = 1.64788e-06 loss)
I0612 15:09:55.704407 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000124122 (* 0.0272727 = 3.38515e-06 loss)
I0612 15:09:55.704421 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000241195 (* 0.0272727 = 6.57806e-06 loss)
I0612 15:09:55.704433 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.659574
I0612 15:09:55.704445 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 15:09:55.704457 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.625
I0612 15:09:55.704469 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.5
I0612 15:09:55.704481 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.625
I0612 15:09:55.704493 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.625
I0612 15:09:55.704504 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.625
I0612 15:09:55.704516 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 1
I0612 15:09:55.704527 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 15:09:55.704540 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 15:09:55.704551 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 15:09:55.704562 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 15:09:55.704574 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 15:09:55.704586 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 15:09:55.704597 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 15:09:55.704608 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 15:09:55.704630 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 15:09:55.704643 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 15:09:55.704655 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 15:09:55.704666 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 15:09:55.704679 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 15:09:55.704690 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 15:09:55.704701 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 15:09:55.704713 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.886364
I0612 15:09:55.704726 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.808511
I0612 15:09:55.704741 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.44541 (* 1 = 1.44541 loss)
I0612 15:09:55.704754 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.44774 (* 1 = 0.44774 loss)
I0612 15:09:55.704768 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.623847 (* 0.0909091 = 0.0567133 loss)
I0612 15:09:55.704783 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 1.22262 (* 0.0909091 = 0.111147 loss)
I0612 15:09:55.704797 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 2.24502 (* 0.0909091 = 0.204093 loss)
I0612 15:09:55.704810 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.42029 (* 0.0909091 = 0.129117 loss)
I0612 15:09:55.704824 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.56974 (* 0.0909091 = 0.142704 loss)
I0612 15:09:55.704838 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.92829 (* 0.0909091 = 0.08439 loss)
I0612 15:09:55.704852 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.128498 (* 0.0909091 = 0.0116816 loss)
I0612 15:09:55.704866 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.289837 (* 0.0909091 = 0.0263488 loss)
I0612 15:09:55.704880 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.225388 (* 0.0909091 = 0.0204898 loss)
I0612 15:09:55.704895 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.0168886 (* 0.0909091 = 0.00153533 loss)
I0612 15:09:55.704908 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.00381256 (* 0.0909091 = 0.000346596 loss)
I0612 15:09:55.704922 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.00102045 (* 0.0909091 = 9.27682e-05 loss)
I0612 15:09:55.704936 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.000429465 (* 0.0909091 = 3.90423e-05 loss)
I0612 15:09:55.704951 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.000251882 (* 0.0909091 = 2.28984e-05 loss)
I0612 15:09:55.704964 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.000169786 (* 0.0909091 = 1.54351e-05 loss)
I0612 15:09:55.704978 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.000292328 (* 0.0909091 = 2.65752e-05 loss)
I0612 15:09:55.704993 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.000322026 (* 0.0909091 = 2.92751e-05 loss)
I0612 15:09:55.705006 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.000170191 (* 0.0909091 = 1.54719e-05 loss)
I0612 15:09:55.705020 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000270982 (* 0.0909091 = 2.46347e-05 loss)
I0612 15:09:55.705034 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000250265 (* 0.0909091 = 2.27513e-05 loss)
I0612 15:09:55.705049 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000368045 (* 0.0909091 = 3.34586e-05 loss)
I0612 15:09:55.705063 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000134261 (* 0.0909091 = 1.22055e-05 loss)
I0612 15:09:55.705075 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.25
I0612 15:09:55.705087 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.125
I0612 15:09:55.705108 6181 solver.cpp:245] Train net output #149: total_confidence = 0.124231
I0612 15:09:55.705122 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.128759
I0612 15:09:55.705134 6181 sgd_solver.cpp:106] Iteration 1500, lr = 0.001
I0612 15:11:14.127490 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.7138 > 30) by scale factor 0.917045
I0612 15:11:16.475309 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.8563 > 30) by scale factor 0.836674
I0612 15:11:22.699947 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.2287 > 30) by scale factor 0.851579
I0612 15:11:26.598889 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.4083 > 30) by scale factor 0.897981
I0612 15:11:57.811267 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.0262 > 30) by scale factor 0.999127
I0612 15:12:10.257869 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.1422 > 30) by scale factor 0.933353
I0612 15:12:16.492213 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 41.3068 > 30) by scale factor 0.726272
I0612 15:12:21.942994 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.6624 > 30) by scale factor 0.918488
I0612 15:13:06.311255 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.6387 > 30) by scale factor 0.948205
I0612 15:13:25.824493 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.5653 > 30) by scale factor 0.82045
I0612 15:14:18.018822 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.2221 > 30) by scale factor 0.851737
I0612 15:14:24.238826 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.1344 > 30) by scale factor 0.963565
I0612 15:16:25.306710 6181 solver.cpp:229] Iteration 2000, loss = 4.40169
I0612 15:16:25.306816 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.571429
I0612 15:16:25.306836 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.625
I0612 15:16:25.306849 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.75
I0612 15:16:25.306861 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.375
I0612 15:16:25.306874 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 15:16:25.306886 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.625
I0612 15:16:25.306898 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.625
I0612 15:16:25.306910 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.5
I0612 15:16:25.306921 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 15:16:25.306933 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 15:16:25.306946 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 15:16:25.306962 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 1
I0612 15:16:25.306974 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 15:16:25.306987 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 15:16:25.306998 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 15:16:25.307010 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 15:16:25.307023 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 15:16:25.307034 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 15:16:25.307045 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 15:16:25.307056 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 15:16:25.307068 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 15:16:25.307080 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 15:16:25.307092 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 15:16:25.307104 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.863636
I0612 15:16:25.307117 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.693878
I0612 15:16:25.307134 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.77472 (* 0.3 = 0.532417 loss)
I0612 15:16:25.307152 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.554957 (* 0.3 = 0.166487 loss)
I0612 15:16:25.307166 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.98606 (* 0.0272727 = 0.0268925 loss)
I0612 15:16:25.307181 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.64397 (* 0.0272727 = 0.0448355 loss)
I0612 15:16:25.307195 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.03348 (* 0.0272727 = 0.0554587 loss)
I0612 15:16:25.307209 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.91157 (* 0.0272727 = 0.0521336 loss)
I0612 15:16:25.307224 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.15452 (* 0.0272727 = 0.0587597 loss)
I0612 15:16:25.307237 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.29759 (* 0.0272727 = 0.0353889 loss)
I0612 15:16:25.307251 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.13609 (* 0.0272727 = 0.0309843 loss)
I0612 15:16:25.307265 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.407074 (* 0.0272727 = 0.011102 loss)
I0612 15:16:25.307279 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.740269 (* 0.0272727 = 0.0201892 loss)
I0612 15:16:25.307294 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 1.06088 (* 0.0272727 = 0.028933 loss)
I0612 15:16:25.307308 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.0324827 (* 0.0272727 = 0.00088589 loss)
I0612 15:16:25.307322 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.0186239 (* 0.0272727 = 0.000507923 loss)
I0612 15:16:25.307337 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.0148393 (* 0.0272727 = 0.000404708 loss)
I0612 15:16:25.307368 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.010648 (* 0.0272727 = 0.000290399 loss)
I0612 15:16:25.307384 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.00943751 (* 0.0272727 = 0.000257387 loss)
I0612 15:16:25.307399 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.00481714 (* 0.0272727 = 0.000131377 loss)
I0612 15:16:25.307411 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00644829 (* 0.0272727 = 0.000175862 loss)
I0612 15:16:25.307425 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00313959 (* 0.0272727 = 8.56253e-05 loss)
I0612 15:16:25.307440 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00481295 (* 0.0272727 = 0.000131262 loss)
I0612 15:16:25.307453 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00434534 (* 0.0272727 = 0.000118509 loss)
I0612 15:16:25.307467 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00364369 (* 0.0272727 = 9.93733e-05 loss)
I0612 15:16:25.307482 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00276496 (* 0.0272727 = 7.54079e-05 loss)
I0612 15:16:25.307493 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.653061
I0612 15:16:25.307505 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 15:16:25.307518 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 15:16:25.307529 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.875
I0612 15:16:25.307540 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.25
I0612 15:16:25.307552 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.75
I0612 15:16:25.307564 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.625
I0612 15:16:25.307575 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 15:16:25.307587 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 1
I0612 15:16:25.307598 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 15:16:25.307610 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 15:16:25.307621 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 1
I0612 15:16:25.307633 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 15:16:25.307644 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 15:16:25.307656 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 15:16:25.307667 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 15:16:25.307678 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 15:16:25.307689 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 15:16:25.307701 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 15:16:25.307713 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 15:16:25.307724 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 15:16:25.307735 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 15:16:25.307746 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 15:16:25.307757 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.892045
I0612 15:16:25.307770 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.795918
I0612 15:16:25.307783 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.37697 (* 0.3 = 0.41309 loss)
I0612 15:16:25.307798 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.445622 (* 0.3 = 0.133687 loss)
I0612 15:16:25.307813 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.715027 (* 0.0272727 = 0.0195007 loss)
I0612 15:16:25.307827 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.54179 (* 0.0272727 = 0.0420487 loss)
I0612 15:16:25.307852 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.809383 (* 0.0272727 = 0.0220741 loss)
I0612 15:16:25.307868 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.62707 (* 0.0272727 = 0.0443747 loss)
I0612 15:16:25.307881 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.83018 (* 0.0272727 = 0.049914 loss)
I0612 15:16:25.307894 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.94938 (* 0.0272727 = 0.0531649 loss)
I0612 15:16:25.307909 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.956764 (* 0.0272727 = 0.0260936 loss)
I0612 15:16:25.307922 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.108282 (* 0.0272727 = 0.00295314 loss)
I0612 15:16:25.307936 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.184318 (* 0.0272727 = 0.00502686 loss)
I0612 15:16:25.307950 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.336467 (* 0.0272727 = 0.00917638 loss)
I0612 15:16:25.307965 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.0526262 (* 0.0272727 = 0.00143526 loss)
I0612 15:16:25.307978 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.00665196 (* 0.0272727 = 0.000181417 loss)
I0612 15:16:25.307992 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.00245609 (* 0.0272727 = 6.69843e-05 loss)
I0612 15:16:25.308010 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.000756311 (* 0.0272727 = 2.06267e-05 loss)
I0612 15:16:25.308025 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.000666315 (* 0.0272727 = 1.81722e-05 loss)
I0612 15:16:25.308039 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.000638208 (* 0.0272727 = 1.74057e-05 loss)
I0612 15:16:25.308053 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.00098719 (* 0.0272727 = 2.69234e-05 loss)
I0612 15:16:25.308068 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.00140932 (* 0.0272727 = 3.8436e-05 loss)
I0612 15:16:25.308081 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00105352 (* 0.0272727 = 2.87323e-05 loss)
I0612 15:16:25.308095 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00159465 (* 0.0272727 = 4.34904e-05 loss)
I0612 15:16:25.308109 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.00135191 (* 0.0272727 = 3.68701e-05 loss)
I0612 15:16:25.308122 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.00135579 (* 0.0272727 = 3.69761e-05 loss)
I0612 15:16:25.308135 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.77551
I0612 15:16:25.308146 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 15:16:25.308158 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 15:16:25.308169 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 15:16:25.308182 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 15:16:25.308192 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.625
I0612 15:16:25.308207 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 15:16:25.308218 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.625
I0612 15:16:25.308229 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 15:16:25.308240 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 15:16:25.308253 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 15:16:25.308264 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 15:16:25.308274 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 15:16:25.308285 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 15:16:25.308297 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 15:16:25.308308 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 15:16:25.308329 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 15:16:25.308342 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 15:16:25.308357 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 15:16:25.308382 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 15:16:25.308401 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 15:16:25.308414 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 15:16:25.308426 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 15:16:25.308439 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.9375
I0612 15:16:25.308450 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.877551
I0612 15:16:25.308465 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.996081 (* 1 = 0.996081 loss)
I0612 15:16:25.308478 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.317386 (* 1 = 0.317386 loss)
I0612 15:16:25.308493 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.700404 (* 0.0909091 = 0.0636731 loss)
I0612 15:16:25.308507 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 1.04489 (* 0.0909091 = 0.0949898 loss)
I0612 15:16:25.308521 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.691043 (* 0.0909091 = 0.0628221 loss)
I0612 15:16:25.308536 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.702893 (* 0.0909091 = 0.0638993 loss)
I0612 15:16:25.308549 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.18798 (* 0.0909091 = 0.107999 loss)
I0612 15:16:25.308563 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.76896 (* 0.0909091 = 0.160814 loss)
I0612 15:16:25.308578 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.696159 (* 0.0909091 = 0.0632871 loss)
I0612 15:16:25.308603 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.3647 (* 0.0909091 = 0.0331546 loss)
I0612 15:16:25.308624 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.339695 (* 0.0909091 = 0.0308814 loss)
I0612 15:16:25.308640 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.163263 (* 0.0909091 = 0.0148421 loss)
I0612 15:16:25.308655 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.0351316 (* 0.0909091 = 0.00319379 loss)
I0612 15:16:25.308668 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.0107756 (* 0.0909091 = 0.000979605 loss)
I0612 15:16:25.308682 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.00575242 (* 0.0909091 = 0.000522947 loss)
I0612 15:16:25.308696 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.00781245 (* 0.0909091 = 0.000710223 loss)
I0612 15:16:25.308711 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.00237075 (* 0.0909091 = 0.000215523 loss)
I0612 15:16:25.308724 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.00194506 (* 0.0909091 = 0.000176824 loss)
I0612 15:16:25.308738 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.00117664 (* 0.0909091 = 0.000106968 loss)
I0612 15:16:25.308753 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00125719 (* 0.0909091 = 0.00011429 loss)
I0612 15:16:25.308766 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00116034 (* 0.0909091 = 0.000105485 loss)
I0612 15:16:25.308780 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000999954 (* 0.0909091 = 9.09049e-05 loss)
I0612 15:16:25.308794 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.0010744 (* 0.0909091 = 9.76731e-05 loss)
I0612 15:16:25.308809 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.00100704 (* 0.0909091 = 9.15487e-05 loss)
I0612 15:16:25.308821 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 15:16:25.308833 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.25
I0612 15:16:25.308856 6181 solver.cpp:245] Train net output #149: total_confidence = 0.319088
I0612 15:16:25.308869 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.294336
I0612 15:16:25.308881 6181 sgd_solver.cpp:106] Iteration 2000, lr = 0.001
I0612 15:16:43.576545 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 46.4551 > 30) by scale factor 0.645785
I0612 15:17:52.040665 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.295 > 30) by scale factor 0.874764
I0612 15:18:08.384460 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.5183 > 30) by scale factor 0.983018
I0612 15:18:13.812207 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.429 > 30) by scale factor 0.780661
I0612 15:19:58.098815 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.3742 > 30) by scale factor 0.824761
I0612 15:19:58.890815 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.1997 > 30) by scale factor 0.993386
I0612 15:21:00.321562 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.5187 > 30) by scale factor 0.895024
I0612 15:21:08.869856 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 42.145 > 30) by scale factor 0.711829
I0612 15:21:19.747089 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.1683 > 30) by scale factor 0.932596
I0612 15:22:04.816174 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.1809 > 30) by scale factor 0.877684
I0612 15:22:43.679147 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.2129 > 30) by scale factor 0.876861
I0612 15:22:54.224483 6181 solver.cpp:229] Iteration 2500, loss = 4.30137
I0612 15:22:54.224566 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.454545
I0612 15:22:54.224586 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.625
I0612 15:22:54.224599 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.25
I0612 15:22:54.224612 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.375
I0612 15:22:54.224624 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.75
I0612 15:22:54.224637 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 15:22:54.224649 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.625
I0612 15:22:54.224663 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 15:22:54.224675 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 15:22:54.224692 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 15:22:54.224705 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 1
I0612 15:22:54.224719 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 15:22:54.224730 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 15:22:54.224742 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 15:22:54.224756 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 15:22:54.224769 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 15:22:54.224781 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 15:22:54.224793 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 15:22:54.224805 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 15:22:54.224817 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 15:22:54.224829 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 15:22:54.224841 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 15:22:54.224853 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 15:22:54.224866 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.823864
I0612 15:22:54.224879 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.745455
I0612 15:22:54.224896 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.83497 (* 0.3 = 0.550492 loss)
I0612 15:22:54.224911 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.620582 (* 0.3 = 0.186175 loss)
I0612 15:22:54.224925 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.78694 (* 0.0272727 = 0.0487346 loss)
I0612 15:22:54.224941 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.24098 (* 0.0272727 = 0.0611177 loss)
I0612 15:22:54.224954 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.81654 (* 0.0272727 = 0.049542 loss)
I0612 15:22:54.224968 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.53227 (* 0.0272727 = 0.0417891 loss)
I0612 15:22:54.224983 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.83909 (* 0.0272727 = 0.0501571 loss)
I0612 15:22:54.224997 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.14037 (* 0.0272727 = 0.0311011 loss)
I0612 15:22:54.225011 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.88698 (* 0.0272727 = 0.0514631 loss)
I0612 15:22:54.225025 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.428575 (* 0.0272727 = 0.0116884 loss)
I0612 15:22:54.225040 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.74191 (* 0.0272727 = 0.0202339 loss)
I0612 15:22:54.225055 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.135339 (* 0.0272727 = 0.00369106 loss)
I0612 15:22:54.225069 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.517989 (* 0.0272727 = 0.014127 loss)
I0612 15:22:54.225085 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.194949 (* 0.0272727 = 0.00531679 loss)
I0612 15:22:54.225136 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.195766 (* 0.0272727 = 0.00533906 loss)
I0612 15:22:54.225152 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.221569 (* 0.0272727 = 0.00604279 loss)
I0612 15:22:54.225169 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.45332 (* 0.0272727 = 0.0123633 loss)
I0612 15:22:54.225184 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.0540389 (* 0.0272727 = 0.00147379 loss)
I0612 15:22:54.225199 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00108714 (* 0.0272727 = 2.96494e-05 loss)
I0612 15:22:54.225214 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000921324 (* 0.0272727 = 2.5127e-05 loss)
I0612 15:22:54.225229 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000603264 (* 0.0272727 = 1.64527e-05 loss)
I0612 15:22:54.225244 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00130218 (* 0.0272727 = 3.55141e-05 loss)
I0612 15:22:54.225258 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000841683 (* 0.0272727 = 2.2955e-05 loss)
I0612 15:22:54.225273 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00344134 (* 0.0272727 = 9.38548e-05 loss)
I0612 15:22:54.225286 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.6
I0612 15:22:54.225299 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 15:22:54.225311 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.5
I0612 15:22:54.225339 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 15:22:54.225354 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.625
I0612 15:22:54.225368 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.75
I0612 15:22:54.225380 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.75
I0612 15:22:54.225392 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 15:22:54.225404 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 1
I0612 15:22:54.225417 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 15:22:54.225430 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 1
I0612 15:22:54.225441 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 15:22:54.225453 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 15:22:54.225466 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 15:22:54.225478 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 15:22:54.225491 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 15:22:54.225503 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 15:22:54.225515 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 15:22:54.225528 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 15:22:54.225539 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 15:22:54.225550 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 15:22:54.225563 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 15:22:54.225574 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 15:22:54.225586 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.857955
I0612 15:22:54.225600 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.854545
I0612 15:22:54.225613 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.36236 (* 0.3 = 0.408708 loss)
I0612 15:22:54.225627 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.476767 (* 0.3 = 0.14303 loss)
I0612 15:22:54.225642 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.862247 (* 0.0272727 = 0.0235158 loss)
I0612 15:22:54.225669 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.73508 (* 0.0272727 = 0.0473204 loss)
I0612 15:22:54.225684 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.06796 (* 0.0272727 = 0.0291261 loss)
I0612 15:22:54.225699 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 0.920558 (* 0.0272727 = 0.0251061 loss)
I0612 15:22:54.225713 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.25551 (* 0.0272727 = 0.0342411 loss)
I0612 15:22:54.225728 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.885435 (* 0.0272727 = 0.0241482 loss)
I0612 15:22:54.225747 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.808712 (* 0.0272727 = 0.0220558 loss)
I0612 15:22:54.225762 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.367156 (* 0.0272727 = 0.0100133 loss)
I0612 15:22:54.225776 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.979766 (* 0.0272727 = 0.0267209 loss)
I0612 15:22:54.225791 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.133267 (* 0.0272727 = 0.00363456 loss)
I0612 15:22:54.225806 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.507287 (* 0.0272727 = 0.0138351 loss)
I0612 15:22:54.225821 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.258933 (* 0.0272727 = 0.00706181 loss)
I0612 15:22:54.225834 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.264042 (* 0.0272727 = 0.00720113 loss)
I0612 15:22:54.225848 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.204663 (* 0.0272727 = 0.00558171 loss)
I0612 15:22:54.225863 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.424777 (* 0.0272727 = 0.0115848 loss)
I0612 15:22:54.225878 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.0445245 (* 0.0272727 = 0.0012143 loss)
I0612 15:22:54.225893 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.000827055 (* 0.0272727 = 2.25561e-05 loss)
I0612 15:22:54.225906 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000298485 (* 0.0272727 = 8.14051e-06 loss)
I0612 15:22:54.225921 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000108542 (* 0.0272727 = 2.96023e-06 loss)
I0612 15:22:54.225935 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000106511 (* 0.0272727 = 2.90484e-06 loss)
I0612 15:22:54.225950 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000104757 (* 0.0272727 = 2.857e-06 loss)
I0612 15:22:54.225965 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 9.10144e-05 (* 0.0272727 = 2.48221e-06 loss)
I0612 15:22:54.225978 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.745455
I0612 15:22:54.225991 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 15:22:54.226003 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 15:22:54.226016 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 15:22:54.226027 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 15:22:54.226039 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 15:22:54.226052 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 1
I0612 15:22:54.226063 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 15:22:54.226076 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.75
I0612 15:22:54.226089 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 15:22:54.226100 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 15:22:54.226111 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 15:22:54.226125 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 15:22:54.226136 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 15:22:54.226150 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 15:22:54.226161 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 15:22:54.226183 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 15:22:54.226197 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 15:22:54.226212 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 15:22:54.226224 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 15:22:54.226236 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 15:22:54.226248 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 15:22:54.226260 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 15:22:54.226272 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.914773
I0612 15:22:54.226285 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.909091
I0612 15:22:54.226300 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.955348 (* 1 = 0.955348 loss)
I0612 15:22:54.226315 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.346081 (* 1 = 0.346081 loss)
I0612 15:22:54.226330 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.475634 (* 0.0909091 = 0.0432394 loss)
I0612 15:22:54.226343 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.759653 (* 0.0909091 = 0.0690593 loss)
I0612 15:22:54.226357 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.966033 (* 0.0909091 = 0.0878212 loss)
I0612 15:22:54.226372 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.724751 (* 0.0909091 = 0.0658864 loss)
I0612 15:22:54.226385 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.909357 (* 0.0909091 = 0.0826689 loss)
I0612 15:22:54.226399 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.430929 (* 0.0909091 = 0.0391754 loss)
I0612 15:22:54.226413 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.958456 (* 0.0909091 = 0.0871323 loss)
I0612 15:22:54.226428 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.613061 (* 0.0909091 = 0.0557328 loss)
I0612 15:22:54.226443 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.435543 (* 0.0909091 = 0.0395948 loss)
I0612 15:22:54.226456 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.112985 (* 0.0909091 = 0.0102713 loss)
I0612 15:22:54.226470 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.565428 (* 0.0909091 = 0.0514026 loss)
I0612 15:22:54.226485 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.108197 (* 0.0909091 = 0.00983611 loss)
I0612 15:22:54.226500 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.286301 (* 0.0909091 = 0.0260274 loss)
I0612 15:22:54.226513 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.24084 (* 0.0909091 = 0.0218946 loss)
I0612 15:22:54.226527 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.508668 (* 0.0909091 = 0.0462426 loss)
I0612 15:22:54.226542 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.090348 (* 0.0909091 = 0.00821345 loss)
I0612 15:22:54.226557 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0129293 (* 0.0909091 = 0.00117539 loss)
I0612 15:22:54.226572 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00546521 (* 0.0909091 = 0.000496837 loss)
I0612 15:22:54.226585 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00374937 (* 0.0909091 = 0.000340851 loss)
I0612 15:22:54.226599 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00174805 (* 0.0909091 = 0.000158914 loss)
I0612 15:22:54.226613 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00117182 (* 0.0909091 = 0.000106529 loss)
I0612 15:22:54.226627 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000489735 (* 0.0909091 = 4.45214e-05 loss)
I0612 15:22:54.226640 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 15:22:54.226665 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.375
I0612 15:22:54.226675 6181 solver.cpp:245] Train net output #149: total_confidence = 0.363324
I0612 15:22:54.226687 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.349721
I0612 15:22:54.226701 6181 sgd_solver.cpp:106] Iteration 2500, lr = 0.001
I0612 15:23:11.721032 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.9527 > 30) by scale factor 0.938887
I0612 15:24:13.883236 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 41.1355 > 30) by scale factor 0.729297
I0612 15:24:27.097232 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.0248 > 30) by scale factor 0.999173
I0612 15:24:31.758752 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.0303 > 30) by scale factor 0.936615
I0612 15:24:44.187541 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 42.9014 > 30) by scale factor 0.699278
I0612 15:24:48.111938 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.9566 > 30) by scale factor 0.969099
I0612 15:24:49.672183 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.8658 > 30) by scale factor 0.836452
I0612 15:25:19.215917 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.0958 > 30) by scale factor 0.906461
I0612 15:25:23.883162 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.0931 > 30) by scale factor 0.964844
I0612 15:25:44.872499 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.3123 > 30) by scale factor 0.989699
I0612 15:28:31.824604 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.3064 > 30) by scale factor 0.900729
I0612 15:28:54.354938 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.0145 > 30) by scale factor 0.856787
I0612 15:29:06.019487 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.9581 > 30) by scale factor 0.790345
I0612 15:29:19.966864 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.443 > 30) by scale factor 0.801219
I0612 15:29:22.707744 6181 solver.cpp:229] Iteration 3000, loss = 4.14428
I0612 15:29:22.707808 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.444444
I0612 15:29:22.707826 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 15:29:22.707839 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.5
I0612 15:29:22.707851 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.625
I0612 15:29:22.707864 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 15:29:22.707876 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 15:29:22.707888 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.625
I0612 15:29:22.707901 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 15:29:22.707912 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 15:29:22.707924 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 15:29:22.707937 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 1
I0612 15:29:22.707948 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 1
I0612 15:29:22.707960 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 15:29:22.707973 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 15:29:22.707984 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 15:29:22.707996 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 15:29:22.708009 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 15:29:22.708020 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 15:29:22.708032 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 15:29:22.708045 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 15:29:22.708056 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 15:29:22.708068 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 15:29:22.708081 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 15:29:22.708099 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.846591
I0612 15:29:22.708115 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.755556
I0612 15:29:22.708132 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.85751 (* 0.3 = 0.557253 loss)
I0612 15:29:22.708148 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.522326 (* 0.3 = 0.156698 loss)
I0612 15:29:22.708163 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.2163 (* 0.0272727 = 0.0331719 loss)
I0612 15:29:22.708176 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.24917 (* 0.0272727 = 0.0340681 loss)
I0612 15:29:22.708190 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.26399 (* 0.0272727 = 0.0344726 loss)
I0612 15:29:22.708204 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.53777 (* 0.0272727 = 0.041939 loss)
I0612 15:29:22.708220 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.53851 (* 0.0272727 = 0.0419594 loss)
I0612 15:29:22.708233 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.70216 (* 0.0272727 = 0.0464224 loss)
I0612 15:29:22.708247 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.96925 (* 0.0272727 = 0.0537069 loss)
I0612 15:29:22.708261 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.57292 (* 0.0272727 = 0.0428979 loss)
I0612 15:29:22.708274 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 1.06429 (* 0.0272727 = 0.0290262 loss)
I0612 15:29:22.708289 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.0147468 (* 0.0272727 = 0.000402184 loss)
I0612 15:29:22.708303 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.0103932 (* 0.0272727 = 0.000283451 loss)
I0612 15:29:22.708348 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.0109774 (* 0.0272727 = 0.000299383 loss)
I0612 15:29:22.708364 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.0111415 (* 0.0272727 = 0.00030386 loss)
I0612 15:29:22.708379 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.0102032 (* 0.0272727 = 0.000278269 loss)
I0612 15:29:22.708392 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.0109645 (* 0.0272727 = 0.000299031 loss)
I0612 15:29:22.708406 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.00406908 (* 0.0272727 = 0.000110975 loss)
I0612 15:29:22.708420 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0066463 (* 0.0272727 = 0.000181263 loss)
I0612 15:29:22.708434 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00399069 (* 0.0272727 = 0.000108837 loss)
I0612 15:29:22.708448 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00398262 (* 0.0272727 = 0.000108617 loss)
I0612 15:29:22.708463 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00353318 (* 0.0272727 = 9.63595e-05 loss)
I0612 15:29:22.708477 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00562796 (* 0.0272727 = 0.00015349 loss)
I0612 15:29:22.708492 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00335732 (* 0.0272727 = 9.15634e-05 loss)
I0612 15:29:22.708504 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.555556
I0612 15:29:22.708516 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 15:29:22.708529 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 15:29:22.708541 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 15:29:22.708552 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.75
I0612 15:29:22.708564 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 15:29:22.708576 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.625
I0612 15:29:22.708588 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 15:29:22.708600 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 15:29:22.708612 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 15:29:22.708624 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 1
I0612 15:29:22.708636 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 1
I0612 15:29:22.708647 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 15:29:22.708662 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 15:29:22.708674 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 15:29:22.708686 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 15:29:22.708698 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 15:29:22.708709 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 15:29:22.708721 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 15:29:22.708732 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 15:29:22.708745 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 15:29:22.708755 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 15:29:22.708767 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 15:29:22.708778 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.880682
I0612 15:29:22.708791 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.822222
I0612 15:29:22.708806 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.36653 (* 0.3 = 0.40996 loss)
I0612 15:29:22.708819 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.380695 (* 0.3 = 0.114208 loss)
I0612 15:29:22.708845 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.578169 (* 0.0272727 = 0.0157682 loss)
I0612 15:29:22.708860 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.765003 (* 0.0272727 = 0.0208637 loss)
I0612 15:29:22.708874 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.09661 (* 0.0272727 = 0.0299076 loss)
I0612 15:29:22.708889 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.0471 (* 0.0272727 = 0.0285572 loss)
I0612 15:29:22.708902 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.33338 (* 0.0272727 = 0.036365 loss)
I0612 15:29:22.708916 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.693 (* 0.0272727 = 0.0461727 loss)
I0612 15:29:22.708930 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.59125 (* 0.0272727 = 0.0433978 loss)
I0612 15:29:22.708945 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.2085 (* 0.0272727 = 0.0329591 loss)
I0612 15:29:22.708957 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 1.50756 (* 0.0272727 = 0.0411154 loss)
I0612 15:29:22.708971 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.0118576 (* 0.0272727 = 0.000323388 loss)
I0612 15:29:22.708986 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.00535408 (* 0.0272727 = 0.00014602 loss)
I0612 15:29:22.708999 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.00234393 (* 0.0272727 = 6.39253e-05 loss)
I0612 15:29:22.709014 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.00113445 (* 0.0272727 = 3.09397e-05 loss)
I0612 15:29:22.709028 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.00077474 (* 0.0272727 = 2.11293e-05 loss)
I0612 15:29:22.709043 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.00053598 (* 0.0272727 = 1.46176e-05 loss)
I0612 15:29:22.709058 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.000571574 (* 0.0272727 = 1.55884e-05 loss)
I0612 15:29:22.709071 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.000240369 (* 0.0272727 = 6.55553e-06 loss)
I0612 15:29:22.709085 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000103629 (* 0.0272727 = 2.82626e-06 loss)
I0612 15:29:22.709100 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000244721 (* 0.0272727 = 6.67422e-06 loss)
I0612 15:29:22.709115 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000111743 (* 0.0272727 = 3.04754e-06 loss)
I0612 15:29:22.709128 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 5.04224e-05 (* 0.0272727 = 1.37516e-06 loss)
I0612 15:29:22.709142 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 3.05725e-05 (* 0.0272727 = 8.33797e-07 loss)
I0612 15:29:22.709156 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.822222
I0612 15:29:22.709169 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 15:29:22.709182 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 15:29:22.709193 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 15:29:22.709205 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 15:29:22.709218 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 15:29:22.709229 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 15:29:22.709241 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 15:29:22.709254 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 15:29:22.709265 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 15:29:22.709277 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 15:29:22.709288 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 15:29:22.709300 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 15:29:22.709311 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 15:29:22.709348 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 15:29:22.709363 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 15:29:22.709375 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 15:29:22.709386 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 15:29:22.709398 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 15:29:22.709410 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 15:29:22.709422 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 15:29:22.709434 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 15:29:22.709445 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 15:29:22.709457 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.954545
I0612 15:29:22.709470 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.911111
I0612 15:29:22.709484 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.936021 (* 1 = 0.936021 loss)
I0612 15:29:22.709498 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.249274 (* 1 = 0.249274 loss)
I0612 15:29:22.709512 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.291184 (* 0.0909091 = 0.0264712 loss)
I0612 15:29:22.709527 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.372285 (* 0.0909091 = 0.0338441 loss)
I0612 15:29:22.709542 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.503208 (* 0.0909091 = 0.0457462 loss)
I0612 15:29:22.709556 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.567789 (* 0.0909091 = 0.0516172 loss)
I0612 15:29:22.709570 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.675513 (* 0.0909091 = 0.0614103 loss)
I0612 15:29:22.709584 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.00268 (* 0.0909091 = 0.0911528 loss)
I0612 15:29:22.709599 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.04746 (* 0.0909091 = 0.0952233 loss)
I0612 15:29:22.709612 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 1.24923 (* 0.0909091 = 0.113566 loss)
I0612 15:29:22.709626 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 1.25867 (* 0.0909091 = 0.114424 loss)
I0612 15:29:22.709641 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.00739898 (* 0.0909091 = 0.000672635 loss)
I0612 15:29:22.709656 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.00699268 (* 0.0909091 = 0.000635698 loss)
I0612 15:29:22.709669 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.00174594 (* 0.0909091 = 0.000158722 loss)
I0612 15:29:22.709684 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.000550336 (* 0.0909091 = 5.00306e-05 loss)
I0612 15:29:22.709698 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.000401534 (* 0.0909091 = 3.65031e-05 loss)
I0612 15:29:22.709717 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.000242893 (* 0.0909091 = 2.20812e-05 loss)
I0612 15:29:22.709731 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.000408448 (* 0.0909091 = 3.71316e-05 loss)
I0612 15:29:22.709745 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.000321932 (* 0.0909091 = 2.92666e-05 loss)
I0612 15:29:22.709759 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.000497991 (* 0.0909091 = 4.52719e-05 loss)
I0612 15:29:22.709774 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00022797 (* 0.0909091 = 2.07245e-05 loss)
I0612 15:29:22.709787 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000290193 (* 0.0909091 = 2.63812e-05 loss)
I0612 15:29:22.709801 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000449506 (* 0.0909091 = 4.08642e-05 loss)
I0612 15:29:22.709816 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000125574 (* 0.0909091 = 1.14158e-05 loss)
I0612 15:29:22.709839 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.875
I0612 15:29:22.709852 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.875
I0612 15:29:22.709864 6181 solver.cpp:245] Train net output #149: total_confidence = 0.521796
I0612 15:29:22.709877 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.574054
I0612 15:29:22.709889 6181 sgd_solver.cpp:106] Iteration 3000, lr = 0.001
I0612 15:30:15.854600 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.6201 > 30) by scale factor 0.979749
I0612 15:34:55.211817 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.8851 > 30) by scale factor 0.912267
I0612 15:35:11.551069 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.9998 > 30) by scale factor 0.909096
I0612 15:35:18.522429 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.1566 > 30) by scale factor 0.932935
I0612 15:35:50.675537 6181 solver.cpp:229] Iteration 3500, loss = 4.12911
I0612 15:35:50.675657 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.5
I0612 15:35:50.675675 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.625
I0612 15:35:50.675688 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.75
I0612 15:35:50.675701 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.875
I0612 15:35:50.675714 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.125
I0612 15:35:50.675725 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.375
I0612 15:35:50.675740 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.75
I0612 15:35:50.675750 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 15:35:50.675763 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 15:35:50.675776 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 15:35:50.675787 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 15:35:50.675799 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.75
I0612 15:35:50.675811 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 15:35:50.675822 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 15:35:50.675835 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 15:35:50.675848 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 15:35:50.675859 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 15:35:50.675871 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 15:35:50.675882 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 15:35:50.675894 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 15:35:50.675905 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 15:35:50.675917 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 15:35:50.675930 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 15:35:50.675941 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.823864
I0612 15:35:50.675954 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.75
I0612 15:35:50.675971 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.57623 (* 0.3 = 0.472869 loss)
I0612 15:35:50.675984 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.59449 (* 0.3 = 0.178347 loss)
I0612 15:35:50.675999 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.0627 (* 0.0272727 = 0.0289827 loss)
I0612 15:35:50.676013 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.03819 (* 0.0272727 = 0.0283144 loss)
I0612 15:35:50.676028 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.47994 (* 0.0272727 = 0.0403621 loss)
I0612 15:35:50.676041 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.26079 (* 0.0272727 = 0.0616579 loss)
I0612 15:35:50.676055 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.57763 (* 0.0272727 = 0.0430262 loss)
I0612 15:35:50.676069 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 0.811103 (* 0.0272727 = 0.022121 loss)
I0612 15:35:50.676084 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.00911 (* 0.0272727 = 0.0275213 loss)
I0612 15:35:50.676097 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.695768 (* 0.0272727 = 0.0189755 loss)
I0612 15:35:50.676111 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.675793 (* 0.0272727 = 0.0184307 loss)
I0612 15:35:50.676126 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.945881 (* 0.0272727 = 0.0257967 loss)
I0612 15:35:50.676141 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.943071 (* 0.0272727 = 0.0257201 loss)
I0612 15:35:50.676154 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.781828 (* 0.0272727 = 0.0213226 loss)
I0612 15:35:50.676187 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.355408 (* 0.0272727 = 0.00969295 loss)
I0612 15:35:50.676203 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.232154 (* 0.0272727 = 0.00633148 loss)
I0612 15:35:50.676218 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.334158 (* 0.0272727 = 0.0091134 loss)
I0612 15:35:50.676234 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.0951757 (* 0.0272727 = 0.0025957 loss)
I0612 15:35:50.676249 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0119923 (* 0.0272727 = 0.000327062 loss)
I0612 15:35:50.676265 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00351989 (* 0.0272727 = 9.59969e-05 loss)
I0612 15:35:50.676278 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00190091 (* 0.0272727 = 5.18429e-05 loss)
I0612 15:35:50.676293 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00141199 (* 0.0272727 = 3.85087e-05 loss)
I0612 15:35:50.676307 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.0010317 (* 0.0272727 = 2.81372e-05 loss)
I0612 15:35:50.676321 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00130525 (* 0.0272727 = 3.55978e-05 loss)
I0612 15:35:50.676334 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.6
I0612 15:35:50.676347 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 15:35:50.676358 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 15:35:50.676370 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 15:35:50.676383 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.75
I0612 15:35:50.676394 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.625
I0612 15:35:50.676406 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.875
I0612 15:35:50.676419 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 15:35:50.676430 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 15:35:50.676441 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 15:35:50.676453 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.75
I0612 15:35:50.676465 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.75
I0612 15:35:50.676476 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 15:35:50.676488 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 15:35:50.676501 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 15:35:50.676512 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 15:35:50.676523 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 15:35:50.676535 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 15:35:50.676547 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 15:35:50.676558 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 15:35:50.676569 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 15:35:50.676580 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 15:35:50.676591 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 15:35:50.676604 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.852273
I0612 15:35:50.676615 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.8
I0612 15:35:50.676630 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.37147 (* 0.3 = 0.411441 loss)
I0612 15:35:50.676643 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.529445 (* 0.3 = 0.158834 loss)
I0612 15:35:50.676657 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.1603 (* 0.0272727 = 0.0316445 loss)
I0612 15:35:50.676674 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.554164 (* 0.0272727 = 0.0151136 loss)
I0612 15:35:50.676695 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.872056 (* 0.0272727 = 0.0237833 loss)
I0612 15:35:50.676712 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.4502 (* 0.0272727 = 0.0395508 loss)
I0612 15:35:50.676726 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.48847 (* 0.0272727 = 0.0405946 loss)
I0612 15:35:50.676740 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.605943 (* 0.0272727 = 0.0165257 loss)
I0612 15:35:50.676754 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.642673 (* 0.0272727 = 0.0175274 loss)
I0612 15:35:50.676769 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.643271 (* 0.0272727 = 0.0175438 loss)
I0612 15:35:50.676782 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.700523 (* 0.0272727 = 0.0191052 loss)
I0612 15:35:50.676796 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.703937 (* 0.0272727 = 0.0191983 loss)
I0612 15:35:50.676810 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 1.21446 (* 0.0272727 = 0.0331216 loss)
I0612 15:35:50.676825 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.635759 (* 0.0272727 = 0.0173389 loss)
I0612 15:35:50.676838 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.393437 (* 0.0272727 = 0.0107301 loss)
I0612 15:35:50.676852 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.369212 (* 0.0272727 = 0.0100694 loss)
I0612 15:35:50.676867 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.199842 (* 0.0272727 = 0.00545024 loss)
I0612 15:35:50.676880 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.044554 (* 0.0272727 = 0.00121511 loss)
I0612 15:35:50.676894 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0063283 (* 0.0272727 = 0.00017259 loss)
I0612 15:35:50.676908 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.00356167 (* 0.0272727 = 9.71365e-05 loss)
I0612 15:35:50.676923 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00183925 (* 0.0272727 = 5.01614e-05 loss)
I0612 15:35:50.676937 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00105726 (* 0.0272727 = 2.88343e-05 loss)
I0612 15:35:50.676951 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000550313 (* 0.0272727 = 1.50085e-05 loss)
I0612 15:35:50.676965 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000309895 (* 0.0272727 = 8.45169e-06 loss)
I0612 15:35:50.676977 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.716667
I0612 15:35:50.676990 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 15:35:50.677001 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 15:35:50.677013 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 15:35:50.677024 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 15:35:50.677037 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 15:35:50.677047 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 1
I0612 15:35:50.677059 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 1
I0612 15:35:50.677070 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 15:35:50.677083 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 15:35:50.677093 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.75
I0612 15:35:50.677105 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.75
I0612 15:35:50.677116 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.75
I0612 15:35:50.677129 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 15:35:50.677139 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 15:35:50.677151 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 15:35:50.677162 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 15:35:50.677183 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 15:35:50.677196 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 15:35:50.677208 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 15:35:50.677219 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 15:35:50.677232 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 15:35:50.677242 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 15:35:50.677254 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.897727
I0612 15:35:50.677266 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.833333
I0612 15:35:50.677295 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.993854 (* 1 = 0.993854 loss)
I0612 15:35:50.677311 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.364542 (* 1 = 0.364542 loss)
I0612 15:35:50.677326 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.60533 (* 0.0909091 = 0.05503 loss)
I0612 15:35:50.677340 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.373689 (* 0.0909091 = 0.0339717 loss)
I0612 15:35:50.677356 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.558418 (* 0.0909091 = 0.0507653 loss)
I0612 15:35:50.677369 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.649148 (* 0.0909091 = 0.0590135 loss)
I0612 15:35:50.677383 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.844168 (* 0.0909091 = 0.0767425 loss)
I0612 15:35:50.677397 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.142303 (* 0.0909091 = 0.0129366 loss)
I0612 15:35:50.677412 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.256031 (* 0.0909091 = 0.0232756 loss)
I0612 15:35:50.677425 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.428932 (* 0.0909091 = 0.0389938 loss)
I0612 15:35:50.677439 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.484144 (* 0.0909091 = 0.0440131 loss)
I0612 15:35:50.677453 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.574709 (* 0.0909091 = 0.0522463 loss)
I0612 15:35:50.677467 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 1.53594 (* 0.0909091 = 0.139631 loss)
I0612 15:35:50.677481 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.77906 (* 0.0909091 = 0.0708237 loss)
I0612 15:35:50.677495 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.14039 (* 0.0909091 = 0.0127627 loss)
I0612 15:35:50.677510 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.0883172 (* 0.0909091 = 0.00802883 loss)
I0612 15:35:50.677523 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.187594 (* 0.0909091 = 0.017054 loss)
I0612 15:35:50.677538 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.0795274 (* 0.0909091 = 0.00722977 loss)
I0612 15:35:50.677552 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.00790451 (* 0.0909091 = 0.000718592 loss)
I0612 15:35:50.677567 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00579157 (* 0.0909091 = 0.000526507 loss)
I0612 15:35:50.677580 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00232682 (* 0.0909091 = 0.000211529 loss)
I0612 15:35:50.677595 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00161912 (* 0.0909091 = 0.000147193 loss)
I0612 15:35:50.677609 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00103215 (* 0.0909091 = 9.38316e-05 loss)
I0612 15:35:50.677623 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000829427 (* 0.0909091 = 7.54025e-05 loss)
I0612 15:35:50.677635 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 15:35:50.677647 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.625
I0612 15:35:50.677670 6181 solver.cpp:245] Train net output #149: total_confidence = 0.301357
I0612 15:35:50.677683 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.21181
I0612 15:35:50.677696 6181 sgd_solver.cpp:106] Iteration 3500, lr = 0.001
I0612 15:36:08.082625 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.2291 > 30) by scale factor 0.902822
I0612 15:37:48.137213 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.8563 > 30) by scale factor 0.941729
I0612 15:38:40.898841 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 63.5163 > 30) by scale factor 0.47232
I0612 15:39:09.604164 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.0501 > 30) by scale factor 0.907712
I0612 15:40:12.385860 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 41.1746 > 30) by scale factor 0.728604
I0612 15:40:41.094377 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.2396 > 30) by scale factor 0.992075
I0612 15:41:09.051343 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.2083 > 30) by scale factor 0.903389
I0612 15:41:15.242527 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 39.4141 > 30) by scale factor 0.761148
I0612 15:41:43.101796 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 43.9568 > 30) by scale factor 0.682489
I0612 15:42:05.602370 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 50.2693 > 30) by scale factor 0.596785
I0612 15:42:18.405341 6181 solver.cpp:229] Iteration 4000, loss = 4.1526
I0612 15:42:18.405459 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.490566
I0612 15:42:18.405480 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 15:42:18.405493 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 15:42:18.405506 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.625
I0612 15:42:18.405519 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.125
I0612 15:42:18.405531 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 15:42:18.405544 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.625
I0612 15:42:18.405556 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.875
I0612 15:42:18.405568 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 15:42:18.405580 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 15:42:18.405592 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 15:42:18.405604 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 15:42:18.405616 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 15:42:18.405628 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 15:42:18.405642 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 15:42:18.405653 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 15:42:18.405665 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 15:42:18.405676 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 15:42:18.405689 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 15:42:18.405700 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 15:42:18.405711 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 15:42:18.405724 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 15:42:18.405735 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 15:42:18.405747 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.829545
I0612 15:42:18.405761 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.660377
I0612 15:42:18.405778 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.04536 (* 0.3 = 0.613608 loss)
I0612 15:42:18.405792 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.680895 (* 0.3 = 0.204268 loss)
I0612 15:42:18.405812 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.05073 (* 0.0272727 = 0.0286564 loss)
I0612 15:42:18.405836 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.68685 (* 0.0272727 = 0.0460049 loss)
I0612 15:42:18.405853 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.6372 (* 0.0272727 = 0.0446508 loss)
I0612 15:42:18.405866 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.73047 (* 0.0272727 = 0.0744675 loss)
I0612 15:42:18.405881 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.72191 (* 0.0272727 = 0.0469612 loss)
I0612 15:42:18.405895 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.63729 (* 0.0272727 = 0.0446533 loss)
I0612 15:42:18.405910 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.394697 (* 0.0272727 = 0.0107645 loss)
I0612 15:42:18.405925 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.241052 (* 0.0272727 = 0.00657414 loss)
I0612 15:42:18.405939 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.695013 (* 0.0272727 = 0.0189549 loss)
I0612 15:42:18.405954 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.504144 (* 0.0272727 = 0.0137494 loss)
I0612 15:42:18.405968 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.393337 (* 0.0272727 = 0.0107274 loss)
I0612 15:42:18.405982 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.281718 (* 0.0272727 = 0.00768323 loss)
I0612 15:42:18.406016 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.245693 (* 0.0272727 = 0.00670072 loss)
I0612 15:42:18.406033 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.238484 (* 0.0272727 = 0.00650411 loss)
I0612 15:42:18.406047 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.329448 (* 0.0272727 = 0.00898494 loss)
I0612 15:42:18.406061 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.120383 (* 0.0272727 = 0.00328317 loss)
I0612 15:42:18.406076 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0131867 (* 0.0272727 = 0.000359636 loss)
I0612 15:42:18.406091 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00482231 (* 0.0272727 = 0.000131517 loss)
I0612 15:42:18.406105 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00169398 (* 0.0272727 = 4.61994e-05 loss)
I0612 15:42:18.406119 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00217698 (* 0.0272727 = 5.93723e-05 loss)
I0612 15:42:18.406134 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00232147 (* 0.0272727 = 6.33128e-05 loss)
I0612 15:42:18.406148 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00314629 (* 0.0272727 = 8.58078e-05 loss)
I0612 15:42:18.406162 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.603774
I0612 15:42:18.406173 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 15:42:18.406185 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.625
I0612 15:42:18.406198 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.5
I0612 15:42:18.406209 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 15:42:18.406224 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.625
I0612 15:42:18.406239 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.375
I0612 15:42:18.406250 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 1
I0612 15:42:18.406262 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 15:42:18.406275 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 15:42:18.406286 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 15:42:18.406297 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 15:42:18.406309 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 15:42:18.406322 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 15:42:18.406333 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 15:42:18.406345 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 15:42:18.406358 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 15:42:18.406369 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 15:42:18.406381 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 15:42:18.406394 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 15:42:18.406404 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 15:42:18.406412 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 15:42:18.406420 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 15:42:18.406432 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.869318
I0612 15:42:18.406445 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.716981
I0612 15:42:18.406460 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.75933 (* 0.3 = 0.5278 loss)
I0612 15:42:18.406476 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.599132 (* 0.3 = 0.17974 loss)
I0612 15:42:18.406497 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.634453 (* 0.0272727 = 0.0173033 loss)
I0612 15:42:18.406522 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.47564 (* 0.0272727 = 0.0402448 loss)
I0612 15:42:18.406549 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.68574 (* 0.0272727 = 0.0459747 loss)
I0612 15:42:18.406565 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.61435 (* 0.0272727 = 0.0440278 loss)
I0612 15:42:18.406579 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.54598 (* 0.0272727 = 0.042163 loss)
I0612 15:42:18.406592 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.63341 (* 0.0272727 = 0.0445476 loss)
I0612 15:42:18.406606 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.359778 (* 0.0272727 = 0.00981212 loss)
I0612 15:42:18.406621 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.403503 (* 0.0272727 = 0.0110046 loss)
I0612 15:42:18.406635 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.819177 (* 0.0272727 = 0.0223412 loss)
I0612 15:42:18.406651 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.611262 (* 0.0272727 = 0.0166708 loss)
I0612 15:42:18.406664 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.348279 (* 0.0272727 = 0.00949852 loss)
I0612 15:42:18.406678 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.280996 (* 0.0272727 = 0.00766352 loss)
I0612 15:42:18.406692 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.269272 (* 0.0272727 = 0.00734377 loss)
I0612 15:42:18.406707 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.273532 (* 0.0272727 = 0.00745995 loss)
I0612 15:42:18.406720 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.266745 (* 0.0272727 = 0.00727488 loss)
I0612 15:42:18.406734 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.173647 (* 0.0272727 = 0.00473582 loss)
I0612 15:42:18.406749 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0270114 (* 0.0272727 = 0.000736674 loss)
I0612 15:42:18.406762 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0263988 (* 0.0272727 = 0.000719968 loss)
I0612 15:42:18.406776 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.0132306 (* 0.0272727 = 0.000360835 loss)
I0612 15:42:18.406790 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.0201258 (* 0.0272727 = 0.000548886 loss)
I0612 15:42:18.406805 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.0209388 (* 0.0272727 = 0.000571058 loss)
I0612 15:42:18.406818 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.0236827 (* 0.0272727 = 0.000645892 loss)
I0612 15:42:18.406831 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.660377
I0612 15:42:18.406843 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 15:42:18.406855 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.625
I0612 15:42:18.406867 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.625
I0612 15:42:18.406879 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 15:42:18.406890 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 15:42:18.406903 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 15:42:18.406913 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 1
I0612 15:42:18.406925 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 1
I0612 15:42:18.406936 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 15:42:18.406949 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 15:42:18.406960 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 15:42:18.406972 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 15:42:18.406983 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 15:42:18.406996 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 15:42:18.407007 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 15:42:18.407029 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 15:42:18.407042 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 15:42:18.407054 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 15:42:18.407066 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 15:42:18.407078 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 15:42:18.407089 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 15:42:18.407100 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 15:42:18.407112 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.897727
I0612 15:42:18.407125 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.773585
I0612 15:42:18.407140 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.25593 (* 1 = 1.25593 loss)
I0612 15:42:18.407153 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.427704 (* 1 = 0.427704 loss)
I0612 15:42:18.407168 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.467414 (* 0.0909091 = 0.0424922 loss)
I0612 15:42:18.407182 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 1.11007 (* 0.0909091 = 0.100915 loss)
I0612 15:42:18.407196 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 1.26568 (* 0.0909091 = 0.115062 loss)
I0612 15:42:18.407210 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.16265 (* 0.0909091 = 0.105696 loss)
I0612 15:42:18.407224 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.84425 (* 0.0909091 = 0.07675 loss)
I0612 15:42:18.407239 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.811907 (* 0.0909091 = 0.0738098 loss)
I0612 15:42:18.407253 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.108003 (* 0.0909091 = 0.00981843 loss)
I0612 15:42:18.407269 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.0573655 (* 0.0909091 = 0.00521504 loss)
I0612 15:42:18.407285 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.773765 (* 0.0909091 = 0.0703423 loss)
I0612 15:42:18.407300 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.452308 (* 0.0909091 = 0.0411189 loss)
I0612 15:42:18.407315 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.309649 (* 0.0909091 = 0.0281499 loss)
I0612 15:42:18.407328 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.0955846 (* 0.0909091 = 0.00868951 loss)
I0612 15:42:18.407342 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.12971 (* 0.0909091 = 0.0117919 loss)
I0612 15:42:18.407356 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.286306 (* 0.0909091 = 0.0260278 loss)
I0612 15:42:18.407371 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.467603 (* 0.0909091 = 0.0425093 loss)
I0612 15:42:18.407384 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.0433438 (* 0.0909091 = 0.00394034 loss)
I0612 15:42:18.407398 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.00359104 (* 0.0909091 = 0.000326458 loss)
I0612 15:42:18.407413 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00385433 (* 0.0909091 = 0.000350394 loss)
I0612 15:42:18.407426 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00434078 (* 0.0909091 = 0.000394617 loss)
I0612 15:42:18.407440 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00611935 (* 0.0909091 = 0.000556304 loss)
I0612 15:42:18.407455 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00998822 (* 0.0909091 = 0.00090802 loss)
I0612 15:42:18.407469 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.00412498 (* 0.0909091 = 0.000374998 loss)
I0612 15:42:18.407481 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 15:42:18.407492 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 15:42:18.407515 6181 solver.cpp:245] Train net output #149: total_confidence = 0.502074
I0612 15:42:18.407531 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.481675
I0612 15:42:18.407544 6181 sgd_solver.cpp:106] Iteration 4000, lr = 0.001
I0612 15:42:50.558892 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.2332 > 30) by scale factor 0.930718
I0612 15:43:16.127706 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.5187 > 30) by scale factor 0.799601
I0612 15:44:25.102818 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.0066 > 30) by scale factor 0.937306
I0612 15:44:46.017738 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 39.6768 > 30) by scale factor 0.756109
I0612 15:46:02.697361 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.2435 > 30) by scale factor 0.902433
I0612 15:46:49.940757 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.1709 > 30) by scale factor 0.852977
I0612 15:48:34.494513 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 54.3285 > 30) by scale factor 0.552196
I0612 15:48:45.727612 6181 solver.cpp:229] Iteration 4500, loss = 4.20351
I0612 15:48:45.727665 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.483871
I0612 15:48:45.727684 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 1
I0612 15:48:45.727701 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.5
I0612 15:48:45.727715 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.625
I0612 15:48:45.727727 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.25
I0612 15:48:45.727741 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 15:48:45.727752 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.75
I0612 15:48:45.727766 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.5
I0612 15:48:45.727777 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 15:48:45.727789 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 15:48:45.727802 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 15:48:45.727814 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 15:48:45.727826 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 15:48:45.727838 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 15:48:45.727850 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 15:48:45.727862 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 15:48:45.727874 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 15:48:45.727886 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 15:48:45.727898 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 15:48:45.727910 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 15:48:45.727921 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 15:48:45.727933 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 15:48:45.727946 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 15:48:45.727957 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.806818
I0612 15:48:45.727969 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.790323
I0612 15:48:45.727985 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.5229 (* 0.3 = 0.456869 loss)
I0612 15:48:45.728000 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.582889 (* 0.3 = 0.174867 loss)
I0612 15:48:45.728016 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.359513 (* 0.0272727 = 0.00980491 loss)
I0612 15:48:45.728031 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.71376 (* 0.0272727 = 0.046739 loss)
I0612 15:48:45.728045 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 0.917429 (* 0.0272727 = 0.0250208 loss)
I0612 15:48:45.728060 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.1426 (* 0.0272727 = 0.0584346 loss)
I0612 15:48:45.728073 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.2057 (* 0.0272727 = 0.0328828 loss)
I0612 15:48:45.728087 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 0.891937 (* 0.0272727 = 0.0243256 loss)
I0612 15:48:45.728101 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 2.52407 (* 0.0272727 = 0.0688381 loss)
I0612 15:48:45.728116 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.804773 (* 0.0272727 = 0.0219483 loss)
I0612 15:48:45.728130 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.588399 (* 0.0272727 = 0.0160472 loss)
I0612 15:48:45.728144 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.755539 (* 0.0272727 = 0.0206056 loss)
I0612 15:48:45.728158 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.269098 (* 0.0272727 = 0.00733903 loss)
I0612 15:48:45.728175 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.323946 (* 0.0272727 = 0.00883489 loss)
I0612 15:48:45.728219 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.339723 (* 0.0272727 = 0.00926516 loss)
I0612 15:48:45.728235 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.231042 (* 0.0272727 = 0.00630115 loss)
I0612 15:48:45.728248 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.280512 (* 0.0272727 = 0.00765032 loss)
I0612 15:48:45.728262 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.461874 (* 0.0272727 = 0.0125966 loss)
I0612 15:48:45.728277 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00347153 (* 0.0272727 = 9.4678e-05 loss)
I0612 15:48:45.728292 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000387823 (* 0.0272727 = 1.0577e-05 loss)
I0612 15:48:45.728307 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 9.35899e-05 (* 0.0272727 = 2.55245e-06 loss)
I0612 15:48:45.728320 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 4.32945e-05 (* 0.0272727 = 1.18076e-06 loss)
I0612 15:48:45.728335 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 7.23321e-05 (* 0.0272727 = 1.97269e-06 loss)
I0612 15:48:45.728349 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 2.16376e-05 (* 0.0272727 = 5.90117e-07 loss)
I0612 15:48:45.728361 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.580645
I0612 15:48:45.728374 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 15:48:45.728386 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 15:48:45.728399 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 15:48:45.728410 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 15:48:45.728421 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.75
I0612 15:48:45.728433 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 15:48:45.728446 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 15:48:45.728456 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 15:48:45.728468 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 15:48:45.728480 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.75
I0612 15:48:45.728492 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 15:48:45.728504 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 15:48:45.728515 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 15:48:45.728528 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 15:48:45.728539 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 15:48:45.728550 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 15:48:45.728562 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 15:48:45.728574 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 15:48:45.728585 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 15:48:45.728596 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 15:48:45.728607 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 15:48:45.728618 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 15:48:45.728631 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.846591
I0612 15:48:45.728642 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.806452
I0612 15:48:45.728657 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.54727 (* 0.3 = 0.46418 loss)
I0612 15:48:45.728669 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.575391 (* 0.3 = 0.172617 loss)
I0612 15:48:45.728683 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.390284 (* 0.0272727 = 0.0106441 loss)
I0612 15:48:45.728708 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.79853 (* 0.0272727 = 0.0490508 loss)
I0612 15:48:45.728724 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.14256 (* 0.0272727 = 0.0311608 loss)
I0612 15:48:45.728737 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.94787 (* 0.0272727 = 0.0531236 loss)
I0612 15:48:45.728755 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 0.702937 (* 0.0272727 = 0.019171 loss)
I0612 15:48:45.728770 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.12915 (* 0.0272727 = 0.0307951 loss)
I0612 15:48:45.728785 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.29587 (* 0.0272727 = 0.035342 loss)
I0612 15:48:45.728797 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.530105 (* 0.0272727 = 0.0144574 loss)
I0612 15:48:45.728807 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.318169 (* 0.0272727 = 0.00867733 loss)
I0612 15:48:45.728822 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.802218 (* 0.0272727 = 0.0218787 loss)
I0612 15:48:45.728837 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.233112 (* 0.0272727 = 0.00635759 loss)
I0612 15:48:45.728852 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.244631 (* 0.0272727 = 0.00667176 loss)
I0612 15:48:45.728864 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.208161 (* 0.0272727 = 0.00567712 loss)
I0612 15:48:45.728878 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.196812 (* 0.0272727 = 0.00536761 loss)
I0612 15:48:45.728893 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.234271 (* 0.0272727 = 0.00638922 loss)
I0612 15:48:45.728906 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.455364 (* 0.0272727 = 0.012419 loss)
I0612 15:48:45.728921 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.00345357 (* 0.0272727 = 9.41884e-05 loss)
I0612 15:48:45.728935 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000794148 (* 0.0272727 = 2.16586e-05 loss)
I0612 15:48:45.728950 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000552786 (* 0.0272727 = 1.5076e-05 loss)
I0612 15:48:45.728963 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.0003366 (* 0.0272727 = 9.18e-06 loss)
I0612 15:48:45.728977 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 9.40839e-05 (* 0.0272727 = 2.56592e-06 loss)
I0612 15:48:45.728992 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000343077 (* 0.0272727 = 9.35664e-06 loss)
I0612 15:48:45.729004 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.693548
I0612 15:48:45.729017 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 15:48:45.729027 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 15:48:45.729039 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 15:48:45.729050 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.625
I0612 15:48:45.729063 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 15:48:45.729074 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 15:48:45.729085 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 15:48:45.729096 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 15:48:45.729109 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 15:48:45.729120 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.75
I0612 15:48:45.729130 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 15:48:45.729142 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 15:48:45.729153 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 15:48:45.729164 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 15:48:45.729185 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 15:48:45.729198 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 15:48:45.729210 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 15:48:45.729224 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 15:48:45.729235 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 15:48:45.729248 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 15:48:45.729259 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 15:48:45.729269 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 15:48:45.729281 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.875
I0612 15:48:45.729293 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.887097
I0612 15:48:45.729307 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.19105 (* 1 = 1.19105 loss)
I0612 15:48:45.729333 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.469163 (* 1 = 0.469163 loss)
I0612 15:48:45.729351 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.19347 (* 0.0909091 = 0.0175882 loss)
I0612 15:48:45.729365 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 1.11142 (* 0.0909091 = 0.101038 loss)
I0612 15:48:45.729379 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.809255 (* 0.0909091 = 0.0735687 loss)
I0612 15:48:45.729393 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 2.10966 (* 0.0909091 = 0.191787 loss)
I0612 15:48:45.729408 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.848722 (* 0.0909091 = 0.0771565 loss)
I0612 15:48:45.729421 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.24916 (* 0.0909091 = 0.11356 loss)
I0612 15:48:45.729435 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.22703 (* 0.0909091 = 0.111548 loss)
I0612 15:48:45.729449 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.221922 (* 0.0909091 = 0.0201747 loss)
I0612 15:48:45.729463 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.435249 (* 0.0909091 = 0.0395681 loss)
I0612 15:48:45.729477 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.749397 (* 0.0909091 = 0.068127 loss)
I0612 15:48:45.729491 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.123265 (* 0.0909091 = 0.0112059 loss)
I0612 15:48:45.729506 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.233741 (* 0.0909091 = 0.0212492 loss)
I0612 15:48:45.729519 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.176927 (* 0.0909091 = 0.0160843 loss)
I0612 15:48:45.729533 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.171233 (* 0.0909091 = 0.0155667 loss)
I0612 15:48:45.729547 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.109661 (* 0.0909091 = 0.00996918 loss)
I0612 15:48:45.729562 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.325988 (* 0.0909091 = 0.0296353 loss)
I0612 15:48:45.729576 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0107028 (* 0.0909091 = 0.000972984 loss)
I0612 15:48:45.729590 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00132284 (* 0.0909091 = 0.000120258 loss)
I0612 15:48:45.729604 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000390389 (* 0.0909091 = 3.549e-05 loss)
I0612 15:48:45.729619 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000119688 (* 0.0909091 = 1.08807e-05 loss)
I0612 15:48:45.729634 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 9.40461e-05 (* 0.0909091 = 8.54964e-06 loss)
I0612 15:48:45.729647 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 3.86204e-05 (* 0.0909091 = 3.51094e-06 loss)
I0612 15:48:45.729660 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.25
I0612 15:48:45.729682 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.125
I0612 15:48:45.729696 6181 solver.cpp:245] Train net output #149: total_confidence = 0.228186
I0612 15:48:45.729707 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.216533
I0612 15:48:45.729720 6181 sgd_solver.cpp:106] Iteration 4500, lr = 0.001
I0612 15:49:00.005985 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.2241 > 30) by scale factor 0.93098
I0612 15:49:23.286460 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.2367 > 30) by scale factor 0.992173
I0612 15:49:27.929306 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.6767 > 30) by scale factor 0.947069
I0612 15:49:35.681351 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 40.8345 > 30) by scale factor 0.734674
I0612 15:51:00.196864 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.8832 > 30) by scale factor 0.885395
I0612 15:51:41.204962 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 53.3357 > 30) by scale factor 0.562475
I0612 15:53:17.202457 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 41.3955 > 30) by scale factor 0.724716
I0612 15:53:31.126236 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.4513 > 30) by scale factor 0.80104
I0612 15:53:35.767758 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.2371 > 30) by scale factor 0.930606
I0612 15:55:12.711760 6181 solver.cpp:338] Iteration 5000, Testing net (#0)
I0612 15:56:10.189697 6181 solver.cpp:393] Test loss: 2.77913
I0612 15:56:10.189806 6181 solver.cpp:406] Test net output #0: loss1/accuracy = 0.61279
I0612 15:56:10.189836 6181 solver.cpp:406] Test net output #1: loss1/accuracy01 = 0.765
I0612 15:56:10.189862 6181 solver.cpp:406] Test net output #2: loss1/accuracy02 = 0.644
I0612 15:56:10.189887 6181 solver.cpp:406] Test net output #3: loss1/accuracy03 = 0.528
I0612 15:56:10.189910 6181 solver.cpp:406] Test net output #4: loss1/accuracy04 = 0.481
I0612 15:56:10.189936 6181 solver.cpp:406] Test net output #5: loss1/accuracy05 = 0.513
I0612 15:56:10.189960 6181 solver.cpp:406] Test net output #6: loss1/accuracy06 = 0.706
I0612 15:56:10.189985 6181 solver.cpp:406] Test net output #7: loss1/accuracy07 = 0.847
I0612 15:56:10.190007 6181 solver.cpp:406] Test net output #8: loss1/accuracy08 = 0.929
I0612 15:56:10.190029 6181 solver.cpp:406] Test net output #9: loss1/accuracy09 = 0.965
I0612 15:56:10.190052 6181 solver.cpp:406] Test net output #10: loss1/accuracy10 = 0.983
I0612 15:56:10.190075 6181 solver.cpp:406] Test net output #11: loss1/accuracy11 = 0.995
I0612 15:56:10.190098 6181 solver.cpp:406] Test net output #12: loss1/accuracy12 = 1
I0612 15:56:10.190120 6181 solver.cpp:406] Test net output #13: loss1/accuracy13 = 1
I0612 15:56:10.190141 6181 solver.cpp:406] Test net output #14: loss1/accuracy14 = 1
I0612 15:56:10.190163 6181 solver.cpp:406] Test net output #15: loss1/accuracy15 = 1
I0612 15:56:10.190186 6181 solver.cpp:406] Test net output #16: loss1/accuracy16 = 1
I0612 15:56:10.190207 6181 solver.cpp:406] Test net output #17: loss1/accuracy17 = 1
I0612 15:56:10.190237 6181 solver.cpp:406] Test net output #18: loss1/accuracy18 = 1
I0612 15:56:10.190261 6181 solver.cpp:406] Test net output #19: loss1/accuracy19 = 1
I0612 15:56:10.190286 6181 solver.cpp:406] Test net output #20: loss1/accuracy20 = 1
I0612 15:56:10.190310 6181 solver.cpp:406] Test net output #21: loss1/accuracy21 = 1
I0612 15:56:10.190331 6181 solver.cpp:406] Test net output #22: loss1/accuracy22 = 1
I0612 15:56:10.190353 6181 solver.cpp:406] Test net output #23: loss1/accuracy_incl_empty = 0.886503
I0612 15:56:10.190376 6181 solver.cpp:406] Test net output #24: loss1/accuracy_top3 = 0.828408
I0612 15:56:10.190405 6181 solver.cpp:406] Test net output #25: loss1/cross_entropy_loss = 1.42253 (* 0.3 = 0.42676 loss)
I0612 15:56:10.190433 6181 solver.cpp:406] Test net output #26: loss1/cross_entropy_loss_incl_empty = 0.429562 (* 0.3 = 0.128868 loss)
I0612 15:56:10.190462 6181 solver.cpp:406] Test net output #27: loss1/loss01 = 0.970657 (* 0.0272727 = 0.0264725 loss)
I0612 15:56:10.190490 6181 solver.cpp:406] Test net output #28: loss1/loss02 = 1.40044 (* 0.0272727 = 0.0381938 loss)
I0612 15:56:10.190516 6181 solver.cpp:406] Test net output #29: loss1/loss03 = 1.74624 (* 0.0272727 = 0.0476248 loss)
I0612 15:56:10.190543 6181 solver.cpp:406] Test net output #30: loss1/loss04 = 1.85905 (* 0.0272727 = 0.0507015 loss)
I0612 15:56:10.190570 6181 solver.cpp:406] Test net output #31: loss1/loss05 = 1.70153 (* 0.0272727 = 0.0464054 loss)
I0612 15:56:10.190598 6181 solver.cpp:406] Test net output #32: loss1/loss06 = 1.07245 (* 0.0272727 = 0.0292487 loss)
I0612 15:56:10.190623 6181 solver.cpp:406] Test net output #33: loss1/loss07 = 0.550973 (* 0.0272727 = 0.0150265 loss)
I0612 15:56:10.190650 6181 solver.cpp:406] Test net output #34: loss1/loss08 = 0.287541 (* 0.0272727 = 0.00784202 loss)
I0612 15:56:10.190677 6181 solver.cpp:406] Test net output #35: loss1/loss09 = 0.16202 (* 0.0272727 = 0.00441873 loss)
I0612 15:56:10.190704 6181 solver.cpp:406] Test net output #36: loss1/loss10 = 0.0927946 (* 0.0272727 = 0.00253076 loss)
I0612 15:56:10.190732 6181 solver.cpp:406] Test net output #37: loss1/loss11 = 0.0279132 (* 0.0272727 = 0.000761269 loss)
I0612 15:56:10.190759 6181 solver.cpp:406] Test net output #38: loss1/loss12 = 0.0169605 (* 0.0272727 = 0.000462558 loss)
I0612 15:56:10.190786 6181 solver.cpp:406] Test net output #39: loss1/loss13 = 0.0117068 (* 0.0272727 = 0.000319276 loss)
I0612 15:56:10.190845 6181 solver.cpp:406] Test net output #40: loss1/loss14 = 0.00816365 (* 0.0272727 = 0.000222645 loss)
I0612 15:56:10.190876 6181 solver.cpp:406] Test net output #41: loss1/loss15 = 0.00655553 (* 0.0272727 = 0.000178787 loss)
I0612 15:56:10.190902 6181 solver.cpp:406] Test net output #42: loss1/loss16 = 0.00521814 (* 0.0272727 = 0.000142313 loss)
I0612 15:56:10.190929 6181 solver.cpp:406] Test net output #43: loss1/loss17 = 0.00479805 (* 0.0272727 = 0.000130856 loss)
I0612 15:56:10.190956 6181 solver.cpp:406] Test net output #44: loss1/loss18 = 0.00425253 (* 0.0272727 = 0.000115978 loss)
I0612 15:56:10.190984 6181 solver.cpp:406] Test net output #45: loss1/loss19 = 0.00406381 (* 0.0272727 = 0.000110831 loss)
I0612 15:56:10.191010 6181 solver.cpp:406] Test net output #46: loss1/loss20 = 0.00375731 (* 0.0272727 = 0.000102472 loss)
I0612 15:56:10.191036 6181 solver.cpp:406] Test net output #47: loss1/loss21 = 0.00369548 (* 0.0272727 = 0.000100786 loss)
I0612 15:56:10.191062 6181 solver.cpp:406] Test net output #48: loss1/loss22 = 0.00315382 (* 0.0272727 = 8.60133e-05 loss)
I0612 15:56:10.191085 6181 solver.cpp:406] Test net output #49: loss2/accuracy = 0.773736
I0612 15:56:10.191108 6181 solver.cpp:406] Test net output #50: loss2/accuracy01 = 0.852
I0612 15:56:10.191131 6181 solver.cpp:406] Test net output #51: loss2/accuracy02 = 0.805
I0612 15:56:10.191154 6181 solver.cpp:406] Test net output #52: loss2/accuracy03 = 0.743
I0612 15:56:10.191179 6181 solver.cpp:406] Test net output #53: loss2/accuracy04 = 0.665
I0612 15:56:10.191201 6181 solver.cpp:406] Test net output #54: loss2/accuracy05 = 0.645
I0612 15:56:10.191225 6181 solver.cpp:406] Test net output #55: loss2/accuracy06 = 0.786
I0612 15:56:10.191246 6181 solver.cpp:406] Test net output #56: loss2/accuracy07 = 0.883
I0612 15:56:10.191267 6181 solver.cpp:406] Test net output #57: loss2/accuracy08 = 0.939
I0612 15:56:10.191294 6181 solver.cpp:406] Test net output #58: loss2/accuracy09 = 0.964
I0612 15:56:10.191318 6181 solver.cpp:406] Test net output #59: loss2/accuracy10 = 0.983
I0612 15:56:10.191340 6181 solver.cpp:406] Test net output #60: loss2/accuracy11 = 0.994
I0612 15:56:10.191362 6181 solver.cpp:406] Test net output #61: loss2/accuracy12 = 0.999
I0612 15:56:10.191385 6181 solver.cpp:406] Test net output #62: loss2/accuracy13 = 0.999
I0612 15:56:10.191406 6181 solver.cpp:406] Test net output #63: loss2/accuracy14 = 1
I0612 15:56:10.191428 6181 solver.cpp:406] Test net output #64: loss2/accuracy15 = 1
I0612 15:56:10.191450 6181 solver.cpp:406] Test net output #65: loss2/accuracy16 = 1
I0612 15:56:10.191471 6181 solver.cpp:406] Test net output #66: loss2/accuracy17 = 1
I0612 15:56:10.191493 6181 solver.cpp:406] Test net output #67: loss2/accuracy18 = 1
I0612 15:56:10.191514 6181 solver.cpp:406] Test net output #68: loss2/accuracy19 = 1
I0612 15:56:10.191536 6181 solver.cpp:406] Test net output #69: loss2/accuracy20 = 1
I0612 15:56:10.191557 6181 solver.cpp:406] Test net output #70: loss2/accuracy21 = 1
I0612 15:56:10.191579 6181 solver.cpp:406] Test net output #71: loss2/accuracy22 = 1
I0612 15:56:10.191601 6181 solver.cpp:406] Test net output #72: loss2/accuracy_incl_empty = 0.931137
I0612 15:56:10.191623 6181 solver.cpp:406] Test net output #73: loss2/accuracy_top3 = 0.897004
I0612 15:56:10.191649 6181 solver.cpp:406] Test net output #74: loss2/cross_entropy_loss = 0.946056 (* 0.3 = 0.283817 loss)
I0612 15:56:10.191676 6181 solver.cpp:406] Test net output #75: loss2/cross_entropy_loss_incl_empty = 0.291842 (* 0.3 = 0.0875526 loss)
I0612 15:56:10.191702 6181 solver.cpp:406] Test net output #76: loss2/loss01 = 0.703218 (* 0.0272727 = 0.0191787 loss)
I0612 15:56:10.191727 6181 solver.cpp:406] Test net output #77: loss2/loss02 = 0.906987 (* 0.0272727 = 0.024736 loss)
I0612 15:56:10.191771 6181 solver.cpp:406] Test net output #78: loss2/loss03 = 1.09056 (* 0.0272727 = 0.0297425 loss)
I0612 15:56:10.191797 6181 solver.cpp:406] Test net output #79: loss2/loss04 = 1.27631 (* 0.0272727 = 0.0348085 loss)
I0612 15:56:10.191824 6181 solver.cpp:406] Test net output #80: loss2/loss05 = 1.22098 (* 0.0272727 = 0.0332993 loss)
I0612 15:56:10.191850 6181 solver.cpp:406] Test net output #81: loss2/loss06 = 0.790008 (* 0.0272727 = 0.0215457 loss)
I0612 15:56:10.191879 6181 solver.cpp:406] Test net output #82: loss2/loss07 = 0.431211 (* 0.0272727 = 0.0117603 loss)
I0612 15:56:10.191907 6181 solver.cpp:406] Test net output #83: loss2/loss08 = 0.238874 (* 0.0272727 = 0.00651474 loss)
I0612 15:56:10.191933 6181 solver.cpp:406] Test net output #84: loss2/loss09 = 0.133597 (* 0.0272727 = 0.00364356 loss)
I0612 15:56:10.191959 6181 solver.cpp:406] Test net output #85: loss2/loss10 = 0.079983 (* 0.0272727 = 0.00218136 loss)
I0612 15:56:10.191987 6181 solver.cpp:406] Test net output #86: loss2/loss11 = 0.0197693 (* 0.0272727 = 0.000539161 loss)
I0612 15:56:10.192013 6181 solver.cpp:406] Test net output #87: loss2/loss12 = 0.0109678 (* 0.0272727 = 0.000299122 loss)
I0612 15:56:10.192037 6181 solver.cpp:406] Test net output #88: loss2/loss13 = 0.00723238 (* 0.0272727 = 0.000197247 loss)
I0612 15:56:10.192065 6181 solver.cpp:406] Test net output #89: loss2/loss14 = 0.00545899 (* 0.0272727 = 0.000148882 loss)
I0612 15:56:10.192090 6181 solver.cpp:406] Test net output #90: loss2/loss15 = 0.00482806 (* 0.0272727 = 0.000131674 loss)
I0612 15:56:10.192116 6181 solver.cpp:406] Test net output #91: loss2/loss16 = 0.00426165 (* 0.0272727 = 0.000116227 loss)
I0612 15:56:10.192142 6181 solver.cpp:406] Test net output #92: loss2/loss17 = 0.00430679 (* 0.0272727 = 0.000117458 loss)
I0612 15:56:10.192169 6181 solver.cpp:406] Test net output #93: loss2/loss18 = 0.00462882 (* 0.0272727 = 0.000126241 loss)
I0612 15:56:10.192194 6181 solver.cpp:406] Test net output #94: loss2/loss19 = 0.00391404 (* 0.0272727 = 0.000106747 loss)
I0612 15:56:10.192220 6181 solver.cpp:406] Test net output #95: loss2/loss20 = 0.00410288 (* 0.0272727 = 0.000111897 loss)
I0612 15:56:10.192246 6181 solver.cpp:406] Test net output #96: loss2/loss21 = 0.00411018 (* 0.0272727 = 0.000112096 loss)
I0612 15:56:10.192272 6181 solver.cpp:406] Test net output #97: loss2/loss22 = 0.00331837 (* 0.0272727 = 9.05009e-05 loss)
I0612 15:56:10.192296 6181 solver.cpp:406] Test net output #98: loss3/accuracy = 0.842862
I0612 15:56:10.192317 6181 solver.cpp:406] Test net output #99: loss3/accuracy01 = 0.875
I0612 15:56:10.192342 6181 solver.cpp:406] Test net output #100: loss3/accuracy02 = 0.859
I0612 15:56:10.192365 6181 solver.cpp:406] Test net output #101: loss3/accuracy03 = 0.829
I0612 15:56:10.192387 6181 solver.cpp:406] Test net output #102: loss3/accuracy04 = 0.837
I0612 15:56:10.192409 6181 solver.cpp:406] Test net output #103: loss3/accuracy05 = 0.83
I0612 15:56:10.192430 6181 solver.cpp:406] Test net output #104: loss3/accuracy06 = 0.882
I0612 15:56:10.192451 6181 solver.cpp:406] Test net output #105: loss3/accuracy07 = 0.914
I0612 15:56:10.192473 6181 solver.cpp:406] Test net output #106: loss3/accuracy08 = 0.952
I0612 15:56:10.192494 6181 solver.cpp:406] Test net output #107: loss3/accuracy09 = 0.966
I0612 15:56:10.192515 6181 solver.cpp:406] Test net output #108: loss3/accuracy10 = 0.985
I0612 15:56:10.192538 6181 solver.cpp:406] Test net output #109: loss3/accuracy11 = 0.995
I0612 15:56:10.192559 6181 solver.cpp:406] Test net output #110: loss3/accuracy12 = 0.996
I0612 15:56:10.192580 6181 solver.cpp:406] Test net output #111: loss3/accuracy13 = 0.999
I0612 15:56:10.192602 6181 solver.cpp:406] Test net output #112: loss3/accuracy14 = 0.999
I0612 15:56:10.192623 6181 solver.cpp:406] Test net output #113: loss3/accuracy15 = 0.999
I0612 15:56:10.192646 6181 solver.cpp:406] Test net output #114: loss3/accuracy16 = 1
I0612 15:56:10.192682 6181 solver.cpp:406] Test net output #115: loss3/accuracy17 = 1
I0612 15:56:10.192705 6181 solver.cpp:406] Test net output #116: loss3/accuracy18 = 1
I0612 15:56:10.192728 6181 solver.cpp:406] Test net output #117: loss3/accuracy19 = 1
I0612 15:56:10.192749 6181 solver.cpp:406] Test net output #118: loss3/accuracy20 = 1
I0612 15:56:10.192770 6181 solver.cpp:406] Test net output #119: loss3/accuracy21 = 1
I0612 15:56:10.192791 6181 solver.cpp:406] Test net output #120: loss3/accuracy22 = 1
I0612 15:56:10.192812 6181 solver.cpp:406] Test net output #121: loss3/accuracy_incl_empty = 0.947
I0612 15:56:10.192833 6181 solver.cpp:406] Test net output #122: loss3/accuracy_top3 = 0.916669
I0612 15:56:10.192859 6181 solver.cpp:406] Test net output #123: loss3/cross_entropy_loss = 0.70639 (* 1 = 0.70639 loss)
I0612 15:56:10.192886 6181 solver.cpp:406] Test net output #124: loss3/cross_entropy_loss_incl_empty = 0.237633 (* 1 = 0.237633 loss)
I0612 15:56:10.192911 6181 solver.cpp:406] Test net output #125: loss3/loss01 = 0.588177 (* 0.0909091 = 0.0534707 loss)
I0612 15:56:10.192941 6181 solver.cpp:406] Test net output #126: loss3/loss02 = 0.677454 (* 0.0909091 = 0.0615867 loss)
I0612 15:56:10.192970 6181 solver.cpp:406] Test net output #127: loss3/loss03 = 0.777615 (* 0.0909091 = 0.0706922 loss)
I0612 15:56:10.192994 6181 solver.cpp:406] Test net output #128: loss3/loss04 = 0.75766 (* 0.0909091 = 0.0688782 loss)
I0612 15:56:10.193020 6181 solver.cpp:406] Test net output #129: loss3/loss05 = 0.775806 (* 0.0909091 = 0.0705278 loss)
I0612 15:56:10.193047 6181 solver.cpp:406] Test net output #130: loss3/loss06 = 0.533875 (* 0.0909091 = 0.0485341 loss)
I0612 15:56:10.193071 6181 solver.cpp:406] Test net output #131: loss3/loss07 = 0.337374 (* 0.0909091 = 0.0306704 loss)
I0612 15:56:10.193096 6181 solver.cpp:406] Test net output #132: loss3/loss08 = 0.214912 (* 0.0909091 = 0.0195374 loss)
I0612 15:56:10.193123 6181 solver.cpp:406] Test net output #133: loss3/loss09 = 0.124529 (* 0.0909091 = 0.0113208 loss)
I0612 15:56:10.193150 6181 solver.cpp:406] Test net output #134: loss3/loss10 = 0.0689842 (* 0.0909091 = 0.00627129 loss)
I0612 15:56:10.193174 6181 solver.cpp:406] Test net output #135: loss3/loss11 = 0.0227967 (* 0.0909091 = 0.00207243 loss)
I0612 15:56:10.193200 6181 solver.cpp:406] Test net output #136: loss3/loss12 = 0.0143388 (* 0.0909091 = 0.00130353 loss)
I0612 15:56:10.193225 6181 solver.cpp:406] Test net output #137: loss3/loss13 = 0.00773135 (* 0.0909091 = 0.00070285 loss)
I0612 15:56:10.193250 6181 solver.cpp:406] Test net output #138: loss3/loss14 = 0.00425286 (* 0.0909091 = 0.000386623 loss)
I0612 15:56:10.193276 6181 solver.cpp:406] Test net output #139: loss3/loss15 = 0.00352137 (* 0.0909091 = 0.000320124 loss)
I0612 15:56:10.193303 6181 solver.cpp:406] Test net output #140: loss3/loss16 = 0.00232334 (* 0.0909091 = 0.000211212 loss)
I0612 15:56:10.193351 6181 solver.cpp:406] Test net output #141: loss3/loss17 = 0.00206039 (* 0.0909091 = 0.000187308 loss)
I0612 15:56:10.193384 6181 solver.cpp:406] Test net output #142: loss3/loss18 = 0.00191709 (* 0.0909091 = 0.000174281 loss)
I0612 15:56:10.193411 6181 solver.cpp:406] Test net output #143: loss3/loss19 = 0.0017751 (* 0.0909091 = 0.000161373 loss)
I0612 15:56:10.193437 6181 solver.cpp:406] Test net output #144: loss3/loss20 = 0.00174329 (* 0.0909091 = 0.000158481 loss)
I0612 15:56:10.193464 6181 solver.cpp:406] Test net output #145: loss3/loss21 = 0.00158349 (* 0.0909091 = 0.000143953 loss)
I0612 15:56:10.193490 6181 solver.cpp:406] Test net output #146: loss3/loss22 = 0.000984662 (* 0.0909091 = 8.95147e-05 loss)
I0612 15:56:10.193512 6181 solver.cpp:406] Test net output #147: total_accuracy = 0.544
I0612 15:56:10.193536 6181 solver.cpp:406] Test net output #148: total_accuracy_not_rec = 0.55
I0612 15:56:10.193557 6181 solver.cpp:406] Test net output #149: total_confidence = 0.45841
I0612 15:56:10.193596 6181 solver.cpp:406] Test net output #150: total_confidence_not_rec = 0.429381
I0612 15:56:10.193620 6181 solver.cpp:338] Iteration 5000, Testing net (#1)
I0612 15:57:07.763401 6181 solver.cpp:393] Test loss: 3.67271
I0612 15:57:07.763520 6181 solver.cpp:406] Test net output #0: loss1/accuracy = 0.573268
I0612 15:57:07.763540 6181 solver.cpp:406] Test net output #1: loss1/accuracy01 = 0.754
I0612 15:57:07.763552 6181 solver.cpp:406] Test net output #2: loss1/accuracy02 = 0.634
I0612 15:57:07.763566 6181 solver.cpp:406] Test net output #3: loss1/accuracy03 = 0.535
I0612 15:57:07.763577 6181 solver.cpp:406] Test net output #4: loss1/accuracy04 = 0.469
I0612 15:57:07.763589 6181 solver.cpp:406] Test net output #5: loss1/accuracy05 = 0.497
I0612 15:57:07.763602 6181 solver.cpp:406] Test net output #6: loss1/accuracy06 = 0.65
I0612 15:57:07.763614 6181 solver.cpp:406] Test net output #7: loss1/accuracy07 = 0.761
I0612 15:57:07.763627 6181 solver.cpp:406] Test net output #8: loss1/accuracy08 = 0.815
I0612 15:57:07.763638 6181 solver.cpp:406] Test net output #9: loss1/accuracy09 = 0.844
I0612 15:57:07.763649 6181 solver.cpp:406] Test net output #10: loss1/accuracy10 = 0.869
I0612 15:57:07.763661 6181 solver.cpp:406] Test net output #11: loss1/accuracy11 = 0.888
I0612 15:57:07.763674 6181 solver.cpp:406] Test net output #12: loss1/accuracy12 = 0.902
I0612 15:57:07.763686 6181 solver.cpp:406] Test net output #13: loss1/accuracy13 = 0.914
I0612 15:57:07.763697 6181 solver.cpp:406] Test net output #14: loss1/accuracy14 = 0.936
I0612 15:57:07.763710 6181 solver.cpp:406] Test net output #15: loss1/accuracy15 = 0.949
I0612 15:57:07.763721 6181 solver.cpp:406] Test net output #16: loss1/accuracy16 = 0.962
I0612 15:57:07.763733 6181 solver.cpp:406] Test net output #17: loss1/accuracy17 = 0.978
I0612 15:57:07.763746 6181 solver.cpp:406] Test net output #18: loss1/accuracy18 = 0.98
I0612 15:57:07.763756 6181 solver.cpp:406] Test net output #19: loss1/accuracy19 = 0.981
I0612 15:57:07.763768 6181 solver.cpp:406] Test net output #20: loss1/accuracy20 = 0.989
I0612 15:57:07.763780 6181 solver.cpp:406] Test net output #21: loss1/accuracy21 = 0.996
I0612 15:57:07.763792 6181 solver.cpp:406] Test net output #22: loss1/accuracy22 = 0.997
I0612 15:57:07.763804 6181 solver.cpp:406] Test net output #23: loss1/accuracy_incl_empty = 0.840047
I0612 15:57:07.763816 6181 solver.cpp:406] Test net output #24: loss1/accuracy_top3 = 0.791242
I0612 15:57:07.763833 6181 solver.cpp:406] Test net output #25: loss1/cross_entropy_loss = 1.58055 (* 0.3 = 0.474166 loss)
I0612 15:57:07.763847 6181 solver.cpp:406] Test net output #26: loss1/cross_entropy_loss_incl_empty = 0.606634 (* 0.3 = 0.18199 loss)
I0612 15:57:07.763862 6181 solver.cpp:406] Test net output #27: loss1/loss01 = 1.07179 (* 0.0272727 = 0.0292307 loss)
I0612 15:57:07.763876 6181 solver.cpp:406] Test net output #28: loss1/loss02 = 1.40227 (* 0.0272727 = 0.0382437 loss)
I0612 15:57:07.763890 6181 solver.cpp:406] Test net output #29: loss1/loss03 = 1.7461 (* 0.0272727 = 0.047621 loss)
I0612 15:57:07.763906 6181 solver.cpp:406] Test net output #30: loss1/loss04 = 1.89434 (* 0.0272727 = 0.0516637 loss)
I0612 15:57:07.763919 6181 solver.cpp:406] Test net output #31: loss1/loss05 = 1.74324 (* 0.0272727 = 0.0475429 loss)
I0612 15:57:07.763933 6181 solver.cpp:406] Test net output #32: loss1/loss06 = 1.28232 (* 0.0272727 = 0.0349723 loss)
I0612 15:57:07.763947 6181 solver.cpp:406] Test net output #33: loss1/loss07 = 0.85177 (* 0.0272727 = 0.0232301 loss)
I0612 15:57:07.763960 6181 solver.cpp:406] Test net output #34: loss1/loss08 = 0.697243 (* 0.0272727 = 0.0190157 loss)
I0612 15:57:07.763974 6181 solver.cpp:406] Test net output #35: loss1/loss09 = 0.608058 (* 0.0272727 = 0.0165834 loss)
I0612 15:57:07.763988 6181 solver.cpp:406] Test net output #36: loss1/loss10 = 0.529701 (* 0.0272727 = 0.0144464 loss)
I0612 15:57:07.764003 6181 solver.cpp:406] Test net output #37: loss1/loss11 = 0.470834 (* 0.0272727 = 0.0128409 loss)
I0612 15:57:07.764016 6181 solver.cpp:406] Test net output #38: loss1/loss12 = 0.402301 (* 0.0272727 = 0.0109718 loss)
I0612 15:57:07.764052 6181 solver.cpp:406] Test net output #39: loss1/loss13 = 0.362139 (* 0.0272727 = 0.00987653 loss)
I0612 15:57:07.764068 6181 solver.cpp:406] Test net output #40: loss1/loss14 = 0.270206 (* 0.0272727 = 0.00736926 loss)
I0612 15:57:07.764082 6181 solver.cpp:406] Test net output #41: loss1/loss15 = 0.233681 (* 0.0272727 = 0.00637311 loss)
I0612 15:57:07.764096 6181 solver.cpp:406] Test net output #42: loss1/loss16 = 0.19229 (* 0.0272727 = 0.00524427 loss)
I0612 15:57:07.764109 6181 solver.cpp:406] Test net output #43: loss1/loss17 = 0.132188 (* 0.0272727 = 0.00360513 loss)
I0612 15:57:07.764123 6181 solver.cpp:406] Test net output #44: loss1/loss18 = 0.125607 (* 0.0272727 = 0.00342564 loss)
I0612 15:57:07.764137 6181 solver.cpp:406] Test net output #45: loss1/loss19 = 0.115516 (* 0.0272727 = 0.00315045 loss)
I0612 15:57:07.764152 6181 solver.cpp:406] Test net output #46: loss1/loss20 = 0.0691872 (* 0.0272727 = 0.00188692 loss)
I0612 15:57:07.764165 6181 solver.cpp:406] Test net output #47: loss1/loss21 = 0.0295276 (* 0.0272727 = 0.000805299 loss)
I0612 15:57:07.764179 6181 solver.cpp:406] Test net output #48: loss1/loss22 = 0.0256001 (* 0.0272727 = 0.000698183 loss)
I0612 15:57:07.764191 6181 solver.cpp:406] Test net output #49: loss2/accuracy = 0.729222
I0612 15:57:07.764204 6181 solver.cpp:406] Test net output #50: loss2/accuracy01 = 0.856
I0612 15:57:07.764215 6181 solver.cpp:406] Test net output #51: loss2/accuracy02 = 0.823
I0612 15:57:07.764230 6181 solver.cpp:406] Test net output #52: loss2/accuracy03 = 0.727
I0612 15:57:07.764241 6181 solver.cpp:406] Test net output #53: loss2/accuracy04 = 0.631
I0612 15:57:07.764253 6181 solver.cpp:406] Test net output #54: loss2/accuracy05 = 0.657
I0612 15:57:07.764264 6181 solver.cpp:406] Test net output #55: loss2/accuracy06 = 0.727
I0612 15:57:07.764276 6181 solver.cpp:406] Test net output #56: loss2/accuracy07 = 0.808
I0612 15:57:07.764288 6181 solver.cpp:406] Test net output #57: loss2/accuracy08 = 0.832
I0612 15:57:07.764299 6181 solver.cpp:406] Test net output #58: loss2/accuracy09 = 0.864
I0612 15:57:07.764312 6181 solver.cpp:406] Test net output #59: loss2/accuracy10 = 0.878
I0612 15:57:07.764323 6181 solver.cpp:406] Test net output #60: loss2/accuracy11 = 0.898
I0612 15:57:07.764334 6181 solver.cpp:406] Test net output #61: loss2/accuracy12 = 0.912
I0612 15:57:07.764346 6181 solver.cpp:406] Test net output #62: loss2/accuracy13 = 0.915
I0612 15:57:07.764358 6181 solver.cpp:406] Test net output #63: loss2/accuracy14 = 0.937
I0612 15:57:07.764369 6181 solver.cpp:406] Test net output #64: loss2/accuracy15 = 0.95
I0612 15:57:07.764381 6181 solver.cpp:406] Test net output #65: loss2/accuracy16 = 0.962
I0612 15:57:07.764392 6181 solver.cpp:406] Test net output #66: loss2/accuracy17 = 0.978
I0612 15:57:07.764403 6181 solver.cpp:406] Test net output #67: loss2/accuracy18 = 0.98
I0612 15:57:07.764415 6181 solver.cpp:406] Test net output #68: loss2/accuracy19 = 0.981
I0612 15:57:07.764427 6181 solver.cpp:406] Test net output #69: loss2/accuracy20 = 0.989
I0612 15:57:07.764438 6181 solver.cpp:406] Test net output #70: loss2/accuracy21 = 0.996
I0612 15:57:07.764451 6181 solver.cpp:406] Test net output #71: loss2/accuracy22 = 0.997
I0612 15:57:07.764462 6181 solver.cpp:406] Test net output #72: loss2/accuracy_incl_empty = 0.887546
I0612 15:57:07.764473 6181 solver.cpp:406] Test net output #73: loss2/accuracy_top3 = 0.870103
I0612 15:57:07.764487 6181 solver.cpp:406] Test net output #74: loss2/cross_entropy_loss = 1.08351 (* 0.3 = 0.325054 loss)
I0612 15:57:07.764502 6181 solver.cpp:406] Test net output #75: loss2/cross_entropy_loss_incl_empty = 0.443797 (* 0.3 = 0.133139 loss)
I0612 15:57:07.764515 6181 solver.cpp:406] Test net output #76: loss2/loss01 = 0.70306 (* 0.0272727 = 0.0191744 loss)
I0612 15:57:07.764528 6181 solver.cpp:406] Test net output #77: loss2/loss02 = 0.837799 (* 0.0272727 = 0.0228491 loss)
I0612 15:57:07.764556 6181 solver.cpp:406] Test net output #78: loss2/loss03 = 1.0916 (* 0.0272727 = 0.029771 loss)
I0612 15:57:07.764571 6181 solver.cpp:406] Test net output #79: loss2/loss04 = 1.2769 (* 0.0272727 = 0.0348245 loss)
I0612 15:57:07.764585 6181 solver.cpp:406] Test net output #80: loss2/loss05 = 1.26175 (* 0.0272727 = 0.0344114 loss)
I0612 15:57:07.764598 6181 solver.cpp:406] Test net output #81: loss2/loss06 = 1.01067 (* 0.0272727 = 0.0275636 loss)
I0612 15:57:07.764612 6181 solver.cpp:406] Test net output #82: loss2/loss07 = 0.716031 (* 0.0272727 = 0.0195281 loss)
I0612 15:57:07.764626 6181 solver.cpp:406] Test net output #83: loss2/loss08 = 0.617277 (* 0.0272727 = 0.0168348 loss)
I0612 15:57:07.764636 6181 solver.cpp:406] Test net output #84: loss2/loss09 = 0.5355 (* 0.0272727 = 0.0146045 loss)
I0612 15:57:07.764644 6181 solver.cpp:406] Test net output #85: loss2/loss10 = 0.472369 (* 0.0272727 = 0.0128828 loss)
I0612 15:57:07.764658 6181 solver.cpp:406] Test net output #86: loss2/loss11 = 0.4225 (* 0.0272727 = 0.0115227 loss)
I0612 15:57:07.764672 6181 solver.cpp:406] Test net output #87: loss2/loss12 = 0.349525 (* 0.0272727 = 0.00953251 loss)
I0612 15:57:07.764686 6181 solver.cpp:406] Test net output #88: loss2/loss13 = 0.330204 (* 0.0272727 = 0.00900557 loss)
I0612 15:57:07.764700 6181 solver.cpp:406] Test net output #89: loss2/loss14 = 0.243261 (* 0.0272727 = 0.0066344 loss)
I0612 15:57:07.764714 6181 solver.cpp:406] Test net output #90: loss2/loss15 = 0.217336 (* 0.0272727 = 0.00592734 loss)
I0612 15:57:07.764727 6181 solver.cpp:406] Test net output #91: loss2/loss16 = 0.177074 (* 0.0272727 = 0.00482928 loss)
I0612 15:57:07.764741 6181 solver.cpp:406] Test net output #92: loss2/loss17 = 0.124111 (* 0.0272727 = 0.00338485 loss)
I0612 15:57:07.764755 6181 solver.cpp:406] Test net output #93: loss2/loss18 = 0.114172 (* 0.0272727 = 0.00311378 loss)
I0612 15:57:07.764768 6181 solver.cpp:406] Test net output #94: loss2/loss19 = 0.114453 (* 0.0272727 = 0.00312145 loss)
I0612 15:57:07.764782 6181 solver.cpp:406] Test net output #95: loss2/loss20 = 0.0720029 (* 0.0272727 = 0.00196371 loss)
I0612 15:57:07.764796 6181 solver.cpp:406] Test net output #96: loss2/loss21 = 0.0355666 (* 0.0272727 = 0.000969997 loss)
I0612 15:57:07.764809 6181 solver.cpp:406] Test net output #97: loss2/loss22 = 0.0311395 (* 0.0272727 = 0.00084926 loss)
I0612 15:57:07.764822 6181 solver.cpp:406] Test net output #98: loss3/accuracy = 0.811908
I0612 15:57:07.764833 6181 solver.cpp:406] Test net output #99: loss3/accuracy01 = 0.881
I0612 15:57:07.764845 6181 solver.cpp:406] Test net output #100: loss3/accuracy02 = 0.861
I0612 15:57:07.764856 6181 solver.cpp:406] Test net output #101: loss3/accuracy03 = 0.839
I0612 15:57:07.764868 6181 solver.cpp:406] Test net output #102: loss3/accuracy04 = 0.815
I0612 15:57:07.764879 6181 solver.cpp:406] Test net output #103: loss3/accuracy05 = 0.806
I0612 15:57:07.764890 6181 solver.cpp:406] Test net output #104: loss3/accuracy06 = 0.838
I0612 15:57:07.764901 6181 solver.cpp:406] Test net output #105: loss3/accuracy07 = 0.873
I0612 15:57:07.764914 6181 solver.cpp:406] Test net output #106: loss3/accuracy08 = 0.892
I0612 15:57:07.764925 6181 solver.cpp:406] Test net output #107: loss3/accuracy09 = 0.904
I0612 15:57:07.764935 6181 solver.cpp:406] Test net output #108: loss3/accuracy10 = 0.905
I0612 15:57:07.764947 6181 solver.cpp:406] Test net output #109: loss3/accuracy11 = 0.914
I0612 15:57:07.764958 6181 solver.cpp:406] Test net output #110: loss3/accuracy12 = 0.932
I0612 15:57:07.764969 6181 solver.cpp:406] Test net output #111: loss3/accuracy13 = 0.931
I0612 15:57:07.764981 6181 solver.cpp:406] Test net output #112: loss3/accuracy14 = 0.946
I0612 15:57:07.764992 6181 solver.cpp:406] Test net output #113: loss3/accuracy15 = 0.96
I0612 15:57:07.765003 6181 solver.cpp:406] Test net output #114: loss3/accuracy16 = 0.97
I0612 15:57:07.765024 6181 solver.cpp:406] Test net output #115: loss3/accuracy17 = 0.98
I0612 15:57:07.765036 6181 solver.cpp:406] Test net output #116: loss3/accuracy18 = 0.985
I0612 15:57:07.765048 6181 solver.cpp:406] Test net output #117: loss3/accuracy19 = 0.982
I0612 15:57:07.765059 6181 solver.cpp:406] Test net output #118: loss3/accuracy20 = 0.989
I0612 15:57:07.765071 6181 solver.cpp:406] Test net output #119: loss3/accuracy21 = 0.996
I0612 15:57:07.765082 6181 solver.cpp:406] Test net output #120: loss3/accuracy22 = 0.997
I0612 15:57:07.765096 6181 solver.cpp:406] Test net output #121: loss3/accuracy_incl_empty = 0.917273
I0612 15:57:07.765108 6181 solver.cpp:406] Test net output #122: loss3/accuracy_top3 = 0.90783
I0612 15:57:07.765123 6181 solver.cpp:406] Test net output #123: loss3/cross_entropy_loss = 0.812433 (* 1 = 0.812433 loss)
I0612 15:57:07.765136 6181 solver.cpp:406] Test net output #124: loss3/cross_entropy_loss_incl_empty = 0.349531 (* 1 = 0.349531 loss)
I0612 15:57:07.765151 6181 solver.cpp:406] Test net output #125: loss3/loss01 = 0.588224 (* 0.0909091 = 0.0534749 loss)
I0612 15:57:07.765163 6181 solver.cpp:406] Test net output #126: loss3/loss02 = 0.625959 (* 0.0909091 = 0.0569054 loss)
I0612 15:57:07.765177 6181 solver.cpp:406] Test net output #127: loss3/loss03 = 0.779524 (* 0.0909091 = 0.0708659 loss)
I0612 15:57:07.765190 6181 solver.cpp:406] Test net output #128: loss3/loss04 = 0.80569 (* 0.0909091 = 0.0732445 loss)
I0612 15:57:07.765203 6181 solver.cpp:406] Test net output #129: loss3/loss05 = 0.859163 (* 0.0909091 = 0.0781057 loss)
I0612 15:57:07.765218 6181 solver.cpp:406] Test net output #130: loss3/loss06 = 0.683156 (* 0.0909091 = 0.0621051 loss)
I0612 15:57:07.765230 6181 solver.cpp:406] Test net output #131: loss3/loss07 = 0.492138 (* 0.0909091 = 0.0447398 loss)
I0612 15:57:07.765244 6181 solver.cpp:406] Test net output #132: loss3/loss08 = 0.433561 (* 0.0909091 = 0.0394146 loss)
I0612 15:57:07.765257 6181 solver.cpp:406] Test net output #133: loss3/loss09 = 0.387357 (* 0.0909091 = 0.0352143 loss)
I0612 15:57:07.765271 6181 solver.cpp:406] Test net output #134: loss3/loss10 = 0.384922 (* 0.0909091 = 0.0349929 loss)
I0612 15:57:07.765302 6181 solver.cpp:406] Test net output #135: loss3/loss11 = 0.325391 (* 0.0909091 = 0.029581 loss)
I0612 15:57:07.765316 6181 solver.cpp:406] Test net output #136: loss3/loss12 = 0.284327 (* 0.0909091 = 0.0258479 loss)
I0612 15:57:07.765331 6181 solver.cpp:406] Test net output #137: loss3/loss13 = 0.255045 (* 0.0909091 = 0.0231859 loss)
I0612 15:57:07.765343 6181 solver.cpp:406] Test net output #138: loss3/loss14 = 0.210259 (* 0.0909091 = 0.0191144 loss)
I0612 15:57:07.765357 6181 solver.cpp:406] Test net output #139: loss3/loss15 = 0.183545 (* 0.0909091 = 0.0166859 loss)
I0612 15:57:07.765372 6181 solver.cpp:406] Test net output #140: loss3/loss16 = 0.149757 (* 0.0909091 = 0.0136143 loss)
I0612 15:57:07.765384 6181 solver.cpp:406] Test net output #141: loss3/loss17 = 0.112935 (* 0.0909091 = 0.0102668 loss)
I0612 15:57:07.765398 6181 solver.cpp:406] Test net output #142: loss3/loss18 = 0.103442 (* 0.0909091 = 0.00940384 loss)
I0612 15:57:07.765413 6181 solver.cpp:406] Test net output #143: loss3/loss19 = 0.101452 (* 0.0909091 = 0.00922295 loss)
I0612 15:57:07.765426 6181 solver.cpp:406] Test net output #144: loss3/loss20 = 0.0519387 (* 0.0909091 = 0.0047217 loss)
I0612 15:57:07.765440 6181 solver.cpp:406] Test net output #145: loss3/loss21 = 0.0213813 (* 0.0909091 = 0.00194375 loss)
I0612 15:57:07.765455 6181 solver.cpp:406] Test net output #146: loss3/loss22 = 0.0180438 (* 0.0909091 = 0.00164035 loss)
I0612 15:57:07.765466 6181 solver.cpp:406] Test net output #147: total_accuracy = 0.492
I0612 15:57:07.765478 6181 solver.cpp:406] Test net output #148: total_accuracy_not_rec = 0.478
I0612 15:57:07.765489 6181 solver.cpp:406] Test net output #149: total_confidence = 0.406055
I0612 15:57:07.765511 6181 solver.cpp:406] Test net output #150: total_confidence_not_rec = 0.378205
I0612 15:57:08.122671 6181 solver.cpp:229] Iteration 5000, loss = 4.22065
I0612 15:57:08.122723 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.378788
I0612 15:57:08.122740 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 15:57:08.122755 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 15:57:08.122766 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 15:57:08.122778 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 15:57:08.122791 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.25
I0612 15:57:08.122803 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.5
I0612 15:57:08.122815 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.5
I0612 15:57:08.122828 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.625
I0612 15:57:08.122840 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.5
I0612 15:57:08.122853 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.625
I0612 15:57:08.122864 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.625
I0612 15:57:08.122876 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.75
I0612 15:57:08.122889 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 15:57:08.122900 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.75
I0612 15:57:08.122912 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.75
I0612 15:57:08.122925 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 15:57:08.122937 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 15:57:08.122949 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 15:57:08.122961 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 15:57:08.122972 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 15:57:08.122983 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 15:57:08.122995 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 15:57:08.123008 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.761364
I0612 15:57:08.123019 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.636364
I0612 15:57:08.123036 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.15228 (* 0.3 = 0.645684 loss)
I0612 15:57:08.123050 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.855413 (* 0.3 = 0.256624 loss)
I0612 15:57:08.123065 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.0479 (* 0.0272727 = 0.0285791 loss)
I0612 15:57:08.123080 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.1333 (* 0.0272727 = 0.030908 loss)
I0612 15:57:08.123093 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.13623 (* 0.0272727 = 0.0582609 loss)
I0612 15:57:08.123111 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.8549 (* 0.0272727 = 0.0505882 loss)
I0612 15:57:08.123124 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.15388 (* 0.0272727 = 0.0587422 loss)
I0612 15:57:08.123138 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.81612 (* 0.0272727 = 0.0495304 loss)
I0612 15:57:08.123152 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.23627 (* 0.0272727 = 0.0337165 loss)
I0612 15:57:08.123167 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.40481 (* 0.0272727 = 0.0383131 loss)
I0612 15:57:08.123181 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 1.23617 (* 0.0272727 = 0.0337138 loss)
I0612 15:57:08.123198 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 1.78334 (* 0.0272727 = 0.0486366 loss)
I0612 15:57:08.123214 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 1.25252 (* 0.0272727 = 0.0341597 loss)
I0612 15:57:08.123250 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 1.17283 (* 0.0272727 = 0.0319863 loss)
I0612 15:57:08.123265 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.387846 (* 0.0272727 = 0.0105776 loss)
I0612 15:57:08.123280 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.927967 (* 0.0272727 = 0.0253082 loss)
I0612 15:57:08.123293 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 1.12688 (* 0.0272727 = 0.0307331 loss)
I0612 15:57:08.123308 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.0346099 (* 0.0272727 = 0.000943906 loss)
I0612 15:57:08.123322 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0124108 (* 0.0272727 = 0.000338478 loss)
I0612 15:57:08.123337 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.0074363 (* 0.0272727 = 0.000202808 loss)
I0612 15:57:08.123350 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00309195 (* 0.0272727 = 8.4326e-05 loss)
I0612 15:57:08.123368 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00335781 (* 0.0272727 = 9.15765e-05 loss)
I0612 15:57:08.123383 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00114523 (* 0.0272727 = 3.12336e-05 loss)
I0612 15:57:08.123397 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000734658 (* 0.0272727 = 2.00361e-05 loss)
I0612 15:57:08.123409 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.5
I0612 15:57:08.123421 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 15:57:08.123433 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 15:57:08.123445 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 15:57:08.123457 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.625
I0612 15:57:08.123469 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 15:57:08.123481 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 15:57:08.123493 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 15:57:08.123505 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.625
I0612 15:57:08.123517 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.625
I0612 15:57:08.123528 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.625
I0612 15:57:08.123539 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.75
I0612 15:57:08.123551 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.75
I0612 15:57:08.123564 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.75
I0612 15:57:08.123575 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.75
I0612 15:57:08.123587 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 15:57:08.123600 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 15:57:08.123607 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 15:57:08.123620 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 15:57:08.123632 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 15:57:08.123644 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 15:57:08.123656 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 15:57:08.123667 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 15:57:08.123678 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.795455
I0612 15:57:08.123690 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.681818
I0612 15:57:08.123704 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.8539 (* 0.3 = 0.556171 loss)
I0612 15:57:08.123718 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.736774 (* 0.3 = 0.221032 loss)
I0612 15:57:08.123744 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.539619 (* 0.0272727 = 0.0147169 loss)
I0612 15:57:08.123759 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.92232 (* 0.0272727 = 0.0251542 loss)
I0612 15:57:08.123774 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.82779 (* 0.0272727 = 0.0498487 loss)
I0612 15:57:08.123787 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.40855 (* 0.0272727 = 0.0384149 loss)
I0612 15:57:08.123800 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.70345 (* 0.0272727 = 0.0464577 loss)
I0612 15:57:08.123814 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.54165 (* 0.0272727 = 0.0420451 loss)
I0612 15:57:08.123828 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.813654 (* 0.0272727 = 0.0221906 loss)
I0612 15:57:08.123842 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.23665 (* 0.0272727 = 0.033727 loss)
I0612 15:57:08.123857 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 1.22861 (* 0.0272727 = 0.0335074 loss)
I0612 15:57:08.123870 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 1.76392 (* 0.0272727 = 0.0481068 loss)
I0612 15:57:08.123884 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.899898 (* 0.0272727 = 0.0245427 loss)
I0612 15:57:08.123898 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 1.06523 (* 0.0272727 = 0.0290518 loss)
I0612 15:57:08.123913 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.561271 (* 0.0272727 = 0.0153074 loss)
I0612 15:57:08.123925 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.798119 (* 0.0272727 = 0.0217669 loss)
I0612 15:57:08.123939 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 1.29041 (* 0.0272727 = 0.035193 loss)
I0612 15:57:08.123953 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.0448064 (* 0.0272727 = 0.00122199 loss)
I0612 15:57:08.123968 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0016352 (* 0.0272727 = 4.45964e-05 loss)
I0612 15:57:08.123982 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000733588 (* 0.0272727 = 2.00069e-05 loss)
I0612 15:57:08.123996 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000645091 (* 0.0272727 = 1.75934e-05 loss)
I0612 15:57:08.124011 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00032659 (* 0.0272727 = 8.907e-06 loss)
I0612 15:57:08.124025 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000213673 (* 0.0272727 = 5.82744e-06 loss)
I0612 15:57:08.124039 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000301206 (* 0.0272727 = 8.2147e-06 loss)
I0612 15:57:08.124052 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.651515
I0612 15:57:08.124063 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 15:57:08.124075 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 15:57:08.124086 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 15:57:08.124099 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 15:57:08.124110 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 15:57:08.124122 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 15:57:08.124133 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 15:57:08.124145 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.75
I0612 15:57:08.124160 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.625
I0612 15:57:08.124171 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.625
I0612 15:57:08.124183 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.75
I0612 15:57:08.124194 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.625
I0612 15:57:08.124207 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.75
I0612 15:57:08.124228 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.75
I0612 15:57:08.124246 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.75
I0612 15:57:08.124258 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 15:57:08.124270 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 15:57:08.124281 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 15:57:08.124294 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 15:57:08.124305 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 15:57:08.124315 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 15:57:08.124327 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 15:57:08.124338 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.857955
I0612 15:57:08.124351 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.80303
I0612 15:57:08.124364 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.38207 (* 1 = 1.38207 loss)
I0612 15:57:08.124378 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.558975 (* 1 = 0.558975 loss)
I0612 15:57:08.124392 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.254308 (* 0.0909091 = 0.0231189 loss)
I0612 15:57:08.124408 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.100876 (* 0.0909091 = 0.00917058 loss)
I0612 15:57:08.124423 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 1.12175 (* 0.0909091 = 0.101977 loss)
I0612 15:57:08.124438 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.38939 (* 0.0909091 = 0.126308 loss)
I0612 15:57:08.124451 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.02023 (* 0.0909091 = 0.0927485 loss)
I0612 15:57:08.124465 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.981012 (* 0.0909091 = 0.0891829 loss)
I0612 15:57:08.124480 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.467372 (* 0.0909091 = 0.0424883 loss)
I0612 15:57:08.124493 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.915912 (* 0.0909091 = 0.0832647 loss)
I0612 15:57:08.124506 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 1.57019 (* 0.0909091 = 0.142744 loss)
I0612 15:57:08.124521 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 1.51543 (* 0.0909091 = 0.137766 loss)
I0612 15:57:08.124534 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 1.15034 (* 0.0909091 = 0.104576 loss)
I0612 15:57:08.124548 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.944584 (* 0.0909091 = 0.0858713 loss)
I0612 15:57:08.124562 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.471308 (* 0.0909091 = 0.0428462 loss)
I0612 15:57:08.124575 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.500266 (* 0.0909091 = 0.0454788 loss)
I0612 15:57:08.124589 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.569796 (* 0.0909091 = 0.0517996 loss)
I0612 15:57:08.124603 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.0207506 (* 0.0909091 = 0.00188642 loss)
I0612 15:57:08.124617 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.00828888 (* 0.0909091 = 0.000753535 loss)
I0612 15:57:08.124631 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00481229 (* 0.0909091 = 0.000437481 loss)
I0612 15:57:08.124645 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.0023838 (* 0.0909091 = 0.000216709 loss)
I0612 15:57:08.124660 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00086495 (* 0.0909091 = 7.86319e-05 loss)
I0612 15:57:08.124675 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00100068 (* 0.0909091 = 9.09712e-05 loss)
I0612 15:57:08.124688 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.0012272 (* 0.0909091 = 0.000111563 loss)
I0612 15:57:08.124711 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.25
I0612 15:57:08.124723 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.25
I0612 15:57:08.124735 6181 solver.cpp:245] Train net output #149: total_confidence = 0.318095
I0612 15:57:08.124747 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.3446
I0612 15:57:08.124760 6181 sgd_solver.cpp:106] Iteration 5000, lr = 0.001
I0612 15:58:11.116199 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 43.3466 > 30) by scale factor 0.692095
I0612 15:58:30.461344 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.1696 > 30) by scale factor 0.877974
I0612 15:59:03.744984 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.9275 > 30) by scale factor 0.97001
I0612 16:00:47.456755 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.2662 > 30) by scale factor 0.901815
I0612 16:01:28.458462 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.7958 > 30) by scale factor 0.862172
I0612 16:03:35.007645 6181 solver.cpp:229] Iteration 5500, loss = 4.1599
I0612 16:03:35.007774 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.557692
I0612 16:03:35.007796 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 16:03:35.007808 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 16:03:35.007822 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.75
I0612 16:03:35.007833 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 16:03:35.007845 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.625
I0612 16:03:35.007858 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 16:03:35.007869 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.875
I0612 16:03:35.007882 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.75
I0612 16:03:35.007894 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 16:03:35.007906 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 1
I0612 16:03:35.007920 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 1
I0612 16:03:35.007930 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 16:03:35.007942 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 16:03:35.007954 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 16:03:35.007966 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 16:03:35.007977 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 16:03:35.007989 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 16:03:35.008002 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 16:03:35.008019 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 16:03:35.008033 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 16:03:35.008044 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 16:03:35.008056 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 16:03:35.008069 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.846591
I0612 16:03:35.008080 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.846154
I0612 16:03:35.008097 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.38388 (* 0.3 = 0.415163 loss)
I0612 16:03:35.008112 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.494882 (* 0.3 = 0.148465 loss)
I0612 16:03:35.008126 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.26387 (* 0.0272727 = 0.0344693 loss)
I0612 16:03:35.008141 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.39193 (* 0.0272727 = 0.0379617 loss)
I0612 16:03:35.008154 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 0.970122 (* 0.0272727 = 0.0264579 loss)
I0612 16:03:35.008169 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.61557 (* 0.0272727 = 0.0440611 loss)
I0612 16:03:35.008183 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.31586 (* 0.0272727 = 0.0358872 loss)
I0612 16:03:35.008198 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.29359 (* 0.0272727 = 0.0625525 loss)
I0612 16:03:35.008211 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.982524 (* 0.0272727 = 0.0267961 loss)
I0612 16:03:35.008229 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.692026 (* 0.0272727 = 0.0188734 loss)
I0612 16:03:35.008244 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.352647 (* 0.0272727 = 0.00961764 loss)
I0612 16:03:35.008257 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.0395921 (* 0.0272727 = 0.00107979 loss)
I0612 16:03:35.008272 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.00865011 (* 0.0272727 = 0.000235912 loss)
I0612 16:03:35.008287 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.00227461 (* 0.0272727 = 6.20348e-05 loss)
I0612 16:03:35.008321 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.000965623 (* 0.0272727 = 2.63352e-05 loss)
I0612 16:03:35.008337 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.000451343 (* 0.0272727 = 1.23094e-05 loss)
I0612 16:03:35.008350 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.000271102 (* 0.0272727 = 7.3937e-06 loss)
I0612 16:03:35.008365 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.000139307 (* 0.0272727 = 3.79929e-06 loss)
I0612 16:03:35.008379 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.000153604 (* 0.0272727 = 4.1892e-06 loss)
I0612 16:03:35.008394 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000127978 (* 0.0272727 = 3.49032e-06 loss)
I0612 16:03:35.008414 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 7.06999e-05 (* 0.0272727 = 1.92818e-06 loss)
I0612 16:03:35.008429 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000132535 (* 0.0272727 = 3.6146e-06 loss)
I0612 16:03:35.008443 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000164425 (* 0.0272727 = 4.48432e-06 loss)
I0612 16:03:35.008457 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000336495 (* 0.0272727 = 9.17714e-06 loss)
I0612 16:03:35.008471 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.692308
I0612 16:03:35.008482 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.5
I0612 16:03:35.008496 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 16:03:35.008507 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 1
I0612 16:03:35.008518 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 16:03:35.008530 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.625
I0612 16:03:35.008543 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.375
I0612 16:03:35.008554 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 16:03:35.008566 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 16:03:35.008579 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 16:03:35.008589 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 1
I0612 16:03:35.008601 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 1
I0612 16:03:35.008612 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 16:03:35.008625 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 16:03:35.008635 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 16:03:35.008647 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 16:03:35.008658 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 16:03:35.008671 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 16:03:35.008682 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 16:03:35.008693 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 16:03:35.008704 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 16:03:35.008715 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 16:03:35.008728 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 16:03:35.008738 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.880682
I0612 16:03:35.008750 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.942308
I0612 16:03:35.008764 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.01734 (* 0.3 = 0.305203 loss)
I0612 16:03:35.008779 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.374723 (* 0.3 = 0.112417 loss)
I0612 16:03:35.008796 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.58487 (* 0.0272727 = 0.0432237 loss)
I0612 16:03:35.008811 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.934569 (* 0.0272727 = 0.0254882 loss)
I0612 16:03:35.008837 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.755021 (* 0.0272727 = 0.0205915 loss)
I0612 16:03:35.008853 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.19433 (* 0.0272727 = 0.0325728 loss)
I0612 16:03:35.008867 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.31388 (* 0.0272727 = 0.0358332 loss)
I0612 16:03:35.008882 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.45961 (* 0.0272727 = 0.0398075 loss)
I0612 16:03:35.008895 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.833468 (* 0.0272727 = 0.0227309 loss)
I0612 16:03:35.008909 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.492197 (* 0.0272727 = 0.0134235 loss)
I0612 16:03:35.008924 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.309761 (* 0.0272727 = 0.00844803 loss)
I0612 16:03:35.008937 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.05941 (* 0.0272727 = 0.00162027 loss)
I0612 16:03:35.008951 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.00812372 (* 0.0272727 = 0.000221556 loss)
I0612 16:03:35.008966 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.00204629 (* 0.0272727 = 5.58079e-05 loss)
I0612 16:03:35.008980 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.000784295 (* 0.0272727 = 2.13899e-05 loss)
I0612 16:03:35.008994 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.000280825 (* 0.0272727 = 7.65887e-06 loss)
I0612 16:03:35.009008 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.000259427 (* 0.0272727 = 7.07527e-06 loss)
I0612 16:03:35.009022 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 9.96383e-05 (* 0.0272727 = 2.71741e-06 loss)
I0612 16:03:35.009037 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.000565806 (* 0.0272727 = 1.54311e-05 loss)
I0612 16:03:35.009050 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 8.02144e-05 (* 0.0272727 = 2.18767e-06 loss)
I0612 16:03:35.009064 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 7.27574e-05 (* 0.0272727 = 1.98429e-06 loss)
I0612 16:03:35.009078 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 3.98286e-05 (* 0.0272727 = 1.08623e-06 loss)
I0612 16:03:35.009093 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 8.34481e-06 (* 0.0272727 = 2.27586e-07 loss)
I0612 16:03:35.009107 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 8.01705e-05 (* 0.0272727 = 2.18647e-06 loss)
I0612 16:03:35.009119 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.788462
I0612 16:03:35.009131 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.625
I0612 16:03:35.009143 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 16:03:35.009155 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 16:03:35.009167 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 1
I0612 16:03:35.009179 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.625
I0612 16:03:35.009191 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 16:03:35.009203 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 16:03:35.009215 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.625
I0612 16:03:35.009227 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 1
I0612 16:03:35.009238 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 16:03:35.009250 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 16:03:35.009263 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 16:03:35.009287 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 16:03:35.009302 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 16:03:35.009315 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 16:03:35.009338 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 16:03:35.009351 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 16:03:35.009363 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 16:03:35.009374 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 16:03:35.009387 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 16:03:35.009397 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 16:03:35.009409 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 16:03:35.009421 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.926136
I0612 16:03:35.009433 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.942308
I0612 16:03:35.009449 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.70563 (* 1 = 0.70563 loss)
I0612 16:03:35.009462 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.301647 (* 1 = 0.301647 loss)
I0612 16:03:35.009476 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 1.16615 (* 0.0909091 = 0.106014 loss)
I0612 16:03:35.009491 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.176663 (* 0.0909091 = 0.0160603 loss)
I0612 16:03:35.009505 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.467839 (* 0.0909091 = 0.0425308 loss)
I0612 16:03:35.009519 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.183386 (* 0.0909091 = 0.0166714 loss)
I0612 16:03:35.009533 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.15942 (* 0.0909091 = 0.105402 loss)
I0612 16:03:35.009547 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.31568 (* 0.0909091 = 0.119607 loss)
I0612 16:03:35.009562 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.32116 (* 0.0909091 = 0.120106 loss)
I0612 16:03:35.009575 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.882467 (* 0.0909091 = 0.0802243 loss)
I0612 16:03:35.009589 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.101322 (* 0.0909091 = 0.00921111 loss)
I0612 16:03:35.009604 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.0165251 (* 0.0909091 = 0.00150228 loss)
I0612 16:03:35.009618 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.00202823 (* 0.0909091 = 0.000184385 loss)
I0612 16:03:35.009632 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.000301303 (* 0.0909091 = 2.73912e-05 loss)
I0612 16:03:35.009646 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.000104066 (* 0.0909091 = 9.46057e-06 loss)
I0612 16:03:35.009660 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 5.26569e-05 (* 0.0909091 = 4.78699e-06 loss)
I0612 16:03:35.009675 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 2.68826e-05 (* 0.0909091 = 2.44388e-06 loss)
I0612 16:03:35.009688 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 2.13242e-05 (* 0.0909091 = 1.93856e-06 loss)
I0612 16:03:35.009702 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 1.82843e-05 (* 0.0909091 = 1.66221e-06 loss)
I0612 16:03:35.009716 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 2.43792e-05 (* 0.0909091 = 2.21629e-06 loss)
I0612 16:03:35.009730 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 3.25609e-05 (* 0.0909091 = 2.96008e-06 loss)
I0612 16:03:35.009748 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 2.34998e-05 (* 0.0909091 = 2.13635e-06 loss)
I0612 16:03:35.009763 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 3.96623e-05 (* 0.0909091 = 3.60567e-06 loss)
I0612 16:03:35.009776 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 2.83884e-05 (* 0.0909091 = 2.58077e-06 loss)
I0612 16:03:35.009789 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.625
I0612 16:03:35.009806 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.375
I0612 16:03:35.009835 6181 solver.cpp:245] Train net output #149: total_confidence = 0.364987
I0612 16:03:35.009852 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.39431
I0612 16:03:35.009866 6181 sgd_solver.cpp:106] Iteration 5500, lr = 0.001
I0612 16:05:14.440244 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.5707 > 30) by scale factor 0.820329
I0612 16:05:22.180692 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.1155 > 30) by scale factor 0.964148
I0612 16:05:51.590502 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.0363 > 30) by scale factor 0.936439
I0612 16:06:16.378460 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.026 > 30) by scale factor 0.936739
I0612 16:06:50.448719 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.6563 > 30) by scale factor 0.978591
I0612 16:07:12.095978 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.6768 > 30) by scale factor 0.890821
I0612 16:07:56.165534 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.9194 > 30) by scale factor 0.835204
I0612 16:08:24.806677 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.1536 > 30) by scale factor 0.96297
I0612 16:08:44.166721 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.7983 > 30) by scale factor 0.974079
I0612 16:09:04.287072 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.8563 > 30) by scale factor 0.860677
I0612 16:09:46.051237 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.1338 > 30) by scale factor 0.905421
I0612 16:10:01.924479 6181 solver.cpp:229] Iteration 6000, loss = 4.12387
I0612 16:10:01.924554 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.533333
I0612 16:10:01.924583 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.5
I0612 16:10:01.924607 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 16:10:01.924629 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 16:10:01.924651 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.375
I0612 16:10:01.924674 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 16:10:01.924697 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.5
I0612 16:10:01.924721 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 16:10:01.924746 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 16:10:01.924769 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 16:10:01.924793 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 16:10:01.924814 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 16:10:01.924842 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 16:10:01.924866 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 16:10:01.924890 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 16:10:01.924913 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 16:10:01.924935 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 16:10:01.924958 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 0.875
I0612 16:10:01.924980 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 0.875
I0612 16:10:01.925004 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 16:10:01.925026 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 16:10:01.925050 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 16:10:01.925071 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 16:10:01.925094 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.829545
I0612 16:10:01.925119 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.716667
I0612 16:10:01.925149 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.74721 (* 0.3 = 0.524162 loss)
I0612 16:10:01.925182 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.648201 (* 0.3 = 0.19446 loss)
I0612 16:10:01.925210 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.99985 (* 0.0272727 = 0.0545413 loss)
I0612 16:10:01.925240 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 0.798455 (* 0.0272727 = 0.021776 loss)
I0612 16:10:01.925266 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.56079 (* 0.0272727 = 0.0425669 loss)
I0612 16:10:01.925295 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.8704 (* 0.0272727 = 0.051011 loss)
I0612 16:10:01.925343 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.86793 (* 0.0272727 = 0.0509436 loss)
I0612 16:10:01.925375 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.34647 (* 0.0272727 = 0.0367218 loss)
I0612 16:10:01.925403 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.56494 (* 0.0272727 = 0.0426801 loss)
I0612 16:10:01.925431 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.765447 (* 0.0272727 = 0.0208758 loss)
I0612 16:10:01.925458 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.452055 (* 0.0272727 = 0.0123288 loss)
I0612 16:10:01.925487 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.271801 (* 0.0272727 = 0.00741274 loss)
I0612 16:10:01.925513 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.282815 (* 0.0272727 = 0.00771313 loss)
I0612 16:10:01.925541 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.226071 (* 0.0272727 = 0.00616557 loss)
I0612 16:10:01.925606 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.13297 (* 0.0272727 = 0.00362645 loss)
I0612 16:10:01.925637 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.512049 (* 0.0272727 = 0.013965 loss)
I0612 16:10:01.925663 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.333868 (* 0.0272727 = 0.00910548 loss)
I0612 16:10:01.925691 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.666843 (* 0.0272727 = 0.0181866 loss)
I0612 16:10:01.925719 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.599554 (* 0.0272727 = 0.0163515 loss)
I0612 16:10:01.925746 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.561653 (* 0.0272727 = 0.0153178 loss)
I0612 16:10:01.925775 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000987617 (* 0.0272727 = 2.6935e-05 loss)
I0612 16:10:01.925803 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000111179 (* 0.0272727 = 3.03215e-06 loss)
I0612 16:10:01.925830 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 2.79416e-05 (* 0.0272727 = 7.62044e-07 loss)
I0612 16:10:01.925858 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 7.01855e-06 (* 0.0272727 = 1.91415e-07 loss)
I0612 16:10:01.925886 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.566667
I0612 16:10:01.925911 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.625
I0612 16:10:01.925935 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 16:10:01.925956 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 16:10:01.925981 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.625
I0612 16:10:01.926003 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.75
I0612 16:10:01.926025 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.75
I0612 16:10:01.926048 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 16:10:01.926071 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.625
I0612 16:10:01.926093 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 16:10:01.926116 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 1
I0612 16:10:01.926138 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 16:10:01.926162 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 16:10:01.926185 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 16:10:01.926208 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 16:10:01.926234 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 16:10:01.926257 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 16:10:01.926280 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 0.875
I0612 16:10:01.926304 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 0.875
I0612 16:10:01.926327 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 16:10:01.926349 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 16:10:01.926373 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 16:10:01.926394 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 16:10:01.926416 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.840909
I0612 16:10:01.926440 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.816667
I0612 16:10:01.926466 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.3297 (* 0.3 = 0.398909 loss)
I0612 16:10:01.926493 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.487361 (* 0.3 = 0.146208 loss)
I0612 16:10:01.926522 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.34769 (* 0.0272727 = 0.0367552 loss)
I0612 16:10:01.926566 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.666524 (* 0.0272727 = 0.0181779 loss)
I0612 16:10:01.926594 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.966554 (* 0.0272727 = 0.0263606 loss)
I0612 16:10:01.926620 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.27351 (* 0.0272727 = 0.0347321 loss)
I0612 16:10:01.926648 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.36223 (* 0.0272727 = 0.0371518 loss)
I0612 16:10:01.926674 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.993994 (* 0.0272727 = 0.0271089 loss)
I0612 16:10:01.926700 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.04969 (* 0.0272727 = 0.028628 loss)
I0612 16:10:01.926728 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.861952 (* 0.0272727 = 0.0235078 loss)
I0612 16:10:01.926755 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.297134 (* 0.0272727 = 0.00810366 loss)
I0612 16:10:01.926782 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.21648 (* 0.0272727 = 0.005904 loss)
I0612 16:10:01.926810 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.311486 (* 0.0272727 = 0.00849509 loss)
I0612 16:10:01.926837 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.361806 (* 0.0272727 = 0.00986745 loss)
I0612 16:10:01.926862 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.313919 (* 0.0272727 = 0.00856142 loss)
I0612 16:10:01.926889 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.42279 (* 0.0272727 = 0.0115306 loss)
I0612 16:10:01.926915 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.494066 (* 0.0272727 = 0.0134745 loss)
I0612 16:10:01.926949 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.245791 (* 0.0272727 = 0.00670338 loss)
I0612 16:10:01.926982 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.355666 (* 0.0272727 = 0.00969998 loss)
I0612 16:10:01.927012 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.334594 (* 0.0272727 = 0.0091253 loss)
I0612 16:10:01.927042 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.0206073 (* 0.0272727 = 0.000562016 loss)
I0612 16:10:01.927068 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00407596 (* 0.0272727 = 0.000111163 loss)
I0612 16:10:01.927095 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000358532 (* 0.0272727 = 9.77815e-06 loss)
I0612 16:10:01.927124 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000197222 (* 0.0272727 = 5.37877e-06 loss)
I0612 16:10:01.927146 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.766667
I0612 16:10:01.927168 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 16:10:01.927192 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 16:10:01.927213 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 16:10:01.927237 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 1
I0612 16:10:01.927258 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 16:10:01.927284 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 16:10:01.927305 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 16:10:01.927328 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.75
I0612 16:10:01.927351 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 1
I0612 16:10:01.927372 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 16:10:01.927394 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 16:10:01.927417 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 16:10:01.927439 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 16:10:01.927459 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 16:10:01.927481 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 16:10:01.927521 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 16:10:01.927544 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 16:10:01.927567 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 0.875
I0612 16:10:01.927588 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 16:10:01.927610 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 16:10:01.927631 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 16:10:01.927654 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 16:10:01.927675 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.909091
I0612 16:10:01.927696 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.916667
I0612 16:10:01.927723 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.755722 (* 1 = 0.755722 loss)
I0612 16:10:01.927750 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.277016 (* 1 = 0.277016 loss)
I0612 16:10:01.927777 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.512953 (* 0.0909091 = 0.0466321 loss)
I0612 16:10:01.927803 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.456106 (* 0.0909091 = 0.0414642 loss)
I0612 16:10:01.927831 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 1.03746 (* 0.0909091 = 0.0943146 loss)
I0612 16:10:01.927858 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.10865 (* 0.0909091 = 0.00987731 loss)
I0612 16:10:01.927886 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.7062 (* 0.0909091 = 0.0642 loss)
I0612 16:10:01.927912 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.379565 (* 0.0909091 = 0.0345059 loss)
I0612 16:10:01.927938 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.529925 (* 0.0909091 = 0.048175 loss)
I0612 16:10:01.927964 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.338815 (* 0.0909091 = 0.0308013 loss)
I0612 16:10:01.927995 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.131831 (* 0.0909091 = 0.0119846 loss)
I0612 16:10:01.928023 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.203748 (* 0.0909091 = 0.0185225 loss)
I0612 16:10:01.928050 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.135437 (* 0.0909091 = 0.0123125 loss)
I0612 16:10:01.928077 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.221259 (* 0.0909091 = 0.0201145 loss)
I0612 16:10:01.928102 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.179743 (* 0.0909091 = 0.0163403 loss)
I0612 16:10:01.928129 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.379137 (* 0.0909091 = 0.034467 loss)
I0612 16:10:01.928156 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.38558 (* 0.0909091 = 0.0350528 loss)
I0612 16:10:01.928181 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.272967 (* 0.0909091 = 0.0248152 loss)
I0612 16:10:01.928208 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.140952 (* 0.0909091 = 0.0128138 loss)
I0612 16:10:01.928234 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.359089 (* 0.0909091 = 0.0326444 loss)
I0612 16:10:01.928261 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.0444887 (* 0.0909091 = 0.00404443 loss)
I0612 16:10:01.928287 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.0122405 (* 0.0909091 = 0.00111277 loss)
I0612 16:10:01.928314 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00127452 (* 0.0909091 = 0.000115866 loss)
I0612 16:10:01.928345 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000820161 (* 0.0909091 = 7.45601e-05 loss)
I0612 16:10:01.928369 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 16:10:01.928392 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 16:10:01.928429 6181 solver.cpp:245] Train net output #149: total_confidence = 0.385697
I0612 16:10:01.928454 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.401084
I0612 16:10:01.928478 6181 sgd_solver.cpp:106] Iteration 6000, lr = 0.001
I0612 16:10:20.098404 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.4068 > 30) by scale factor 0.781112
I0612 16:10:23.970257 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.7601 > 30) by scale factor 0.794488
I0612 16:11:15.794358 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.6726 > 30) by scale factor 0.978072
I0612 16:11:37.416280 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.2618 > 30) by scale factor 0.85078
I0612 16:11:44.394134 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.2902 > 30) by scale factor 0.99042
I0612 16:12:58.628484 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.9437 > 30) by scale factor 0.969502
I0612 16:13:40.335355 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.0374 > 30) by scale factor 0.998754
I0612 16:15:01.540300 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 61.5278 > 30) by scale factor 0.487585
I0612 16:15:44.805621 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.7195 > 30) by scale factor 0.889693
I0612 16:16:28.499460 6181 solver.cpp:229] Iteration 6500, loss = 4.18915
I0612 16:16:28.499580 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.318182
I0612 16:16:28.499600 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.625
I0612 16:16:28.499614 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 16:16:28.499625 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.375
I0612 16:16:28.499637 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0
I0612 16:16:28.499650 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.375
I0612 16:16:28.499663 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.25
I0612 16:16:28.499675 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.5
I0612 16:16:28.499687 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 16:16:28.499699 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 16:16:28.499711 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 16:16:28.499722 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.75
I0612 16:16:28.499734 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.75
I0612 16:16:28.499747 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.75
I0612 16:16:28.499758 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.75
I0612 16:16:28.499770 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.75
I0612 16:16:28.499783 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 16:16:28.499794 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 0.875
I0612 16:16:28.499806 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 16:16:28.499817 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 16:16:28.499830 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 16:16:28.499840 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 16:16:28.499852 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 16:16:28.499864 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.715909
I0612 16:16:28.499876 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.560606
I0612 16:16:28.499893 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.49407 (* 0.3 = 0.748221 loss)
I0612 16:16:28.499908 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 1.08264 (* 0.3 = 0.324791 loss)
I0612 16:16:28.499922 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.04323 (* 0.0272727 = 0.0284516 loss)
I0612 16:16:28.499936 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.15601 (* 0.0272727 = 0.0588003 loss)
I0612 16:16:28.499951 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.92153 (* 0.0272727 = 0.0796781 loss)
I0612 16:16:28.499964 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.61083 (* 0.0272727 = 0.0712046 loss)
I0612 16:16:28.499987 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.45124 (* 0.0272727 = 0.066852 loss)
I0612 16:16:28.500017 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.27974 (* 0.0272727 = 0.0621746 loss)
I0612 16:16:28.500046 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.6318 (* 0.0272727 = 0.0445036 loss)
I0612 16:16:28.500072 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.00372 (* 0.0272727 = 0.0273742 loss)
I0612 16:16:28.500087 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.983324 (* 0.0272727 = 0.0268179 loss)
I0612 16:16:28.500102 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.890663 (* 0.0272727 = 0.0242908 loss)
I0612 16:16:28.500116 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 1.18823 (* 0.0272727 = 0.0324061 loss)
I0612 16:16:28.500129 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 1.12315 (* 0.0272727 = 0.0306313 loss)
I0612 16:16:28.500161 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 1.03795 (* 0.0272727 = 0.0283077 loss)
I0612 16:16:28.500177 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 1.01973 (* 0.0272727 = 0.0278109 loss)
I0612 16:16:28.500190 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 1.26317 (* 0.0272727 = 0.0344501 loss)
I0612 16:16:28.500205 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.808283 (* 0.0272727 = 0.0220441 loss)
I0612 16:16:28.500218 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.681882 (* 0.0272727 = 0.0185968 loss)
I0612 16:16:28.500236 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.0502483 (* 0.0272727 = 0.00137041 loss)
I0612 16:16:28.500252 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.0520203 (* 0.0272727 = 0.00141873 loss)
I0612 16:16:28.500265 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.0377254 (* 0.0272727 = 0.00102887 loss)
I0612 16:16:28.500279 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.0282037 (* 0.0272727 = 0.000769191 loss)
I0612 16:16:28.500293 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.0350088 (* 0.0272727 = 0.000954786 loss)
I0612 16:16:28.500305 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.454545
I0612 16:16:28.500319 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 16:16:28.500330 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.5
I0612 16:16:28.500342 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.375
I0612 16:16:28.500354 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 16:16:28.500365 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 16:16:28.500377 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 16:16:28.500390 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.375
I0612 16:16:28.500401 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 16:16:28.500412 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 16:16:28.500424 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 16:16:28.500435 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.75
I0612 16:16:28.500447 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.75
I0612 16:16:28.500458 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 16:16:28.500470 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.75
I0612 16:16:28.500483 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.75
I0612 16:16:28.500494 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 16:16:28.500505 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 0.875
I0612 16:16:28.500517 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 16:16:28.500530 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 16:16:28.500540 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 16:16:28.500551 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 16:16:28.500563 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 16:16:28.500576 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.789773
I0612 16:16:28.500586 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.666667
I0612 16:16:28.500605 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 2.04192 (* 0.3 = 0.612575 loss)
I0612 16:16:28.500620 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.816031 (* 0.3 = 0.244809 loss)
I0612 16:16:28.500634 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.668406 (* 0.0272727 = 0.0182293 loss)
I0612 16:16:28.500645 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.10201 (* 0.0272727 = 0.0300548 loss)
I0612 16:16:28.500685 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.95574 (* 0.0272727 = 0.0533384 loss)
I0612 16:16:28.500718 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.70305 (* 0.0272727 = 0.0464469 loss)
I0612 16:16:28.500752 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.57851 (* 0.0272727 = 0.0430502 loss)
I0612 16:16:28.500782 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.61929 (* 0.0272727 = 0.0441626 loss)
I0612 16:16:28.500798 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.56927 (* 0.0272727 = 0.0427982 loss)
I0612 16:16:28.500813 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.12426 (* 0.0272727 = 0.0306617 loss)
I0612 16:16:28.500826 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 1.11973 (* 0.0272727 = 0.0305381 loss)
I0612 16:16:28.500840 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.778627 (* 0.0272727 = 0.0212353 loss)
I0612 16:16:28.500854 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 1.42987 (* 0.0272727 = 0.0389963 loss)
I0612 16:16:28.500869 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.991213 (* 0.0272727 = 0.0270331 loss)
I0612 16:16:28.500882 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 1.15417 (* 0.0272727 = 0.0314775 loss)
I0612 16:16:28.500895 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 1.13593 (* 0.0272727 = 0.0309799 loss)
I0612 16:16:28.500910 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 1.39277 (* 0.0272727 = 0.0379846 loss)
I0612 16:16:28.500923 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.864446 (* 0.0272727 = 0.0235758 loss)
I0612 16:16:28.500937 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.963683 (* 0.0272727 = 0.0262823 loss)
I0612 16:16:28.500952 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0296054 (* 0.0272727 = 0.000807421 loss)
I0612 16:16:28.500965 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.0155549 (* 0.0272727 = 0.000424226 loss)
I0612 16:16:28.500980 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00742149 (* 0.0272727 = 0.000202404 loss)
I0612 16:16:28.500994 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.0150739 (* 0.0272727 = 0.000411107 loss)
I0612 16:16:28.501008 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.0184083 (* 0.0272727 = 0.000502045 loss)
I0612 16:16:28.501020 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.666667
I0612 16:16:28.501031 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 16:16:28.501044 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 16:16:28.501055 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.625
I0612 16:16:28.501067 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 16:16:28.501078 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 1
I0612 16:16:28.501090 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 16:16:28.501101 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.625
I0612 16:16:28.501112 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 16:16:28.501124 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 16:16:28.501135 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 16:16:28.501147 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.75
I0612 16:16:28.501158 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.75
I0612 16:16:28.501170 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 16:16:28.501183 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.75
I0612 16:16:28.501194 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.75
I0612 16:16:28.501205 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 16:16:28.501229 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 0.875
I0612 16:16:28.501242 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 16:16:28.501253 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 16:16:28.501266 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 16:16:28.501296 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 16:16:28.501310 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 16:16:28.501322 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.875
I0612 16:16:28.501333 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.818182
I0612 16:16:28.501348 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.59949 (* 1 = 1.59949 loss)
I0612 16:16:28.501363 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.628874 (* 1 = 0.628874 loss)
I0612 16:16:28.501376 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.241206 (* 0.0909091 = 0.0219278 loss)
I0612 16:16:28.501390 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.50848 (* 0.0909091 = 0.0462255 loss)
I0612 16:16:28.501405 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 1.10281 (* 0.0909091 = 0.100255 loss)
I0612 16:16:28.501418 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.568177 (* 0.0909091 = 0.0516524 loss)
I0612 16:16:28.501431 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.361508 (* 0.0909091 = 0.0328644 loss)
I0612 16:16:28.501446 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.712681 (* 0.0909091 = 0.0647892 loss)
I0612 16:16:28.501459 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.77339 (* 0.0909091 = 0.161217 loss)
I0612 16:16:28.501473 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.837952 (* 0.0909091 = 0.0761774 loss)
I0612 16:16:28.501487 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 1.07172 (* 0.0909091 = 0.0974294 loss)
I0612 16:16:28.501502 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.514588 (* 0.0909091 = 0.0467808 loss)
I0612 16:16:28.501514 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 1.64508 (* 0.0909091 = 0.149553 loss)
I0612 16:16:28.501528 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 1.2635 (* 0.0909091 = 0.114864 loss)
I0612 16:16:28.501543 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 1.24016 (* 0.0909091 = 0.112742 loss)
I0612 16:16:28.501556 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 1.49459 (* 0.0909091 = 0.135872 loss)
I0612 16:16:28.501570 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 1.39504 (* 0.0909091 = 0.126822 loss)
I0612 16:16:28.501585 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 1.21525 (* 0.0909091 = 0.110478 loss)
I0612 16:16:28.501597 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 1.17728 (* 0.0909091 = 0.107025 loss)
I0612 16:16:28.501611 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00266969 (* 0.0909091 = 0.000242699 loss)
I0612 16:16:28.501626 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00163096 (* 0.0909091 = 0.000148269 loss)
I0612 16:16:28.501639 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000915279 (* 0.0909091 = 8.32072e-05 loss)
I0612 16:16:28.501658 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00106015 (* 0.0909091 = 9.6377e-05 loss)
I0612 16:16:28.501673 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000699647 (* 0.0909091 = 6.36043e-05 loss)
I0612 16:16:28.501685 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 16:16:28.501698 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.25
I0612 16:16:28.501709 6181 solver.cpp:245] Train net output #149: total_confidence = 0.306175
I0612 16:16:28.501732 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.288351
I0612 16:16:28.501746 6181 sgd_solver.cpp:106] Iteration 6500, lr = 0.001
I0612 16:17:53.926492 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.7873 > 30) by scale factor 0.974428
I0612 16:18:28.778460 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.1161 > 30) by scale factor 0.996146
I0612 16:20:44.007654 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.0762 > 30) by scale factor 0.997466
I0612 16:21:00.999207 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.4336 > 30) by scale factor 0.846654
I0612 16:21:23.388447 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.9195 > 30) by scale factor 0.911313
I0612 16:22:54.986263 6181 solver.cpp:229] Iteration 7000, loss = 4.1346
I0612 16:22:54.986387 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.551724
I0612 16:22:54.986407 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.875
I0612 16:22:54.986420 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.75
I0612 16:22:54.986433 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.625
I0612 16:22:54.986444 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.625
I0612 16:22:54.986456 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.375
I0612 16:22:54.986469 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.25
I0612 16:22:54.986481 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.5
I0612 16:22:54.986495 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 16:22:54.986506 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 16:22:54.986518 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 16:22:54.986531 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 1
I0612 16:22:54.986542 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 16:22:54.986554 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 16:22:54.986567 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 16:22:54.986578 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 16:22:54.986590 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 16:22:54.986601 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 16:22:54.986613 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 16:22:54.986625 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 16:22:54.986637 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 16:22:54.986649 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 16:22:54.986660 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 16:22:54.986672 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.852273
I0612 16:22:54.986685 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.741379
I0612 16:22:54.986701 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.70206 (* 0.3 = 0.510617 loss)
I0612 16:22:54.986716 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.571133 (* 0.3 = 0.17134 loss)
I0612 16:22:54.986729 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.833064 (* 0.0272727 = 0.0227199 loss)
I0612 16:22:54.986743 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.0814 (* 0.0272727 = 0.0294927 loss)
I0612 16:22:54.986757 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.63024 (* 0.0272727 = 0.0444611 loss)
I0612 16:22:54.986771 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.70092 (* 0.0272727 = 0.0463887 loss)
I0612 16:22:54.986786 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.9976 (* 0.0272727 = 0.05448 loss)
I0612 16:22:54.986800 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.70156 (* 0.0272727 = 0.0464061 loss)
I0612 16:22:54.986814 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.60812 (* 0.0272727 = 0.0438579 loss)
I0612 16:22:54.986829 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.243 (* 0.0272727 = 0.0339001 loss)
I0612 16:22:54.986842 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 1.87619 (* 0.0272727 = 0.0511689 loss)
I0612 16:22:54.986856 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.659763 (* 0.0272727 = 0.0179935 loss)
I0612 16:22:54.986871 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.00585158 (* 0.0272727 = 0.000159589 loss)
I0612 16:22:54.986886 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.00260966 (* 0.0272727 = 7.11726e-05 loss)
I0612 16:22:54.986899 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.00277679 (* 0.0272727 = 7.57305e-05 loss)
I0612 16:22:54.986930 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.00235051 (* 0.0272727 = 6.41047e-05 loss)
I0612 16:22:54.986946 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.00169681 (* 0.0272727 = 4.62767e-05 loss)
I0612 16:22:54.986960 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.00142275 (* 0.0272727 = 3.88022e-05 loss)
I0612 16:22:54.986974 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.000179567 (* 0.0272727 = 4.89727e-06 loss)
I0612 16:22:54.986989 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 5.14864e-05 (* 0.0272727 = 1.40417e-06 loss)
I0612 16:22:54.987002 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 4.16705e-05 (* 0.0272727 = 1.13647e-06 loss)
I0612 16:22:54.987016 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 6.65587e-05 (* 0.0272727 = 1.81524e-06 loss)
I0612 16:22:54.987030 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 3.47617e-05 (* 0.0272727 = 9.48047e-07 loss)
I0612 16:22:54.987045 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 4.23709e-05 (* 0.0272727 = 1.15557e-06 loss)
I0612 16:22:54.987056 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.655172
I0612 16:22:54.987068 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 16:22:54.987082 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 16:22:54.987092 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 16:22:54.987104 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.625
I0612 16:22:54.987117 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 16:22:54.987128 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.625
I0612 16:22:54.987140 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.875
I0612 16:22:54.987151 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 16:22:54.987164 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 16:22:54.987175 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 16:22:54.987187 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 1
I0612 16:22:54.987198 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 16:22:54.987210 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 16:22:54.987223 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 16:22:54.987236 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 16:22:54.987248 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 16:22:54.987259 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 16:22:54.987270 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 16:22:54.987282 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 16:22:54.987293 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 16:22:54.987305 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 16:22:54.987316 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 16:22:54.987328 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.880682
I0612 16:22:54.987339 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.844828
I0612 16:22:54.987354 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.53628 (* 0.3 = 0.460884 loss)
I0612 16:22:54.987367 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.532214 (* 0.3 = 0.159664 loss)
I0612 16:22:54.987382 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.69246 (* 0.0272727 = 0.0188853 loss)
I0612 16:22:54.987396 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.0814 (* 0.0272727 = 0.0294926 loss)
I0612 16:22:54.987426 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.28211 (* 0.0272727 = 0.0349667 loss)
I0612 16:22:54.987440 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.235 (* 0.0272727 = 0.0336818 loss)
I0612 16:22:54.987454 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.711 (* 0.0272727 = 0.0466637 loss)
I0612 16:22:54.987468 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.34846 (* 0.0272727 = 0.0367762 loss)
I0612 16:22:54.987481 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.0789 (* 0.0272727 = 0.0294245 loss)
I0612 16:22:54.987495 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.06203 (* 0.0272727 = 0.0289645 loss)
I0612 16:22:54.987509 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 1.52154 (* 0.0272727 = 0.0414965 loss)
I0612 16:22:54.987522 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.84084 (* 0.0272727 = 0.022932 loss)
I0612 16:22:54.987536 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.0258203 (* 0.0272727 = 0.000704191 loss)
I0612 16:22:54.987550 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.0244454 (* 0.0272727 = 0.000666692 loss)
I0612 16:22:54.987565 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.0186281 (* 0.0272727 = 0.000508038 loss)
I0612 16:22:54.987578 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.0107172 (* 0.0272727 = 0.000292288 loss)
I0612 16:22:54.987592 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.0146904 (* 0.0272727 = 0.000400648 loss)
I0612 16:22:54.987607 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.0129057 (* 0.0272727 = 0.000351974 loss)
I0612 16:22:54.987619 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.00530312 (* 0.0272727 = 0.00014463 loss)
I0612 16:22:54.987633 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0022927 (* 0.0272727 = 6.25283e-05 loss)
I0612 16:22:54.987648 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00221194 (* 0.0272727 = 6.03256e-05 loss)
I0612 16:22:54.987661 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.0014671 (* 0.0272727 = 4.00119e-05 loss)
I0612 16:22:54.987675 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.00191605 (* 0.0272727 = 5.22558e-05 loss)
I0612 16:22:54.987689 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.00140099 (* 0.0272727 = 3.82089e-05 loss)
I0612 16:22:54.987701 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.810345
I0612 16:22:54.987714 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 16:22:54.987726 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 16:22:54.987738 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 16:22:54.987746 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 16:22:54.987754 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 16:22:54.987766 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 16:22:54.987778 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 16:22:54.987790 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 16:22:54.987802 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 16:22:54.987813 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 16:22:54.987825 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 16:22:54.987836 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 16:22:54.987848 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 16:22:54.987859 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 16:22:54.987870 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 16:22:54.987882 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 16:22:54.987902 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 16:22:54.987915 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 16:22:54.987927 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 16:22:54.987938 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 16:22:54.987949 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 16:22:54.987962 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 16:22:54.987972 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.931818
I0612 16:22:54.987984 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.913793
I0612 16:22:54.987998 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.618432 (* 1 = 0.618432 loss)
I0612 16:22:54.988011 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.216765 (* 1 = 0.216765 loss)
I0612 16:22:54.988025 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.338306 (* 0.0909091 = 0.0307551 loss)
I0612 16:22:54.988039 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.685013 (* 0.0909091 = 0.0622739 loss)
I0612 16:22:54.988054 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.518845 (* 0.0909091 = 0.0471677 loss)
I0612 16:22:54.988066 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.537094 (* 0.0909091 = 0.0488267 loss)
I0612 16:22:54.988080 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.567122 (* 0.0909091 = 0.0515565 loss)
I0612 16:22:54.988095 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.707579 (* 0.0909091 = 0.0643254 loss)
I0612 16:22:54.988108 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.753529 (* 0.0909091 = 0.0685027 loss)
I0612 16:22:54.988121 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.807997 (* 0.0909091 = 0.0734543 loss)
I0612 16:22:54.988135 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 1.17998 (* 0.0909091 = 0.107271 loss)
I0612 16:22:54.988148 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.147157 (* 0.0909091 = 0.0133779 loss)
I0612 16:22:54.988162 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.103838 (* 0.0909091 = 0.00943981 loss)
I0612 16:22:54.988176 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.0666499 (* 0.0909091 = 0.00605908 loss)
I0612 16:22:54.988190 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.03651 (* 0.0909091 = 0.00331909 loss)
I0612 16:22:54.988204 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.0308545 (* 0.0909091 = 0.00280496 loss)
I0612 16:22:54.988217 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.027051 (* 0.0909091 = 0.00245918 loss)
I0612 16:22:54.988231 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.0155842 (* 0.0909091 = 0.00141675 loss)
I0612 16:22:54.988245 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0133946 (* 0.0909091 = 0.00121769 loss)
I0612 16:22:54.988260 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.0123884 (* 0.0909091 = 0.00112622 loss)
I0612 16:22:54.988272 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00977896 (* 0.0909091 = 0.000888997 loss)
I0612 16:22:54.988289 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.0136178 (* 0.0909091 = 0.00123798 loss)
I0612 16:22:54.988304 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00804005 (* 0.0909091 = 0.000730914 loss)
I0612 16:22:54.988318 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.00897596 (* 0.0909091 = 0.000815996 loss)
I0612 16:22:54.988330 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.625
I0612 16:22:54.988343 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.625
I0612 16:22:54.988363 6181 solver.cpp:245] Train net output #149: total_confidence = 0.493033
I0612 16:22:54.988376 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.457829
I0612 16:22:54.988389 6181 sgd_solver.cpp:106] Iteration 7000, lr = 0.001
I0612 16:24:22.685796 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 40.6246 > 30) by scale factor 0.738468
I0612 16:24:27.425341 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 40.9891 > 30) by scale factor 0.731902
I0612 16:24:57.584754 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.3476 > 30) by scale factor 0.988545
I0612 16:25:03.775053 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.6303 > 30) by scale factor 0.797229
I0612 16:26:59.681572 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 51.6555 > 30) by scale factor 0.580771
I0612 16:27:56.913152 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.4074 > 30) by scale factor 0.824009
I0612 16:28:26.298308 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.4076 > 30) by scale factor 0.847277
I0612 16:28:51.012599 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.0465 > 30) by scale factor 0.907813
I0612 16:29:04.921824 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.4353 > 30) by scale factor 0.897255
I0612 16:29:21.564831 6181 solver.cpp:229] Iteration 7500, loss = 4.21911
I0612 16:29:21.564985 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.414286
I0612 16:29:21.565007 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 16:29:21.565021 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.75
I0612 16:29:21.565033 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.75
I0612 16:29:21.565045 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.375
I0612 16:29:21.565057 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 16:29:21.565069 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 16:29:21.565081 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 16:29:21.565098 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.625
I0612 16:29:21.565110 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.625
I0612 16:29:21.565122 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.625
I0612 16:29:21.565135 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.75
I0612 16:29:21.565147 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.75
I0612 16:29:21.565160 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 16:29:21.565171 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.75
I0612 16:29:21.565183 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 16:29:21.565196 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 16:29:21.565207 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 0.875
I0612 16:29:21.565222 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 16:29:21.565234 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 16:29:21.565246 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 16:29:21.565258 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 16:29:21.565270 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 16:29:21.565282 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.755682
I0612 16:29:21.565294 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.742857
I0612 16:29:21.565310 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.72739 (* 0.3 = 0.518216 loss)
I0612 16:29:21.565340 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.737654 (* 0.3 = 0.221296 loss)
I0612 16:29:21.565356 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.709306 (* 0.0272727 = 0.0193447 loss)
I0612 16:29:21.565371 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.07963 (* 0.0272727 = 0.0294444 loss)
I0612 16:29:21.565384 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.34614 (* 0.0272727 = 0.036713 loss)
I0612 16:29:21.565398 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.06164 (* 0.0272727 = 0.0562267 loss)
I0612 16:29:21.565413 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.3791 (* 0.0272727 = 0.0376118 loss)
I0612 16:29:21.565428 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.18434 (* 0.0272727 = 0.0595728 loss)
I0612 16:29:21.565443 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.69079 (* 0.0272727 = 0.0461124 loss)
I0612 16:29:21.565456 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.15622 (* 0.0272727 = 0.0315331 loss)
I0612 16:29:21.565482 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.945157 (* 0.0272727 = 0.025777 loss)
I0612 16:29:21.565498 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.894114 (* 0.0272727 = 0.0243849 loss)
I0612 16:29:21.565512 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.869614 (* 0.0272727 = 0.0237167 loss)
I0612 16:29:21.565526 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.618831 (* 0.0272727 = 0.0168772 loss)
I0612 16:29:21.565556 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.545223 (* 0.0272727 = 0.0148697 loss)
I0612 16:29:21.565572 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 1.11447 (* 0.0272727 = 0.0303945 loss)
I0612 16:29:21.565585 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.316863 (* 0.0272727 = 0.00864172 loss)
I0612 16:29:21.565599 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.240589 (* 0.0272727 = 0.00656152 loss)
I0612 16:29:21.565614 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.728377 (* 0.0272727 = 0.0198648 loss)
I0612 16:29:21.565629 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00499722 (* 0.0272727 = 0.000136288 loss)
I0612 16:29:21.565642 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00337958 (* 0.0272727 = 9.21703e-05 loss)
I0612 16:29:21.565656 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000509582 (* 0.0272727 = 1.38977e-05 loss)
I0612 16:29:21.565670 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000781534 (* 0.0272727 = 2.13146e-05 loss)
I0612 16:29:21.565685 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 7.60027e-05 (* 0.0272727 = 2.0728e-06 loss)
I0612 16:29:21.565696 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.571429
I0612 16:29:21.565709 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 16:29:21.565721 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 16:29:21.565732 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 1
I0612 16:29:21.565744 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.875
I0612 16:29:21.565755 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.625
I0612 16:29:21.565768 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 16:29:21.565779 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 16:29:21.565790 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.625
I0612 16:29:21.565803 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 16:29:21.565814 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 16:29:21.565825 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.75
I0612 16:29:21.565837 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 16:29:21.565850 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 16:29:21.565860 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.75
I0612 16:29:21.565872 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 16:29:21.565884 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 16:29:21.565896 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 0.875
I0612 16:29:21.565907 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 16:29:21.565918 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 16:29:21.565930 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 16:29:21.565942 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 16:29:21.565953 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 16:29:21.565964 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.823864
I0612 16:29:21.565975 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.842857
I0612 16:29:21.565989 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.27975 (* 0.3 = 0.383925 loss)
I0612 16:29:21.566004 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.538847 (* 0.3 = 0.161654 loss)
I0612 16:29:21.566017 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.03811 (* 0.0272727 = 0.028312 loss)
I0612 16:29:21.566031 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.415658 (* 0.0272727 = 0.0113361 loss)
I0612 16:29:21.566056 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.362205 (* 0.0272727 = 0.00987831 loss)
I0612 16:29:21.566071 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 0.683696 (* 0.0272727 = 0.0186462 loss)
I0612 16:29:21.566085 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.35572 (* 0.0272727 = 0.0369742 loss)
I0612 16:29:21.566098 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.50386 (* 0.0272727 = 0.0410145 loss)
I0612 16:29:21.566112 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.807641 (* 0.0272727 = 0.0220266 loss)
I0612 16:29:21.566126 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.4606 (* 0.0272727 = 0.0398346 loss)
I0612 16:29:21.566143 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.772565 (* 0.0272727 = 0.02107 loss)
I0612 16:29:21.566157 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.539299 (* 0.0272727 = 0.0147082 loss)
I0612 16:29:21.566171 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.757542 (* 0.0272727 = 0.0206602 loss)
I0612 16:29:21.566185 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.406165 (* 0.0272727 = 0.0110772 loss)
I0612 16:29:21.566200 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.421497 (* 0.0272727 = 0.0114954 loss)
I0612 16:29:21.566213 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.608896 (* 0.0272727 = 0.0166063 loss)
I0612 16:29:21.566227 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.323581 (* 0.0272727 = 0.00882494 loss)
I0612 16:29:21.566241 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.382649 (* 0.0272727 = 0.0104359 loss)
I0612 16:29:21.566256 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.657535 (* 0.0272727 = 0.0179328 loss)
I0612 16:29:21.566268 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0052528 (* 0.0272727 = 0.000143258 loss)
I0612 16:29:21.566284 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00148108 (* 0.0272727 = 4.03931e-05 loss)
I0612 16:29:21.566298 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00325165 (* 0.0272727 = 8.86812e-05 loss)
I0612 16:29:21.566313 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 8.1343e-05 (* 0.0272727 = 2.21844e-06 loss)
I0612 16:29:21.566328 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 2.36725e-05 (* 0.0272727 = 6.45613e-07 loss)
I0612 16:29:21.566339 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.785714
I0612 16:29:21.566351 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.75
I0612 16:29:21.566364 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 16:29:21.566375 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 1
I0612 16:29:21.566386 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 1
I0612 16:29:21.566397 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 1
I0612 16:29:21.566408 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 16:29:21.566419 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 16:29:21.566431 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 16:29:21.566442 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 16:29:21.566454 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 16:29:21.566465 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 16:29:21.566476 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 16:29:21.566488 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 16:29:21.566498 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 16:29:21.566510 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 16:29:21.566531 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 16:29:21.566545 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 16:29:21.566555 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 16:29:21.566566 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 16:29:21.566578 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 16:29:21.566589 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 16:29:21.566601 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 16:29:21.566612 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.886364
I0612 16:29:21.566624 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.942857
I0612 16:29:21.566637 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.662777 (* 1 = 0.662777 loss)
I0612 16:29:21.566651 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.321965 (* 1 = 0.321965 loss)
I0612 16:29:21.566665 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.747452 (* 0.0909091 = 0.0679502 loss)
I0612 16:29:21.566679 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.0177115 (* 0.0909091 = 0.00161014 loss)
I0612 16:29:21.566694 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.123536 (* 0.0909091 = 0.0112306 loss)
I0612 16:29:21.566707 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.0120135 (* 0.0909091 = 0.00109214 loss)
I0612 16:29:21.566721 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.0839797 (* 0.0909091 = 0.00763452 loss)
I0612 16:29:21.566735 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.618091 (* 0.0909091 = 0.05619 loss)
I0612 16:29:21.566748 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.418463 (* 0.0909091 = 0.0380421 loss)
I0612 16:29:21.566762 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.271269 (* 0.0909091 = 0.0246608 loss)
I0612 16:29:21.566776 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.4041 (* 0.0909091 = 0.0367363 loss)
I0612 16:29:21.566789 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.12187 (* 0.0909091 = 0.0110791 loss)
I0612 16:29:21.566803 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.240552 (* 0.0909091 = 0.0218683 loss)
I0612 16:29:21.566817 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.21577 (* 0.0909091 = 0.0196155 loss)
I0612 16:29:21.566830 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.354949 (* 0.0909091 = 0.0322681 loss)
I0612 16:29:21.566844 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.437805 (* 0.0909091 = 0.0398004 loss)
I0612 16:29:21.566859 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.260082 (* 0.0909091 = 0.0236438 loss)
I0612 16:29:21.566871 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.14173 (* 0.0909091 = 0.0128845 loss)
I0612 16:29:21.566885 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.204848 (* 0.0909091 = 0.0186226 loss)
I0612 16:29:21.566900 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.112468 (* 0.0909091 = 0.0102243 loss)
I0612 16:29:21.566912 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.0197388 (* 0.0909091 = 0.00179444 loss)
I0612 16:29:21.566926 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00323466 (* 0.0909091 = 0.00029406 loss)
I0612 16:29:21.566941 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000606751 (* 0.0909091 = 5.51592e-05 loss)
I0612 16:29:21.566954 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000662081 (* 0.0909091 = 6.01892e-05 loss)
I0612 16:29:21.566967 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 16:29:21.566978 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.25
I0612 16:29:21.566998 6181 solver.cpp:245] Train net output #149: total_confidence = 0.343654
I0612 16:29:21.567013 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.337455
I0612 16:29:21.567025 6181 sgd_solver.cpp:106] Iteration 7500, lr = 0.001
I0612 16:30:03.699846 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.11 > 30) by scale factor 0.934287
I0612 16:30:11.432972 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 44.4225 > 30) by scale factor 0.675333
I0612 16:30:36.893146 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.0745 > 30) by scale factor 0.907043
I0612 16:32:35.046856 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.8936 > 30) by scale factor 0.859758
I0612 16:32:39.680465 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.1418 > 30) by scale factor 0.933364
I0612 16:33:15.989573 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.4355 > 30) by scale factor 0.954336
I0612 16:33:29.163905 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.2606 > 30) by scale factor 0.901969
I0612 16:34:50.292867 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.8245 > 30) by scale factor 0.973252
I0612 16:35:47.864903 6181 solver.cpp:229] Iteration 8000, loss = 4.11657
I0612 16:35:47.865010 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.491803
I0612 16:35:47.865030 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.875
I0612 16:35:47.865042 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.75
I0612 16:35:47.865057 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.125
I0612 16:35:47.865070 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.625
I0612 16:35:47.865083 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 16:35:47.865095 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.625
I0612 16:35:47.865108 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 16:35:47.865119 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.625
I0612 16:35:47.865131 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.625
I0612 16:35:47.865144 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 16:35:47.865156 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.625
I0612 16:35:47.865169 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 16:35:47.865180 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 16:35:47.865192 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 16:35:47.865205 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 16:35:47.865216 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 16:35:47.865227 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 16:35:47.865239 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 16:35:47.865252 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 16:35:47.865263 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 16:35:47.865275 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 16:35:47.865288 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 16:35:47.865299 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.8125
I0612 16:35:47.865312 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.704918
I0612 16:35:47.865341 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.71761 (* 0.3 = 0.515284 loss)
I0612 16:35:47.865358 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.641292 (* 0.3 = 0.192388 loss)
I0612 16:35:47.865375 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.644809 (* 0.0272727 = 0.0175857 loss)
I0612 16:35:47.865388 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 0.7746 (* 0.0272727 = 0.0211255 loss)
I0612 16:35:47.865402 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.2474 (* 0.0272727 = 0.0612927 loss)
I0612 16:35:47.865424 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.13843 (* 0.0272727 = 0.0310481 loss)
I0612 16:35:47.865439 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.13584 (* 0.0272727 = 0.0582501 loss)
I0612 16:35:47.865453 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.76818 (* 0.0272727 = 0.048223 loss)
I0612 16:35:47.865468 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.15408 (* 0.0272727 = 0.0314748 loss)
I0612 16:35:47.865481 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.24241 (* 0.0272727 = 0.0338839 loss)
I0612 16:35:47.865495 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 1.4942 (* 0.0272727 = 0.0407508 loss)
I0612 16:35:47.865509 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 1.29586 (* 0.0272727 = 0.0353416 loss)
I0612 16:35:47.865523 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 1.38903 (* 0.0272727 = 0.0378826 loss)
I0612 16:35:47.865537 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.470668 (* 0.0272727 = 0.0128364 loss)
I0612 16:35:47.865552 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.0153354 (* 0.0272727 = 0.000418239 loss)
I0612 16:35:47.865586 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.00821881 (* 0.0272727 = 0.000224149 loss)
I0612 16:35:47.865602 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.00216698 (* 0.0272727 = 5.90993e-05 loss)
I0612 16:35:47.865615 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.000825815 (* 0.0272727 = 2.25222e-05 loss)
I0612 16:35:47.865629 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.000480339 (* 0.0272727 = 1.31001e-05 loss)
I0612 16:35:47.865648 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000430208 (* 0.0272727 = 1.17329e-05 loss)
I0612 16:35:47.865664 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000199974 (* 0.0272727 = 5.45384e-06 loss)
I0612 16:35:47.865677 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000252625 (* 0.0272727 = 6.88977e-06 loss)
I0612 16:35:47.865691 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000152532 (* 0.0272727 = 4.15996e-06 loss)
I0612 16:35:47.865705 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000100225 (* 0.0272727 = 2.73342e-06 loss)
I0612 16:35:47.865717 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.57377
I0612 16:35:47.865731 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.625
I0612 16:35:47.865742 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 1
I0612 16:35:47.865754 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.5
I0612 16:35:47.865766 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.625
I0612 16:35:47.865777 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 16:35:47.865788 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 16:35:47.865800 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 16:35:47.865811 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 16:35:47.865823 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 16:35:47.865835 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.625
I0612 16:35:47.865847 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.625
I0612 16:35:47.865859 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 16:35:47.865870 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 16:35:47.865882 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 16:35:47.865895 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 16:35:47.865906 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 16:35:47.865917 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 16:35:47.865929 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 16:35:47.865941 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 16:35:47.865952 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 16:35:47.865963 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 16:35:47.865975 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 16:35:47.865988 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.829545
I0612 16:35:47.865999 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.786885
I0612 16:35:47.866013 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.44938 (* 0.3 = 0.434813 loss)
I0612 16:35:47.866026 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.565563 (* 0.3 = 0.169669 loss)
I0612 16:35:47.866041 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.29727 (* 0.0272727 = 0.03538 loss)
I0612 16:35:47.866051 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.286755 (* 0.0272727 = 0.00782059 loss)
I0612 16:35:47.866077 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.63612 (* 0.0272727 = 0.0446215 loss)
I0612 16:35:47.866092 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.13343 (* 0.0272727 = 0.0309118 loss)
I0612 16:35:47.866114 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.52469 (* 0.0272727 = 0.0415825 loss)
I0612 16:35:47.866130 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.95155 (* 0.0272727 = 0.0532242 loss)
I0612 16:35:47.866143 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.38187 (* 0.0272727 = 0.0376873 loss)
I0612 16:35:47.866158 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.918117 (* 0.0272727 = 0.0250396 loss)
I0612 16:35:47.866171 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 1.06724 (* 0.0272727 = 0.0291066 loss)
I0612 16:35:47.866185 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 1.27902 (* 0.0272727 = 0.0348824 loss)
I0612 16:35:47.866199 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 1.35141 (* 0.0272727 = 0.0368567 loss)
I0612 16:35:47.866212 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.267962 (* 0.0272727 = 0.00730804 loss)
I0612 16:35:47.866226 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.100248 (* 0.0272727 = 0.00273403 loss)
I0612 16:35:47.866240 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.0398153 (* 0.0272727 = 0.00108587 loss)
I0612 16:35:47.866255 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.0106615 (* 0.0272727 = 0.000290767 loss)
I0612 16:35:47.866268 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.00464295 (* 0.0272727 = 0.000126626 loss)
I0612 16:35:47.866282 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.000533658 (* 0.0272727 = 1.45543e-05 loss)
I0612 16:35:47.866297 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000230515 (* 0.0272727 = 6.28676e-06 loss)
I0612 16:35:47.866310 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000276981 (* 0.0272727 = 7.55402e-06 loss)
I0612 16:35:47.866324 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000191651 (* 0.0272727 = 5.22684e-06 loss)
I0612 16:35:47.866338 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000116798 (* 0.0272727 = 3.18539e-06 loss)
I0612 16:35:47.866353 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000161004 (* 0.0272727 = 4.39103e-06 loss)
I0612 16:35:47.866364 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.704918
I0612 16:35:47.866376 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.75
I0612 16:35:47.866389 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 16:35:47.866400 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 16:35:47.866411 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 16:35:47.866423 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 16:35:47.866436 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.625
I0612 16:35:47.866446 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 16:35:47.866458 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 16:35:47.866469 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.625
I0612 16:35:47.866482 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 16:35:47.866492 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.625
I0612 16:35:47.866504 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 16:35:47.866515 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 16:35:47.866528 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 16:35:47.866539 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 16:35:47.866560 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 16:35:47.866574 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 16:35:47.866585 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 16:35:47.866596 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 16:35:47.866607 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 16:35:47.866619 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 16:35:47.866631 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 16:35:47.866642 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.892045
I0612 16:35:47.866654 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.95082
I0612 16:35:47.866668 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.866081 (* 1 = 0.866081 loss)
I0612 16:35:47.866686 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.324212 (* 1 = 0.324212 loss)
I0612 16:35:47.866701 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.601123 (* 0.0909091 = 0.0546475 loss)
I0612 16:35:47.866716 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.2538 (* 0.0909091 = 0.0230727 loss)
I0612 16:35:47.866730 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.808008 (* 0.0909091 = 0.0734553 loss)
I0612 16:35:47.866744 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.759143 (* 0.0909091 = 0.069013 loss)
I0612 16:35:47.866758 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.495815 (* 0.0909091 = 0.0450741 loss)
I0612 16:35:47.866772 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.41694 (* 0.0909091 = 0.128813 loss)
I0612 16:35:47.866786 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.959698 (* 0.0909091 = 0.0872453 loss)
I0612 16:35:47.866801 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.498145 (* 0.0909091 = 0.0452859 loss)
I0612 16:35:47.866813 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.931767 (* 0.0909091 = 0.0847061 loss)
I0612 16:35:47.866827 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.884158 (* 0.0909091 = 0.080378 loss)
I0612 16:35:47.866842 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 1.02593 (* 0.0909091 = 0.093266 loss)
I0612 16:35:47.866855 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.47294 (* 0.0909091 = 0.0429945 loss)
I0612 16:35:47.866869 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.026595 (* 0.0909091 = 0.00241773 loss)
I0612 16:35:47.866883 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.0182149 (* 0.0909091 = 0.0016559 loss)
I0612 16:35:47.866897 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.00988212 (* 0.0909091 = 0.000898375 loss)
I0612 16:35:47.866911 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.00200791 (* 0.0909091 = 0.000182537 loss)
I0612 16:35:47.866925 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.000929469 (* 0.0909091 = 8.44971e-05 loss)
I0612 16:35:47.866940 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.0008228 (* 0.0909091 = 7.48e-05 loss)
I0612 16:35:47.866953 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000528455 (* 0.0909091 = 4.80414e-05 loss)
I0612 16:35:47.866968 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000451665 (* 0.0909091 = 4.10604e-05 loss)
I0612 16:35:47.866982 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000545792 (* 0.0909091 = 4.96175e-05 loss)
I0612 16:35:47.866997 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000130892 (* 0.0909091 = 1.18993e-05 loss)
I0612 16:35:47.867008 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.25
I0612 16:35:47.867020 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.25
I0612 16:35:47.867043 6181 solver.cpp:245] Train net output #149: total_confidence = 0.301113
I0612 16:35:47.867055 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.27138
I0612 16:35:47.867069 6181 sgd_solver.cpp:106] Iteration 8000, lr = 0.001
I0612 16:36:22.224355 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 43.5047 > 30) by scale factor 0.68958
I0612 16:36:55.448747 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 39.7113 > 30) by scale factor 0.755453
I0612 16:36:56.222462 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.8526 > 30) by scale factor 0.972365
I0612 16:38:38.169561 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.8973 > 30) by scale factor 0.940518
I0612 16:39:12.922482 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.2227 > 30) by scale factor 0.992633
I0612 16:39:22.971249 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.2701 > 30) by scale factor 0.850579
I0612 16:40:49.458945 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.6138 > 30) by scale factor 0.866708
I0612 16:42:14.055169 6181 solver.cpp:229] Iteration 8500, loss = 4.1658
I0612 16:42:14.055279 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.318182
I0612 16:42:14.055299 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.5
I0612 16:42:14.055312 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.375
I0612 16:42:14.055328 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 16:42:14.055341 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 16:42:14.055353 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 16:42:14.055366 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.5
I0612 16:42:14.055377 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 16:42:14.055390 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.75
I0612 16:42:14.055402 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 16:42:14.055414 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.625
I0612 16:42:14.055426 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 16:42:14.055445 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 16:42:14.055459 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.75
I0612 16:42:14.055470 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.75
I0612 16:42:14.055483 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 16:42:14.055495 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 16:42:14.055507 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 16:42:14.055519 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 16:42:14.055531 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 16:42:14.055542 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 16:42:14.055554 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 16:42:14.055567 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 16:42:14.055577 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.738636
I0612 16:42:14.055590 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.651515
I0612 16:42:14.055608 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.02907 (* 0.3 = 0.608722 loss)
I0612 16:42:14.055622 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.805897 (* 0.3 = 0.241769 loss)
I0612 16:42:14.055636 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.37916 (* 0.0272727 = 0.0376135 loss)
I0612 16:42:14.055651 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.8607 (* 0.0272727 = 0.0507463 loss)
I0612 16:42:14.055665 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.13448 (* 0.0272727 = 0.0582132 loss)
I0612 16:42:14.055680 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.64751 (* 0.0272727 = 0.044932 loss)
I0612 16:42:14.055692 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.88197 (* 0.0272727 = 0.0513264 loss)
I0612 16:42:14.055706 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.92266 (* 0.0272727 = 0.0524363 loss)
I0612 16:42:14.055721 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.11753 (* 0.0272727 = 0.030478 loss)
I0612 16:42:14.055734 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.15629 (* 0.0272727 = 0.0315353 loss)
I0612 16:42:14.055747 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 1.06125 (* 0.0272727 = 0.0289432 loss)
I0612 16:42:14.055762 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 1.45327 (* 0.0272727 = 0.0396347 loss)
I0612 16:42:14.055775 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.426614 (* 0.0272727 = 0.0116349 loss)
I0612 16:42:14.055789 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.531345 (* 0.0272727 = 0.0144912 loss)
I0612 16:42:14.055821 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.597453 (* 0.0272727 = 0.0162942 loss)
I0612 16:42:14.055837 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.517172 (* 0.0272727 = 0.0141047 loss)
I0612 16:42:14.055851 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.53471 (* 0.0272727 = 0.014583 loss)
I0612 16:42:14.055865 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.705061 (* 0.0272727 = 0.0192289 loss)
I0612 16:42:14.055879 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0231065 (* 0.0272727 = 0.000630178 loss)
I0612 16:42:14.055893 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00372292 (* 0.0272727 = 0.000101534 loss)
I0612 16:42:14.055908 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00213558 (* 0.0272727 = 5.82431e-05 loss)
I0612 16:42:14.055922 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00155326 (* 0.0272727 = 4.23615e-05 loss)
I0612 16:42:14.055937 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00135909 (* 0.0272727 = 3.70662e-05 loss)
I0612 16:42:14.055950 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00131959 (* 0.0272727 = 3.59888e-05 loss)
I0612 16:42:14.055963 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.545455
I0612 16:42:14.055975 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 16:42:14.055987 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 1
I0612 16:42:14.055999 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 16:42:14.056010 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 16:42:14.056023 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 16:42:14.056035 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 16:42:14.056046 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 16:42:14.056057 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 16:42:14.056069 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 16:42:14.056082 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.625
I0612 16:42:14.056092 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 16:42:14.056104 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.75
I0612 16:42:14.056115 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 16:42:14.056130 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.75
I0612 16:42:14.056143 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.75
I0612 16:42:14.056154 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 16:42:14.056165 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 16:42:14.056177 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 16:42:14.056188 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 16:42:14.056200 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 16:42:14.056211 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 16:42:14.056222 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 16:42:14.056234 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.823864
I0612 16:42:14.056246 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.772727
I0612 16:42:14.056260 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.70613 (* 0.3 = 0.51184 loss)
I0612 16:42:14.056274 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.657999 (* 0.3 = 0.1974 loss)
I0612 16:42:14.056288 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.403163 (* 0.0272727 = 0.0109953 loss)
I0612 16:42:14.056303 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.447196 (* 0.0272727 = 0.0121963 loss)
I0612 16:42:14.056327 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.06997 (* 0.0272727 = 0.0291811 loss)
I0612 16:42:14.056344 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.73996 (* 0.0272727 = 0.0474535 loss)
I0612 16:42:14.056354 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.92507 (* 0.0272727 = 0.0525019 loss)
I0612 16:42:14.056363 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 2.06447 (* 0.0272727 = 0.0563038 loss)
I0612 16:42:14.056381 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.44724 (* 0.0272727 = 0.0394701 loss)
I0612 16:42:14.056403 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.21432 (* 0.0272727 = 0.0331179 loss)
I0612 16:42:14.056417 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 1.54886 (* 0.0272727 = 0.0422416 loss)
I0612 16:42:14.056437 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 1.74723 (* 0.0272727 = 0.0476517 loss)
I0612 16:42:14.056452 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.474605 (* 0.0272727 = 0.0129438 loss)
I0612 16:42:14.056465 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.558692 (* 0.0272727 = 0.015237 loss)
I0612 16:42:14.056478 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.444428 (* 0.0272727 = 0.0121208 loss)
I0612 16:42:14.056493 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.531889 (* 0.0272727 = 0.0145061 loss)
I0612 16:42:14.056506 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.647924 (* 0.0272727 = 0.0176707 loss)
I0612 16:42:14.056520 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.80294 (* 0.0272727 = 0.0218984 loss)
I0612 16:42:14.056535 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0116394 (* 0.0272727 = 0.000317437 loss)
I0612 16:42:14.056550 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.00368487 (* 0.0272727 = 0.000100497 loss)
I0612 16:42:14.056563 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00101631 (* 0.0272727 = 2.77176e-05 loss)
I0612 16:42:14.056577 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000339679 (* 0.0272727 = 9.26398e-06 loss)
I0612 16:42:14.056591 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000356042 (* 0.0272727 = 9.71025e-06 loss)
I0612 16:42:14.056605 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000873246 (* 0.0272727 = 2.38158e-05 loss)
I0612 16:42:14.056617 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.651515
I0612 16:42:14.056629 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.75
I0612 16:42:14.056641 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 16:42:14.056653 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 1
I0612 16:42:14.056664 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 16:42:14.056676 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 16:42:14.056689 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 16:42:14.056699 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 16:42:14.056711 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.75
I0612 16:42:14.056722 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 16:42:14.056735 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.75
I0612 16:42:14.056746 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 16:42:14.056757 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.75
I0612 16:42:14.056768 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.75
I0612 16:42:14.056780 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 16:42:14.056792 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.75
I0612 16:42:14.056803 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 16:42:14.056825 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 16:42:14.056838 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 16:42:14.056849 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 16:42:14.056861 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 16:42:14.056872 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 16:42:14.056885 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 16:42:14.056895 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.863636
I0612 16:42:14.056907 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.878788
I0612 16:42:14.056921 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.1171 (* 1 = 1.1171 loss)
I0612 16:42:14.056936 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.441985 (* 1 = 0.441985 loss)
I0612 16:42:14.056951 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.439412 (* 0.0909091 = 0.0399466 loss)
I0612 16:42:14.056964 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.428555 (* 0.0909091 = 0.0389596 loss)
I0612 16:42:14.056978 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.189757 (* 0.0909091 = 0.0172507 loss)
I0612 16:42:14.056993 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.00534 (* 0.0909091 = 0.0913948 loss)
I0612 16:42:14.057005 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.04192 (* 0.0909091 = 0.0947201 loss)
I0612 16:42:14.057019 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.911069 (* 0.0909091 = 0.0828245 loss)
I0612 16:42:14.057034 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.385055 (* 0.0909091 = 0.035005 loss)
I0612 16:42:14.057047 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.739319 (* 0.0909091 = 0.0672108 loss)
I0612 16:42:14.057061 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.864401 (* 0.0909091 = 0.078582 loss)
I0612 16:42:14.057075 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 1.04646 (* 0.0909091 = 0.0951331 loss)
I0612 16:42:14.057090 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.184126 (* 0.0909091 = 0.0167387 loss)
I0612 16:42:14.057102 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.52681 (* 0.0909091 = 0.0478918 loss)
I0612 16:42:14.057116 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.480659 (* 0.0909091 = 0.0436963 loss)
I0612 16:42:14.057131 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.259591 (* 0.0909091 = 0.0235992 loss)
I0612 16:42:14.057143 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.533779 (* 0.0909091 = 0.0485253 loss)
I0612 16:42:14.057157 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.454418 (* 0.0909091 = 0.0413107 loss)
I0612 16:42:14.057171 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0171886 (* 0.0909091 = 0.0015626 loss)
I0612 16:42:14.057188 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00529753 (* 0.0909091 = 0.000481594 loss)
I0612 16:42:14.057204 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.0029498 (* 0.0909091 = 0.000268164 loss)
I0612 16:42:14.057217 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00230647 (* 0.0909091 = 0.000209679 loss)
I0612 16:42:14.057231 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00257155 (* 0.0909091 = 0.000233777 loss)
I0612 16:42:14.057245 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.00077303 (* 0.0909091 = 7.02755e-05 loss)
I0612 16:42:14.057257 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 16:42:14.057270 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.375
I0612 16:42:14.057281 6181 solver.cpp:245] Train net output #149: total_confidence = 0.261101
I0612 16:42:14.057303 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.253901
I0612 16:42:14.057318 6181 sgd_solver.cpp:106] Iteration 8500, lr = 0.001
I0612 16:43:09.972951 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.4057 > 30) by scale factor 0.925762
I0612 16:44:44.165825 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.6102 > 30) by scale factor 0.892586
I0612 16:44:59.584374 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.9397 > 30) by scale factor 0.939271
I0612 16:45:18.900722 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.0236 > 30) by scale factor 0.967005
I0612 16:45:52.897974 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.8353 > 30) by scale factor 0.792911
I0612 16:46:01.395972 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.2935 > 30) by scale factor 0.92898
I0612 16:47:07.033380 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 52.8593 > 30) by scale factor 0.567544
I0612 16:48:40.123019 6181 solver.cpp:229] Iteration 9000, loss = 4.0775
I0612 16:48:40.123162 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.428571
I0612 16:48:40.123183 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 16:48:40.123198 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.375
I0612 16:48:40.123209 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.25
I0612 16:48:40.123224 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.375
I0612 16:48:40.123237 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 16:48:40.123250 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.25
I0612 16:48:40.123263 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 16:48:40.123275 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 16:48:40.123287 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 16:48:40.123299 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 16:48:40.123311 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 16:48:40.123323 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 16:48:40.123335 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 16:48:40.123348 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 16:48:40.123360 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 16:48:40.123371 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 16:48:40.123383 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 16:48:40.123395 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 16:48:40.123407 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 16:48:40.123419 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 16:48:40.123430 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 16:48:40.123442 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 16:48:40.123455 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.789773
I0612 16:48:40.123466 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.642857
I0612 16:48:40.123483 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.16412 (* 0.3 = 0.649235 loss)
I0612 16:48:40.123498 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.803322 (* 0.3 = 0.240997 loss)
I0612 16:48:40.123512 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.50607 (* 0.0272727 = 0.0410745 loss)
I0612 16:48:40.123528 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 3.23678 (* 0.0272727 = 0.0882758 loss)
I0612 16:48:40.123541 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.44938 (* 0.0272727 = 0.0668012 loss)
I0612 16:48:40.123555 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.5748 (* 0.0272727 = 0.0702219 loss)
I0612 16:48:40.123569 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.96487 (* 0.0272727 = 0.0535874 loss)
I0612 16:48:40.123584 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.0362 (* 0.0272727 = 0.0555327 loss)
I0612 16:48:40.123597 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.990035 (* 0.0272727 = 0.027001 loss)
I0612 16:48:40.123611 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.657977 (* 0.0272727 = 0.0179448 loss)
I0612 16:48:40.123625 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.446565 (* 0.0272727 = 0.012179 loss)
I0612 16:48:40.123639 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.68368 (* 0.0272727 = 0.0186458 loss)
I0612 16:48:40.123653 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.402011 (* 0.0272727 = 0.0109639 loss)
I0612 16:48:40.123668 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.321954 (* 0.0272727 = 0.00878058 loss)
I0612 16:48:40.123697 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.321002 (* 0.0272727 = 0.00875461 loss)
I0612 16:48:40.123713 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.0373583 (* 0.0272727 = 0.00101886 loss)
I0612 16:48:40.123728 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.0252049 (* 0.0272727 = 0.000687406 loss)
I0612 16:48:40.123741 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.00456486 (* 0.0272727 = 0.000124496 loss)
I0612 16:48:40.123755 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00161951 (* 0.0272727 = 4.41683e-05 loss)
I0612 16:48:40.123770 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00113412 (* 0.0272727 = 3.09306e-05 loss)
I0612 16:48:40.123785 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000651915 (* 0.0272727 = 1.77795e-05 loss)
I0612 16:48:40.123798 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000203638 (* 0.0272727 = 5.55376e-06 loss)
I0612 16:48:40.123812 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000149198 (* 0.0272727 = 4.06905e-06 loss)
I0612 16:48:40.123826 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000206558 (* 0.0272727 = 5.63339e-06 loss)
I0612 16:48:40.123839 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.625
I0612 16:48:40.123852 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 16:48:40.123863 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 16:48:40.123875 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 16:48:40.123888 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 16:48:40.123899 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.75
I0612 16:48:40.123911 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.375
I0612 16:48:40.123924 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 16:48:40.123935 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 1
I0612 16:48:40.123946 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 16:48:40.123958 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 16:48:40.123970 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 1
I0612 16:48:40.123982 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 16:48:40.123994 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 16:48:40.124006 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 16:48:40.124017 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 16:48:40.124029 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 16:48:40.124040 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 16:48:40.124053 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 16:48:40.124063 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 16:48:40.124075 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 16:48:40.124088 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 16:48:40.124099 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 16:48:40.124109 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.852273
I0612 16:48:40.124122 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.857143
I0612 16:48:40.124135 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.19059 (* 0.3 = 0.357176 loss)
I0612 16:48:40.124150 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.485303 (* 0.3 = 0.145591 loss)
I0612 16:48:40.124164 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.930688 (* 0.0272727 = 0.0253824 loss)
I0612 16:48:40.124181 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.755777 (* 0.0272727 = 0.0206121 loss)
I0612 16:48:40.124207 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.08389 (* 0.0272727 = 0.0295606 loss)
I0612 16:48:40.124223 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 2.02147 (* 0.0272727 = 0.055131 loss)
I0612 16:48:40.124238 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.20862 (* 0.0272727 = 0.0329623 loss)
I0612 16:48:40.124253 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.34891 (* 0.0272727 = 0.0367884 loss)
I0612 16:48:40.124265 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.879581 (* 0.0272727 = 0.0239886 loss)
I0612 16:48:40.124282 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.361592 (* 0.0272727 = 0.0098616 loss)
I0612 16:48:40.124297 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.491328 (* 0.0272727 = 0.0133999 loss)
I0612 16:48:40.124311 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.394466 (* 0.0272727 = 0.0107582 loss)
I0612 16:48:40.124325 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.2493 (* 0.0272727 = 0.00679909 loss)
I0612 16:48:40.124342 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.439344 (* 0.0272727 = 0.0119821 loss)
I0612 16:48:40.124352 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.263046 (* 0.0272727 = 0.00717399 loss)
I0612 16:48:40.124367 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.0997451 (* 0.0272727 = 0.00272032 loss)
I0612 16:48:40.124382 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.0539131 (* 0.0272727 = 0.00147036 loss)
I0612 16:48:40.124395 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.0139238 (* 0.0272727 = 0.000379739 loss)
I0612 16:48:40.124409 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.00678985 (* 0.0272727 = 0.000185178 loss)
I0612 16:48:40.124423 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0040782 (* 0.0272727 = 0.000111224 loss)
I0612 16:48:40.124438 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00100774 (* 0.0272727 = 2.74839e-05 loss)
I0612 16:48:40.124451 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00142186 (* 0.0272727 = 3.87781e-05 loss)
I0612 16:48:40.124465 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000284045 (* 0.0272727 = 7.74667e-06 loss)
I0612 16:48:40.124480 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000161826 (* 0.0272727 = 4.41344e-06 loss)
I0612 16:48:40.124491 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.803571
I0612 16:48:40.124503 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 16:48:40.124516 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 16:48:40.124527 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 16:48:40.124539 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 1
I0612 16:48:40.124550 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 1
I0612 16:48:40.124562 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 16:48:40.124573 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 16:48:40.124585 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 1
I0612 16:48:40.124596 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 16:48:40.124608 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 16:48:40.124620 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 16:48:40.124631 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 16:48:40.124644 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 16:48:40.124655 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 16:48:40.124666 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 16:48:40.124687 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 16:48:40.124701 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 16:48:40.124711 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 16:48:40.124723 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 16:48:40.124734 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 16:48:40.124747 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 16:48:40.124758 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 16:48:40.124768 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.909091
I0612 16:48:40.124780 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.928571
I0612 16:48:40.124794 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.747552 (* 1 = 0.747552 loss)
I0612 16:48:40.124809 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.300436 (* 1 = 0.300436 loss)
I0612 16:48:40.124822 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.692681 (* 0.0909091 = 0.062971 loss)
I0612 16:48:40.124837 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.749728 (* 0.0909091 = 0.0681571 loss)
I0612 16:48:40.124851 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 1.25406 (* 0.0909091 = 0.114006 loss)
I0612 16:48:40.124866 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.358653 (* 0.0909091 = 0.0326049 loss)
I0612 16:48:40.124879 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.23015 (* 0.0909091 = 0.0209227 loss)
I0612 16:48:40.124893 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.472959 (* 0.0909091 = 0.0429963 loss)
I0612 16:48:40.124907 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.391306 (* 0.0909091 = 0.0355733 loss)
I0612 16:48:40.124922 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.191022 (* 0.0909091 = 0.0173656 loss)
I0612 16:48:40.124934 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.339889 (* 0.0909091 = 0.030899 loss)
I0612 16:48:40.124948 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.372583 (* 0.0909091 = 0.0338712 loss)
I0612 16:48:40.124963 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.26738 (* 0.0909091 = 0.0243073 loss)
I0612 16:48:40.124976 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.410141 (* 0.0909091 = 0.0372856 loss)
I0612 16:48:40.124990 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.266314 (* 0.0909091 = 0.0242103 loss)
I0612 16:48:40.125005 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.132394 (* 0.0909091 = 0.0120358 loss)
I0612 16:48:40.125017 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.0345455 (* 0.0909091 = 0.0031405 loss)
I0612 16:48:40.125031 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.00866272 (* 0.0909091 = 0.00078752 loss)
I0612 16:48:40.125046 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0105124 (* 0.0909091 = 0.00095567 loss)
I0612 16:48:40.125061 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00309447 (* 0.0909091 = 0.000281315 loss)
I0612 16:48:40.125074 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00257921 (* 0.0909091 = 0.000234474 loss)
I0612 16:48:40.125088 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00138597 (* 0.0909091 = 0.000125997 loss)
I0612 16:48:40.125102 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00261295 (* 0.0909091 = 0.000237541 loss)
I0612 16:48:40.125116 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.00056191 (* 0.0909091 = 5.10828e-05 loss)
I0612 16:48:40.125129 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 16:48:40.125141 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 16:48:40.125162 6181 solver.cpp:245] Train net output #149: total_confidence = 0.145077
I0612 16:48:40.125176 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.235856
I0612 16:48:40.125190 6181 sgd_solver.cpp:106] Iteration 9000, lr = 0.001
I0612 16:49:39.935817 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.1356 > 30) by scale factor 0.807849
I0612 16:50:28.668800 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.0553 > 30) by scale factor 0.907569
I0612 16:50:42.583147 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.2028 > 30) by scale factor 0.961452
I0612 16:51:09.584115 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.9613 > 30) by scale factor 0.883359
I0612 16:51:12.667829 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 43.928 > 30) by scale factor 0.682936
I0612 16:51:19.608880 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.0627 > 30) by scale factor 0.809438
I0612 16:51:21.926403 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.0893 > 30) by scale factor 0.93489
I0612 16:51:31.974159 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 54.2161 > 30) by scale factor 0.553341
I0612 16:51:42.771576 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.0304 > 30) by scale factor 0.966793
I0612 16:52:00.516966 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.1233 > 30) by scale factor 0.854133
I0612 16:52:36.804390 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.8561 > 30) by scale factor 0.860681
I0612 16:52:52.248564 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 52.135 > 30) by scale factor 0.575429
I0612 16:53:23.144201 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.7566 > 30) by scale factor 0.863147
I0612 16:55:06.232306 6181 solver.cpp:229] Iteration 9500, loss = 3.99149
I0612 16:55:06.232393 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.396552
I0612 16:55:06.232412 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.5
I0612 16:55:06.232425 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.5
I0612 16:55:06.232437 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.625
I0612 16:55:06.232450 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.375
I0612 16:55:06.232461 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 16:55:06.232473 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 16:55:06.232486 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.5
I0612 16:55:06.232498 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.75
I0612 16:55:06.232511 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 16:55:06.232522 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 16:55:06.232534 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 16:55:06.232547 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 16:55:06.232558 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 16:55:06.232570 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 16:55:06.232583 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 16:55:06.232595 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 16:55:06.232607 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 0.875
I0612 16:55:06.232620 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 0.875
I0612 16:55:06.232631 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 16:55:06.232643 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 16:55:06.232655 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 16:55:06.232666 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 16:55:06.232678 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.795455
I0612 16:55:06.232691 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.62069
I0612 16:55:06.232705 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.49735 (* 0.3 = 0.749205 loss)
I0612 16:55:06.232720 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.846354 (* 0.3 = 0.253906 loss)
I0612 16:55:06.232735 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 2.20348 (* 0.0272727 = 0.0600949 loss)
I0612 16:55:06.232749 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.72458 (* 0.0272727 = 0.0743067 loss)
I0612 16:55:06.232764 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.34275 (* 0.0272727 = 0.0638931 loss)
I0612 16:55:06.232777 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 3.11375 (* 0.0272727 = 0.0849205 loss)
I0612 16:55:06.232791 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.5019 (* 0.0272727 = 0.040961 loss)
I0612 16:55:06.232805 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 3.23737 (* 0.0272727 = 0.0882918 loss)
I0612 16:55:06.232820 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.32934 (* 0.0272727 = 0.0362547 loss)
I0612 16:55:06.232833 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.35945 (* 0.0272727 = 0.0370759 loss)
I0612 16:55:06.232847 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.465201 (* 0.0272727 = 0.0126873 loss)
I0612 16:55:06.232862 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.297232 (* 0.0272727 = 0.00810634 loss)
I0612 16:55:06.232877 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.453203 (* 0.0272727 = 0.0123601 loss)
I0612 16:55:06.232890 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.289926 (* 0.0272727 = 0.00790707 loss)
I0612 16:55:06.232923 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.327247 (* 0.0272727 = 0.00892491 loss)
I0612 16:55:06.232938 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.247713 (* 0.0272727 = 0.00675581 loss)
I0612 16:55:06.232952 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.256282 (* 0.0272727 = 0.00698951 loss)
I0612 16:55:06.232966 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.453966 (* 0.0272727 = 0.0123809 loss)
I0612 16:55:06.232980 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.313261 (* 0.0272727 = 0.00854349 loss)
I0612 16:55:06.232995 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.289305 (* 0.0272727 = 0.00789014 loss)
I0612 16:55:06.233008 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00239988 (* 0.0272727 = 6.54514e-05 loss)
I0612 16:55:06.233023 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000426697 (* 0.0272727 = 1.16372e-05 loss)
I0612 16:55:06.233037 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000189948 (* 0.0272727 = 5.18039e-06 loss)
I0612 16:55:06.233052 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000277318 (* 0.0272727 = 7.56322e-06 loss)
I0612 16:55:06.233064 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.448276
I0612 16:55:06.233077 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 16:55:06.233088 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 16:55:06.233100 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 16:55:06.233111 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 16:55:06.233124 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.625
I0612 16:55:06.233135 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 16:55:06.233147 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.5
I0612 16:55:06.233158 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 16:55:06.233175 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 16:55:06.233188 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 16:55:06.233199 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 16:55:06.233211 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 16:55:06.233223 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 16:55:06.233235 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 16:55:06.233247 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 16:55:06.233258 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 16:55:06.233270 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 0.875
I0612 16:55:06.233283 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 0.875
I0612 16:55:06.233294 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 16:55:06.233305 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 16:55:06.233316 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 16:55:06.233343 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 16:55:06.233356 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.801136
I0612 16:55:06.233368 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.706897
I0612 16:55:06.233382 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 2.32713 (* 0.3 = 0.69814 loss)
I0612 16:55:06.233397 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.828278 (* 0.3 = 0.248483 loss)
I0612 16:55:06.233410 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.72212 (* 0.0272727 = 0.046967 loss)
I0612 16:55:06.233425 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 2.8851 (* 0.0272727 = 0.0786847 loss)
I0612 16:55:06.233451 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 2.40583 (* 0.0272727 = 0.0656135 loss)
I0612 16:55:06.233467 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 2.92864 (* 0.0272727 = 0.079872 loss)
I0612 16:55:06.233480 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 2.06744 (* 0.0272727 = 0.0563846 loss)
I0612 16:55:06.233494 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 2.95373 (* 0.0272727 = 0.0805563 loss)
I0612 16:55:06.233508 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.67847 (* 0.0272727 = 0.0457765 loss)
I0612 16:55:06.233522 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.55468 (* 0.0272727 = 0.0424004 loss)
I0612 16:55:06.233536 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.374515 (* 0.0272727 = 0.010214 loss)
I0612 16:55:06.233549 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.188212 (* 0.0272727 = 0.00513306 loss)
I0612 16:55:06.233563 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.357451 (* 0.0272727 = 0.00974868 loss)
I0612 16:55:06.233577 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.311115 (* 0.0272727 = 0.00848495 loss)
I0612 16:55:06.233592 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.391129 (* 0.0272727 = 0.0106671 loss)
I0612 16:55:06.233604 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.248188 (* 0.0272727 = 0.00676876 loss)
I0612 16:55:06.233618 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.416568 (* 0.0272727 = 0.0113609 loss)
I0612 16:55:06.233633 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.56396 (* 0.0272727 = 0.0153807 loss)
I0612 16:55:06.233645 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.581877 (* 0.0272727 = 0.0158694 loss)
I0612 16:55:06.233659 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.691001 (* 0.0272727 = 0.0188455 loss)
I0612 16:55:06.233674 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00101334 (* 0.0272727 = 2.76365e-05 loss)
I0612 16:55:06.233688 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000305178 (* 0.0272727 = 8.32303e-06 loss)
I0612 16:55:06.233702 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000195319 (* 0.0272727 = 5.32689e-06 loss)
I0612 16:55:06.233716 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.00016846 (* 0.0272727 = 4.59437e-06 loss)
I0612 16:55:06.233728 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.672414
I0612 16:55:06.233741 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.75
I0612 16:55:06.233752 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 16:55:06.233763 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 16:55:06.233775 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.625
I0612 16:55:06.233786 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 16:55:06.233798 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 16:55:06.233810 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.625
I0612 16:55:06.233821 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 16:55:06.233834 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 16:55:06.233844 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 16:55:06.233856 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 16:55:06.233867 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 16:55:06.233880 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 16:55:06.233891 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 16:55:06.233902 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 16:55:06.233919 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 16:55:06.233928 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 16:55:06.233942 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 0.875
I0612 16:55:06.233954 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 16:55:06.233965 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 16:55:06.233976 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 16:55:06.233989 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 16:55:06.233999 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.886364
I0612 16:55:06.234011 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.810345
I0612 16:55:06.234025 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.61166 (* 1 = 1.61166 loss)
I0612 16:55:06.234038 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.579146 (* 1 = 0.579146 loss)
I0612 16:55:06.234052 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 2.01226 (* 0.0909091 = 0.182933 loss)
I0612 16:55:06.234066 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 1.43143 (* 0.0909091 = 0.13013 loss)
I0612 16:55:06.234079 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 1.96455 (* 0.0909091 = 0.178595 loss)
I0612 16:55:06.234093 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.69441 (* 0.0909091 = 0.154037 loss)
I0612 16:55:06.234107 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.61579 (* 0.0909091 = 0.14689 loss)
I0612 16:55:06.234120 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.81775 (* 0.0909091 = 0.16525 loss)
I0612 16:55:06.234133 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.195 (* 0.0909091 = 0.108637 loss)
I0612 16:55:06.234148 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.812282 (* 0.0909091 = 0.0738438 loss)
I0612 16:55:06.234160 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.169903 (* 0.0909091 = 0.0154457 loss)
I0612 16:55:06.234175 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.0267884 (* 0.0909091 = 0.00243531 loss)
I0612 16:55:06.234189 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.276017 (* 0.0909091 = 0.0250925 loss)
I0612 16:55:06.234202 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.204722 (* 0.0909091 = 0.0186111 loss)
I0612 16:55:06.234220 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.0798179 (* 0.0909091 = 0.00725618 loss)
I0612 16:55:06.234235 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.0102093 (* 0.0909091 = 0.000928119 loss)
I0612 16:55:06.234249 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.182955 (* 0.0909091 = 0.0166323 loss)
I0612 16:55:06.234263 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.0700223 (* 0.0909091 = 0.00636566 loss)
I0612 16:55:06.234277 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0984384 (* 0.0909091 = 0.00894894 loss)
I0612 16:55:06.234290 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.191234 (* 0.0909091 = 0.0173849 loss)
I0612 16:55:06.234304 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00177519 (* 0.0909091 = 0.000161381 loss)
I0612 16:55:06.234318 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00159536 (* 0.0909091 = 0.000145033 loss)
I0612 16:55:06.234331 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000709531 (* 0.0909091 = 6.45028e-05 loss)
I0612 16:55:06.234345 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000301052 (* 0.0909091 = 2.73683e-05 loss)
I0612 16:55:06.234357 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 16:55:06.234369 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 16:55:06.234390 6181 solver.cpp:245] Train net output #149: total_confidence = 0.42281
I0612 16:55:06.234403 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.348524
I0612 16:55:06.234416 6181 sgd_solver.cpp:106] Iteration 9500, lr = 0.001
I0612 16:57:58.804543 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 39.1469 > 30) by scale factor 0.766344
I0612 16:58:52.815533 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 45.6421 > 30) by scale factor 0.657288
I0612 16:59:27.542361 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.0525 > 30) by scale factor 0.966107
I0612 17:00:20.022415 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.406 > 30) by scale factor 0.898041
I0612 17:00:23.123388 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.0434 > 30) by scale factor 0.936231
I0612 17:00:58.680922 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.1528 > 30) by scale factor 0.853418
I0612 17:01:31.911170 6181 solver.cpp:456] Snapshotting to binary proto file /mnt/snapshots/mixed_lstm21_iter_10000.caffemodel
I0612 17:01:32.492070 6181 sgd_solver.cpp:273] Snapshotting solver state to binary proto file /mnt/snapshots/mixed_lstm21_iter_10000.solverstate
I0612 17:01:32.759300 6181 solver.cpp:338] Iteration 10000, Testing net (#0)
I0612 17:02:30.349463 6181 solver.cpp:393] Test loss: 2.84486
I0612 17:02:30.349592 6181 solver.cpp:406] Test net output #0: loss1/accuracy = 0.58284
I0612 17:02:30.349612 6181 solver.cpp:406] Test net output #1: loss1/accuracy01 = 0.771
I0612 17:02:30.349627 6181 solver.cpp:406] Test net output #2: loss1/accuracy02 = 0.642
I0612 17:02:30.349639 6181 solver.cpp:406] Test net output #3: loss1/accuracy03 = 0.499
I0612 17:02:30.349653 6181 solver.cpp:406] Test net output #4: loss1/accuracy04 = 0.485
I0612 17:02:30.349664 6181 solver.cpp:406] Test net output #5: loss1/accuracy05 = 0.486
I0612 17:02:30.349678 6181 solver.cpp:406] Test net output #6: loss1/accuracy06 = 0.697
I0612 17:02:30.349691 6181 solver.cpp:406] Test net output #7: loss1/accuracy07 = 0.852
I0612 17:02:30.349704 6181 solver.cpp:406] Test net output #8: loss1/accuracy08 = 0.921
I0612 17:02:30.349716 6181 solver.cpp:406] Test net output #9: loss1/accuracy09 = 0.966
I0612 17:02:30.349730 6181 solver.cpp:406] Test net output #10: loss1/accuracy10 = 0.986
I0612 17:02:30.349741 6181 solver.cpp:406] Test net output #11: loss1/accuracy11 = 0.996
I0612 17:02:30.349755 6181 solver.cpp:406] Test net output #12: loss1/accuracy12 = 0.999
I0612 17:02:30.349766 6181 solver.cpp:406] Test net output #13: loss1/accuracy13 = 1
I0612 17:02:30.349778 6181 solver.cpp:406] Test net output #14: loss1/accuracy14 = 1
I0612 17:02:30.349791 6181 solver.cpp:406] Test net output #15: loss1/accuracy15 = 1
I0612 17:02:30.349802 6181 solver.cpp:406] Test net output #16: loss1/accuracy16 = 1
I0612 17:02:30.349814 6181 solver.cpp:406] Test net output #17: loss1/accuracy17 = 1
I0612 17:02:30.349825 6181 solver.cpp:406] Test net output #18: loss1/accuracy18 = 1
I0612 17:02:30.349838 6181 solver.cpp:406] Test net output #19: loss1/accuracy19 = 1
I0612 17:02:30.349849 6181 solver.cpp:406] Test net output #20: loss1/accuracy20 = 1
I0612 17:02:30.349860 6181 solver.cpp:406] Test net output #21: loss1/accuracy21 = 1
I0612 17:02:30.349872 6181 solver.cpp:406] Test net output #22: loss1/accuracy22 = 1
I0612 17:02:30.349884 6181 solver.cpp:406] Test net output #23: loss1/accuracy_incl_empty = 0.883138
I0612 17:02:30.349896 6181 solver.cpp:406] Test net output #24: loss1/accuracy_top3 = 0.816667
I0612 17:02:30.349913 6181 solver.cpp:406] Test net output #25: loss1/cross_entropy_loss = 1.50009 (* 0.3 = 0.450028 loss)
I0612 17:02:30.349928 6181 solver.cpp:406] Test net output #26: loss1/cross_entropy_loss_incl_empty = 0.431025 (* 0.3 = 0.129307 loss)
I0612 17:02:30.349942 6181 solver.cpp:406] Test net output #27: loss1/loss01 = 1.00719 (* 0.0272727 = 0.0274688 loss)
I0612 17:02:30.349957 6181 solver.cpp:406] Test net output #28: loss1/loss02 = 1.42549 (* 0.0272727 = 0.0388769 loss)
I0612 17:02:30.349972 6181 solver.cpp:406] Test net output #29: loss1/loss03 = 1.78433 (* 0.0272727 = 0.0486636 loss)
I0612 17:02:30.349985 6181 solver.cpp:406] Test net output #30: loss1/loss04 = 1.87739 (* 0.0272727 = 0.0512015 loss)
I0612 17:02:30.349998 6181 solver.cpp:406] Test net output #31: loss1/loss05 = 1.71446 (* 0.0272727 = 0.0467581 loss)
I0612 17:02:30.350013 6181 solver.cpp:406] Test net output #32: loss1/loss06 = 1.07255 (* 0.0272727 = 0.0292514 loss)
I0612 17:02:30.350026 6181 solver.cpp:406] Test net output #33: loss1/loss07 = 0.543324 (* 0.0272727 = 0.0148179 loss)
I0612 17:02:30.350040 6181 solver.cpp:406] Test net output #34: loss1/loss08 = 0.281678 (* 0.0272727 = 0.00768214 loss)
I0612 17:02:30.350054 6181 solver.cpp:406] Test net output #35: loss1/loss09 = 0.141678 (* 0.0272727 = 0.00386396 loss)
I0612 17:02:30.350069 6181 solver.cpp:406] Test net output #36: loss1/loss10 = 0.0714613 (* 0.0272727 = 0.00194894 loss)
I0612 17:02:30.350083 6181 solver.cpp:406] Test net output #37: loss1/loss11 = 0.0212718 (* 0.0272727 = 0.000580139 loss)
I0612 17:02:30.350097 6181 solver.cpp:406] Test net output #38: loss1/loss12 = 0.011937 (* 0.0272727 = 0.000325554 loss)
I0612 17:02:30.350111 6181 solver.cpp:406] Test net output #39: loss1/loss13 = 0.0078558 (* 0.0272727 = 0.000214249 loss)
I0612 17:02:30.350145 6181 solver.cpp:406] Test net output #40: loss1/loss14 = 0.00545348 (* 0.0272727 = 0.000148731 loss)
I0612 17:02:30.350160 6181 solver.cpp:406] Test net output #41: loss1/loss15 = 0.00432662 (* 0.0272727 = 0.000117999 loss)
I0612 17:02:30.350174 6181 solver.cpp:406] Test net output #42: loss1/loss16 = 0.00364622 (* 0.0272727 = 9.94424e-05 loss)
I0612 17:02:30.350188 6181 solver.cpp:406] Test net output #43: loss1/loss17 = 0.00339211 (* 0.0272727 = 9.25122e-05 loss)
I0612 17:02:30.350203 6181 solver.cpp:406] Test net output #44: loss1/loss18 = 0.00294975 (* 0.0272727 = 8.04477e-05 loss)
I0612 17:02:30.350216 6181 solver.cpp:406] Test net output #45: loss1/loss19 = 0.00283517 (* 0.0272727 = 7.73227e-05 loss)
I0612 17:02:30.350234 6181 solver.cpp:406] Test net output #46: loss1/loss20 = 0.00239763 (* 0.0272727 = 6.53899e-05 loss)
I0612 17:02:30.350249 6181 solver.cpp:406] Test net output #47: loss1/loss21 = 0.00284724 (* 0.0272727 = 7.7652e-05 loss)
I0612 17:02:30.350262 6181 solver.cpp:406] Test net output #48: loss1/loss22 = 0.0026757 (* 0.0272727 = 7.29736e-05 loss)
I0612 17:02:30.350275 6181 solver.cpp:406] Test net output #49: loss2/accuracy = 0.752834
I0612 17:02:30.350287 6181 solver.cpp:406] Test net output #50: loss2/accuracy01 = 0.857
I0612 17:02:30.350299 6181 solver.cpp:406] Test net output #51: loss2/accuracy02 = 0.808
I0612 17:02:30.350311 6181 solver.cpp:406] Test net output #52: loss2/accuracy03 = 0.743
I0612 17:02:30.350322 6181 solver.cpp:406] Test net output #53: loss2/accuracy04 = 0.637
I0612 17:02:30.350334 6181 solver.cpp:406] Test net output #54: loss2/accuracy05 = 0.63
I0612 17:02:30.350347 6181 solver.cpp:406] Test net output #55: loss2/accuracy06 = 0.786
I0612 17:02:30.350358 6181 solver.cpp:406] Test net output #56: loss2/accuracy07 = 0.874
I0612 17:02:30.350369 6181 solver.cpp:406] Test net output #57: loss2/accuracy08 = 0.939
I0612 17:02:30.350381 6181 solver.cpp:406] Test net output #58: loss2/accuracy09 = 0.965
I0612 17:02:30.350392 6181 solver.cpp:406] Test net output #59: loss2/accuracy10 = 0.985
I0612 17:02:30.350404 6181 solver.cpp:406] Test net output #60: loss2/accuracy11 = 0.994
I0612 17:02:30.350415 6181 solver.cpp:406] Test net output #61: loss2/accuracy12 = 0.999
I0612 17:02:30.350427 6181 solver.cpp:406] Test net output #62: loss2/accuracy13 = 0.999
I0612 17:02:30.350440 6181 solver.cpp:406] Test net output #63: loss2/accuracy14 = 1
I0612 17:02:30.350450 6181 solver.cpp:406] Test net output #64: loss2/accuracy15 = 1
I0612 17:02:30.350462 6181 solver.cpp:406] Test net output #65: loss2/accuracy16 = 1
I0612 17:02:30.350473 6181 solver.cpp:406] Test net output #66: loss2/accuracy17 = 1
I0612 17:02:30.350484 6181 solver.cpp:406] Test net output #67: loss2/accuracy18 = 1
I0612 17:02:30.350495 6181 solver.cpp:406] Test net output #68: loss2/accuracy19 = 1
I0612 17:02:30.350507 6181 solver.cpp:406] Test net output #69: loss2/accuracy20 = 1
I0612 17:02:30.350518 6181 solver.cpp:406] Test net output #70: loss2/accuracy21 = 1
I0612 17:02:30.350529 6181 solver.cpp:406] Test net output #71: loss2/accuracy22 = 1
I0612 17:02:30.350541 6181 solver.cpp:406] Test net output #72: loss2/accuracy_incl_empty = 0.927047
I0612 17:02:30.350553 6181 solver.cpp:406] Test net output #73: loss2/accuracy_top3 = 0.893688
I0612 17:02:30.350567 6181 solver.cpp:406] Test net output #74: loss2/cross_entropy_loss = 1.00045 (* 0.3 = 0.300135 loss)
I0612 17:02:30.350581 6181 solver.cpp:406] Test net output #75: loss2/cross_entropy_loss_incl_empty = 0.29787 (* 0.3 = 0.0893611 loss)
I0612 17:02:30.350595 6181 solver.cpp:406] Test net output #76: loss2/loss01 = 0.721312 (* 0.0272727 = 0.0196721 loss)
I0612 17:02:30.350612 6181 solver.cpp:406] Test net output #77: loss2/loss02 = 0.900712 (* 0.0272727 = 0.0245649 loss)
I0612 17:02:30.350637 6181 solver.cpp:406] Test net output #78: loss2/loss03 = 1.12502 (* 0.0272727 = 0.0306823 loss)
I0612 17:02:30.350652 6181 solver.cpp:406] Test net output #79: loss2/loss04 = 1.31025 (* 0.0272727 = 0.0357341 loss)
I0612 17:02:30.350666 6181 solver.cpp:406] Test net output #80: loss2/loss05 = 1.20541 (* 0.0272727 = 0.0328748 loss)
I0612 17:02:30.350680 6181 solver.cpp:406] Test net output #81: loss2/loss06 = 0.799733 (* 0.0272727 = 0.0218109 loss)
I0612 17:02:30.350694 6181 solver.cpp:406] Test net output #82: loss2/loss07 = 0.448595 (* 0.0272727 = 0.0122344 loss)
I0612 17:02:30.350708 6181 solver.cpp:406] Test net output #83: loss2/loss08 = 0.233495 (* 0.0272727 = 0.00636805 loss)
I0612 17:02:30.350723 6181 solver.cpp:406] Test net output #84: loss2/loss09 = 0.122736 (* 0.0272727 = 0.00334733 loss)
I0612 17:02:30.350736 6181 solver.cpp:406] Test net output #85: loss2/loss10 = 0.0648696 (* 0.0272727 = 0.00176917 loss)
I0612 17:02:30.350750 6181 solver.cpp:406] Test net output #86: loss2/loss11 = 0.0198007 (* 0.0272727 = 0.000540019 loss)
I0612 17:02:30.350764 6181 solver.cpp:406] Test net output #87: loss2/loss12 = 0.00959624 (* 0.0272727 = 0.000261716 loss)
I0612 17:02:30.350778 6181 solver.cpp:406] Test net output #88: loss2/loss13 = 0.00577063 (* 0.0272727 = 0.000157381 loss)
I0612 17:02:30.350791 6181 solver.cpp:406] Test net output #89: loss2/loss14 = 0.00386598 (* 0.0272727 = 0.000105436 loss)
I0612 17:02:30.350805 6181 solver.cpp:406] Test net output #90: loss2/loss15 = 0.00287552 (* 0.0272727 = 7.84232e-05 loss)
I0612 17:02:30.350816 6181 solver.cpp:406] Test net output #91: loss2/loss16 = 0.0025124 (* 0.0272727 = 6.85199e-05 loss)
I0612 17:02:30.350826 6181 solver.cpp:406] Test net output #92: loss2/loss17 = 0.00236054 (* 0.0272727 = 6.43785e-05 loss)
I0612 17:02:30.350841 6181 solver.cpp:406] Test net output #93: loss2/loss18 = 0.00231403 (* 0.0272727 = 6.311e-05 loss)
I0612 17:02:30.350855 6181 solver.cpp:406] Test net output #94: loss2/loss19 = 0.00191369 (* 0.0272727 = 5.21916e-05 loss)
I0612 17:02:30.350869 6181 solver.cpp:406] Test net output #95: loss2/loss20 = 0.00171724 (* 0.0272727 = 4.68338e-05 loss)
I0612 17:02:30.350883 6181 solver.cpp:406] Test net output #96: loss2/loss21 = 0.00151292 (* 0.0272727 = 4.12613e-05 loss)
I0612 17:02:30.350898 6181 solver.cpp:406] Test net output #97: loss2/loss22 = 0.00112509 (* 0.0272727 = 3.06842e-05 loss)
I0612 17:02:30.350909 6181 solver.cpp:406] Test net output #98: loss3/accuracy = 0.842286
I0612 17:02:30.350921 6181 solver.cpp:406] Test net output #99: loss3/accuracy01 = 0.868
I0612 17:02:30.350934 6181 solver.cpp:406] Test net output #100: loss3/accuracy02 = 0.849
I0612 17:02:30.350945 6181 solver.cpp:406] Test net output #101: loss3/accuracy03 = 0.826
I0612 17:02:30.350957 6181 solver.cpp:406] Test net output #102: loss3/accuracy04 = 0.832
I0612 17:02:30.350968 6181 solver.cpp:406] Test net output #103: loss3/accuracy05 = 0.831
I0612 17:02:30.350980 6181 solver.cpp:406] Test net output #104: loss3/accuracy06 = 0.878
I0612 17:02:30.350991 6181 solver.cpp:406] Test net output #105: loss3/accuracy07 = 0.918
I0612 17:02:30.351002 6181 solver.cpp:406] Test net output #106: loss3/accuracy08 = 0.951
I0612 17:02:30.351013 6181 solver.cpp:406] Test net output #107: loss3/accuracy09 = 0.974
I0612 17:02:30.351025 6181 solver.cpp:406] Test net output #108: loss3/accuracy10 = 0.985
I0612 17:02:30.351037 6181 solver.cpp:406] Test net output #109: loss3/accuracy11 = 0.995
I0612 17:02:30.351048 6181 solver.cpp:406] Test net output #110: loss3/accuracy12 = 0.997
I0612 17:02:30.351059 6181 solver.cpp:406] Test net output #111: loss3/accuracy13 = 0.997
I0612 17:02:30.351070 6181 solver.cpp:406] Test net output #112: loss3/accuracy14 = 0.998
I0612 17:02:30.351083 6181 solver.cpp:406] Test net output #113: loss3/accuracy15 = 0.998
I0612 17:02:30.351094 6181 solver.cpp:406] Test net output #114: loss3/accuracy16 = 1
I0612 17:02:30.351114 6181 solver.cpp:406] Test net output #115: loss3/accuracy17 = 1
I0612 17:02:30.351126 6181 solver.cpp:406] Test net output #116: loss3/accuracy18 = 1
I0612 17:02:30.351138 6181 solver.cpp:406] Test net output #117: loss3/accuracy19 = 1
I0612 17:02:30.351150 6181 solver.cpp:406] Test net output #118: loss3/accuracy20 = 1
I0612 17:02:30.351161 6181 solver.cpp:406] Test net output #119: loss3/accuracy21 = 1
I0612 17:02:30.351171 6181 solver.cpp:406] Test net output #120: loss3/accuracy22 = 1
I0612 17:02:30.351182 6181 solver.cpp:406] Test net output #121: loss3/accuracy_incl_empty = 0.950864
I0612 17:02:30.351194 6181 solver.cpp:406] Test net output #122: loss3/accuracy_top3 = 0.912912
I0612 17:02:30.351208 6181 solver.cpp:406] Test net output #123: loss3/cross_entropy_loss = 0.72829 (* 1 = 0.72829 loss)
I0612 17:02:30.351222 6181 solver.cpp:406] Test net output #124: loss3/cross_entropy_loss_incl_empty = 0.230301 (* 1 = 0.230301 loss)
I0612 17:02:30.351235 6181 solver.cpp:406] Test net output #125: loss3/loss01 = 0.615025 (* 0.0909091 = 0.0559113 loss)
I0612 17:02:30.351248 6181 solver.cpp:406] Test net output #126: loss3/loss02 = 0.710971 (* 0.0909091 = 0.0646337 loss)
I0612 17:02:30.351261 6181 solver.cpp:406] Test net output #127: loss3/loss03 = 0.782309 (* 0.0909091 = 0.071119 loss)
I0612 17:02:30.351279 6181 solver.cpp:406] Test net output #128: loss3/loss04 = 0.784455 (* 0.0909091 = 0.0713141 loss)
I0612 17:02:30.351294 6181 solver.cpp:406] Test net output #129: loss3/loss05 = 0.766877 (* 0.0909091 = 0.0697161 loss)
I0612 17:02:30.351307 6181 solver.cpp:406] Test net output #130: loss3/loss06 = 0.555643 (* 0.0909091 = 0.050513 loss)
I0612 17:02:30.351321 6181 solver.cpp:406] Test net output #131: loss3/loss07 = 0.346912 (* 0.0909091 = 0.0315375 loss)
I0612 17:02:30.351336 6181 solver.cpp:406] Test net output #132: loss3/loss08 = 0.211283 (* 0.0909091 = 0.0192076 loss)
I0612 17:02:30.351348 6181 solver.cpp:406] Test net output #133: loss3/loss09 = 0.0989194 (* 0.0909091 = 0.00899268 loss)
I0612 17:02:30.351362 6181 solver.cpp:406] Test net output #134: loss3/loss10 = 0.0568215 (* 0.0909091 = 0.00516559 loss)
I0612 17:02:30.351377 6181 solver.cpp:406] Test net output #135: loss3/loss11 = 0.0229864 (* 0.0909091 = 0.00208967 loss)
I0612 17:02:30.351389 6181 solver.cpp:406] Test net output #136: loss3/loss12 = 0.0139232 (* 0.0909091 = 0.00126574 loss)
I0612 17:02:30.351404 6181 solver.cpp:406] Test net output #137: loss3/loss13 = 0.0087824 (* 0.0909091 = 0.0007984 loss)
I0612 17:02:30.351418 6181 solver.cpp:406] Test net output #138: loss3/loss14 = 0.00504597 (* 0.0909091 = 0.000458725 loss)
I0612 17:02:30.351433 6181 solver.cpp:406] Test net output #139: loss3/loss15 = 0.00368455 (* 0.0909091 = 0.000334959 loss)
I0612 17:02:30.351446 6181 solver.cpp:406] Test net output #140: loss3/loss16 = 0.00278035 (* 0.0909091 = 0.000252759 loss)
I0612 17:02:30.351459 6181 solver.cpp:406] Test net output #141: loss3/loss17 = 0.00263879 (* 0.0909091 = 0.00023989 loss)
I0612 17:02:30.351474 6181 solver.cpp:406] Test net output #142: loss3/loss18 = 0.00227333 (* 0.0909091 = 0.000206666 loss)
I0612 17:02:30.351487 6181 solver.cpp:406] Test net output #143: loss3/loss19 = 0.00219292 (* 0.0909091 = 0.000199356 loss)
I0612 17:02:30.351501 6181 solver.cpp:406] Test net output #144: loss3/loss20 = 0.00209581 (* 0.0909091 = 0.000190528 loss)
I0612 17:02:30.351516 6181 solver.cpp:406] Test net output #145: loss3/loss21 = 0.00158453 (* 0.0909091 = 0.000144048 loss)
I0612 17:02:30.351529 6181 solver.cpp:406] Test net output #146: loss3/loss22 = 0.000998169 (* 0.0909091 = 9.07427e-05 loss)
I0612 17:02:30.351541 6181 solver.cpp:406] Test net output #147: total_accuracy = 0.578
I0612 17:02:30.351553 6181 solver.cpp:406] Test net output #148: total_accuracy_not_rec = 0.546
I0612 17:02:30.351564 6181 solver.cpp:406] Test net output #149: total_confidence = 0.485671
I0612 17:02:30.351584 6181 solver.cpp:406] Test net output #150: total_confidence_not_rec = 0.431795
I0612 17:02:30.351599 6181 solver.cpp:338] Iteration 10000, Testing net (#1)
I0612 17:03:28.001616 6181 solver.cpp:393] Test loss: 3.75726
I0612 17:03:28.001749 6181 solver.cpp:406] Test net output #0: loss1/accuracy = 0.557617
I0612 17:03:28.001771 6181 solver.cpp:406] Test net output #1: loss1/accuracy01 = 0.764
I0612 17:03:28.001785 6181 solver.cpp:406] Test net output #2: loss1/accuracy02 = 0.647
I0612 17:03:28.001798 6181 solver.cpp:406] Test net output #3: loss1/accuracy03 = 0.522
I0612 17:03:28.001811 6181 solver.cpp:406] Test net output #4: loss1/accuracy04 = 0.476
I0612 17:03:28.001823 6181 solver.cpp:406] Test net output #5: loss1/accuracy05 = 0.484
I0612 17:03:28.001835 6181 solver.cpp:406] Test net output #6: loss1/accuracy06 = 0.656
I0612 17:03:28.001848 6181 solver.cpp:406] Test net output #7: loss1/accuracy07 = 0.75
I0612 17:03:28.001862 6181 solver.cpp:406] Test net output #8: loss1/accuracy08 = 0.809
I0612 17:03:28.001873 6181 solver.cpp:406] Test net output #9: loss1/accuracy09 = 0.839
I0612 17:03:28.001886 6181 solver.cpp:406] Test net output #10: loss1/accuracy10 = 0.866
I0612 17:03:28.001899 6181 solver.cpp:406] Test net output #11: loss1/accuracy11 = 0.885
I0612 17:03:28.001911 6181 solver.cpp:406] Test net output #12: loss1/accuracy12 = 0.899
I0612 17:03:28.001924 6181 solver.cpp:406] Test net output #13: loss1/accuracy13 = 0.916
I0612 17:03:28.001935 6181 solver.cpp:406] Test net output #14: loss1/accuracy14 = 0.937
I0612 17:03:28.001948 6181 solver.cpp:406] Test net output #15: loss1/accuracy15 = 0.955
I0612 17:03:28.001960 6181 solver.cpp:406] Test net output #16: loss1/accuracy16 = 0.966
I0612 17:03:28.001971 6181 solver.cpp:406] Test net output #17: loss1/accuracy17 = 0.982
I0612 17:03:28.001983 6181 solver.cpp:406] Test net output #18: loss1/accuracy18 = 0.984
I0612 17:03:28.001996 6181 solver.cpp:406] Test net output #19: loss1/accuracy19 = 0.985
I0612 17:03:28.002007 6181 solver.cpp:406] Test net output #20: loss1/accuracy20 = 0.993
I0612 17:03:28.002018 6181 solver.cpp:406] Test net output #21: loss1/accuracy21 = 0.998
I0612 17:03:28.002030 6181 solver.cpp:406] Test net output #22: loss1/accuracy22 = 0.999
I0612 17:03:28.002043 6181 solver.cpp:406] Test net output #23: loss1/accuracy_incl_empty = 0.840183
I0612 17:03:28.002055 6181 solver.cpp:406] Test net output #24: loss1/accuracy_top3 = 0.794109
I0612 17:03:28.002071 6181 solver.cpp:406] Test net output #25: loss1/cross_entropy_loss = 1.60643 (* 0.3 = 0.48193 loss)
I0612 17:03:28.002086 6181 solver.cpp:406] Test net output #26: loss1/cross_entropy_loss_incl_empty = 0.595793 (* 0.3 = 0.178738 loss)
I0612 17:03:28.002101 6181 solver.cpp:406] Test net output #27: loss1/loss01 = 1.04478 (* 0.0272727 = 0.0284941 loss)
I0612 17:03:28.002115 6181 solver.cpp:406] Test net output #28: loss1/loss02 = 1.37408 (* 0.0272727 = 0.037475 loss)
I0612 17:03:28.002130 6181 solver.cpp:406] Test net output #29: loss1/loss03 = 1.73161 (* 0.0272727 = 0.0472256 loss)
I0612 17:03:28.002143 6181 solver.cpp:406] Test net output #30: loss1/loss04 = 1.89089 (* 0.0272727 = 0.0515697 loss)
I0612 17:03:28.002157 6181 solver.cpp:406] Test net output #31: loss1/loss05 = 1.77645 (* 0.0272727 = 0.0484488 loss)
I0612 17:03:28.002171 6181 solver.cpp:406] Test net output #32: loss1/loss06 = 1.25252 (* 0.0272727 = 0.0341596 loss)
I0612 17:03:28.002184 6181 solver.cpp:406] Test net output #33: loss1/loss07 = 0.877268 (* 0.0272727 = 0.0239255 loss)
I0612 17:03:28.002198 6181 solver.cpp:406] Test net output #34: loss1/loss08 = 0.721259 (* 0.0272727 = 0.0196707 loss)
I0612 17:03:28.002212 6181 solver.cpp:406] Test net output #35: loss1/loss09 = 0.635991 (* 0.0272727 = 0.0173452 loss)
I0612 17:03:28.002228 6181 solver.cpp:406] Test net output #36: loss1/loss10 = 0.544643 (* 0.0272727 = 0.0148539 loss)
I0612 17:03:28.002243 6181 solver.cpp:406] Test net output #37: loss1/loss11 = 0.474393 (* 0.0272727 = 0.012938 loss)
I0612 17:03:28.002257 6181 solver.cpp:406] Test net output #38: loss1/loss12 = 0.406429 (* 0.0272727 = 0.0110844 loss)
I0612 17:03:28.002290 6181 solver.cpp:406] Test net output #39: loss1/loss13 = 0.359411 (* 0.0272727 = 0.00980211 loss)
I0612 17:03:28.002306 6181 solver.cpp:406] Test net output #40: loss1/loss14 = 0.275688 (* 0.0272727 = 0.00751877 loss)
I0612 17:03:28.002321 6181 solver.cpp:406] Test net output #41: loss1/loss15 = 0.233344 (* 0.0272727 = 0.00636393 loss)
I0612 17:03:28.002334 6181 solver.cpp:406] Test net output #42: loss1/loss16 = 0.186604 (* 0.0272727 = 0.0050892 loss)
I0612 17:03:28.002348 6181 solver.cpp:406] Test net output #43: loss1/loss17 = 0.119498 (* 0.0272727 = 0.00325903 loss)
I0612 17:03:28.002362 6181 solver.cpp:406] Test net output #44: loss1/loss18 = 0.110714 (* 0.0272727 = 0.00301947 loss)
I0612 17:03:28.002377 6181 solver.cpp:406] Test net output #45: loss1/loss19 = 0.103757 (* 0.0272727 = 0.00282973 loss)
I0612 17:03:28.002390 6181 solver.cpp:406] Test net output #46: loss1/loss20 = 0.0517907 (* 0.0272727 = 0.00141247 loss)
I0612 17:03:28.002404 6181 solver.cpp:406] Test net output #47: loss1/loss21 = 0.0154149 (* 0.0272727 = 0.000420407 loss)
I0612 17:03:28.002418 6181 solver.cpp:406] Test net output #48: loss1/loss22 = 0.0108564 (* 0.0272727 = 0.000296084 loss)
I0612 17:03:28.002430 6181 solver.cpp:406] Test net output #49: loss2/accuracy = 0.704104
I0612 17:03:28.002442 6181 solver.cpp:406] Test net output #50: loss2/accuracy01 = 0.857
I0612 17:03:28.002454 6181 solver.cpp:406] Test net output #51: loss2/accuracy02 = 0.825
I0612 17:03:28.002466 6181 solver.cpp:406] Test net output #52: loss2/accuracy03 = 0.723
I0612 17:03:28.002478 6181 solver.cpp:406] Test net output #53: loss2/accuracy04 = 0.632
I0612 17:03:28.002490 6181 solver.cpp:406] Test net output #54: loss2/accuracy05 = 0.635
I0612 17:03:28.002501 6181 solver.cpp:406] Test net output #55: loss2/accuracy06 = 0.73
I0612 17:03:28.002513 6181 solver.cpp:406] Test net output #56: loss2/accuracy07 = 0.791
I0612 17:03:28.002524 6181 solver.cpp:406] Test net output #57: loss2/accuracy08 = 0.827
I0612 17:03:28.002537 6181 solver.cpp:406] Test net output #58: loss2/accuracy09 = 0.854
I0612 17:03:28.002547 6181 solver.cpp:406] Test net output #59: loss2/accuracy10 = 0.877
I0612 17:03:28.002559 6181 solver.cpp:406] Test net output #60: loss2/accuracy11 = 0.896
I0612 17:03:28.002570 6181 solver.cpp:406] Test net output #61: loss2/accuracy12 = 0.91
I0612 17:03:28.002583 6181 solver.cpp:406] Test net output #62: loss2/accuracy13 = 0.919
I0612 17:03:28.002593 6181 solver.cpp:406] Test net output #63: loss2/accuracy14 = 0.937
I0612 17:03:28.002605 6181 solver.cpp:406] Test net output #64: loss2/accuracy15 = 0.954
I0612 17:03:28.002617 6181 solver.cpp:406] Test net output #65: loss2/accuracy16 = 0.966
I0612 17:03:28.002629 6181 solver.cpp:406] Test net output #66: loss2/accuracy17 = 0.982
I0612 17:03:28.002640 6181 solver.cpp:406] Test net output #67: loss2/accuracy18 = 0.984
I0612 17:03:28.002652 6181 solver.cpp:406] Test net output #68: loss2/accuracy19 = 0.985
I0612 17:03:28.002665 6181 solver.cpp:406] Test net output #69: loss2/accuracy20 = 0.993
I0612 17:03:28.002676 6181 solver.cpp:406] Test net output #70: loss2/accuracy21 = 0.998
I0612 17:03:28.002687 6181 solver.cpp:406] Test net output #71: loss2/accuracy22 = 0.999
I0612 17:03:28.002699 6181 solver.cpp:406] Test net output #72: loss2/accuracy_incl_empty = 0.880956
I0612 17:03:28.002712 6181 solver.cpp:406] Test net output #73: loss2/accuracy_top3 = 0.868071
I0612 17:03:28.002725 6181 solver.cpp:406] Test net output #74: loss2/cross_entropy_loss = 1.14304 (* 0.3 = 0.342913 loss)
I0612 17:03:28.002738 6181 solver.cpp:406] Test net output #75: loss2/cross_entropy_loss_incl_empty = 0.46335 (* 0.3 = 0.139005 loss)
I0612 17:03:28.002753 6181 solver.cpp:406] Test net output #76: loss2/loss01 = 0.700048 (* 0.0272727 = 0.0190922 loss)
I0612 17:03:28.002770 6181 solver.cpp:406] Test net output #77: loss2/loss02 = 0.824306 (* 0.0272727 = 0.0224811 loss)
I0612 17:03:28.002795 6181 solver.cpp:406] Test net output #78: loss2/loss03 = 1.12499 (* 0.0272727 = 0.0306814 loss)
I0612 17:03:28.002810 6181 solver.cpp:406] Test net output #79: loss2/loss04 = 1.29978 (* 0.0272727 = 0.0354486 loss)
I0612 17:03:28.002825 6181 solver.cpp:406] Test net output #80: loss2/loss05 = 1.30006 (* 0.0272727 = 0.0354561 loss)
I0612 17:03:28.002838 6181 solver.cpp:406] Test net output #81: loss2/loss06 = 0.985676 (* 0.0272727 = 0.0268821 loss)
I0612 17:03:28.002852 6181 solver.cpp:406] Test net output #82: loss2/loss07 = 0.730716 (* 0.0272727 = 0.0199286 loss)
I0612 17:03:28.002866 6181 solver.cpp:406] Test net output #83: loss2/loss08 = 0.643833 (* 0.0272727 = 0.0175591 loss)
I0612 17:03:28.002879 6181 solver.cpp:406] Test net output #84: loss2/loss09 = 0.570286 (* 0.0272727 = 0.0155533 loss)
I0612 17:03:28.002893 6181 solver.cpp:406] Test net output #85: loss2/loss10 = 0.485492 (* 0.0272727 = 0.0132407 loss)
I0612 17:03:28.002907 6181 solver.cpp:406] Test net output #86: loss2/loss11 = 0.431803 (* 0.0272727 = 0.0117764 loss)
I0612 17:03:28.002917 6181 solver.cpp:406] Test net output #87: loss2/loss12 = 0.357227 (* 0.0272727 = 0.00974255 loss)
I0612 17:03:28.002926 6181 solver.cpp:406] Test net output #88: loss2/loss13 = 0.333875 (* 0.0272727 = 0.00910568 loss)
I0612 17:03:28.002935 6181 solver.cpp:406] Test net output #89: loss2/loss14 = 0.257823 (* 0.0272727 = 0.00703153 loss)
I0612 17:03:28.002950 6181 solver.cpp:406] Test net output #90: loss2/loss15 = 0.229202 (* 0.0272727 = 0.00625097 loss)
I0612 17:03:28.002964 6181 solver.cpp:406] Test net output #91: loss2/loss16 = 0.18668 (* 0.0272727 = 0.00509129 loss)
I0612 17:03:28.002977 6181 solver.cpp:406] Test net output #92: loss2/loss17 = 0.131398 (* 0.0272727 = 0.00358358 loss)
I0612 17:03:28.002991 6181 solver.cpp:406] Test net output #93: loss2/loss18 = 0.117136 (* 0.0272727 = 0.00319462 loss)
I0612 17:03:28.003005 6181 solver.cpp:406] Test net output #94: loss2/loss19 = 0.119378 (* 0.0272727 = 0.00325577 loss)
I0612 17:03:28.003020 6181 solver.cpp:406] Test net output #95: loss2/loss20 = 0.0509042 (* 0.0272727 = 0.0013883 loss)
I0612 17:03:28.003033 6181 solver.cpp:406] Test net output #96: loss2/loss21 = 0.0168917 (* 0.0272727 = 0.000460682 loss)
I0612 17:03:28.003047 6181 solver.cpp:406] Test net output #97: loss2/loss22 = 0.00985338 (* 0.0272727 = 0.000268729 loss)
I0612 17:03:28.003059 6181 solver.cpp:406] Test net output #98: loss3/accuracy = 0.802594
I0612 17:03:28.003072 6181 solver.cpp:406] Test net output #99: loss3/accuracy01 = 0.868
I0612 17:03:28.003083 6181 solver.cpp:406] Test net output #100: loss3/accuracy02 = 0.86
I0612 17:03:28.003095 6181 solver.cpp:406] Test net output #101: loss3/accuracy03 = 0.819
I0612 17:03:28.003106 6181 solver.cpp:406] Test net output #102: loss3/accuracy04 = 0.81
I0612 17:03:28.003118 6181 solver.cpp:406] Test net output #103: loss3/accuracy05 = 0.796
I0612 17:03:28.003129 6181 solver.cpp:406] Test net output #104: loss3/accuracy06 = 0.837
I0612 17:03:28.003140 6181 solver.cpp:406] Test net output #105: loss3/accuracy07 = 0.879
I0612 17:03:28.003152 6181 solver.cpp:406] Test net output #106: loss3/accuracy08 = 0.877
I0612 17:03:28.003163 6181 solver.cpp:406] Test net output #107: loss3/accuracy09 = 0.893
I0612 17:03:28.003175 6181 solver.cpp:406] Test net output #108: loss3/accuracy10 = 0.901
I0612 17:03:28.003186 6181 solver.cpp:406] Test net output #109: loss3/accuracy11 = 0.914
I0612 17:03:28.003198 6181 solver.cpp:406] Test net output #110: loss3/accuracy12 = 0.935
I0612 17:03:28.003209 6181 solver.cpp:406] Test net output #111: loss3/accuracy13 = 0.935
I0612 17:03:28.003221 6181 solver.cpp:406] Test net output #112: loss3/accuracy14 = 0.948
I0612 17:03:28.003233 6181 solver.cpp:406] Test net output #113: loss3/accuracy15 = 0.963
I0612 17:03:28.003244 6181 solver.cpp:406] Test net output #114: loss3/accuracy16 = 0.969
I0612 17:03:28.003265 6181 solver.cpp:406] Test net output #115: loss3/accuracy17 = 0.984
I0612 17:03:28.003281 6181 solver.cpp:406] Test net output #116: loss3/accuracy18 = 0.985
I0612 17:03:28.003293 6181 solver.cpp:406] Test net output #117: loss3/accuracy19 = 0.984
I0612 17:03:28.003304 6181 solver.cpp:406] Test net output #118: loss3/accuracy20 = 0.994
I0612 17:03:28.003315 6181 solver.cpp:406] Test net output #119: loss3/accuracy21 = 0.998
I0612 17:03:28.003327 6181 solver.cpp:406] Test net output #120: loss3/accuracy22 = 0.999
I0612 17:03:28.003339 6181 solver.cpp:406] Test net output #121: loss3/accuracy_incl_empty = 0.918046
I0612 17:03:28.003350 6181 solver.cpp:406] Test net output #122: loss3/accuracy_top3 = 0.906638
I0612 17:03:28.003363 6181 solver.cpp:406] Test net output #123: loss3/cross_entropy_loss = 0.839087 (* 1 = 0.839087 loss)
I0612 17:03:28.003377 6181 solver.cpp:406] Test net output #124: loss3/cross_entropy_loss_incl_empty = 0.350248 (* 1 = 0.350248 loss)
I0612 17:03:28.003391 6181 solver.cpp:406] Test net output #125: loss3/loss01 = 0.610579 (* 0.0909091 = 0.0555072 loss)
I0612 17:03:28.003404 6181 solver.cpp:406] Test net output #126: loss3/loss02 = 0.668132 (* 0.0909091 = 0.0607393 loss)
I0612 17:03:28.003418 6181 solver.cpp:406] Test net output #127: loss3/loss03 = 0.819625 (* 0.0909091 = 0.0745113 loss)
I0612 17:03:28.003432 6181 solver.cpp:406] Test net output #128: loss3/loss04 = 0.82796 (* 0.0909091 = 0.0752691 loss)
I0612 17:03:28.003446 6181 solver.cpp:406] Test net output #129: loss3/loss05 = 0.914598 (* 0.0909091 = 0.0831453 loss)
I0612 17:03:28.003459 6181 solver.cpp:406] Test net output #130: loss3/loss06 = 0.683131 (* 0.0909091 = 0.0621029 loss)
I0612 17:03:28.003473 6181 solver.cpp:406] Test net output #131: loss3/loss07 = 0.52357 (* 0.0909091 = 0.0475973 loss)
I0612 17:03:28.003487 6181 solver.cpp:406] Test net output #132: loss3/loss08 = 0.47485 (* 0.0909091 = 0.0431682 loss)
I0612 17:03:28.003500 6181 solver.cpp:406] Test net output #133: loss3/loss09 = 0.425279 (* 0.0909091 = 0.0386618 loss)
I0612 17:03:28.003515 6181 solver.cpp:406] Test net output #134: loss3/loss10 = 0.381472 (* 0.0909091 = 0.0346793 loss)
I0612 17:03:28.003527 6181 solver.cpp:406] Test net output #135: loss3/loss11 = 0.3399 (* 0.0909091 = 0.0309 loss)
I0612 17:03:28.003541 6181 solver.cpp:406] Test net output #136: loss3/loss12 = 0.287735 (* 0.0909091 = 0.0261577 loss)
I0612 17:03:28.003556 6181 solver.cpp:406] Test net output #137: loss3/loss13 = 0.249709 (* 0.0909091 = 0.0227008 loss)
I0612 17:03:28.003569 6181 solver.cpp:406] Test net output #138: loss3/loss14 = 0.209892 (* 0.0909091 = 0.0190811 loss)
I0612 17:03:28.003582 6181 solver.cpp:406] Test net output #139: loss3/loss15 = 0.187759 (* 0.0909091 = 0.017069 loss)
I0612 17:03:28.003597 6181 solver.cpp:406] Test net output #140: loss3/loss16 = 0.157956 (* 0.0909091 = 0.0143597 loss)
I0612 17:03:28.003609 6181 solver.cpp:406] Test net output #141: loss3/loss17 = 0.114677 (* 0.0909091 = 0.0104252 loss)
I0612 17:03:28.003624 6181 solver.cpp:406] Test net output #142: loss3/loss18 = 0.104318 (* 0.0909091 = 0.00948347 loss)
I0612 17:03:28.003638 6181 solver.cpp:406] Test net output #143: loss3/loss19 = 0.104308 (* 0.0909091 = 0.00948255 loss)
I0612 17:03:28.003651 6181 solver.cpp:406] Test net output #144: loss3/loss20 = 0.0393379 (* 0.0909091 = 0.00357617 loss)
I0612 17:03:28.003665 6181 solver.cpp:406] Test net output #145: loss3/loss21 = 0.013775 (* 0.0909091 = 0.00125228 loss)
I0612 17:03:28.003679 6181 solver.cpp:406] Test net output #146: loss3/loss22 = 0.00879136 (* 0.0909091 = 0.000799215 loss)
I0612 17:03:28.003691 6181 solver.cpp:406] Test net output #147: total_accuracy = 0.499
I0612 17:03:28.003702 6181 solver.cpp:406] Test net output #148: total_accuracy_not_rec = 0.454
I0612 17:03:28.003713 6181 solver.cpp:406] Test net output #149: total_confidence = 0.426673
I0612 17:03:28.003734 6181 solver.cpp:406] Test net output #150: total_confidence_not_rec = 0.374191
I0612 17:03:28.362526 6181 solver.cpp:229] Iteration 10000, loss = 4.17127
I0612 17:03:28.362583 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.478261
I0612 17:03:28.362602 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 1
I0612 17:03:28.362615 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 17:03:28.362628 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 17:03:28.362642 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.25
I0612 17:03:28.362653 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 17:03:28.362666 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.25
I0612 17:03:28.362679 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 17:03:28.362690 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.625
I0612 17:03:28.362702 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.625
I0612 17:03:28.362715 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 17:03:28.362727 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.75
I0612 17:03:28.362738 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.75
I0612 17:03:28.362754 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 17:03:28.362767 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 17:03:28.362781 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 17:03:28.362792 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 17:03:28.362804 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 17:03:28.362817 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 17:03:28.362828 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 17:03:28.362839 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 17:03:28.362851 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 17:03:28.362864 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 17:03:28.362875 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.789773
I0612 17:03:28.362887 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.681159
I0612 17:03:28.362903 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.87104 (* 0.3 = 0.561312 loss)
I0612 17:03:28.362918 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.762356 (* 0.3 = 0.228707 loss)
I0612 17:03:28.362933 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.492877 (* 0.0272727 = 0.0134421 loss)
I0612 17:03:28.362947 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.44441 (* 0.0272727 = 0.0393931 loss)
I0612 17:03:28.362962 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.3965 (* 0.0272727 = 0.0380864 loss)
I0612 17:03:28.362977 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.363 (* 0.0272727 = 0.0644455 loss)
I0612 17:03:28.362998 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.65054 (* 0.0272727 = 0.0450148 loss)
I0612 17:03:28.363013 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.31059 (* 0.0272727 = 0.0630161 loss)
I0612 17:03:28.363026 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.42016 (* 0.0272727 = 0.0387316 loss)
I0612 17:03:28.363039 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.65169 (* 0.0272727 = 0.0450462 loss)
I0612 17:03:28.363054 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.994051 (* 0.0272727 = 0.0271105 loss)
I0612 17:03:28.363068 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.949748 (* 0.0272727 = 0.0259022 loss)
I0612 17:03:28.363082 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.777114 (* 0.0272727 = 0.021194 loss)
I0612 17:03:28.363123 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.841249 (* 0.0272727 = 0.0229432 loss)
I0612 17:03:28.363139 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.442525 (* 0.0272727 = 0.0120689 loss)
I0612 17:03:28.363154 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.782492 (* 0.0272727 = 0.0213407 loss)
I0612 17:03:28.363168 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.759824 (* 0.0272727 = 0.0207225 loss)
I0612 17:03:28.363181 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.872202 (* 0.0272727 = 0.0237873 loss)
I0612 17:03:28.363196 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00387243 (* 0.0272727 = 0.000105612 loss)
I0612 17:03:28.363211 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000821544 (* 0.0272727 = 2.24058e-05 loss)
I0612 17:03:28.363225 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00160492 (* 0.0272727 = 4.37705e-05 loss)
I0612 17:03:28.363240 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00112153 (* 0.0272727 = 3.05872e-05 loss)
I0612 17:03:28.363253 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00170442 (* 0.0272727 = 4.64843e-05 loss)
I0612 17:03:28.363267 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00160029 (* 0.0272727 = 4.36442e-05 loss)
I0612 17:03:28.363281 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.594203
I0612 17:03:28.363292 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 17:03:28.363304 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 17:03:28.363317 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.875
I0612 17:03:28.363328 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 17:03:28.363340 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.375
I0612 17:03:28.363353 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.375
I0612 17:03:28.363363 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 17:03:28.363375 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 17:03:28.363387 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 17:03:28.363399 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.75
I0612 17:03:28.363410 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.75
I0612 17:03:28.363422 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.75
I0612 17:03:28.363433 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 17:03:28.363445 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 17:03:28.363457 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 17:03:28.363468 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 17:03:28.363481 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 17:03:28.363492 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 17:03:28.363503 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 17:03:28.363515 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 17:03:28.363526 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 17:03:28.363538 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 17:03:28.363549 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.835227
I0612 17:03:28.363561 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.826087
I0612 17:03:28.363575 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.39105 (* 0.3 = 0.417315 loss)
I0612 17:03:28.363590 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.572055 (* 0.3 = 0.171617 loss)
I0612 17:03:28.363615 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.448459 (* 0.0272727 = 0.0122307 loss)
I0612 17:03:28.363631 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.45637 (* 0.0272727 = 0.0124464 loss)
I0612 17:03:28.363641 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.633059 (* 0.0272727 = 0.0172652 loss)
I0612 17:03:28.363651 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.21518 (* 0.0272727 = 0.0331413 loss)
I0612 17:03:28.363667 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 2.65343 (* 0.0272727 = 0.0723662 loss)
I0612 17:03:28.363680 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.50157 (* 0.0272727 = 0.040952 loss)
I0612 17:03:28.363700 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.10573 (* 0.0272727 = 0.0301562 loss)
I0612 17:03:28.363715 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.904093 (* 0.0272727 = 0.0246571 loss)
I0612 17:03:28.363729 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.929035 (* 0.0272727 = 0.0253373 loss)
I0612 17:03:28.363744 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.775029 (* 0.0272727 = 0.0211372 loss)
I0612 17:03:28.363759 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.643835 (* 0.0272727 = 0.0175591 loss)
I0612 17:03:28.363772 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.91276 (* 0.0272727 = 0.0248935 loss)
I0612 17:03:28.363786 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.336241 (* 0.0272727 = 0.00917021 loss)
I0612 17:03:28.363803 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.582675 (* 0.0272727 = 0.0158911 loss)
I0612 17:03:28.363818 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.585449 (* 0.0272727 = 0.0159668 loss)
I0612 17:03:28.363832 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.998081 (* 0.0272727 = 0.0272204 loss)
I0612 17:03:28.363847 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 7.78733e-05 (* 0.0272727 = 2.12382e-06 loss)
I0612 17:03:28.363862 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 9.37311e-06 (* 0.0272727 = 2.5563e-07 loss)
I0612 17:03:28.363875 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 4.69392e-06 (* 0.0272727 = 1.28016e-07 loss)
I0612 17:03:28.363889 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 3.20379e-06 (* 0.0272727 = 8.7376e-08 loss)
I0612 17:03:28.363904 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 3.20378e-06 (* 0.0272727 = 8.73759e-08 loss)
I0612 17:03:28.363919 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 1.81795e-06 (* 0.0272727 = 4.95806e-08 loss)
I0612 17:03:28.363930 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.797101
I0612 17:03:28.363942 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 17:03:28.363955 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 17:03:28.363965 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 1
I0612 17:03:28.363977 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 1
I0612 17:03:28.363988 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.625
I0612 17:03:28.364001 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 17:03:28.364012 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 17:03:28.364023 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.75
I0612 17:03:28.364034 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 17:03:28.364047 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.75
I0612 17:03:28.364058 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.75
I0612 17:03:28.364069 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 17:03:28.364080 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 17:03:28.364102 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 17:03:28.364116 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 17:03:28.364127 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 17:03:28.364140 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 17:03:28.364151 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 17:03:28.364161 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 17:03:28.364176 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 17:03:28.364188 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 17:03:28.364199 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 17:03:28.364212 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.909091
I0612 17:03:28.364223 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.956522
I0612 17:03:28.364238 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.695626 (* 1 = 0.695626 loss)
I0612 17:03:28.364251 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.301596 (* 1 = 0.301596 loss)
I0612 17:03:28.364266 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.043629 (* 0.0909091 = 0.00396627 loss)
I0612 17:03:28.364280 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.074562 (* 0.0909091 = 0.00677837 loss)
I0612 17:03:28.364295 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.1851 (* 0.0909091 = 0.0168272 loss)
I0612 17:03:28.364308 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.053012 (* 0.0909091 = 0.00481927 loss)
I0612 17:03:28.364322 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.01928 (* 0.0909091 = 0.0926621 loss)
I0612 17:03:28.364336 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.692696 (* 0.0909091 = 0.0629724 loss)
I0612 17:03:28.364349 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.399892 (* 0.0909091 = 0.0363538 loss)
I0612 17:03:28.364363 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.558715 (* 0.0909091 = 0.0507923 loss)
I0612 17:03:28.364377 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.524657 (* 0.0909091 = 0.0476961 loss)
I0612 17:03:28.364392 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.493071 (* 0.0909091 = 0.0448246 loss)
I0612 17:03:28.364405 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.643206 (* 0.0909091 = 0.0584733 loss)
I0612 17:03:28.364418 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.144671 (* 0.0909091 = 0.0131519 loss)
I0612 17:03:28.364434 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.260411 (* 0.0909091 = 0.0236737 loss)
I0612 17:03:28.364446 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.246426 (* 0.0909091 = 0.0224023 loss)
I0612 17:03:28.364460 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.221832 (* 0.0909091 = 0.0201665 loss)
I0612 17:03:28.364475 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.485132 (* 0.0909091 = 0.0441029 loss)
I0612 17:03:28.364488 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.00220689 (* 0.0909091 = 0.000200626 loss)
I0612 17:03:28.364502 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.000652391 (* 0.0909091 = 5.93082e-05 loss)
I0612 17:03:28.364516 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000778729 (* 0.0909091 = 7.07936e-05 loss)
I0612 17:03:28.364531 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00016587 (* 0.0909091 = 1.50791e-05 loss)
I0612 17:03:28.364544 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 5.5371e-05 (* 0.0909091 = 5.03373e-06 loss)
I0612 17:03:28.364558 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 1.80013e-05 (* 0.0909091 = 1.63648e-06 loss)
I0612 17:03:28.364580 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 17:03:28.364593 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 17:03:28.364605 6181 solver.cpp:245] Train net output #149: total_confidence = 0.384742
I0612 17:03:28.364617 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.367248
I0612 17:03:28.364630 6181 sgd_solver.cpp:106] Iteration 10000, lr = 0.001
I0612 17:03:29.503659 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 41.7649 > 30) by scale factor 0.718307
I0612 17:03:37.222497 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.799 > 30) by scale factor 0.81524
I0612 17:04:38.288465 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.2914 > 30) by scale factor 0.990381
I0612 17:05:00.776741 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.7824 > 30) by scale factor 0.94392
I0612 17:05:36.340175 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.6725 > 30) by scale factor 0.775745
I0612 17:07:33.963937 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 42.439 > 30) by scale factor 0.706898
I0612 17:08:45.121105 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.4281 > 30) by scale factor 0.871382
I0612 17:09:09.071055 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 57.7252 > 30) by scale factor 0.519703
I0612 17:09:55.115447 6181 solver.cpp:229] Iteration 10500, loss = 4.03087
I0612 17:09:55.115562 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.433962
I0612 17:09:55.115584 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.625
I0612 17:09:55.115599 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 17:09:55.115612 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 17:09:55.115625 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.125
I0612 17:09:55.115638 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.625
I0612 17:09:55.115653 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 17:09:55.115664 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 17:09:55.115677 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.75
I0612 17:09:55.115690 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 17:09:55.115703 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 17:09:55.115716 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 1
I0612 17:09:55.115730 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 17:09:55.115741 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 17:09:55.115753 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 17:09:55.115766 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 17:09:55.115777 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 17:09:55.115789 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 17:09:55.115802 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 17:09:55.115813 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 17:09:55.115824 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 17:09:55.115836 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 17:09:55.115849 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 17:09:55.115860 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.818182
I0612 17:09:55.115874 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.735849
I0612 17:09:55.115895 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.71913 (* 0.3 = 0.515739 loss)
I0612 17:09:55.115911 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.560836 (* 0.3 = 0.168251 loss)
I0612 17:09:55.115924 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.13207 (* 0.0272727 = 0.0308746 loss)
I0612 17:09:55.115947 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.42602 (* 0.0272727 = 0.0388915 loss)
I0612 17:09:55.115960 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.86333 (* 0.0272727 = 0.0508181 loss)
I0612 17:09:55.115975 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.15111 (* 0.0272727 = 0.0586666 loss)
I0612 17:09:55.115989 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.58124 (* 0.0272727 = 0.0431248 loss)
I0612 17:09:55.116003 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.84764 (* 0.0272727 = 0.0503902 loss)
I0612 17:09:55.116017 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.31966 (* 0.0272727 = 0.0359907 loss)
I0612 17:09:55.116031 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.813526 (* 0.0272727 = 0.0221871 loss)
I0612 17:09:55.116045 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.439577 (* 0.0272727 = 0.0119885 loss)
I0612 17:09:55.116060 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.596568 (* 0.0272727 = 0.01627 loss)
I0612 17:09:55.116075 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.0505105 (* 0.0272727 = 0.00137756 loss)
I0612 17:09:55.116088 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.0243855 (* 0.0272727 = 0.000665059 loss)
I0612 17:09:55.116122 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.00992114 (* 0.0272727 = 0.000270576 loss)
I0612 17:09:55.116138 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.00556375 (* 0.0272727 = 0.000151739 loss)
I0612 17:09:55.116153 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.0045381 (* 0.0272727 = 0.000123766 loss)
I0612 17:09:55.116168 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.0021077 (* 0.0272727 = 5.74827e-05 loss)
I0612 17:09:55.116181 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00299905 (* 0.0272727 = 8.17923e-05 loss)
I0612 17:09:55.116196 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.0029467 (* 0.0272727 = 8.03646e-05 loss)
I0612 17:09:55.116210 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00200887 (* 0.0272727 = 5.47874e-05 loss)
I0612 17:09:55.116227 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.001076 (* 0.0272727 = 2.93455e-05 loss)
I0612 17:09:55.116243 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00295521 (* 0.0272727 = 8.05966e-05 loss)
I0612 17:09:55.116257 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00203495 (* 0.0272727 = 5.54987e-05 loss)
I0612 17:09:55.116271 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.566038
I0612 17:09:55.116282 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 17:09:55.116295 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 17:09:55.116307 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 17:09:55.116319 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 17:09:55.116333 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 17:09:55.116344 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.75
I0612 17:09:55.116356 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 17:09:55.116367 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 17:09:55.116379 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 17:09:55.116391 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 17:09:55.116403 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 1
I0612 17:09:55.116415 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 17:09:55.116426 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 17:09:55.116438 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 17:09:55.116451 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 17:09:55.116462 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 17:09:55.116473 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 17:09:55.116485 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 17:09:55.116497 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 17:09:55.116508 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 17:09:55.116520 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 17:09:55.116533 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 17:09:55.116544 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.852273
I0612 17:09:55.116554 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.773585
I0612 17:09:55.116564 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.53619 (* 0.3 = 0.460857 loss)
I0612 17:09:55.116581 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.518518 (* 0.3 = 0.155555 loss)
I0612 17:09:55.116596 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.20551 (* 0.0272727 = 0.0328776 loss)
I0612 17:09:55.116611 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.01994 (* 0.0272727 = 0.0278167 loss)
I0612 17:09:55.116637 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.00637 (* 0.0272727 = 0.0274465 loss)
I0612 17:09:55.116652 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 2.09274 (* 0.0272727 = 0.0570747 loss)
I0612 17:09:55.116667 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.49912 (* 0.0272727 = 0.0408851 loss)
I0612 17:09:55.116680 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.26738 (* 0.0272727 = 0.0345649 loss)
I0612 17:09:55.116694 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.14323 (* 0.0272727 = 0.0311791 loss)
I0612 17:09:55.116708 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.772386 (* 0.0272727 = 0.0210651 loss)
I0612 17:09:55.116722 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.837856 (* 0.0272727 = 0.0228506 loss)
I0612 17:09:55.116737 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.409196 (* 0.0272727 = 0.0111599 loss)
I0612 17:09:55.116751 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.119785 (* 0.0272727 = 0.00326686 loss)
I0612 17:09:55.116765 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.0544056 (* 0.0272727 = 0.00148379 loss)
I0612 17:09:55.116780 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.0434882 (* 0.0272727 = 0.00118604 loss)
I0612 17:09:55.116793 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.0223173 (* 0.0272727 = 0.000608653 loss)
I0612 17:09:55.116808 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.0139012 (* 0.0272727 = 0.000379123 loss)
I0612 17:09:55.116822 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.0158467 (* 0.0272727 = 0.000432182 loss)
I0612 17:09:55.116837 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0109249 (* 0.0272727 = 0.000297951 loss)
I0612 17:09:55.116850 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0145963 (* 0.0272727 = 0.000398082 loss)
I0612 17:09:55.116864 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.0181896 (* 0.0272727 = 0.00049608 loss)
I0612 17:09:55.116878 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.012006 (* 0.0272727 = 0.000327437 loss)
I0612 17:09:55.116894 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.0141048 (* 0.0272727 = 0.000384677 loss)
I0612 17:09:55.116907 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.0162982 (* 0.0272727 = 0.000444495 loss)
I0612 17:09:55.116919 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.735849
I0612 17:09:55.116931 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.75
I0612 17:09:55.116943 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 17:09:55.116955 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 17:09:55.116967 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 17:09:55.116979 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 17:09:55.116991 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.625
I0612 17:09:55.117002 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 17:09:55.117014 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 1
I0612 17:09:55.117027 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 17:09:55.117038 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 17:09:55.117049 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 17:09:55.117061 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 17:09:55.117074 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 17:09:55.117084 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 17:09:55.117096 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 17:09:55.117108 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 17:09:55.117130 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 17:09:55.117143 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 17:09:55.117156 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 17:09:55.117167 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 17:09:55.117179 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 17:09:55.117192 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 17:09:55.117202 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.914773
I0612 17:09:55.117214 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.849057
I0612 17:09:55.117228 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.991506 (* 1 = 0.991506 loss)
I0612 17:09:55.117243 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.320385 (* 1 = 0.320385 loss)
I0612 17:09:55.117257 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.72298 (* 0.0909091 = 0.0657255 loss)
I0612 17:09:55.117271 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.579304 (* 0.0909091 = 0.052664 loss)
I0612 17:09:55.117302 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.814592 (* 0.0909091 = 0.0740538 loss)
I0612 17:09:55.117317 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.5582 (* 0.0909091 = 0.0507455 loss)
I0612 17:09:55.117331 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.929542 (* 0.0909091 = 0.0845038 loss)
I0612 17:09:55.117346 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.19871 (* 0.0909091 = 0.108973 loss)
I0612 17:09:55.117360 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.03674 (* 0.0909091 = 0.0942488 loss)
I0612 17:09:55.117374 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.302604 (* 0.0909091 = 0.0275095 loss)
I0612 17:09:55.117388 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.633342 (* 0.0909091 = 0.0575765 loss)
I0612 17:09:55.117403 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.203352 (* 0.0909091 = 0.0184866 loss)
I0612 17:09:55.117418 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.10496 (* 0.0909091 = 0.0095418 loss)
I0612 17:09:55.117432 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.0220716 (* 0.0909091 = 0.00200651 loss)
I0612 17:09:55.117446 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.00384349 (* 0.0909091 = 0.000349408 loss)
I0612 17:09:55.117461 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.00171334 (* 0.0909091 = 0.000155758 loss)
I0612 17:09:55.117475 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.000735678 (* 0.0909091 = 6.68798e-05 loss)
I0612 17:09:55.117491 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.000589293 (* 0.0909091 = 5.35721e-05 loss)
I0612 17:09:55.117506 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.000512403 (* 0.0909091 = 4.65821e-05 loss)
I0612 17:09:55.117534 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.000608491 (* 0.0909091 = 5.53174e-05 loss)
I0612 17:09:55.117569 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000660782 (* 0.0909091 = 6.00711e-05 loss)
I0612 17:09:55.117595 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000628227 (* 0.0909091 = 5.71115e-05 loss)
I0612 17:09:55.117610 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00108505 (* 0.0909091 = 9.86407e-05 loss)
I0612 17:09:55.117629 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000878446 (* 0.0909091 = 7.98587e-05 loss)
I0612 17:09:55.117641 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 17:09:55.117655 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 17:09:55.117679 6181 solver.cpp:245] Train net output #149: total_confidence = 0.382825
I0612 17:09:55.117692 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.390017
I0612 17:09:55.117707 6181 sgd_solver.cpp:106] Iteration 10500, lr = 0.001
I0612 17:10:06.317948 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.9647 > 30) by scale factor 0.968846
I0612 17:11:06.693161 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.0865 > 30) by scale factor 0.934973
I0612 17:11:10.559882 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.0283 > 30) by scale factor 0.936672
I0612 17:12:10.838280 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 69.6807 > 30) by scale factor 0.430535
I0612 17:13:11.978929 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.427 > 30) by scale factor 0.897477
I0612 17:13:15.845185 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.2326 > 30) by scale factor 0.960533
I0612 17:13:49.871265 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.875 > 30) by scale factor 0.771705
I0612 17:14:54.060112 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 41.2591 > 30) by scale factor 0.727113
I0612 17:16:00.561991 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.0503 > 30) by scale factor 0.788431
I0612 17:16:21.861727 6181 solver.cpp:229] Iteration 11000, loss = 4.04844
I0612 17:16:21.861799 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.534483
I0612 17:16:21.861819 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 17:16:21.861832 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 17:16:21.861845 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.75
I0612 17:16:21.861858 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.25
I0612 17:16:21.861871 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.625
I0612 17:16:21.861884 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.75
I0612 17:16:21.861897 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.75
I0612 17:16:21.861910 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.75
I0612 17:16:21.861922 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 17:16:21.861938 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 17:16:21.861953 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.75
I0612 17:16:21.861965 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.75
I0612 17:16:21.861979 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 17:16:21.861991 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 17:16:21.862004 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 17:16:21.862015 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 17:16:21.862028 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 17:16:21.862040 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 17:16:21.862052 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 17:16:21.862063 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 17:16:21.862076 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 17:16:21.862087 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 17:16:21.862099 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.829545
I0612 17:16:21.862112 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.672414
I0612 17:16:21.862129 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.61541 (* 0.3 = 0.484622 loss)
I0612 17:16:21.862148 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.569585 (* 0.3 = 0.170875 loss)
I0612 17:16:21.862162 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.717224 (* 0.0272727 = 0.0195607 loss)
I0612 17:16:21.862177 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.16949 (* 0.0272727 = 0.0318951 loss)
I0612 17:16:21.862191 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.11738 (* 0.0272727 = 0.0304739 loss)
I0612 17:16:21.862206 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.13146 (* 0.0272727 = 0.0581307 loss)
I0612 17:16:21.862221 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.49653 (* 0.0272727 = 0.0408144 loss)
I0612 17:16:21.862234 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 0.969366 (* 0.0272727 = 0.0264372 loss)
I0612 17:16:21.862248 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.63974 (* 0.0272727 = 0.0174475 loss)
I0612 17:16:21.862262 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.827932 (* 0.0272727 = 0.02258 loss)
I0612 17:16:21.862277 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.767453 (* 0.0272727 = 0.0209305 loss)
I0612 17:16:21.862290 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.925351 (* 0.0272727 = 0.0252368 loss)
I0612 17:16:21.862305 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.941799 (* 0.0272727 = 0.0256854 loss)
I0612 17:16:21.862319 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.51367 (* 0.0272727 = 0.0140092 loss)
I0612 17:16:21.862366 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.584746 (* 0.0272727 = 0.0159476 loss)
I0612 17:16:21.862382 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.673755 (* 0.0272727 = 0.0183751 loss)
I0612 17:16:21.862396 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.644973 (* 0.0272727 = 0.0175902 loss)
I0612 17:16:21.862411 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.00349693 (* 0.0272727 = 9.53709e-05 loss)
I0612 17:16:21.862426 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.000967695 (* 0.0272727 = 2.63917e-05 loss)
I0612 17:16:21.862440 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000320368 (* 0.0272727 = 8.7373e-06 loss)
I0612 17:16:21.862455 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000164968 (* 0.0272727 = 4.49914e-06 loss)
I0612 17:16:21.862470 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 6.01755e-05 (* 0.0272727 = 1.64115e-06 loss)
I0612 17:16:21.862484 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 6.34992e-05 (* 0.0272727 = 1.7318e-06 loss)
I0612 17:16:21.862499 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 1.19808e-05 (* 0.0272727 = 3.26749e-07 loss)
I0612 17:16:21.862511 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.672414
I0612 17:16:21.862524 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 17:16:21.862536 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.625
I0612 17:16:21.862548 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 17:16:21.862560 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.625
I0612 17:16:21.862572 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.375
I0612 17:16:21.862584 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.875
I0612 17:16:21.862596 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 17:16:21.862608 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 17:16:21.862620 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 17:16:21.862632 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.75
I0612 17:16:21.862644 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 17:16:21.862656 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.75
I0612 17:16:21.862668 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.75
I0612 17:16:21.862680 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 17:16:21.862692 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 17:16:21.862704 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 17:16:21.862716 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 17:16:21.862727 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 17:16:21.862740 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 17:16:21.862751 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 17:16:21.862762 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 17:16:21.862774 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 17:16:21.862787 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.863636
I0612 17:16:21.862798 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.810345
I0612 17:16:21.862812 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.35801 (* 0.3 = 0.407404 loss)
I0612 17:16:21.862826 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.526581 (* 0.3 = 0.157974 loss)
I0612 17:16:21.862840 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.18652 (* 0.0272727 = 0.0323595 loss)
I0612 17:16:21.862865 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.55135 (* 0.0272727 = 0.0423096 loss)
I0612 17:16:21.862880 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.753532 (* 0.0272727 = 0.0205509 loss)
I0612 17:16:21.862895 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.12383 (* 0.0272727 = 0.0306498 loss)
I0612 17:16:21.862910 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.373 (* 0.0272727 = 0.0374454 loss)
I0612 17:16:21.862922 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.568421 (* 0.0272727 = 0.0155024 loss)
I0612 17:16:21.862936 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.532083 (* 0.0272727 = 0.0145113 loss)
I0612 17:16:21.862951 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.784489 (* 0.0272727 = 0.0213951 loss)
I0612 17:16:21.862965 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.62192 (* 0.0272727 = 0.0169615 loss)
I0612 17:16:21.862979 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.672957 (* 0.0272727 = 0.0183534 loss)
I0612 17:16:21.862996 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.522413 (* 0.0272727 = 0.0142476 loss)
I0612 17:16:21.863010 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.537812 (* 0.0272727 = 0.0146676 loss)
I0612 17:16:21.863024 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.511181 (* 0.0272727 = 0.0139413 loss)
I0612 17:16:21.863039 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.600187 (* 0.0272727 = 0.0163687 loss)
I0612 17:16:21.863054 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.426945 (* 0.0272727 = 0.011644 loss)
I0612 17:16:21.863068 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.199358 (* 0.0272727 = 0.00543703 loss)
I0612 17:16:21.863081 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.154646 (* 0.0272727 = 0.00421762 loss)
I0612 17:16:21.863096 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.062519 (* 0.0272727 = 0.00170506 loss)
I0612 17:16:21.863111 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.0181465 (* 0.0272727 = 0.000494904 loss)
I0612 17:16:21.863124 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.0049615 (* 0.0272727 = 0.000135314 loss)
I0612 17:16:21.863139 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.00324439 (* 0.0272727 = 8.84833e-05 loss)
I0612 17:16:21.863153 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 6.96293e-05 (* 0.0272727 = 1.89898e-06 loss)
I0612 17:16:21.863165 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.741379
I0612 17:16:21.863178 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.625
I0612 17:16:21.863193 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 17:16:21.863206 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 17:16:21.863219 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 17:16:21.863230 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 17:16:21.863242 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 1
I0612 17:16:21.863255 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 17:16:21.863266 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.75
I0612 17:16:21.863278 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 17:16:21.863289 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.75
I0612 17:16:21.863301 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 17:16:21.863313 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 17:16:21.863325 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 17:16:21.863337 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 17:16:21.863348 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 17:16:21.863370 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 17:16:21.863384 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 17:16:21.863394 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 17:16:21.863406 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 17:16:21.863418 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 17:16:21.863430 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 17:16:21.863441 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 17:16:21.863453 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.897727
I0612 17:16:21.863466 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.931035
I0612 17:16:21.863479 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.883842 (* 1 = 0.883842 loss)
I0612 17:16:21.863493 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.345982 (* 1 = 0.345982 loss)
I0612 17:16:21.863507 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 1.57703 (* 0.0909091 = 0.143366 loss)
I0612 17:16:21.863522 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.81444 (* 0.0909091 = 0.07404 loss)
I0612 17:16:21.863535 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.274958 (* 0.0909091 = 0.0249962 loss)
I0612 17:16:21.863549 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.281324 (* 0.0909091 = 0.0255749 loss)
I0612 17:16:21.863564 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.294048 (* 0.0909091 = 0.0267316 loss)
I0612 17:16:21.863577 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.152596 (* 0.0909091 = 0.0138723 loss)
I0612 17:16:21.863591 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.264023 (* 0.0909091 = 0.0240021 loss)
I0612 17:16:21.863605 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.880132 (* 0.0909091 = 0.080012 loss)
I0612 17:16:21.863620 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.428362 (* 0.0909091 = 0.038942 loss)
I0612 17:16:21.863633 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.652643 (* 0.0909091 = 0.0593312 loss)
I0612 17:16:21.863647 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.315457 (* 0.0909091 = 0.0286779 loss)
I0612 17:16:21.863662 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.316002 (* 0.0909091 = 0.0287274 loss)
I0612 17:16:21.863675 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.267252 (* 0.0909091 = 0.0242956 loss)
I0612 17:16:21.863689 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.523665 (* 0.0909091 = 0.0476059 loss)
I0612 17:16:21.863703 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.287139 (* 0.0909091 = 0.0261036 loss)
I0612 17:16:21.863718 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.0844071 (* 0.0909091 = 0.00767337 loss)
I0612 17:16:21.863731 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0299339 (* 0.0909091 = 0.00272127 loss)
I0612 17:16:21.863745 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00959654 (* 0.0909091 = 0.000872412 loss)
I0612 17:16:21.863759 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00730218 (* 0.0909091 = 0.000663835 loss)
I0612 17:16:21.863773 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00128002 (* 0.0909091 = 0.000116365 loss)
I0612 17:16:21.863787 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000753862 (* 0.0909091 = 6.85329e-05 loss)
I0612 17:16:21.863802 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000305851 (* 0.0909091 = 2.78046e-05 loss)
I0612 17:16:21.863814 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 17:16:21.863836 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.375
I0612 17:16:21.863845 6181 solver.cpp:245] Train net output #149: total_confidence = 0.375362
I0612 17:16:21.863853 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.363254
I0612 17:16:21.863862 6181 sgd_solver.cpp:106] Iteration 11000, lr = 0.001
I0612 17:16:29.196876 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.801 > 30) by scale factor 0.943368
I0612 17:16:34.630314 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.2606 > 30) by scale factor 0.929926
I0612 17:17:01.692621 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.8225 > 30) by scale factor 0.861513
I0612 17:19:39.998337 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.1483 > 30) by scale factor 0.933175
I0612 17:21:02.586324 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.2704 > 30) by scale factor 0.804928
I0612 17:21:09.550046 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.8818 > 30) by scale factor 0.885432
I0612 17:21:50.395095 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.5531 > 30) by scale factor 0.894105
I0612 17:22:20.494644 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.5115 > 30) by scale factor 0.821659
I0612 17:22:34.424360 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.0545 > 30) by scale factor 0.998185
I0612 17:22:47.967528 6181 solver.cpp:229] Iteration 11500, loss = 4.06842
I0612 17:22:47.967602 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.60274
I0612 17:22:47.967622 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.875
I0612 17:22:47.967635 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 17:22:47.967649 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 17:22:47.967663 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 17:22:47.967675 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.375
I0612 17:22:47.967689 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 17:22:47.967700 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.25
I0612 17:22:47.967713 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.75
I0612 17:22:47.967726 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 17:22:47.967739 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 17:22:47.967751 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.75
I0612 17:22:47.967766 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 17:22:47.967778 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.75
I0612 17:22:47.967790 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 17:22:47.967803 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.75
I0612 17:22:47.967815 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.75
I0612 17:22:47.967828 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 17:22:47.967840 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 17:22:47.967852 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 17:22:47.967864 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 17:22:47.967876 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 17:22:47.967890 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 17:22:47.967901 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.829545
I0612 17:22:47.967914 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.739726
I0612 17:22:47.967931 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.56633 (* 0.3 = 0.4699 loss)
I0612 17:22:47.967946 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.683307 (* 0.3 = 0.204992 loss)
I0612 17:22:47.967962 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.620398 (* 0.0272727 = 0.0169199 loss)
I0612 17:22:47.967975 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 0.960007 (* 0.0272727 = 0.026182 loss)
I0612 17:22:47.967990 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.66812 (* 0.0272727 = 0.0454942 loss)
I0612 17:22:47.968004 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.36366 (* 0.0272727 = 0.0371908 loss)
I0612 17:22:47.968019 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.52426 (* 0.0272727 = 0.0415707 loss)
I0612 17:22:47.968034 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.69449 (* 0.0272727 = 0.0734861 loss)
I0612 17:22:47.968047 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.5598 (* 0.0272727 = 0.0425401 loss)
I0612 17:22:47.968062 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.0244 (* 0.0272727 = 0.0279383 loss)
I0612 17:22:47.968076 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.502671 (* 0.0272727 = 0.0137092 loss)
I0612 17:22:47.968091 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.677694 (* 0.0272727 = 0.0184826 loss)
I0612 17:22:47.968104 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.661096 (* 0.0272727 = 0.0180299 loss)
I0612 17:22:47.968159 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.605564 (* 0.0272727 = 0.0165154 loss)
I0612 17:22:47.968175 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.641413 (* 0.0272727 = 0.0174931 loss)
I0612 17:22:47.968190 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.555184 (* 0.0272727 = 0.0151414 loss)
I0612 17:22:47.968204 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.631789 (* 0.0272727 = 0.0172306 loss)
I0612 17:22:47.968219 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 1.08664 (* 0.0272727 = 0.0296357 loss)
I0612 17:22:47.968233 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0134063 (* 0.0272727 = 0.000365628 loss)
I0612 17:22:47.968248 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00493212 (* 0.0272727 = 0.000134512 loss)
I0612 17:22:47.968262 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00337953 (* 0.0272727 = 9.21689e-05 loss)
I0612 17:22:47.968278 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00118942 (* 0.0272727 = 3.24386e-05 loss)
I0612 17:22:47.968292 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00185667 (* 0.0272727 = 5.06364e-05 loss)
I0612 17:22:47.968307 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00162137 (* 0.0272727 = 4.42193e-05 loss)
I0612 17:22:47.968319 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.520548
I0612 17:22:47.968333 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 17:22:47.968344 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 17:22:47.968356 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.5
I0612 17:22:47.968369 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 17:22:47.968380 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.75
I0612 17:22:47.968392 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.25
I0612 17:22:47.968405 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 17:22:47.968416 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 17:22:47.968428 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 17:22:47.968441 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.75
I0612 17:22:47.968457 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.75
I0612 17:22:47.968482 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 17:22:47.968504 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.75
I0612 17:22:47.968526 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.75
I0612 17:22:47.968552 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 17:22:47.968576 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.75
I0612 17:22:47.968598 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 17:22:47.968621 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 17:22:47.968641 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 17:22:47.968658 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 17:22:47.968677 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 17:22:47.968694 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 17:22:47.968713 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.789773
I0612 17:22:47.968732 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.767123
I0612 17:22:47.968755 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.31306 (* 0.3 = 0.393919 loss)
I0612 17:22:47.968778 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.594598 (* 0.3 = 0.178379 loss)
I0612 17:22:47.968819 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.438332 (* 0.0272727 = 0.0119545 loss)
I0612 17:22:47.968844 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.635316 (* 0.0272727 = 0.0173268 loss)
I0612 17:22:47.968868 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.14207 (* 0.0272727 = 0.0311473 loss)
I0612 17:22:47.968896 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.28697 (* 0.0272727 = 0.0350992 loss)
I0612 17:22:47.968921 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.43841 (* 0.0272727 = 0.0392293 loss)
I0612 17:22:47.968950 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 2.41673 (* 0.0272727 = 0.0659109 loss)
I0612 17:22:47.968967 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.07933 (* 0.0272727 = 0.0294363 loss)
I0612 17:22:47.968982 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.709495 (* 0.0272727 = 0.0193499 loss)
I0612 17:22:47.968997 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.578162 (* 0.0272727 = 0.0157681 loss)
I0612 17:22:47.969010 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.642751 (* 0.0272727 = 0.0175296 loss)
I0612 17:22:47.969028 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.680436 (* 0.0272727 = 0.0185573 loss)
I0612 17:22:47.969043 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.433632 (* 0.0272727 = 0.0118263 loss)
I0612 17:22:47.969058 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.61695 (* 0.0272727 = 0.0168259 loss)
I0612 17:22:47.969071 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.60737 (* 0.0272727 = 0.0165647 loss)
I0612 17:22:47.969085 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.597599 (* 0.0272727 = 0.0162982 loss)
I0612 17:22:47.969099 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.770657 (* 0.0272727 = 0.0210179 loss)
I0612 17:22:47.969115 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0356571 (* 0.0272727 = 0.000972466 loss)
I0612 17:22:47.969130 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0146836 (* 0.0272727 = 0.000400463 loss)
I0612 17:22:47.969144 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.034648 (* 0.0272727 = 0.000944946 loss)
I0612 17:22:47.969158 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00700044 (* 0.0272727 = 0.000190921 loss)
I0612 17:22:47.969177 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.0140787 (* 0.0272727 = 0.000383965 loss)
I0612 17:22:47.969192 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.014112 (* 0.0272727 = 0.000384874 loss)
I0612 17:22:47.969204 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.821918
I0612 17:22:47.969216 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 17:22:47.969229 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 17:22:47.969241 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 17:22:47.969254 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 1
I0612 17:22:47.969265 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 1
I0612 17:22:47.969277 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 17:22:47.969290 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 17:22:47.969302 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 17:22:47.969315 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 17:22:47.969341 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 17:22:47.969355 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.75
I0612 17:22:47.969368 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 17:22:47.969380 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.75
I0612 17:22:47.969405 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.75
I0612 17:22:47.969420 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.75
I0612 17:22:47.969431 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 17:22:47.969444 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 17:22:47.969455 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 17:22:47.969468 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 17:22:47.969480 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 17:22:47.969492 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 17:22:47.969503 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 17:22:47.969516 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.926136
I0612 17:22:47.969528 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.931507
I0612 17:22:47.969543 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.540968 (* 1 = 0.540968 loss)
I0612 17:22:47.969558 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.238146 (* 1 = 0.238146 loss)
I0612 17:22:47.969571 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.206038 (* 0.0909091 = 0.0187308 loss)
I0612 17:22:47.969586 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.071788 (* 0.0909091 = 0.00652618 loss)
I0612 17:22:47.969606 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.335807 (* 0.0909091 = 0.0305279 loss)
I0612 17:22:47.969621 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.149722 (* 0.0909091 = 0.0136111 loss)
I0612 17:22:47.969636 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.157751 (* 0.0909091 = 0.014341 loss)
I0612 17:22:47.969647 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.704312 (* 0.0909091 = 0.0640284 loss)
I0612 17:22:47.969657 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.391536 (* 0.0909091 = 0.0355942 loss)
I0612 17:22:47.969676 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.210388 (* 0.0909091 = 0.0191262 loss)
I0612 17:22:47.969689 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.406132 (* 0.0909091 = 0.0369211 loss)
I0612 17:22:47.969703 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.32294 (* 0.0909091 = 0.0293582 loss)
I0612 17:22:47.969718 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.42525 (* 0.0909091 = 0.0386591 loss)
I0612 17:22:47.969732 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.391227 (* 0.0909091 = 0.0355661 loss)
I0612 17:22:47.969746 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.414935 (* 0.0909091 = 0.0377214 loss)
I0612 17:22:47.969760 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.45788 (* 0.0909091 = 0.0416254 loss)
I0612 17:22:47.969774 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.449413 (* 0.0909091 = 0.0408557 loss)
I0612 17:22:47.969789 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.25441 (* 0.0909091 = 0.0231282 loss)
I0612 17:22:47.969802 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0185443 (* 0.0909091 = 0.00168585 loss)
I0612 17:22:47.969816 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00146803 (* 0.0909091 = 0.000133458 loss)
I0612 17:22:47.969831 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00123341 (* 0.0909091 = 0.000112128 loss)
I0612 17:22:47.969846 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000529826 (* 0.0909091 = 4.8166e-05 loss)
I0612 17:22:47.969859 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000343033 (* 0.0909091 = 3.11849e-05 loss)
I0612 17:22:47.969873 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000128159 (* 0.0909091 = 1.16508e-05 loss)
I0612 17:22:47.969895 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.625
I0612 17:22:47.969909 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.625
I0612 17:22:47.969929 6181 solver.cpp:245] Train net output #149: total_confidence = 0.406638
I0612 17:22:47.969950 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.34492
I0612 17:22:47.969965 6181 sgd_solver.cpp:106] Iteration 11500, lr = 0.001
I0612 17:23:33.085671 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.8249 > 30) by scale factor 0.814665
I0612 17:24:29.396625 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.4334 > 30) by scale factor 0.985758
I0612 17:25:59.686774 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 48.6111 > 30) by scale factor 0.617143
I0612 17:26:24.427610 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.6737 > 30) by scale factor 0.775721
I0612 17:28:44.152704 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.3516 > 30) by scale factor 0.988415
I0612 17:29:13.903719 6181 solver.cpp:229] Iteration 12000, loss = 3.9224
I0612 17:29:13.903774 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.311475
I0612 17:29:13.903792 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.375
I0612 17:29:13.903806 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.5
I0612 17:29:13.903820 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.25
I0612 17:29:13.903832 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 17:29:13.903846 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.125
I0612 17:29:13.903858 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 17:29:13.903872 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.5
I0612 17:29:13.903883 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.5
I0612 17:29:13.903897 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.625
I0612 17:29:13.903909 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 17:29:13.903923 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 17:29:13.903935 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 17:29:13.903947 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 17:29:13.903960 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 17:29:13.903972 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 17:29:13.903985 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 17:29:13.903996 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 17:29:13.904008 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 17:29:13.904019 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 17:29:13.904031 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 17:29:13.904047 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 17:29:13.904059 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 17:29:13.904072 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.755682
I0612 17:29:13.904084 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.47541
I0612 17:29:13.904100 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.66896 (* 0.3 = 0.800688 loss)
I0612 17:29:13.904115 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.983533 (* 0.3 = 0.29506 loss)
I0612 17:29:13.904130 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 2.32248 (* 0.0272727 = 0.0633403 loss)
I0612 17:29:13.904145 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.52692 (* 0.0272727 = 0.068916 loss)
I0612 17:29:13.904158 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.76162 (* 0.0272727 = 0.0753168 loss)
I0612 17:29:13.904172 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.39877 (* 0.0272727 = 0.0654211 loss)
I0612 17:29:13.904187 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 3.21232 (* 0.0272727 = 0.0876086 loss)
I0612 17:29:13.904201 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.41514 (* 0.0272727 = 0.0658674 loss)
I0612 17:29:13.904216 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.97911 (* 0.0272727 = 0.0539757 loss)
I0612 17:29:13.904229 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 2.55281 (* 0.0272727 = 0.0696222 loss)
I0612 17:29:13.904248 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 2.43323 (* 0.0272727 = 0.0663608 loss)
I0612 17:29:13.904263 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 1.09697 (* 0.0272727 = 0.0299174 loss)
I0612 17:29:13.904278 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.41042 (* 0.0272727 = 0.0111933 loss)
I0612 17:29:13.904292 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.142451 (* 0.0272727 = 0.00388503 loss)
I0612 17:29:13.904336 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.111215 (* 0.0272727 = 0.00303313 loss)
I0612 17:29:13.904353 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.106944 (* 0.0272727 = 0.00291665 loss)
I0612 17:29:13.904367 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.0883253 (* 0.0272727 = 0.00240887 loss)
I0612 17:29:13.904382 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.0577036 (* 0.0272727 = 0.00157373 loss)
I0612 17:29:13.904397 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0142327 (* 0.0272727 = 0.000388164 loss)
I0612 17:29:13.904410 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00794096 (* 0.0272727 = 0.000216572 loss)
I0612 17:29:13.904425 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00493708 (* 0.0272727 = 0.000134648 loss)
I0612 17:29:13.904439 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00308918 (* 0.0272727 = 8.42504e-05 loss)
I0612 17:29:13.904454 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00378624 (* 0.0272727 = 0.000103261 loss)
I0612 17:29:13.904469 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00239186 (* 0.0272727 = 6.52327e-05 loss)
I0612 17:29:13.904481 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.42623
I0612 17:29:13.904494 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.5
I0612 17:29:13.904506 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.625
I0612 17:29:13.904518 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.5
I0612 17:29:13.904531 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 17:29:13.904542 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 17:29:13.904554 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.625
I0612 17:29:13.904567 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 17:29:13.904578 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.5
I0612 17:29:13.904590 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.625
I0612 17:29:13.904603 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.75
I0612 17:29:13.904614 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 17:29:13.904626 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 17:29:13.904639 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 17:29:13.904649 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 17:29:13.904661 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 17:29:13.904673 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 17:29:13.904685 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 17:29:13.904696 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 17:29:13.904708 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 17:29:13.904719 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 17:29:13.904731 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 17:29:13.904743 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 17:29:13.904754 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.795455
I0612 17:29:13.904767 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.606557
I0612 17:29:13.904781 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 2.1475 (* 0.3 = 0.644251 loss)
I0612 17:29:13.904795 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.778134 (* 0.3 = 0.23344 loss)
I0612 17:29:13.904809 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.64915 (* 0.0272727 = 0.0449767 loss)
I0612 17:29:13.904824 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.73781 (* 0.0272727 = 0.0473948 loss)
I0612 17:29:13.904849 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.5879 (* 0.0272727 = 0.0433064 loss)
I0612 17:29:13.904863 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 2.94563 (* 0.0272727 = 0.0803353 loss)
I0612 17:29:13.904878 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 2.40706 (* 0.0272727 = 0.065647 loss)
I0612 17:29:13.904892 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.85553 (* 0.0272727 = 0.0506054 loss)
I0612 17:29:13.904906 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.81524 (* 0.0272727 = 0.0495065 loss)
I0612 17:29:13.904920 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 2.24135 (* 0.0272727 = 0.0611276 loss)
I0612 17:29:13.904934 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 2.1497 (* 0.0272727 = 0.0586283 loss)
I0612 17:29:13.904948 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.963779 (* 0.0272727 = 0.0262849 loss)
I0612 17:29:13.904963 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.645429 (* 0.0272727 = 0.0176026 loss)
I0612 17:29:13.904976 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.0444735 (* 0.0272727 = 0.00121291 loss)
I0612 17:29:13.904990 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.0294872 (* 0.0272727 = 0.000804198 loss)
I0612 17:29:13.905004 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.0198134 (* 0.0272727 = 0.000540364 loss)
I0612 17:29:13.905019 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.0132468 (* 0.0272727 = 0.000361277 loss)
I0612 17:29:13.905033 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.0114125 (* 0.0272727 = 0.000311249 loss)
I0612 17:29:13.905047 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.00804262 (* 0.0272727 = 0.000219344 loss)
I0612 17:29:13.905061 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.00529293 (* 0.0272727 = 0.000144353 loss)
I0612 17:29:13.905076 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.0049713 (* 0.0272727 = 0.000135581 loss)
I0612 17:29:13.905093 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00325167 (* 0.0272727 = 8.86818e-05 loss)
I0612 17:29:13.905108 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.00467004 (* 0.0272727 = 0.000127365 loss)
I0612 17:29:13.905122 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.00320211 (* 0.0272727 = 8.73303e-05 loss)
I0612 17:29:13.905135 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.655738
I0612 17:29:13.905148 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.5
I0612 17:29:13.905160 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 17:29:13.905172 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.625
I0612 17:29:13.905184 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.625
I0612 17:29:13.905195 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.5
I0612 17:29:13.905207 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 17:29:13.905220 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.625
I0612 17:29:13.905231 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.5
I0612 17:29:13.905243 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 17:29:13.905256 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.75
I0612 17:29:13.905270 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 17:29:13.905283 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 17:29:13.905309 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 17:29:13.905323 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 17:29:13.905335 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 17:29:13.905359 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 17:29:13.905371 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 17:29:13.905383 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 17:29:13.905395 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 17:29:13.905406 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 17:29:13.905417 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 17:29:13.905429 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 17:29:13.905441 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.869318
I0612 17:29:13.905453 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.819672
I0612 17:29:13.905467 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.25985 (* 1 = 1.25985 loss)
I0612 17:29:13.905481 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.47779 (* 1 = 0.47779 loss)
I0612 17:29:13.905495 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 1.57346 (* 0.0909091 = 0.143042 loss)
I0612 17:29:13.905509 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 1.08227 (* 0.0909091 = 0.0983879 loss)
I0612 17:29:13.905524 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 1.10286 (* 0.0909091 = 0.10026 loss)
I0612 17:29:13.905537 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.39422 (* 0.0909091 = 0.126747 loss)
I0612 17:29:13.905551 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.91078 (* 0.0909091 = 0.173707 loss)
I0612 17:29:13.905565 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.898206 (* 0.0909091 = 0.0816551 loss)
I0612 17:29:13.905580 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.64611 (* 0.0909091 = 0.149647 loss)
I0612 17:29:13.905593 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 1.56695 (* 0.0909091 = 0.14245 loss)
I0612 17:29:13.905607 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 1.87162 (* 0.0909091 = 0.170148 loss)
I0612 17:29:13.905622 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.829423 (* 0.0909091 = 0.0754021 loss)
I0612 17:29:13.905635 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.384021 (* 0.0909091 = 0.034911 loss)
I0612 17:29:13.905649 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.0188008 (* 0.0909091 = 0.00170916 loss)
I0612 17:29:13.905663 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.00580774 (* 0.0909091 = 0.000527976 loss)
I0612 17:29:13.905678 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.00303233 (* 0.0909091 = 0.000275666 loss)
I0612 17:29:13.905691 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.00165877 (* 0.0909091 = 0.000150797 loss)
I0612 17:29:13.905704 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.00146472 (* 0.0909091 = 0.000133156 loss)
I0612 17:29:13.905719 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.000874251 (* 0.0909091 = 7.94774e-05 loss)
I0612 17:29:13.905733 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.000671535 (* 0.0909091 = 6.10486e-05 loss)
I0612 17:29:13.905747 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000515115 (* 0.0909091 = 4.68286e-05 loss)
I0612 17:29:13.905761 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000247533 (* 0.0909091 = 2.2503e-05 loss)
I0612 17:29:13.905776 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000258061 (* 0.0909091 = 2.346e-05 loss)
I0612 17:29:13.905791 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000157748 (* 0.0909091 = 1.43407e-05 loss)
I0612 17:29:13.905802 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 17:29:13.905814 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.25
I0612 17:29:13.905836 6181 solver.cpp:245] Train net output #149: total_confidence = 0.178464
I0612 17:29:13.905849 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.162658
I0612 17:29:13.905863 6181 sgd_solver.cpp:106] Iteration 12000, lr = 0.001
I0612 17:29:41.324246 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.1828 > 30) by scale factor 0.932174
I0612 17:30:36.201341 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.2426 > 30) by scale factor 0.851242
I0612 17:30:36.971071 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.619 > 30) by scale factor 0.979783
I0612 17:33:02.552765 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.2078 > 30) by scale factor 0.993122
I0612 17:33:09.492539 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.697 > 30) by scale factor 0.977295
I0612 17:33:42.682713 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 42.3118 > 30) by scale factor 0.709023
I0612 17:34:56.808907 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.9696 > 30) by scale factor 0.769831
I0612 17:35:04.546509 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.9526 > 30) by scale factor 0.938892
I0612 17:35:40.500792 6181 solver.cpp:229] Iteration 12500, loss = 3.87753
I0612 17:35:40.500915 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.378788
I0612 17:35:40.500936 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.875
I0612 17:35:40.500951 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.5
I0612 17:35:40.500963 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.375
I0612 17:35:40.500977 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.375
I0612 17:35:40.500989 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 17:35:40.501003 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.5
I0612 17:35:40.501015 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.75
I0612 17:35:40.501029 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.75
I0612 17:35:40.501041 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 17:35:40.501055 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 17:35:40.501067 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.625
I0612 17:35:40.501080 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.625
I0612 17:35:40.501092 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.625
I0612 17:35:40.501106 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.625
I0612 17:35:40.501124 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.625
I0612 17:35:40.501138 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 17:35:40.501150 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 0.875
I0612 17:35:40.501163 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 17:35:40.501175 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 17:35:40.501186 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 17:35:40.501199 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 17:35:40.501210 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 17:35:40.501225 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.727273
I0612 17:35:40.501238 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.575758
I0612 17:35:40.501255 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.74191 (* 0.3 = 0.822572 loss)
I0612 17:35:40.501269 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 1.17626 (* 0.3 = 0.352877 loss)
I0612 17:35:40.501283 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.14425 (* 0.0272727 = 0.0312068 loss)
I0612 17:35:40.501298 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.25715 (* 0.0272727 = 0.0615586 loss)
I0612 17:35:40.501312 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.99224 (* 0.0272727 = 0.0816067 loss)
I0612 17:35:40.501344 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.80114 (* 0.0272727 = 0.0763947 loss)
I0612 17:35:40.501360 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.96009 (* 0.0272727 = 0.0534569 loss)
I0612 17:35:40.501374 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.84824 (* 0.0272727 = 0.0504064 loss)
I0612 17:35:40.501389 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.47191 (* 0.0272727 = 0.040143 loss)
I0612 17:35:40.501404 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.51215 (* 0.0272727 = 0.0412404 loss)
I0612 17:35:40.501417 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.837715 (* 0.0272727 = 0.0228468 loss)
I0612 17:35:40.501431 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 1.40876 (* 0.0272727 = 0.0384207 loss)
I0612 17:35:40.501446 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 1.46295 (* 0.0272727 = 0.0398987 loss)
I0612 17:35:40.501459 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 1.6281 (* 0.0272727 = 0.0444028 loss)
I0612 17:35:40.501492 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 1.12164 (* 0.0272727 = 0.0305901 loss)
I0612 17:35:40.501507 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 1.20842 (* 0.0272727 = 0.032957 loss)
I0612 17:35:40.501520 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 1.34444 (* 0.0272727 = 0.0366666 loss)
I0612 17:35:40.501534 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.578714 (* 0.0272727 = 0.0157831 loss)
I0612 17:35:40.501549 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.462293 (* 0.0272727 = 0.012608 loss)
I0612 17:35:40.501564 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.0440854 (* 0.0272727 = 0.00120233 loss)
I0612 17:35:40.501579 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00979113 (* 0.0272727 = 0.000267031 loss)
I0612 17:35:40.501592 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00248959 (* 0.0272727 = 6.7898e-05 loss)
I0612 17:35:40.501606 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00146815 (* 0.0272727 = 4.00406e-05 loss)
I0612 17:35:40.501621 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000324335 (* 0.0272727 = 8.84551e-06 loss)
I0612 17:35:40.501633 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.424242
I0612 17:35:40.501646 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 17:35:40.501657 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.5
I0612 17:35:40.501669 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.375
I0612 17:35:40.501682 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 17:35:40.501693 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.375
I0612 17:35:40.501704 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.625
I0612 17:35:40.501716 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 17:35:40.501729 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 17:35:40.501740 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.625
I0612 17:35:40.501752 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.625
I0612 17:35:40.501763 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.625
I0612 17:35:40.501775 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.75
I0612 17:35:40.501787 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.75
I0612 17:35:40.501799 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.625
I0612 17:35:40.501811 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.625
I0612 17:35:40.501823 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 17:35:40.501834 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 0.875
I0612 17:35:40.501847 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 17:35:40.501858 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 17:35:40.501869 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 17:35:40.501881 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 17:35:40.501893 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 17:35:40.501904 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.738636
I0612 17:35:40.501917 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.666667
I0612 17:35:40.501935 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 2.16688 (* 0.3 = 0.650065 loss)
I0612 17:35:40.501950 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.97976 (* 0.3 = 0.293928 loss)
I0612 17:35:40.501965 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.578919 (* 0.0272727 = 0.0157887 loss)
I0612 17:35:40.501978 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.57119 (* 0.0272727 = 0.0428507 loss)
I0612 17:35:40.502004 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.5704 (* 0.0272727 = 0.042829 loss)
I0612 17:35:40.502019 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 2.28848 (* 0.0272727 = 0.0624132 loss)
I0612 17:35:40.502033 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.88478 (* 0.0272727 = 0.0514032 loss)
I0612 17:35:40.502048 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.30828 (* 0.0272727 = 0.0356804 loss)
I0612 17:35:40.502061 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.32959 (* 0.0272727 = 0.0362616 loss)
I0612 17:35:40.502075 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.47551 (* 0.0272727 = 0.0402413 loss)
I0612 17:35:40.502089 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.924858 (* 0.0272727 = 0.0252234 loss)
I0612 17:35:40.502104 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 1.33959 (* 0.0272727 = 0.0365344 loss)
I0612 17:35:40.502116 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 1.37575 (* 0.0272727 = 0.0375206 loss)
I0612 17:35:40.502130 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 1.10617 (* 0.0272727 = 0.0301683 loss)
I0612 17:35:40.502145 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.858071 (* 0.0272727 = 0.0234019 loss)
I0612 17:35:40.502158 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 1.02371 (* 0.0272727 = 0.0279195 loss)
I0612 17:35:40.502172 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 1.30198 (* 0.0272727 = 0.0355086 loss)
I0612 17:35:40.502187 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.546418 (* 0.0272727 = 0.0149023 loss)
I0612 17:35:40.502200 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.377567 (* 0.0272727 = 0.0102973 loss)
I0612 17:35:40.502214 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0612483 (* 0.0272727 = 0.00167041 loss)
I0612 17:35:40.502229 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.0220969 (* 0.0272727 = 0.000602642 loss)
I0612 17:35:40.502240 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.0148251 (* 0.0272727 = 0.000404321 loss)
I0612 17:35:40.502250 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.00251261 (* 0.0272727 = 6.85258e-05 loss)
I0612 17:35:40.502265 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.0010266 (* 0.0272727 = 2.79982e-05 loss)
I0612 17:35:40.502285 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.560606
I0612 17:35:40.502300 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 17:35:40.502312 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 17:35:40.502327 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 17:35:40.502341 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 17:35:40.502352 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 17:35:40.502364 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 17:35:40.502377 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 17:35:40.502388 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.625
I0612 17:35:40.502399 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.625
I0612 17:35:40.502410 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.75
I0612 17:35:40.502423 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.75
I0612 17:35:40.502434 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.625
I0612 17:35:40.502445 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.625
I0612 17:35:40.502457 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.625
I0612 17:35:40.502468 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.625
I0612 17:35:40.502490 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 17:35:40.502503 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 17:35:40.502516 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 17:35:40.502526 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 17:35:40.502538 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 17:35:40.502549 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 17:35:40.502560 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 17:35:40.502573 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.784091
I0612 17:35:40.502584 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.681818
I0612 17:35:40.502598 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 2.20391 (* 1 = 2.20391 loss)
I0612 17:35:40.502611 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.966024 (* 1 = 0.966024 loss)
I0612 17:35:40.502625 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 1.0279 (* 0.0909091 = 0.0934457 loss)
I0612 17:35:40.502640 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 1.38891 (* 0.0909091 = 0.126264 loss)
I0612 17:35:40.502653 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 1.6751 (* 0.0909091 = 0.152282 loss)
I0612 17:35:40.502667 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.01191 (* 0.0909091 = 0.0919919 loss)
I0612 17:35:40.502681 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.858842 (* 0.0909091 = 0.0780765 loss)
I0612 17:35:40.502694 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.50549 (* 0.0909091 = 0.136863 loss)
I0612 17:35:40.502708 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.29772 (* 0.0909091 = 0.117974 loss)
I0612 17:35:40.502722 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 1.78781 (* 0.0909091 = 0.162528 loss)
I0612 17:35:40.502735 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 1.15395 (* 0.0909091 = 0.104905 loss)
I0612 17:35:40.502749 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 1.21395 (* 0.0909091 = 0.110359 loss)
I0612 17:35:40.502763 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 1.6322 (* 0.0909091 = 0.148382 loss)
I0612 17:35:40.502776 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 1.47135 (* 0.0909091 = 0.133759 loss)
I0612 17:35:40.502790 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 1.18399 (* 0.0909091 = 0.107635 loss)
I0612 17:35:40.502804 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 1.39051 (* 0.0909091 = 0.12641 loss)
I0612 17:35:40.502817 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 1.22527 (* 0.0909091 = 0.111388 loss)
I0612 17:35:40.502831 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.51817 (* 0.0909091 = 0.0471064 loss)
I0612 17:35:40.502846 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.215059 (* 0.0909091 = 0.0195508 loss)
I0612 17:35:40.502859 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.147532 (* 0.0909091 = 0.013412 loss)
I0612 17:35:40.502873 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.0731799 (* 0.0909091 = 0.00665271 loss)
I0612 17:35:40.502887 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.0222894 (* 0.0909091 = 0.00202631 loss)
I0612 17:35:40.502902 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00543698 (* 0.0909091 = 0.000494271 loss)
I0612 17:35:40.502915 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.00267477 (* 0.0909091 = 0.000243161 loss)
I0612 17:35:40.502928 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 17:35:40.502939 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 17:35:40.502951 6181 solver.cpp:245] Train net output #149: total_confidence = 0.343786
I0612 17:35:40.502972 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.336795
I0612 17:35:40.502990 6181 sgd_solver.cpp:106] Iteration 12500, lr = 0.001
I0612 17:36:18.637253 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.6983 > 30) by scale factor 0.977254
I0612 17:37:15.786763 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.1684 > 30) by scale factor 0.994419
I0612 17:39:02.252974 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.7237 > 30) by scale factor 0.839778
I0612 17:41:11.963557 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.0011 > 30) by scale factor 0.857115
I0612 17:41:12.736387 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.9759 > 30) by scale factor 0.882979
I0612 17:42:06.473217 6181 solver.cpp:229] Iteration 13000, loss = 3.91575
I0612 17:42:06.473358 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.426471
I0612 17:42:06.473381 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 17:42:06.473394 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.75
I0612 17:42:06.473407 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.25
I0612 17:42:06.473420 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 17:42:06.473433 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.625
I0612 17:42:06.473446 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.75
I0612 17:42:06.473459 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 17:42:06.473474 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.75
I0612 17:42:06.473485 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.625
I0612 17:42:06.473498 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 17:42:06.473511 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 17:42:06.473525 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 17:42:06.473536 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 17:42:06.473549 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.75
I0612 17:42:06.473562 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 17:42:06.473574 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 17:42:06.473587 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 0.875
I0612 17:42:06.473598 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 17:42:06.473613 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 17:42:06.473624 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 17:42:06.473635 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 17:42:06.473647 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 17:42:06.473659 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.772727
I0612 17:42:06.473671 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.661765
I0612 17:42:06.473688 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.86659 (* 0.3 = 0.559977 loss)
I0612 17:42:06.473703 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.774684 (* 0.3 = 0.232405 loss)
I0612 17:42:06.473717 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.09124 (* 0.0272727 = 0.029761 loss)
I0612 17:42:06.473731 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 0.671129 (* 0.0272727 = 0.0183035 loss)
I0612 17:42:06.473747 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.6059 (* 0.0272727 = 0.07107 loss)
I0612 17:42:06.473760 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.99899 (* 0.0272727 = 0.054518 loss)
I0612 17:42:06.473774 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.56525 (* 0.0272727 = 0.0426887 loss)
I0612 17:42:06.473788 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.09615 (* 0.0272727 = 0.0298951 loss)
I0612 17:42:06.473803 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.80767 (* 0.0272727 = 0.0493001 loss)
I0612 17:42:06.473816 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.899529 (* 0.0272727 = 0.0245326 loss)
I0612 17:42:06.473830 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 1.29289 (* 0.0272727 = 0.0352606 loss)
I0612 17:42:06.473845 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.870118 (* 0.0272727 = 0.0237305 loss)
I0612 17:42:06.473870 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.818199 (* 0.0272727 = 0.0223145 loss)
I0612 17:42:06.473887 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.614557 (* 0.0272727 = 0.0167607 loss)
I0612 17:42:06.473917 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.634761 (* 0.0272727 = 0.0173117 loss)
I0612 17:42:06.473932 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.616419 (* 0.0272727 = 0.0168114 loss)
I0612 17:42:06.473947 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.657498 (* 0.0272727 = 0.0179318 loss)
I0612 17:42:06.473960 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.509542 (* 0.0272727 = 0.0138966 loss)
I0612 17:42:06.473974 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.374325 (* 0.0272727 = 0.0102089 loss)
I0612 17:42:06.473989 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.0469959 (* 0.0272727 = 0.00128171 loss)
I0612 17:42:06.474004 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.018357 (* 0.0272727 = 0.000500644 loss)
I0612 17:42:06.474019 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00534887 (* 0.0272727 = 0.000145878 loss)
I0612 17:42:06.474032 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00183354 (* 0.0272727 = 5.00058e-05 loss)
I0612 17:42:06.474046 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00046263 (* 0.0272727 = 1.26172e-05 loss)
I0612 17:42:06.474058 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.573529
I0612 17:42:06.474071 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 17:42:06.474083 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 17:42:06.474095 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.875
I0612 17:42:06.474108 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 17:42:06.474117 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.75
I0612 17:42:06.474123 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.75
I0612 17:42:06.474136 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.5
I0612 17:42:06.474148 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 17:42:06.474160 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.625
I0612 17:42:06.474171 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.75
I0612 17:42:06.474184 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.75
I0612 17:42:06.474195 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.75
I0612 17:42:06.474207 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.75
I0612 17:42:06.474220 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 17:42:06.474234 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 17:42:06.474246 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 17:42:06.474258 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 0.875
I0612 17:42:06.474270 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 17:42:06.474282 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 17:42:06.474293 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 17:42:06.474304 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 17:42:06.474316 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 17:42:06.474328 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.829545
I0612 17:42:06.474339 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.764706
I0612 17:42:06.474354 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.35072 (* 0.3 = 0.405215 loss)
I0612 17:42:06.474369 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.557217 (* 0.3 = 0.167165 loss)
I0612 17:42:06.474385 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.630361 (* 0.0272727 = 0.0171917 loss)
I0612 17:42:06.474400 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.46695 (* 0.0272727 = 0.012735 loss)
I0612 17:42:06.474426 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.645223 (* 0.0272727 = 0.017597 loss)
I0612 17:42:06.474441 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.88531 (* 0.0272727 = 0.0514177 loss)
I0612 17:42:06.474454 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.51038 (* 0.0272727 = 0.0411921 loss)
I0612 17:42:06.474468 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.995341 (* 0.0272727 = 0.0271457 loss)
I0612 17:42:06.474483 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.18261 (* 0.0272727 = 0.032253 loss)
I0612 17:42:06.474496 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.923611 (* 0.0272727 = 0.0251894 loss)
I0612 17:42:06.474510 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.968884 (* 0.0272727 = 0.0264241 loss)
I0612 17:42:06.474524 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.788625 (* 0.0272727 = 0.021508 loss)
I0612 17:42:06.474539 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.971693 (* 0.0272727 = 0.0265007 loss)
I0612 17:42:06.474552 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.652314 (* 0.0272727 = 0.0177904 loss)
I0612 17:42:06.474566 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.777657 (* 0.0272727 = 0.0212088 loss)
I0612 17:42:06.474581 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.526502 (* 0.0272727 = 0.0143591 loss)
I0612 17:42:06.474593 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.720318 (* 0.0272727 = 0.019645 loss)
I0612 17:42:06.474607 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.457936 (* 0.0272727 = 0.0124892 loss)
I0612 17:42:06.474622 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.397211 (* 0.0272727 = 0.010833 loss)
I0612 17:42:06.474635 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0155635 (* 0.0272727 = 0.000424458 loss)
I0612 17:42:06.474649 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00421966 (* 0.0272727 = 0.000115082 loss)
I0612 17:42:06.474663 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00350699 (* 0.0272727 = 9.56451e-05 loss)
I0612 17:42:06.474678 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000809694 (* 0.0272727 = 2.20826e-05 loss)
I0612 17:42:06.474692 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000548642 (* 0.0272727 = 1.4963e-05 loss)
I0612 17:42:06.474705 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.808824
I0612 17:42:06.474717 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 17:42:06.474730 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 17:42:06.474741 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 1
I0612 17:42:06.474752 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 17:42:06.474764 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 1
I0612 17:42:06.474776 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 1
I0612 17:42:06.474787 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 17:42:06.474799 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.625
I0612 17:42:06.474812 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 17:42:06.474823 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.625
I0612 17:42:06.474835 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.75
I0612 17:42:06.474848 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 17:42:06.474858 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.75
I0612 17:42:06.474871 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.75
I0612 17:42:06.474884 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 17:42:06.474895 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 17:42:06.474916 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 0.875
I0612 17:42:06.474930 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 17:42:06.474941 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 17:42:06.474953 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 17:42:06.474964 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 17:42:06.474977 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 17:42:06.474988 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.926136
I0612 17:42:06.475000 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.882353
I0612 17:42:06.475014 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.701901 (* 1 = 0.701901 loss)
I0612 17:42:06.475029 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.279554 (* 1 = 0.279554 loss)
I0612 17:42:06.475044 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.0390429 (* 0.0909091 = 0.00354936 loss)
I0612 17:42:06.475057 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.0393185 (* 0.0909091 = 0.00357441 loss)
I0612 17:42:06.475072 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.12462 (* 0.0909091 = 0.0113291 loss)
I0612 17:42:06.475086 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.378778 (* 0.0909091 = 0.0344344 loss)
I0612 17:42:06.475101 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.0865602 (* 0.0909091 = 0.00786911 loss)
I0612 17:42:06.475116 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.0980884 (* 0.0909091 = 0.00891712 loss)
I0612 17:42:06.475129 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.278424 (* 0.0909091 = 0.0253113 loss)
I0612 17:42:06.475143 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.640251 (* 0.0909091 = 0.0582046 loss)
I0612 17:42:06.475157 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.810948 (* 0.0909091 = 0.0737226 loss)
I0612 17:42:06.475172 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.662696 (* 0.0909091 = 0.0602451 loss)
I0612 17:42:06.475185 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.679294 (* 0.0909091 = 0.061754 loss)
I0612 17:42:06.475199 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.39654 (* 0.0909091 = 0.0360491 loss)
I0612 17:42:06.475214 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.492618 (* 0.0909091 = 0.0447834 loss)
I0612 17:42:06.475227 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.58073 (* 0.0909091 = 0.0527937 loss)
I0612 17:42:06.475241 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.663346 (* 0.0909091 = 0.0603042 loss)
I0612 17:42:06.475255 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.607578 (* 0.0909091 = 0.0552343 loss)
I0612 17:42:06.475270 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.569853 (* 0.0909091 = 0.0518048 loss)
I0612 17:42:06.475286 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00952873 (* 0.0909091 = 0.000866248 loss)
I0612 17:42:06.475301 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00225507 (* 0.0909091 = 0.000205006 loss)
I0612 17:42:06.475316 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00161103 (* 0.0909091 = 0.000146457 loss)
I0612 17:42:06.475329 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000146213 (* 0.0909091 = 1.32921e-05 loss)
I0612 17:42:06.475344 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 9.70567e-05 (* 0.0909091 = 8.82334e-06 loss)
I0612 17:42:06.475356 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.625
I0612 17:42:06.475368 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 17:42:06.475390 6181 solver.cpp:245] Train net output #149: total_confidence = 0.469962
I0612 17:42:06.475404 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.442964
I0612 17:42:06.475417 6181 sgd_solver.cpp:106] Iteration 13000, lr = 0.001
I0612 17:42:13.779525 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.8988 > 30) by scale factor 0.940474
I0612 17:42:46.189854 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 66.2044 > 30) by scale factor 0.453142
I0612 17:43:39.414546 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.5874 > 30) by scale factor 0.819955
I0612 17:45:32.811995 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.7243 > 30) by scale factor 0.816898
I0612 17:46:47.699672 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.1345 > 30) by scale factor 0.905401
I0612 17:46:48.473903 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.424 > 30) by scale factor 0.897558
I0612 17:47:17.073868 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.5124 > 30) by scale factor 0.952007
I0612 17:47:59.607650 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.4613 > 30) by scale factor 0.953551
I0612 17:48:32.444504 6181 solver.cpp:229] Iteration 13500, loss = 3.97279
I0612 17:48:32.444622 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.46875
I0612 17:48:32.444643 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 17:48:32.444656 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.75
I0612 17:48:32.444669 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.625
I0612 17:48:32.444682 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.375
I0612 17:48:32.444694 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.375
I0612 17:48:32.444708 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.875
I0612 17:48:32.444721 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 17:48:32.444735 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 17:48:32.444746 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.625
I0612 17:48:32.444759 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 17:48:32.444772 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 17:48:32.444785 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 17:48:32.444798 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 17:48:32.444810 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 17:48:32.444823 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 17:48:32.444835 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 17:48:32.444847 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 17:48:32.444860 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 17:48:32.444872 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 17:48:32.444883 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 17:48:32.444895 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 17:48:32.444907 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 17:48:32.444919 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.789773
I0612 17:48:32.444931 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.703125
I0612 17:48:32.444948 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.85623 (* 0.3 = 0.556871 loss)
I0612 17:48:32.444963 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.76179 (* 0.3 = 0.228537 loss)
I0612 17:48:32.444978 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.6312 (* 0.0272727 = 0.0444872 loss)
I0612 17:48:32.444993 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.45023 (* 0.0272727 = 0.0395517 loss)
I0612 17:48:32.445006 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.98934 (* 0.0272727 = 0.0542546 loss)
I0612 17:48:32.445021 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.52834 (* 0.0272727 = 0.0689547 loss)
I0612 17:48:32.445035 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.62781 (* 0.0272727 = 0.0443949 loss)
I0612 17:48:32.445050 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.52892 (* 0.0272727 = 0.0416977 loss)
I0612 17:48:32.445065 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.44267 (* 0.0272727 = 0.0393455 loss)
I0612 17:48:32.445078 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.643038 (* 0.0272727 = 0.0175374 loss)
I0612 17:48:32.445093 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 1.11905 (* 0.0272727 = 0.0305194 loss)
I0612 17:48:32.445107 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 1.14899 (* 0.0272727 = 0.0313362 loss)
I0612 17:48:32.445122 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.73727 (* 0.0272727 = 0.0201074 loss)
I0612 17:48:32.445135 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.633815 (* 0.0272727 = 0.0172859 loss)
I0612 17:48:32.445168 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.41724 (* 0.0272727 = 0.0113793 loss)
I0612 17:48:32.445183 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.418347 (* 0.0272727 = 0.0114095 loss)
I0612 17:48:32.445199 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.273322 (* 0.0272727 = 0.00745424 loss)
I0612 17:48:32.445212 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.0545659 (* 0.0272727 = 0.00148816 loss)
I0612 17:48:32.445230 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00387972 (* 0.0272727 = 0.000105811 loss)
I0612 17:48:32.445245 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.00214091 (* 0.0272727 = 5.83885e-05 loss)
I0612 17:48:32.445260 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00100484 (* 0.0272727 = 2.74047e-05 loss)
I0612 17:48:32.445274 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00142438 (* 0.0272727 = 3.88467e-05 loss)
I0612 17:48:32.445288 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00132795 (* 0.0272727 = 3.62169e-05 loss)
I0612 17:48:32.445302 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000636402 (* 0.0272727 = 1.73564e-05 loss)
I0612 17:48:32.445315 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.59375
I0612 17:48:32.445341 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 17:48:32.445355 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 17:48:32.445368 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 17:48:32.445379 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 17:48:32.445391 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 17:48:32.445405 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.75
I0612 17:48:32.445416 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 17:48:32.445427 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 1
I0612 17:48:32.445439 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 17:48:32.445451 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.75
I0612 17:48:32.445463 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.75
I0612 17:48:32.445475 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 17:48:32.445487 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 17:48:32.445499 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 17:48:32.445511 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 17:48:32.445523 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 17:48:32.445534 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 17:48:32.445546 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 17:48:32.445557 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 17:48:32.445569 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 17:48:32.445580 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 17:48:32.445592 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 17:48:32.445605 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.840909
I0612 17:48:32.445616 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.78125
I0612 17:48:32.445631 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.43942 (* 0.3 = 0.431825 loss)
I0612 17:48:32.445648 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.577033 (* 0.3 = 0.17311 loss)
I0612 17:48:32.445664 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.330347 (* 0.0272727 = 0.00900945 loss)
I0612 17:48:32.445678 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.80127 (* 0.0272727 = 0.0218528 loss)
I0612 17:48:32.445706 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.35971 (* 0.0272727 = 0.037083 loss)
I0612 17:48:32.445722 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.82794 (* 0.0272727 = 0.0498528 loss)
I0612 17:48:32.445736 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.76615 (* 0.0272727 = 0.0481677 loss)
I0612 17:48:32.445750 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.02956 (* 0.0272727 = 0.0280788 loss)
I0612 17:48:32.445765 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.02614 (* 0.0272727 = 0.0279857 loss)
I0612 17:48:32.445780 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.447838 (* 0.0272727 = 0.0122138 loss)
I0612 17:48:32.445793 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.725065 (* 0.0272727 = 0.0197745 loss)
I0612 17:48:32.445807 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.771366 (* 0.0272727 = 0.0210372 loss)
I0612 17:48:32.445822 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.836034 (* 0.0272727 = 0.0228009 loss)
I0612 17:48:32.445835 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.559961 (* 0.0272727 = 0.0152717 loss)
I0612 17:48:32.445850 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.525405 (* 0.0272727 = 0.0143292 loss)
I0612 17:48:32.445864 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.340966 (* 0.0272727 = 0.00929907 loss)
I0612 17:48:32.445878 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.231014 (* 0.0272727 = 0.00630038 loss)
I0612 17:48:32.445894 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.0463682 (* 0.0272727 = 0.00126459 loss)
I0612 17:48:32.445907 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0018115 (* 0.0272727 = 4.94046e-05 loss)
I0612 17:48:32.445921 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000896458 (* 0.0272727 = 2.44489e-05 loss)
I0612 17:48:32.445936 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000416692 (* 0.0272727 = 1.13643e-05 loss)
I0612 17:48:32.445951 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000291636 (* 0.0272727 = 7.95371e-06 loss)
I0612 17:48:32.445966 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000322568 (* 0.0272727 = 8.79731e-06 loss)
I0612 17:48:32.445979 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000183362 (* 0.0272727 = 5.00079e-06 loss)
I0612 17:48:32.445992 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.78125
I0612 17:48:32.446004 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 17:48:32.446017 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 17:48:32.446028 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 17:48:32.446040 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 17:48:32.446053 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 17:48:32.446064 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 17:48:32.446076 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 1
I0612 17:48:32.446089 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 1
I0612 17:48:32.446100 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.75
I0612 17:48:32.446112 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 17:48:32.446120 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.75
I0612 17:48:32.446128 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 17:48:32.446141 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 17:48:32.446154 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 17:48:32.446166 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 17:48:32.446187 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 17:48:32.446200 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 17:48:32.446211 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 17:48:32.446223 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 17:48:32.446234 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 17:48:32.446245 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 17:48:32.446257 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 17:48:32.446269 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.914773
I0612 17:48:32.446285 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.890625
I0612 17:48:32.446298 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.84657 (* 1 = 0.84657 loss)
I0612 17:48:32.446312 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.361703 (* 1 = 0.361703 loss)
I0612 17:48:32.446327 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.45766 (* 0.0909091 = 0.0416055 loss)
I0612 17:48:32.446341 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.763718 (* 0.0909091 = 0.0694289 loss)
I0612 17:48:32.446355 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 1.01428 (* 0.0909091 = 0.0922068 loss)
I0612 17:48:32.446369 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.11457 (* 0.0909091 = 0.101325 loss)
I0612 17:48:32.446383 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.04243 (* 0.0909091 = 0.0947666 loss)
I0612 17:48:32.446396 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.739128 (* 0.0909091 = 0.0671935 loss)
I0612 17:48:32.446410 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.241989 (* 0.0909091 = 0.021999 loss)
I0612 17:48:32.446425 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.229436 (* 0.0909091 = 0.0208578 loss)
I0612 17:48:32.446439 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.878628 (* 0.0909091 = 0.0798752 loss)
I0612 17:48:32.446452 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.538624 (* 0.0909091 = 0.0489658 loss)
I0612 17:48:32.446466 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.621068 (* 0.0909091 = 0.0564607 loss)
I0612 17:48:32.446480 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.380655 (* 0.0909091 = 0.034605 loss)
I0612 17:48:32.446494 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.253551 (* 0.0909091 = 0.0230501 loss)
I0612 17:48:32.446508 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.153815 (* 0.0909091 = 0.0139832 loss)
I0612 17:48:32.446522 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.234544 (* 0.0909091 = 0.0213222 loss)
I0612 17:48:32.446537 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.0302274 (* 0.0909091 = 0.00274794 loss)
I0612 17:48:32.446550 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.00504073 (* 0.0909091 = 0.000458248 loss)
I0612 17:48:32.446564 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.0017764 (* 0.0909091 = 0.000161491 loss)
I0612 17:48:32.446579 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00113709 (* 0.0909091 = 0.000103371 loss)
I0612 17:48:32.446593 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00117231 (* 0.0909091 = 0.000106574 loss)
I0612 17:48:32.446607 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00133147 (* 0.0909091 = 0.000121042 loss)
I0612 17:48:32.446621 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000990788 (* 0.0909091 = 9.00717e-05 loss)
I0612 17:48:32.446633 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 17:48:32.446645 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.375
I0612 17:48:32.446667 6181 solver.cpp:245] Train net output #149: total_confidence = 0.251301
I0612 17:48:32.446681 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.248872
I0612 17:48:32.446697 6181 sgd_solver.cpp:106] Iteration 13500, lr = 0.001
I0612 17:48:59.808317 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.6443 > 30) by scale factor 0.84165
I0612 17:49:16.037410 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.4691 > 30) by scale factor 0.923955
I0612 17:49:34.559288 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.2507 > 30) by scale factor 0.902238
I0612 17:50:10.053587 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.2307 > 30) by scale factor 0.828027
I0612 17:51:01.007455 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.9617 > 30) by scale factor 0.834221
I0612 17:51:02.556872 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.6533 > 30) by scale factor 0.918742
I0612 17:51:47.363178 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.81 > 30) by scale factor 0.861822
I0612 17:52:10.559895 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.5448 > 30) by scale factor 0.844004
I0612 17:53:18.393774 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.4872 > 30) by scale factor 0.822206
I0612 17:54:12.368660 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.6165 > 30) by scale factor 0.979865
I0612 17:54:33.961352 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 49.3045 > 30) by scale factor 0.608463
I0612 17:54:58.291550 6181 solver.cpp:229] Iteration 14000, loss = 3.86379
I0612 17:54:58.291677 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.546875
I0612 17:54:58.291697 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 17:54:58.291712 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 1
I0612 17:54:58.291724 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 17:54:58.291738 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.625
I0612 17:54:58.291751 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.625
I0612 17:54:58.291764 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.625
I0612 17:54:58.291777 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.75
I0612 17:54:58.291790 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 17:54:58.291803 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 17:54:58.291815 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 17:54:58.291828 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.75
I0612 17:54:58.291841 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 17:54:58.291854 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.75
I0612 17:54:58.291867 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.75
I0612 17:54:58.291878 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.75
I0612 17:54:58.291889 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 17:54:58.291901 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 17:54:58.291913 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 17:54:58.291925 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 17:54:58.291936 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 17:54:58.291949 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 17:54:58.291960 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 17:54:58.291972 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.829545
I0612 17:54:58.291985 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.78125
I0612 17:54:58.292001 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.37876 (* 0.3 = 0.413628 loss)
I0612 17:54:58.292016 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.533759 (* 0.3 = 0.160128 loss)
I0612 17:54:58.292031 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.525926 (* 0.0272727 = 0.0143434 loss)
I0612 17:54:58.292045 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 0.357442 (* 0.0272727 = 0.00974842 loss)
I0612 17:54:58.292060 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.73381 (* 0.0272727 = 0.0472857 loss)
I0612 17:54:58.292074 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.07672 (* 0.0272727 = 0.029365 loss)
I0612 17:54:58.292088 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.49111 (* 0.0272727 = 0.0406665 loss)
I0612 17:54:58.292103 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 0.932446 (* 0.0272727 = 0.0254304 loss)
I0612 17:54:58.292116 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.31574 (* 0.0272727 = 0.0358837 loss)
I0612 17:54:58.292130 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.598396 (* 0.0272727 = 0.0163199 loss)
I0612 17:54:58.292145 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.547843 (* 0.0272727 = 0.0149412 loss)
I0612 17:54:58.292160 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.841628 (* 0.0272727 = 0.0229535 loss)
I0612 17:54:58.292173 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.533176 (* 0.0272727 = 0.0145412 loss)
I0612 17:54:58.292187 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.337812 (* 0.0272727 = 0.00921304 loss)
I0612 17:54:58.292219 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 1.00001 (* 0.0272727 = 0.027273 loss)
I0612 17:54:58.292238 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.803884 (* 0.0272727 = 0.0219241 loss)
I0612 17:54:58.292253 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.539959 (* 0.0272727 = 0.0147262 loss)
I0612 17:54:58.292266 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.554944 (* 0.0272727 = 0.0151348 loss)
I0612 17:54:58.292280 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00119471 (* 0.0272727 = 3.25829e-05 loss)
I0612 17:54:58.292295 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 4.82033e-05 (* 0.0272727 = 1.31464e-06 loss)
I0612 17:54:58.292309 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 2.16225e-05 (* 0.0272727 = 5.89704e-07 loss)
I0612 17:54:58.292325 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 1.82397e-05 (* 0.0272727 = 4.97447e-07 loss)
I0612 17:54:58.292338 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 2.22037e-05 (* 0.0272727 = 6.05556e-07 loss)
I0612 17:54:58.292352 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 3.42088e-05 (* 0.0272727 = 9.32967e-07 loss)
I0612 17:54:58.292364 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.59375
I0612 17:54:58.292377 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 17:54:58.292389 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 1
I0612 17:54:58.292402 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.375
I0612 17:54:58.292413 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.75
I0612 17:54:58.292425 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.875
I0612 17:54:58.292438 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.75
I0612 17:54:58.292449 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 17:54:58.292461 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 17:54:58.292472 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 17:54:58.292484 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.75
I0612 17:54:58.292496 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.75
I0612 17:54:58.292508 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.75
I0612 17:54:58.292520 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.75
I0612 17:54:58.292532 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.75
I0612 17:54:58.292544 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.75
I0612 17:54:58.292556 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.75
I0612 17:54:58.292567 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 17:54:58.292579 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 17:54:58.292590 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 17:54:58.292603 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 17:54:58.292614 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 17:54:58.292625 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 17:54:58.292637 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.840909
I0612 17:54:58.292649 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.796875
I0612 17:54:58.292664 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.24951 (* 0.3 = 0.374853 loss)
I0612 17:54:58.292676 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.482792 (* 0.3 = 0.144838 loss)
I0612 17:54:58.292695 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.539835 (* 0.0272727 = 0.0147228 loss)
I0612 17:54:58.292709 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.486387 (* 0.0272727 = 0.0132651 loss)
I0612 17:54:58.292735 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.758 (* 0.0272727 = 0.0479456 loss)
I0612 17:54:58.292750 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 0.792198 (* 0.0272727 = 0.0216054 loss)
I0612 17:54:58.292764 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 0.740359 (* 0.0272727 = 0.0201916 loss)
I0612 17:54:58.292778 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.866038 (* 0.0272727 = 0.0236192 loss)
I0612 17:54:58.292793 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.702717 (* 0.0272727 = 0.019165 loss)
I0612 17:54:58.292806 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.469772 (* 0.0272727 = 0.012812 loss)
I0612 17:54:58.292820 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.601959 (* 0.0272727 = 0.0164171 loss)
I0612 17:54:58.292834 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.782267 (* 0.0272727 = 0.0213346 loss)
I0612 17:54:58.292848 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.652751 (* 0.0272727 = 0.0178023 loss)
I0612 17:54:58.292862 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.55606 (* 0.0272727 = 0.0151653 loss)
I0612 17:54:58.292876 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.789833 (* 0.0272727 = 0.0215409 loss)
I0612 17:54:58.292889 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.794971 (* 0.0272727 = 0.021681 loss)
I0612 17:54:58.292903 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.833868 (* 0.0272727 = 0.0227419 loss)
I0612 17:54:58.292917 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.585312 (* 0.0272727 = 0.0159631 loss)
I0612 17:54:58.292932 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.00654036 (* 0.0272727 = 0.000178373 loss)
I0612 17:54:58.292946 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000656546 (* 0.0272727 = 1.79058e-05 loss)
I0612 17:54:58.292960 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000137306 (* 0.0272727 = 3.7447e-06 loss)
I0612 17:54:58.292974 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 9.08843e-05 (* 0.0272727 = 2.47866e-06 loss)
I0612 17:54:58.292989 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 4.16898e-05 (* 0.0272727 = 1.13699e-06 loss)
I0612 17:54:58.293002 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 3.59372e-05 (* 0.0272727 = 9.80107e-07 loss)
I0612 17:54:58.293015 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.6875
I0612 17:54:58.293026 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 17:54:58.293040 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 17:54:58.293051 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 17:54:58.293062 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 17:54:58.293074 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 17:54:58.293087 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 17:54:58.293098 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 17:54:58.293110 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 17:54:58.293121 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 17:54:58.293133 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 17:54:58.293144 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 17:54:58.293156 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 17:54:58.293169 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.75
I0612 17:54:58.293180 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.75
I0612 17:54:58.293192 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 17:54:58.293213 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 17:54:58.293226 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 17:54:58.293238 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 17:54:58.293249 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 17:54:58.293262 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 17:54:58.293284 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 17:54:58.293301 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 17:54:58.293313 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.880682
I0612 17:54:58.293325 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.953125
I0612 17:54:58.293340 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.838122 (* 1 = 0.838122 loss)
I0612 17:54:58.293354 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.317748 (* 1 = 0.317748 loss)
I0612 17:54:58.293368 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.394965 (* 0.0909091 = 0.0359059 loss)
I0612 17:54:58.293382 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.0741047 (* 0.0909091 = 0.00673679 loss)
I0612 17:54:58.293396 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.745 (* 0.0909091 = 0.0677273 loss)
I0612 17:54:58.293411 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.289712 (* 0.0909091 = 0.0263375 loss)
I0612 17:54:58.293424 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.525938 (* 0.0909091 = 0.0478125 loss)
I0612 17:54:58.293438 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.739052 (* 0.0909091 = 0.0671866 loss)
I0612 17:54:58.293452 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.7465 (* 0.0909091 = 0.0678637 loss)
I0612 17:54:58.293467 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.382782 (* 0.0909091 = 0.0347983 loss)
I0612 17:54:58.293480 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.405874 (* 0.0909091 = 0.0368976 loss)
I0612 17:54:58.293494 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.307652 (* 0.0909091 = 0.0279684 loss)
I0612 17:54:58.293509 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.281517 (* 0.0909091 = 0.0255925 loss)
I0612 17:54:58.293524 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.323731 (* 0.0909091 = 0.0294301 loss)
I0612 17:54:58.293534 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.619393 (* 0.0909091 = 0.0563084 loss)
I0612 17:54:58.293543 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.630037 (* 0.0909091 = 0.0572761 loss)
I0612 17:54:58.293557 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.72921 (* 0.0909091 = 0.0662918 loss)
I0612 17:54:58.293572 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.102987 (* 0.0909091 = 0.00936245 loss)
I0612 17:54:58.293586 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.00896155 (* 0.0909091 = 0.000814686 loss)
I0612 17:54:58.293601 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.000827709 (* 0.0909091 = 7.52463e-05 loss)
I0612 17:54:58.293614 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000205598 (* 0.0909091 = 1.86907e-05 loss)
I0612 17:54:58.293628 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 7.49075e-05 (* 0.0909091 = 6.80978e-06 loss)
I0612 17:54:58.293643 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 7.86092e-05 (* 0.0909091 = 7.14629e-06 loss)
I0612 17:54:58.293658 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 1.37541e-05 (* 0.0909091 = 1.25038e-06 loss)
I0612 17:54:58.293669 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.625
I0612 17:54:58.293681 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.625
I0612 17:54:58.293704 6181 solver.cpp:245] Train net output #149: total_confidence = 0.546978
I0612 17:54:58.293718 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.577594
I0612 17:54:58.293730 6181 sgd_solver.cpp:106] Iteration 14000, lr = 0.001
I0612 17:55:24.903228 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.6294 > 30) by scale factor 0.979451
I0612 17:55:44.973006 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 46.6294 > 30) by scale factor 0.643371
I0612 17:55:52.697398 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 48.7604 > 30) by scale factor 0.615254
I0612 17:55:58.087215 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.9019 > 30) by scale factor 0.970814
I0612 17:56:24.334683 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.9101 > 30) by scale factor 0.83542
I0612 17:56:30.516391 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.1384 > 30) by scale factor 0.830143
I0612 17:57:40.758623 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 45.8534 > 30) by scale factor 0.654259
I0612 17:58:34.048012 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.1476 > 30) by scale factor 0.963156
I0612 17:58:44.103217 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.459 > 30) by scale factor 0.953622
I0612 17:58:58.770596 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.9828 > 30) by scale factor 0.909565
I0612 17:59:25.774478 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 43.4763 > 30) by scale factor 0.690031
I0612 18:00:23.635515 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.237 > 30) by scale factor 0.930607
I0612 18:01:24.216934 6181 solver.cpp:229] Iteration 14500, loss = 4.0246
I0612 18:01:24.217058 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.58
I0612 18:01:24.217079 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 18:01:24.217093 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 18:01:24.217106 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.625
I0612 18:01:24.217119 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.375
I0612 18:01:24.217133 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.625
I0612 18:01:24.217145 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.75
I0612 18:01:24.217157 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 1
I0612 18:01:24.217170 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 18:01:24.217182 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 1
I0612 18:01:24.217195 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 1
I0612 18:01:24.217207 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 18:01:24.217221 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 18:01:24.217236 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 18:01:24.217249 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 18:01:24.217262 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 18:01:24.217274 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 18:01:24.217285 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 18:01:24.217298 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 18:01:24.217309 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 18:01:24.217332 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 18:01:24.217347 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 18:01:24.217360 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 18:01:24.217371 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.857955
I0612 18:01:24.217383 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.78
I0612 18:01:24.217399 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.48151 (* 0.3 = 0.444452 loss)
I0612 18:01:24.217414 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.477126 (* 0.3 = 0.143138 loss)
I0612 18:01:24.217429 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.60669 (* 0.0272727 = 0.0165461 loss)
I0612 18:01:24.217443 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.52275 (* 0.0272727 = 0.0415295 loss)
I0612 18:01:24.217458 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.59003 (* 0.0272727 = 0.0433645 loss)
I0612 18:01:24.217473 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.9256 (* 0.0272727 = 0.0525163 loss)
I0612 18:01:24.217488 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.54091 (* 0.0272727 = 0.0420247 loss)
I0612 18:01:24.217501 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.25818 (* 0.0272727 = 0.034314 loss)
I0612 18:01:24.217515 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.357511 (* 0.0272727 = 0.0097503 loss)
I0612 18:01:24.217530 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.475878 (* 0.0272727 = 0.0129785 loss)
I0612 18:01:24.217545 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.258098 (* 0.0272727 = 0.00703904 loss)
I0612 18:01:24.217558 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.251056 (* 0.0272727 = 0.00684697 loss)
I0612 18:01:24.217572 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.419187 (* 0.0272727 = 0.0114324 loss)
I0612 18:01:24.217587 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.130279 (* 0.0272727 = 0.00355307 loss)
I0612 18:01:24.217602 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.216639 (* 0.0272727 = 0.00590833 loss)
I0612 18:01:24.217634 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.0872283 (* 0.0272727 = 0.00237895 loss)
I0612 18:01:24.217650 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.0194253 (* 0.0272727 = 0.00052978 loss)
I0612 18:01:24.217664 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.00504893 (* 0.0272727 = 0.000137698 loss)
I0612 18:01:24.217679 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00163271 (* 0.0272727 = 4.45286e-05 loss)
I0612 18:01:24.217694 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000571674 (* 0.0272727 = 1.55911e-05 loss)
I0612 18:01:24.217707 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00054315 (* 0.0272727 = 1.48132e-05 loss)
I0612 18:01:24.217721 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000827108 (* 0.0272727 = 2.25575e-05 loss)
I0612 18:01:24.217736 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000575383 (* 0.0272727 = 1.56923e-05 loss)
I0612 18:01:24.217749 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00084379 (* 0.0272727 = 2.30125e-05 loss)
I0612 18:01:24.217761 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.66
I0612 18:01:24.217774 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 18:01:24.217787 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.625
I0612 18:01:24.217798 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 18:01:24.217810 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.625
I0612 18:01:24.217823 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 18:01:24.217834 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.875
I0612 18:01:24.217846 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 18:01:24.217859 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 18:01:24.217870 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 18:01:24.217881 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 18:01:24.217893 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 18:01:24.217905 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 18:01:24.217916 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 18:01:24.217928 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 18:01:24.217939 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 18:01:24.217952 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 18:01:24.217962 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 18:01:24.217974 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 18:01:24.217985 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 18:01:24.217998 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 18:01:24.218008 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 18:01:24.218020 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 18:01:24.218031 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.903409
I0612 18:01:24.218044 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.84
I0612 18:01:24.218057 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.20988 (* 0.3 = 0.362964 loss)
I0612 18:01:24.218075 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.363815 (* 0.3 = 0.109145 loss)
I0612 18:01:24.218089 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.397022 (* 0.0272727 = 0.0108279 loss)
I0612 18:01:24.218103 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.36043 (* 0.0272727 = 0.0371025 loss)
I0612 18:01:24.218129 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.22632 (* 0.0272727 = 0.0334451 loss)
I0612 18:01:24.218144 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.04063 (* 0.0272727 = 0.0283809 loss)
I0612 18:01:24.218158 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.39067 (* 0.0272727 = 0.0379272 loss)
I0612 18:01:24.218173 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.442182 (* 0.0272727 = 0.0120595 loss)
I0612 18:01:24.218188 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.473875 (* 0.0272727 = 0.0129239 loss)
I0612 18:01:24.218201 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.492539 (* 0.0272727 = 0.0134329 loss)
I0612 18:01:24.218215 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.334798 (* 0.0272727 = 0.00913085 loss)
I0612 18:01:24.218230 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.286121 (* 0.0272727 = 0.00780329 loss)
I0612 18:01:24.218243 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.387804 (* 0.0272727 = 0.0105765 loss)
I0612 18:01:24.218258 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.0748612 (* 0.0272727 = 0.00204167 loss)
I0612 18:01:24.218274 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.365148 (* 0.0272727 = 0.00995859 loss)
I0612 18:01:24.218291 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.0229953 (* 0.0272727 = 0.000627144 loss)
I0612 18:01:24.218304 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.00230379 (* 0.0272727 = 6.28306e-05 loss)
I0612 18:01:24.218318 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.00149764 (* 0.0272727 = 4.08447e-05 loss)
I0612 18:01:24.218333 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.000497565 (* 0.0272727 = 1.357e-05 loss)
I0612 18:01:24.218348 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 4.01266e-05 (* 0.0272727 = 1.09436e-06 loss)
I0612 18:01:24.218363 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 1.07589e-05 (* 0.0272727 = 2.93424e-07 loss)
I0612 18:01:24.218376 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 8.00202e-06 (* 0.0272727 = 2.18237e-07 loss)
I0612 18:01:24.218390 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 7.19735e-06 (* 0.0272727 = 1.96291e-07 loss)
I0612 18:01:24.218405 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 7.45066e-06 (* 0.0272727 = 2.032e-07 loss)
I0612 18:01:24.218417 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.86
I0612 18:01:24.218430 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 18:01:24.218441 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 18:01:24.218453 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.625
I0612 18:01:24.218466 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 18:01:24.218477 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 1
I0612 18:01:24.218489 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 18:01:24.218500 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 18:01:24.218513 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 1
I0612 18:01:24.218524 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 1
I0612 18:01:24.218536 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 18:01:24.218547 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 18:01:24.218559 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 18:01:24.218570 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 18:01:24.218582 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 18:01:24.218595 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 18:01:24.218605 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 18:01:24.218627 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 18:01:24.218641 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 18:01:24.218652 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 18:01:24.218663 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 18:01:24.218675 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 18:01:24.218688 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 18:01:24.218698 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.960227
I0612 18:01:24.218710 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.94
I0612 18:01:24.218724 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.755594 (* 1 = 0.755594 loss)
I0612 18:01:24.218739 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.22808 (* 1 = 0.22808 loss)
I0612 18:01:24.218750 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.0942326 (* 0.0909091 = 0.0085666 loss)
I0612 18:01:24.218760 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.379234 (* 0.0909091 = 0.0344758 loss)
I0612 18:01:24.218775 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 2.67883 (* 0.0909091 = 0.24353 loss)
I0612 18:01:24.218788 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.933605 (* 0.0909091 = 0.0848732 loss)
I0612 18:01:24.218802 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.315422 (* 0.0909091 = 0.0286747 loss)
I0612 18:01:24.218816 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.483186 (* 0.0909091 = 0.043926 loss)
I0612 18:01:24.218830 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.181908 (* 0.0909091 = 0.0165371 loss)
I0612 18:01:24.218844 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.0801041 (* 0.0909091 = 0.00728219 loss)
I0612 18:01:24.218858 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.0822668 (* 0.0909091 = 0.0074788 loss)
I0612 18:01:24.218873 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.102666 (* 0.0909091 = 0.00933324 loss)
I0612 18:01:24.218888 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.098523 (* 0.0909091 = 0.00895664 loss)
I0612 18:01:24.218901 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.0748439 (* 0.0909091 = 0.00680399 loss)
I0612 18:01:24.218916 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.182635 (* 0.0909091 = 0.0166032 loss)
I0612 18:01:24.218930 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.0113354 (* 0.0909091 = 0.00103049 loss)
I0612 18:01:24.218945 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.00243917 (* 0.0909091 = 0.000221743 loss)
I0612 18:01:24.218958 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.000377497 (* 0.0909091 = 3.43179e-05 loss)
I0612 18:01:24.218973 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 4.00791e-05 (* 0.0909091 = 3.64355e-06 loss)
I0612 18:01:24.218987 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 2.21887e-05 (* 0.0909091 = 2.01715e-06 loss)
I0612 18:01:24.219002 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 2.37087e-05 (* 0.0909091 = 2.15533e-06 loss)
I0612 18:01:24.219015 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 1.89103e-05 (* 0.0909091 = 1.71912e-06 loss)
I0612 18:01:24.219029 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 2.04303e-05 (* 0.0909091 = 1.8573e-06 loss)
I0612 18:01:24.219044 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 2.1429e-05 (* 0.0909091 = 1.94809e-06 loss)
I0612 18:01:24.219056 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 18:01:24.219069 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 18:01:24.219089 6181 solver.cpp:245] Train net output #149: total_confidence = 0.413668
I0612 18:01:24.219102 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.360957
I0612 18:01:24.219116 6181 sgd_solver.cpp:106] Iteration 14500, lr = 0.001
I0612 18:02:25.550959 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.03 > 30) by scale factor 0.966806
I0612 18:04:20.570942 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.8308 > 30) by scale factor 0.913775
I0612 18:05:25.355268 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 47.8045 > 30) by scale factor 0.627556
I0612 18:05:39.243542 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.9387 > 30) by scale factor 0.858647
I0612 18:05:48.486340 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.8915 > 30) by scale factor 0.971139
I0612 18:06:30.911415 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.2879 > 30) by scale factor 0.990493
I0612 18:06:34.769986 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.1091 > 30) by scale factor 0.906095
I0612 18:07:37.368152 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.8755 > 30) by scale factor 0.885597
I0612 18:07:49.723551 6181 solver.cpp:338] Iteration 15000, Testing net (#0)
I0612 18:08:47.370841 6181 solver.cpp:393] Test loss: 2.77531
I0612 18:08:47.370949 6181 solver.cpp:406] Test net output #0: loss1/accuracy = 0.605191
I0612 18:08:47.370970 6181 solver.cpp:406] Test net output #1: loss1/accuracy01 = 0.775
I0612 18:08:47.370985 6181 solver.cpp:406] Test net output #2: loss1/accuracy02 = 0.641
I0612 18:08:47.370997 6181 solver.cpp:406] Test net output #3: loss1/accuracy03 = 0.498
I0612 18:08:47.371009 6181 solver.cpp:406] Test net output #4: loss1/accuracy04 = 0.49
I0612 18:08:47.371022 6181 solver.cpp:406] Test net output #5: loss1/accuracy05 = 0.496
I0612 18:08:47.371036 6181 solver.cpp:406] Test net output #6: loss1/accuracy06 = 0.707
I0612 18:08:47.371048 6181 solver.cpp:406] Test net output #7: loss1/accuracy07 = 0.837
I0612 18:08:47.371062 6181 solver.cpp:406] Test net output #8: loss1/accuracy08 = 0.926
I0612 18:08:47.371073 6181 solver.cpp:406] Test net output #9: loss1/accuracy09 = 0.968
I0612 18:08:47.371085 6181 solver.cpp:406] Test net output #10: loss1/accuracy10 = 0.984
I0612 18:08:47.371098 6181 solver.cpp:406] Test net output #11: loss1/accuracy11 = 0.996
I0612 18:08:47.371110 6181 solver.cpp:406] Test net output #12: loss1/accuracy12 = 0.999
I0612 18:08:47.371124 6181 solver.cpp:406] Test net output #13: loss1/accuracy13 = 1
I0612 18:08:47.371135 6181 solver.cpp:406] Test net output #14: loss1/accuracy14 = 1
I0612 18:08:47.371147 6181 solver.cpp:406] Test net output #15: loss1/accuracy15 = 1
I0612 18:08:47.371160 6181 solver.cpp:406] Test net output #16: loss1/accuracy16 = 1
I0612 18:08:47.371171 6181 solver.cpp:406] Test net output #17: loss1/accuracy17 = 1
I0612 18:08:47.371183 6181 solver.cpp:406] Test net output #18: loss1/accuracy18 = 1
I0612 18:08:47.371194 6181 solver.cpp:406] Test net output #19: loss1/accuracy19 = 1
I0612 18:08:47.371206 6181 solver.cpp:406] Test net output #20: loss1/accuracy20 = 1
I0612 18:08:47.371217 6181 solver.cpp:406] Test net output #21: loss1/accuracy21 = 1
I0612 18:08:47.371232 6181 solver.cpp:406] Test net output #22: loss1/accuracy22 = 1
I0612 18:08:47.371244 6181 solver.cpp:406] Test net output #23: loss1/accuracy_incl_empty = 0.885093
I0612 18:08:47.371258 6181 solver.cpp:406] Test net output #24: loss1/accuracy_top3 = 0.827576
I0612 18:08:47.371273 6181 solver.cpp:406] Test net output #25: loss1/cross_entropy_loss = 1.44702 (* 0.3 = 0.434105 loss)
I0612 18:08:47.371289 6181 solver.cpp:406] Test net output #26: loss1/cross_entropy_loss_incl_empty = 0.430485 (* 0.3 = 0.129145 loss)
I0612 18:08:47.371304 6181 solver.cpp:406] Test net output #27: loss1/loss01 = 0.962303 (* 0.0272727 = 0.0262446 loss)
I0612 18:08:47.371318 6181 solver.cpp:406] Test net output #28: loss1/loss02 = 1.3949 (* 0.0272727 = 0.0380426 loss)
I0612 18:08:47.371332 6181 solver.cpp:406] Test net output #29: loss1/loss03 = 1.78248 (* 0.0272727 = 0.0486131 loss)
I0612 18:08:47.371345 6181 solver.cpp:406] Test net output #30: loss1/loss04 = 1.86807 (* 0.0272727 = 0.0509473 loss)
I0612 18:08:47.371359 6181 solver.cpp:406] Test net output #31: loss1/loss05 = 1.70832 (* 0.0272727 = 0.0465906 loss)
I0612 18:08:47.371373 6181 solver.cpp:406] Test net output #32: loss1/loss06 = 1.02387 (* 0.0272727 = 0.0279238 loss)
I0612 18:08:47.371387 6181 solver.cpp:406] Test net output #33: loss1/loss07 = 0.569868 (* 0.0272727 = 0.0155419 loss)
I0612 18:08:47.371402 6181 solver.cpp:406] Test net output #34: loss1/loss08 = 0.290058 (* 0.0272727 = 0.00791068 loss)
I0612 18:08:47.371415 6181 solver.cpp:406] Test net output #35: loss1/loss09 = 0.14902 (* 0.0272727 = 0.00406418 loss)
I0612 18:08:47.371430 6181 solver.cpp:406] Test net output #36: loss1/loss10 = 0.0838656 (* 0.0272727 = 0.00228724 loss)
I0612 18:08:47.371444 6181 solver.cpp:406] Test net output #37: loss1/loss11 = 0.0248508 (* 0.0272727 = 0.000677748 loss)
I0612 18:08:47.371459 6181 solver.cpp:406] Test net output #38: loss1/loss12 = 0.0142184 (* 0.0272727 = 0.000387774 loss)
I0612 18:08:47.371472 6181 solver.cpp:406] Test net output #39: loss1/loss13 = 0.00898984 (* 0.0272727 = 0.000245177 loss)
I0612 18:08:47.371506 6181 solver.cpp:406] Test net output #40: loss1/loss14 = 0.00592113 (* 0.0272727 = 0.000161485 loss)
I0612 18:08:47.371522 6181 solver.cpp:406] Test net output #41: loss1/loss15 = 0.0043747 (* 0.0272727 = 0.00011931 loss)
I0612 18:08:47.371536 6181 solver.cpp:406] Test net output #42: loss1/loss16 = 0.00327533 (* 0.0272727 = 8.93271e-05 loss)
I0612 18:08:47.371551 6181 solver.cpp:406] Test net output #43: loss1/loss17 = 0.00282585 (* 0.0272727 = 7.70686e-05 loss)
I0612 18:08:47.371564 6181 solver.cpp:406] Test net output #44: loss1/loss18 = 0.00248992 (* 0.0272727 = 6.7907e-05 loss)
I0612 18:08:47.371578 6181 solver.cpp:406] Test net output #45: loss1/loss19 = 0.00236494 (* 0.0272727 = 6.44984e-05 loss)
I0612 18:08:47.371593 6181 solver.cpp:406] Test net output #46: loss1/loss20 = 0.00200014 (* 0.0272727 = 5.45493e-05 loss)
I0612 18:08:47.371608 6181 solver.cpp:406] Test net output #47: loss1/loss21 = 0.00187875 (* 0.0272727 = 5.12387e-05 loss)
I0612 18:08:47.371621 6181 solver.cpp:406] Test net output #48: loss1/loss22 = 0.00164442 (* 0.0272727 = 4.48479e-05 loss)
I0612 18:08:47.371634 6181 solver.cpp:406] Test net output #49: loss2/accuracy = 0.754271
I0612 18:08:47.371646 6181 solver.cpp:406] Test net output #50: loss2/accuracy01 = 0.858
I0612 18:08:47.371659 6181 solver.cpp:406] Test net output #51: loss2/accuracy02 = 0.817
I0612 18:08:47.371670 6181 solver.cpp:406] Test net output #52: loss2/accuracy03 = 0.744
I0612 18:08:47.371681 6181 solver.cpp:406] Test net output #53: loss2/accuracy04 = 0.64
I0612 18:08:47.371693 6181 solver.cpp:406] Test net output #54: loss2/accuracy05 = 0.649
I0612 18:08:47.371706 6181 solver.cpp:406] Test net output #55: loss2/accuracy06 = 0.801
I0612 18:08:47.371716 6181 solver.cpp:406] Test net output #56: loss2/accuracy07 = 0.866
I0612 18:08:47.371728 6181 solver.cpp:406] Test net output #57: loss2/accuracy08 = 0.937
I0612 18:08:47.371740 6181 solver.cpp:406] Test net output #58: loss2/accuracy09 = 0.967
I0612 18:08:47.371752 6181 solver.cpp:406] Test net output #59: loss2/accuracy10 = 0.984
I0612 18:08:47.371763 6181 solver.cpp:406] Test net output #60: loss2/accuracy11 = 0.994
I0612 18:08:47.371774 6181 solver.cpp:406] Test net output #61: loss2/accuracy12 = 0.998
I0612 18:08:47.371786 6181 solver.cpp:406] Test net output #62: loss2/accuracy13 = 0.999
I0612 18:08:47.371798 6181 solver.cpp:406] Test net output #63: loss2/accuracy14 = 1
I0612 18:08:47.371810 6181 solver.cpp:406] Test net output #64: loss2/accuracy15 = 1
I0612 18:08:47.371821 6181 solver.cpp:406] Test net output #65: loss2/accuracy16 = 1
I0612 18:08:47.371834 6181 solver.cpp:406] Test net output #66: loss2/accuracy17 = 1
I0612 18:08:47.371845 6181 solver.cpp:406] Test net output #67: loss2/accuracy18 = 1
I0612 18:08:47.371856 6181 solver.cpp:406] Test net output #68: loss2/accuracy19 = 1
I0612 18:08:47.371867 6181 solver.cpp:406] Test net output #69: loss2/accuracy20 = 1
I0612 18:08:47.371878 6181 solver.cpp:406] Test net output #70: loss2/accuracy21 = 1
I0612 18:08:47.371891 6181 solver.cpp:406] Test net output #71: loss2/accuracy22 = 1
I0612 18:08:47.371901 6181 solver.cpp:406] Test net output #72: loss2/accuracy_incl_empty = 0.926183
I0612 18:08:47.371913 6181 solver.cpp:406] Test net output #73: loss2/accuracy_top3 = 0.899144
I0612 18:08:47.371927 6181 solver.cpp:406] Test net output #74: loss2/cross_entropy_loss = 0.978858 (* 0.3 = 0.293658 loss)
I0612 18:08:47.371940 6181 solver.cpp:406] Test net output #75: loss2/cross_entropy_loss_incl_empty = 0.298941 (* 0.3 = 0.0896823 loss)
I0612 18:08:47.371954 6181 solver.cpp:406] Test net output #76: loss2/loss01 = 0.704845 (* 0.0272727 = 0.019223 loss)
I0612 18:08:47.371971 6181 solver.cpp:406] Test net output #77: loss2/loss02 = 0.877843 (* 0.0272727 = 0.0239412 loss)
I0612 18:08:47.371999 6181 solver.cpp:406] Test net output #78: loss2/loss03 = 1.10696 (* 0.0272727 = 0.0301898 loss)
I0612 18:08:47.372014 6181 solver.cpp:406] Test net output #79: loss2/loss04 = 1.31828 (* 0.0272727 = 0.0359531 loss)
I0612 18:08:47.372026 6181 solver.cpp:406] Test net output #80: loss2/loss05 = 1.18291 (* 0.0272727 = 0.0322613 loss)
I0612 18:08:47.372040 6181 solver.cpp:406] Test net output #81: loss2/loss06 = 0.762109 (* 0.0272727 = 0.0207848 loss)
I0612 18:08:47.372054 6181 solver.cpp:406] Test net output #82: loss2/loss07 = 0.463398 (* 0.0272727 = 0.0126381 loss)
I0612 18:08:47.372068 6181 solver.cpp:406] Test net output #83: loss2/loss08 = 0.240988 (* 0.0272727 = 0.0065724 loss)
I0612 18:08:47.372082 6181 solver.cpp:406] Test net output #84: loss2/loss09 = 0.132086 (* 0.0272727 = 0.00360233 loss)
I0612 18:08:47.372097 6181 solver.cpp:406] Test net output #85: loss2/loss10 = 0.0781026 (* 0.0272727 = 0.00213007 loss)
I0612 18:08:47.372110 6181 solver.cpp:406] Test net output #86: loss2/loss11 = 0.026055 (* 0.0272727 = 0.000710592 loss)
I0612 18:08:47.372124 6181 solver.cpp:406] Test net output #87: loss2/loss12 = 0.0152863 (* 0.0272727 = 0.000416899 loss)
I0612 18:08:47.372138 6181 solver.cpp:406] Test net output #88: loss2/loss13 = 0.0105784 (* 0.0272727 = 0.000288502 loss)
I0612 18:08:47.372153 6181 solver.cpp:406] Test net output #89: loss2/loss14 = 0.00821073 (* 0.0272727 = 0.000223929 loss)
I0612 18:08:47.372166 6181 solver.cpp:406] Test net output #90: loss2/loss15 = 0.00680291 (* 0.0272727 = 0.000185534 loss)
I0612 18:08:47.372179 6181 solver.cpp:406] Test net output #91: loss2/loss16 = 0.00582102 (* 0.0272727 = 0.000158755 loss)
I0612 18:08:47.372195 6181 solver.cpp:406] Test net output #92: loss2/loss17 = 0.00552201 (* 0.0272727 = 0.0001506 loss)
I0612 18:08:47.372208 6181 solver.cpp:406] Test net output #93: loss2/loss18 = 0.00542108 (* 0.0272727 = 0.000147848 loss)
I0612 18:08:47.372222 6181 solver.cpp:406] Test net output #94: loss2/loss19 = 0.00498845 (* 0.0272727 = 0.000136049 loss)
I0612 18:08:47.372236 6181 solver.cpp:406] Test net output #95: loss2/loss20 = 0.00484581 (* 0.0272727 = 0.000132158 loss)
I0612 18:08:47.372249 6181 solver.cpp:406] Test net output #96: loss2/loss21 = 0.00428315 (* 0.0272727 = 0.000116813 loss)
I0612 18:08:47.372264 6181 solver.cpp:406] Test net output #97: loss2/loss22 = 0.00378133 (* 0.0272727 = 0.000103127 loss)
I0612 18:08:47.372280 6181 solver.cpp:406] Test net output #98: loss3/accuracy = 0.845519
I0612 18:08:47.372293 6181 solver.cpp:406] Test net output #99: loss3/accuracy01 = 0.875
I0612 18:08:47.372305 6181 solver.cpp:406] Test net output #100: loss3/accuracy02 = 0.863
I0612 18:08:47.372318 6181 solver.cpp:406] Test net output #101: loss3/accuracy03 = 0.849
I0612 18:08:47.372328 6181 solver.cpp:406] Test net output #102: loss3/accuracy04 = 0.84
I0612 18:08:47.372340 6181 solver.cpp:406] Test net output #103: loss3/accuracy05 = 0.828
I0612 18:08:47.372352 6181 solver.cpp:406] Test net output #104: loss3/accuracy06 = 0.887
I0612 18:08:47.372364 6181 solver.cpp:406] Test net output #105: loss3/accuracy07 = 0.908
I0612 18:08:47.372375 6181 solver.cpp:406] Test net output #106: loss3/accuracy08 = 0.945
I0612 18:08:47.372387 6181 solver.cpp:406] Test net output #107: loss3/accuracy09 = 0.966
I0612 18:08:47.372398 6181 solver.cpp:406] Test net output #108: loss3/accuracy10 = 0.984
I0612 18:08:47.372411 6181 solver.cpp:406] Test net output #109: loss3/accuracy11 = 0.996
I0612 18:08:47.372421 6181 solver.cpp:406] Test net output #110: loss3/accuracy12 = 0.997
I0612 18:08:47.372433 6181 solver.cpp:406] Test net output #111: loss3/accuracy13 = 0.999
I0612 18:08:47.372444 6181 solver.cpp:406] Test net output #112: loss3/accuracy14 = 0.999
I0612 18:08:47.372457 6181 solver.cpp:406] Test net output #113: loss3/accuracy15 = 0.999
I0612 18:08:47.372467 6181 solver.cpp:406] Test net output #114: loss3/accuracy16 = 1
I0612 18:08:47.372489 6181 solver.cpp:406] Test net output #115: loss3/accuracy17 = 1
I0612 18:08:47.372503 6181 solver.cpp:406] Test net output #116: loss3/accuracy18 = 1
I0612 18:08:47.372514 6181 solver.cpp:406] Test net output #117: loss3/accuracy19 = 1
I0612 18:08:47.372525 6181 solver.cpp:406] Test net output #118: loss3/accuracy20 = 1
I0612 18:08:47.372536 6181 solver.cpp:406] Test net output #119: loss3/accuracy21 = 1
I0612 18:08:47.372547 6181 solver.cpp:406] Test net output #120: loss3/accuracy22 = 1
I0612 18:08:47.372558 6181 solver.cpp:406] Test net output #121: loss3/accuracy_incl_empty = 0.949636
I0612 18:08:47.372570 6181 solver.cpp:406] Test net output #122: loss3/accuracy_top3 = 0.921351
I0612 18:08:47.372584 6181 solver.cpp:406] Test net output #123: loss3/cross_entropy_loss = 0.695762 (* 1 = 0.695762 loss)
I0612 18:08:47.372598 6181 solver.cpp:406] Test net output #124: loss3/cross_entropy_loss_incl_empty = 0.231548 (* 1 = 0.231548 loss)
I0612 18:08:47.372612 6181 solver.cpp:406] Test net output #125: loss3/loss01 = 0.559659 (* 0.0909091 = 0.0508781 loss)
I0612 18:08:47.372627 6181 solver.cpp:406] Test net output #126: loss3/loss02 = 0.664192 (* 0.0909091 = 0.0603811 loss)
I0612 18:08:47.372640 6181 solver.cpp:406] Test net output #127: loss3/loss03 = 0.763308 (* 0.0909091 = 0.0693916 loss)
I0612 18:08:47.372654 6181 solver.cpp:406] Test net output #128: loss3/loss04 = 0.774106 (* 0.0909091 = 0.0703733 loss)
I0612 18:08:47.372668 6181 solver.cpp:406] Test net output #129: loss3/loss05 = 0.74839 (* 0.0909091 = 0.0680354 loss)
I0612 18:08:47.372681 6181 solver.cpp:406] Test net output #130: loss3/loss06 = 0.511492 (* 0.0909091 = 0.0464993 loss)
I0612 18:08:47.372695 6181 solver.cpp:406] Test net output #131: loss3/loss07 = 0.359306 (* 0.0909091 = 0.0326642 loss)
I0612 18:08:47.372709 6181 solver.cpp:406] Test net output #132: loss3/loss08 = 0.211222 (* 0.0909091 = 0.019202 loss)
I0612 18:08:47.372723 6181 solver.cpp:406] Test net output #133: loss3/loss09 = 0.115196 (* 0.0909091 = 0.0104723 loss)
I0612 18:08:47.372737 6181 solver.cpp:406] Test net output #134: loss3/loss10 = 0.064892 (* 0.0909091 = 0.00589927 loss)
I0612 18:08:47.372752 6181 solver.cpp:406] Test net output #135: loss3/loss11 = 0.0247325 (* 0.0909091 = 0.0022484 loss)
I0612 18:08:47.372766 6181 solver.cpp:406] Test net output #136: loss3/loss12 = 0.0151055 (* 0.0909091 = 0.00137322 loss)
I0612 18:08:47.372781 6181 solver.cpp:406] Test net output #137: loss3/loss13 = 0.00946909 (* 0.0909091 = 0.000860826 loss)
I0612 18:08:47.372793 6181 solver.cpp:406] Test net output #138: loss3/loss14 = 0.00646504 (* 0.0909091 = 0.00058773 loss)
I0612 18:08:47.372808 6181 solver.cpp:406] Test net output #139: loss3/loss15 = 0.00473794 (* 0.0909091 = 0.000430722 loss)
I0612 18:08:47.372822 6181 solver.cpp:406] Test net output #140: loss3/loss16 = 0.00370043 (* 0.0909091 = 0.000336403 loss)
I0612 18:08:47.372836 6181 solver.cpp:406] Test net output #141: loss3/loss17 = 0.00351792 (* 0.0909091 = 0.000319811 loss)
I0612 18:08:47.372850 6181 solver.cpp:406] Test net output #142: loss3/loss18 = 0.00316238 (* 0.0909091 = 0.000287489 loss)
I0612 18:08:47.372864 6181 solver.cpp:406] Test net output #143: loss3/loss19 = 0.00305448 (* 0.0909091 = 0.00027768 loss)
I0612 18:08:47.372879 6181 solver.cpp:406] Test net output #144: loss3/loss20 = 0.00264431 (* 0.0909091 = 0.000240392 loss)
I0612 18:08:47.372894 6181 solver.cpp:406] Test net output #145: loss3/loss21 = 0.00239143 (* 0.0909091 = 0.000217403 loss)
I0612 18:08:47.372907 6181 solver.cpp:406] Test net output #146: loss3/loss22 = 0.00173956 (* 0.0909091 = 0.000158142 loss)
I0612 18:08:47.372920 6181 solver.cpp:406] Test net output #147: total_accuracy = 0.577
I0612 18:08:47.372931 6181 solver.cpp:406] Test net output #148: total_accuracy_not_rec = 0.564
I0612 18:08:47.372943 6181 solver.cpp:406] Test net output #149: total_confidence = 0.470483
I0612 18:08:47.372963 6181 solver.cpp:406] Test net output #150: total_confidence_not_rec = 0.415947
I0612 18:08:47.372978 6181 solver.cpp:338] Iteration 15000, Testing net (#1)
I0612 18:09:44.967357 6181 solver.cpp:393] Test loss: 3.6945
I0612 18:09:44.967500 6181 solver.cpp:406] Test net output #0: loss1/accuracy = 0.564779
I0612 18:09:44.967522 6181 solver.cpp:406] Test net output #1: loss1/accuracy01 = 0.755
I0612 18:09:44.967536 6181 solver.cpp:406] Test net output #2: loss1/accuracy02 = 0.64
I0612 18:09:44.967550 6181 solver.cpp:406] Test net output #3: loss1/accuracy03 = 0.521
I0612 18:09:44.967562 6181 solver.cpp:406] Test net output #4: loss1/accuracy04 = 0.475
I0612 18:09:44.967576 6181 solver.cpp:406] Test net output #5: loss1/accuracy05 = 0.482
I0612 18:09:44.967588 6181 solver.cpp:406] Test net output #6: loss1/accuracy06 = 0.648
I0612 18:09:44.967600 6181 solver.cpp:406] Test net output #7: loss1/accuracy07 = 0.768
I0612 18:09:44.967613 6181 solver.cpp:406] Test net output #8: loss1/accuracy08 = 0.818
I0612 18:09:44.967625 6181 solver.cpp:406] Test net output #9: loss1/accuracy09 = 0.841
I0612 18:09:44.967638 6181 solver.cpp:406] Test net output #10: loss1/accuracy10 = 0.873
I0612 18:09:44.967651 6181 solver.cpp:406] Test net output #11: loss1/accuracy11 = 0.889
I0612 18:09:44.967664 6181 solver.cpp:406] Test net output #12: loss1/accuracy12 = 0.898
I0612 18:09:44.967676 6181 solver.cpp:406] Test net output #13: loss1/accuracy13 = 0.91
I0612 18:09:44.967687 6181 solver.cpp:406] Test net output #14: loss1/accuracy14 = 0.936
I0612 18:09:44.967700 6181 solver.cpp:406] Test net output #15: loss1/accuracy15 = 0.953
I0612 18:09:44.967711 6181 solver.cpp:406] Test net output #16: loss1/accuracy16 = 0.967
I0612 18:09:44.967723 6181 solver.cpp:406] Test net output #17: loss1/accuracy17 = 0.98
I0612 18:09:44.967736 6181 solver.cpp:406] Test net output #18: loss1/accuracy18 = 0.982
I0612 18:09:44.967747 6181 solver.cpp:406] Test net output #19: loss1/accuracy19 = 0.983
I0612 18:09:44.967758 6181 solver.cpp:406] Test net output #20: loss1/accuracy20 = 0.989
I0612 18:09:44.967770 6181 solver.cpp:406] Test net output #21: loss1/accuracy21 = 0.996
I0612 18:09:44.967782 6181 solver.cpp:406] Test net output #22: loss1/accuracy22 = 0.998
I0612 18:09:44.967794 6181 solver.cpp:406] Test net output #23: loss1/accuracy_incl_empty = 0.839184
I0612 18:09:44.967806 6181 solver.cpp:406] Test net output #24: loss1/accuracy_top3 = 0.78972
I0612 18:09:44.967823 6181 solver.cpp:406] Test net output #25: loss1/cross_entropy_loss = 1.60617 (* 0.3 = 0.48185 loss)
I0612 18:09:44.967838 6181 solver.cpp:406] Test net output #26: loss1/cross_entropy_loss_incl_empty = 0.603141 (* 0.3 = 0.180942 loss)
I0612 18:09:44.967852 6181 solver.cpp:406] Test net output #27: loss1/loss01 = 1.06278 (* 0.0272727 = 0.0289848 loss)
I0612 18:09:44.967866 6181 solver.cpp:406] Test net output #28: loss1/loss02 = 1.44483 (* 0.0272727 = 0.0394044 loss)
I0612 18:09:44.967880 6181 solver.cpp:406] Test net output #29: loss1/loss03 = 1.78224 (* 0.0272727 = 0.0486065 loss)
I0612 18:09:44.967893 6181 solver.cpp:406] Test net output #30: loss1/loss04 = 1.85591 (* 0.0272727 = 0.0506156 loss)
I0612 18:09:44.967907 6181 solver.cpp:406] Test net output #31: loss1/loss05 = 1.80107 (* 0.0272727 = 0.0491201 loss)
I0612 18:09:44.967921 6181 solver.cpp:406] Test net output #32: loss1/loss06 = 1.27099 (* 0.0272727 = 0.0346634 loss)
I0612 18:09:44.967936 6181 solver.cpp:406] Test net output #33: loss1/loss07 = 0.86608 (* 0.0272727 = 0.0236204 loss)
I0612 18:09:44.967948 6181 solver.cpp:406] Test net output #34: loss1/loss08 = 0.713433 (* 0.0272727 = 0.0194573 loss)
I0612 18:09:44.967962 6181 solver.cpp:406] Test net output #35: loss1/loss09 = 0.623042 (* 0.0272727 = 0.0169921 loss)
I0612 18:09:44.967977 6181 solver.cpp:406] Test net output #36: loss1/loss10 = 0.543189 (* 0.0272727 = 0.0148142 loss)
I0612 18:09:44.967990 6181 solver.cpp:406] Test net output #37: loss1/loss11 = 0.498367 (* 0.0272727 = 0.0135918 loss)
I0612 18:09:44.968004 6181 solver.cpp:406] Test net output #38: loss1/loss12 = 0.416038 (* 0.0272727 = 0.0113465 loss)
I0612 18:09:44.968037 6181 solver.cpp:406] Test net output #39: loss1/loss13 = 0.373396 (* 0.0272727 = 0.0101835 loss)
I0612 18:09:44.968052 6181 solver.cpp:406] Test net output #40: loss1/loss14 = 0.281009 (* 0.0272727 = 0.00766389 loss)
I0612 18:09:44.968066 6181 solver.cpp:406] Test net output #41: loss1/loss15 = 0.226555 (* 0.0272727 = 0.00617877 loss)
I0612 18:09:44.968080 6181 solver.cpp:406] Test net output #42: loss1/loss16 = 0.181462 (* 0.0272727 = 0.00494896 loss)
I0612 18:09:44.968094 6181 solver.cpp:406] Test net output #43: loss1/loss17 = 0.12996 (* 0.0272727 = 0.00354437 loss)
I0612 18:09:44.968108 6181 solver.cpp:406] Test net output #44: loss1/loss18 = 0.12046 (* 0.0272727 = 0.00328528 loss)
I0612 18:09:44.968122 6181 solver.cpp:406] Test net output #45: loss1/loss19 = 0.110261 (* 0.0272727 = 0.00300711 loss)
I0612 18:09:44.968137 6181 solver.cpp:406] Test net output #46: loss1/loss20 = 0.0725195 (* 0.0272727 = 0.00197781 loss)
I0612 18:09:44.968150 6181 solver.cpp:406] Test net output #47: loss1/loss21 = 0.0266989 (* 0.0272727 = 0.000728152 loss)
I0612 18:09:44.968165 6181 solver.cpp:406] Test net output #48: loss1/loss22 = 0.016326 (* 0.0272727 = 0.000445254 loss)
I0612 18:09:44.968178 6181 solver.cpp:406] Test net output #49: loss2/accuracy = 0.704277
I0612 18:09:44.968189 6181 solver.cpp:406] Test net output #50: loss2/accuracy01 = 0.851
I0612 18:09:44.968201 6181 solver.cpp:406] Test net output #51: loss2/accuracy02 = 0.822
I0612 18:09:44.968212 6181 solver.cpp:406] Test net output #52: loss2/accuracy03 = 0.726
I0612 18:09:44.968228 6181 solver.cpp:406] Test net output #53: loss2/accuracy04 = 0.64
I0612 18:09:44.968240 6181 solver.cpp:406] Test net output #54: loss2/accuracy05 = 0.63
I0612 18:09:44.968252 6181 solver.cpp:406] Test net output #55: loss2/accuracy06 = 0.726
I0612 18:09:44.968263 6181 solver.cpp:406] Test net output #56: loss2/accuracy07 = 0.807
I0612 18:09:44.968276 6181 solver.cpp:406] Test net output #57: loss2/accuracy08 = 0.822
I0612 18:09:44.968287 6181 solver.cpp:406] Test net output #58: loss2/accuracy09 = 0.864
I0612 18:09:44.968298 6181 solver.cpp:406] Test net output #59: loss2/accuracy10 = 0.882
I0612 18:09:44.968310 6181 solver.cpp:406] Test net output #60: loss2/accuracy11 = 0.895
I0612 18:09:44.968322 6181 solver.cpp:406] Test net output #61: loss2/accuracy12 = 0.917
I0612 18:09:44.968334 6181 solver.cpp:406] Test net output #62: loss2/accuracy13 = 0.921
I0612 18:09:44.968346 6181 solver.cpp:406] Test net output #63: loss2/accuracy14 = 0.938
I0612 18:09:44.968358 6181 solver.cpp:406] Test net output #64: loss2/accuracy15 = 0.952
I0612 18:09:44.968369 6181 solver.cpp:406] Test net output #65: loss2/accuracy16 = 0.967
I0612 18:09:44.968381 6181 solver.cpp:406] Test net output #66: loss2/accuracy17 = 0.981
I0612 18:09:44.968394 6181 solver.cpp:406] Test net output #67: loss2/accuracy18 = 0.982
I0612 18:09:44.968405 6181 solver.cpp:406] Test net output #68: loss2/accuracy19 = 0.983
I0612 18:09:44.968416 6181 solver.cpp:406] Test net output #69: loss2/accuracy20 = 0.989
I0612 18:09:44.968427 6181 solver.cpp:406] Test net output #70: loss2/accuracy21 = 0.996
I0612 18:09:44.968439 6181 solver.cpp:406] Test net output #71: loss2/accuracy22 = 0.998
I0612 18:09:44.968451 6181 solver.cpp:406] Test net output #72: loss2/accuracy_incl_empty = 0.881955
I0612 18:09:44.968463 6181 solver.cpp:406] Test net output #73: loss2/accuracy_top3 = 0.870185
I0612 18:09:44.968477 6181 solver.cpp:406] Test net output #74: loss2/cross_entropy_loss = 1.11251 (* 0.3 = 0.333754 loss)
I0612 18:09:44.968490 6181 solver.cpp:406] Test net output #75: loss2/cross_entropy_loss_incl_empty = 0.44478 (* 0.3 = 0.133434 loss)
I0612 18:09:44.968504 6181 solver.cpp:406] Test net output #76: loss2/loss01 = 0.718785 (* 0.0272727 = 0.0196032 loss)
I0612 18:09:44.968521 6181 solver.cpp:406] Test net output #77: loss2/loss02 = 0.834741 (* 0.0272727 = 0.0227657 loss)
I0612 18:09:44.968546 6181 solver.cpp:406] Test net output #78: loss2/loss03 = 1.13034 (* 0.0272727 = 0.0308274 loss)
I0612 18:09:44.968562 6181 solver.cpp:406] Test net output #79: loss2/loss04 = 1.26761 (* 0.0272727 = 0.0345713 loss)
I0612 18:09:44.968575 6181 solver.cpp:406] Test net output #80: loss2/loss05 = 1.28686 (* 0.0272727 = 0.035096 loss)
I0612 18:09:44.968590 6181 solver.cpp:406] Test net output #81: loss2/loss06 = 1.00907 (* 0.0272727 = 0.02752 loss)
I0612 18:09:44.968603 6181 solver.cpp:406] Test net output #82: loss2/loss07 = 0.724647 (* 0.0272727 = 0.0197631 loss)
I0612 18:09:44.968617 6181 solver.cpp:406] Test net output #83: loss2/loss08 = 0.624827 (* 0.0272727 = 0.0170407 loss)
I0612 18:09:44.968631 6181 solver.cpp:406] Test net output #84: loss2/loss09 = 0.534399 (* 0.0272727 = 0.0145745 loss)
I0612 18:09:44.968644 6181 solver.cpp:406] Test net output #85: loss2/loss10 = 0.474981 (* 0.0272727 = 0.012954 loss)
I0612 18:09:44.968658 6181 solver.cpp:406] Test net output #86: loss2/loss11 = 0.439028 (* 0.0272727 = 0.0119735 loss)
I0612 18:09:44.968672 6181 solver.cpp:406] Test net output #87: loss2/loss12 = 0.345102 (* 0.0272727 = 0.00941187 loss)
I0612 18:09:44.968685 6181 solver.cpp:406] Test net output #88: loss2/loss13 = 0.316133 (* 0.0272727 = 0.00862181 loss)
I0612 18:09:44.968699 6181 solver.cpp:406] Test net output #89: loss2/loss14 = 0.244952 (* 0.0272727 = 0.00668052 loss)
I0612 18:09:44.968713 6181 solver.cpp:406] Test net output #90: loss2/loss15 = 0.201451 (* 0.0272727 = 0.00549412 loss)
I0612 18:09:44.968727 6181 solver.cpp:406] Test net output #91: loss2/loss16 = 0.160518 (* 0.0272727 = 0.00437777 loss)
I0612 18:09:44.968740 6181 solver.cpp:406] Test net output #92: loss2/loss17 = 0.11208 (* 0.0272727 = 0.00305674 loss)
I0612 18:09:44.968755 6181 solver.cpp:406] Test net output #93: loss2/loss18 = 0.0997187 (* 0.0272727 = 0.0027196 loss)
I0612 18:09:44.968770 6181 solver.cpp:406] Test net output #94: loss2/loss19 = 0.0968107 (* 0.0272727 = 0.00264029 loss)
I0612 18:09:44.968783 6181 solver.cpp:406] Test net output #95: loss2/loss20 = 0.0628839 (* 0.0272727 = 0.00171502 loss)
I0612 18:09:44.968797 6181 solver.cpp:406] Test net output #96: loss2/loss21 = 0.0279995 (* 0.0272727 = 0.000763623 loss)
I0612 18:09:44.968811 6181 solver.cpp:406] Test net output #97: loss2/loss22 = 0.0169938 (* 0.0272727 = 0.000463468 loss)
I0612 18:09:44.968823 6181 solver.cpp:406] Test net output #98: loss3/accuracy = 0.8052
I0612 18:09:44.968835 6181 solver.cpp:406] Test net output #99: loss3/accuracy01 = 0.868
I0612 18:09:44.968847 6181 solver.cpp:406] Test net output #100: loss3/accuracy02 = 0.857
I0612 18:09:44.968859 6181 solver.cpp:406] Test net output #101: loss3/accuracy03 = 0.826
I0612 18:09:44.968870 6181 solver.cpp:406] Test net output #102: loss3/accuracy04 = 0.817
I0612 18:09:44.968883 6181 solver.cpp:406] Test net output #103: loss3/accuracy05 = 0.801
I0612 18:09:44.968894 6181 solver.cpp:406] Test net output #104: loss3/accuracy06 = 0.857
I0612 18:09:44.968905 6181 solver.cpp:406] Test net output #105: loss3/accuracy07 = 0.881
I0612 18:09:44.968917 6181 solver.cpp:406] Test net output #106: loss3/accuracy08 = 0.889
I0612 18:09:44.968930 6181 solver.cpp:406] Test net output #107: loss3/accuracy09 = 0.904
I0612 18:09:44.968940 6181 solver.cpp:406] Test net output #108: loss3/accuracy10 = 0.903
I0612 18:09:44.968952 6181 solver.cpp:406] Test net output #109: loss3/accuracy11 = 0.909
I0612 18:09:44.968964 6181 solver.cpp:406] Test net output #110: loss3/accuracy12 = 0.935
I0612 18:09:44.968976 6181 solver.cpp:406] Test net output #111: loss3/accuracy13 = 0.939
I0612 18:09:44.968987 6181 solver.cpp:406] Test net output #112: loss3/accuracy14 = 0.947
I0612 18:09:44.968999 6181 solver.cpp:406] Test net output #113: loss3/accuracy15 = 0.961
I0612 18:09:44.969012 6181 solver.cpp:406] Test net output #114: loss3/accuracy16 = 0.97
I0612 18:09:44.969033 6181 solver.cpp:406] Test net output #115: loss3/accuracy17 = 0.982
I0612 18:09:44.969045 6181 solver.cpp:406] Test net output #116: loss3/accuracy18 = 0.985
I0612 18:09:44.969058 6181 solver.cpp:406] Test net output #117: loss3/accuracy19 = 0.984
I0612 18:09:44.969069 6181 solver.cpp:406] Test net output #118: loss3/accuracy20 = 0.991
I0612 18:09:44.969080 6181 solver.cpp:406] Test net output #119: loss3/accuracy21 = 0.996
I0612 18:09:44.969092 6181 solver.cpp:406] Test net output #120: loss3/accuracy22 = 0.998
I0612 18:09:44.969105 6181 solver.cpp:406] Test net output #121: loss3/accuracy_incl_empty = 0.916819
I0612 18:09:44.969116 6181 solver.cpp:406] Test net output #122: loss3/accuracy_top3 = 0.907815
I0612 18:09:44.969130 6181 solver.cpp:406] Test net output #123: loss3/cross_entropy_loss = 0.816782 (* 1 = 0.816782 loss)
I0612 18:09:44.969144 6181 solver.cpp:406] Test net output #124: loss3/cross_entropy_loss_incl_empty = 0.342963 (* 1 = 0.342963 loss)
I0612 18:09:44.969158 6181 solver.cpp:406] Test net output #125: loss3/loss01 = 0.600968 (* 0.0909091 = 0.0546335 loss)
I0612 18:09:44.969172 6181 solver.cpp:406] Test net output #126: loss3/loss02 = 0.653549 (* 0.0909091 = 0.0594135 loss)
I0612 18:09:44.969187 6181 solver.cpp:406] Test net output #127: loss3/loss03 = 0.805906 (* 0.0909091 = 0.0732642 loss)
I0612 18:09:44.969200 6181 solver.cpp:406] Test net output #128: loss3/loss04 = 0.812895 (* 0.0909091 = 0.0738996 loss)
I0612 18:09:44.969213 6181 solver.cpp:406] Test net output #129: loss3/loss05 = 0.884039 (* 0.0909091 = 0.0803671 loss)
I0612 18:09:44.969228 6181 solver.cpp:406] Test net output #130: loss3/loss06 = 0.674181 (* 0.0909091 = 0.0612892 loss)
I0612 18:09:44.969241 6181 solver.cpp:406] Test net output #131: loss3/loss07 = 0.511696 (* 0.0909091 = 0.0465178 loss)
I0612 18:09:44.969255 6181 solver.cpp:406] Test net output #132: loss3/loss08 = 0.439279 (* 0.0909091 = 0.0399345 loss)
I0612 18:09:44.969269 6181 solver.cpp:406] Test net output #133: loss3/loss09 = 0.385553 (* 0.0909091 = 0.0350503 loss)
I0612 18:09:44.969300 6181 solver.cpp:406] Test net output #134: loss3/loss10 = 0.384861 (* 0.0909091 = 0.0349874 loss)
I0612 18:09:44.969316 6181 solver.cpp:406] Test net output #135: loss3/loss11 = 0.336428 (* 0.0909091 = 0.0305844 loss)
I0612 18:09:44.969331 6181 solver.cpp:406] Test net output #136: loss3/loss12 = 0.279825 (* 0.0909091 = 0.0254387 loss)
I0612 18:09:44.969343 6181 solver.cpp:406] Test net output #137: loss3/loss13 = 0.236419 (* 0.0909091 = 0.0214926 loss)
I0612 18:09:44.969357 6181 solver.cpp:406] Test net output #138: loss3/loss14 = 0.201799 (* 0.0909091 = 0.0183453 loss)
I0612 18:09:44.969372 6181 solver.cpp:406] Test net output #139: loss3/loss15 = 0.173616 (* 0.0909091 = 0.0157832 loss)
I0612 18:09:44.969385 6181 solver.cpp:406] Test net output #140: loss3/loss16 = 0.138313 (* 0.0909091 = 0.0125739 loss)
I0612 18:09:44.969399 6181 solver.cpp:406] Test net output #141: loss3/loss17 = 0.10725 (* 0.0909091 = 0.00974996 loss)
I0612 18:09:44.969413 6181 solver.cpp:406] Test net output #142: loss3/loss18 = 0.0950907 (* 0.0909091 = 0.00864461 loss)
I0612 18:09:44.969424 6181 solver.cpp:406] Test net output #143: loss3/loss19 = 0.0938393 (* 0.0909091 = 0.00853084 loss)
I0612 18:09:44.969439 6181 solver.cpp:406] Test net output #144: loss3/loss20 = 0.0550988 (* 0.0909091 = 0.00500898 loss)
I0612 18:09:44.969452 6181 solver.cpp:406] Test net output #145: loss3/loss21 = 0.0241048 (* 0.0909091 = 0.00219135 loss)
I0612 18:09:44.969467 6181 solver.cpp:406] Test net output #146: loss3/loss22 = 0.0139038 (* 0.0909091 = 0.00126398 loss)
I0612 18:09:44.969480 6181 solver.cpp:406] Test net output #147: total_accuracy = 0.493
I0612 18:09:44.969491 6181 solver.cpp:406] Test net output #148: total_accuracy_not_rec = 0.485
I0612 18:09:44.969502 6181 solver.cpp:406] Test net output #149: total_confidence = 0.409975
I0612 18:09:44.969526 6181 solver.cpp:406] Test net output #150: total_confidence_not_rec = 0.36584
I0612 18:09:45.327286 6181 solver.cpp:229] Iteration 15000, loss = 4.09618
I0612 18:09:45.327344 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.457627
I0612 18:09:45.327363 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.5
I0612 18:09:45.327376 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 18:09:45.327389 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 18:09:45.327402 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.25
I0612 18:09:45.327415 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 18:09:45.327427 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.625
I0612 18:09:45.327440 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.5
I0612 18:09:45.327452 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.625
I0612 18:09:45.327466 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 18:09:45.327477 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 18:09:45.327491 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 18:09:45.327503 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 18:09:45.327515 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 18:09:45.327527 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 18:09:45.327540 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 18:09:45.327553 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 18:09:45.327564 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 18:09:45.327576 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 18:09:45.327589 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 18:09:45.327600 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 18:09:45.327612 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 18:09:45.327625 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 18:09:45.327636 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.8125
I0612 18:09:45.327648 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.59322
I0612 18:09:45.327664 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.46255 (* 0.3 = 0.738765 loss)
I0612 18:09:45.327678 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.866456 (* 0.3 = 0.259937 loss)
I0612 18:09:45.327693 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.8578 (* 0.0272727 = 0.0506673 loss)
I0612 18:09:45.327708 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.76179 (* 0.0272727 = 0.0480489 loss)
I0612 18:09:45.327721 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.20754 (* 0.0272727 = 0.0602057 loss)
I0612 18:09:45.327736 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.8361 (* 0.0272727 = 0.0773482 loss)
I0612 18:09:45.327750 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.92585 (* 0.0272727 = 0.079796 loss)
I0612 18:09:45.327764 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.08387 (* 0.0272727 = 0.0568328 loss)
I0612 18:09:45.327778 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 2.20371 (* 0.0272727 = 0.0601013 loss)
I0612 18:09:45.327791 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 2.08138 (* 0.0272727 = 0.0567648 loss)
I0612 18:09:45.327805 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.71851 (* 0.0272727 = 0.0195957 loss)
I0612 18:09:45.327819 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.473826 (* 0.0272727 = 0.0129225 loss)
I0612 18:09:45.327834 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.538002 (* 0.0272727 = 0.0146728 loss)
I0612 18:09:45.327874 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.386748 (* 0.0272727 = 0.0105477 loss)
I0612 18:09:45.327889 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.373263 (* 0.0272727 = 0.0101799 loss)
I0612 18:09:45.327904 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.431839 (* 0.0272727 = 0.0117774 loss)
I0612 18:09:45.327919 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.468982 (* 0.0272727 = 0.0127904 loss)
I0612 18:09:45.327932 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.613533 (* 0.0272727 = 0.0167327 loss)
I0612 18:09:45.327946 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.000446808 (* 0.0272727 = 1.21857e-05 loss)
I0612 18:09:45.327960 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000139501 (* 0.0272727 = 3.80458e-06 loss)
I0612 18:09:45.327975 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 9.10958e-05 (* 0.0272727 = 2.48443e-06 loss)
I0612 18:09:45.327989 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 7.8755e-05 (* 0.0272727 = 2.14786e-06 loss)
I0612 18:09:45.328007 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000104377 (* 0.0272727 = 2.84664e-06 loss)
I0612 18:09:45.328022 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 4.33885e-05 (* 0.0272727 = 1.18332e-06 loss)
I0612 18:09:45.328035 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.440678
I0612 18:09:45.328048 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.5
I0612 18:09:45.328060 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 18:09:45.328073 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 18:09:45.328084 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 18:09:45.328096 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.375
I0612 18:09:45.328109 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.375
I0612 18:09:45.328120 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 18:09:45.328132 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.625
I0612 18:09:45.328145 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 18:09:45.328156 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 18:09:45.328168 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 18:09:45.328179 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 18:09:45.328191 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 18:09:45.328203 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 18:09:45.328215 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 18:09:45.328227 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 18:09:45.328238 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 18:09:45.328250 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 18:09:45.328263 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 18:09:45.328274 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 18:09:45.328285 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 18:09:45.328296 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 18:09:45.328308 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.8125
I0612 18:09:45.328320 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.627119
I0612 18:09:45.328335 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 2.11441 (* 0.3 = 0.634322 loss)
I0612 18:09:45.328348 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.752867 (* 0.3 = 0.22586 loss)
I0612 18:09:45.328373 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 2.64816 (* 0.0272727 = 0.0722225 loss)
I0612 18:09:45.328388 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.997305 (* 0.0272727 = 0.0271992 loss)
I0612 18:09:45.328403 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.35021 (* 0.0272727 = 0.0368239 loss)
I0612 18:09:45.328418 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 2.23205 (* 0.0272727 = 0.0608741 loss)
I0612 18:09:45.328431 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 2.37081 (* 0.0272727 = 0.0646584 loss)
I0612 18:09:45.328445 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.67246 (* 0.0272727 = 0.0456125 loss)
I0612 18:09:45.328460 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.41146 (* 0.0272727 = 0.0384943 loss)
I0612 18:09:45.328474 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 1.7739 (* 0.0272727 = 0.048379 loss)
I0612 18:09:45.328488 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.497518 (* 0.0272727 = 0.0135687 loss)
I0612 18:09:45.328502 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.70325 (* 0.0272727 = 0.0191795 loss)
I0612 18:09:45.328516 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.592568 (* 0.0272727 = 0.0161609 loss)
I0612 18:09:45.328531 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.343591 (* 0.0272727 = 0.00937067 loss)
I0612 18:09:45.328544 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.347867 (* 0.0272727 = 0.00948728 loss)
I0612 18:09:45.328558 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.503398 (* 0.0272727 = 0.013729 loss)
I0612 18:09:45.328573 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.2599 (* 0.0272727 = 0.00708819 loss)
I0612 18:09:45.328588 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.274377 (* 0.0272727 = 0.007483 loss)
I0612 18:09:45.328601 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0265865 (* 0.0272727 = 0.000725086 loss)
I0612 18:09:45.328615 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.0158491 (* 0.0272727 = 0.000432249 loss)
I0612 18:09:45.328629 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.0182475 (* 0.0272727 = 0.000497659 loss)
I0612 18:09:45.328644 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.0166615 (* 0.0272727 = 0.000454405 loss)
I0612 18:09:45.328658 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.0115203 (* 0.0272727 = 0.00031419 loss)
I0612 18:09:45.328672 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.0186735 (* 0.0272727 = 0.000509278 loss)
I0612 18:09:45.328685 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.627119
I0612 18:09:45.328696 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.625
I0612 18:09:45.328708 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 18:09:45.328721 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 18:09:45.328732 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.625
I0612 18:09:45.328744 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 18:09:45.328757 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 18:09:45.328768 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 18:09:45.328780 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.75
I0612 18:09:45.328791 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 18:09:45.328804 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 18:09:45.328815 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 18:09:45.328827 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 18:09:45.328840 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 18:09:45.328861 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 18:09:45.328874 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 18:09:45.328886 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 18:09:45.328898 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 18:09:45.328910 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 18:09:45.328922 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 18:09:45.328933 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 18:09:45.328944 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 18:09:45.328956 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 18:09:45.328969 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.875
I0612 18:09:45.328980 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.745763
I0612 18:09:45.328994 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.51879 (* 1 = 1.51879 loss)
I0612 18:09:45.329008 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.529616 (* 1 = 0.529616 loss)
I0612 18:09:45.329022 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 1.3178 (* 0.0909091 = 0.1198 loss)
I0612 18:09:45.329036 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.85641 (* 0.0909091 = 0.0778555 loss)
I0612 18:09:45.329054 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.973821 (* 0.0909091 = 0.0885292 loss)
I0612 18:09:45.329069 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.55258 (* 0.0909091 = 0.141144 loss)
I0612 18:09:45.329083 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.1303 (* 0.0909091 = 0.102754 loss)
I0612 18:09:45.329097 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.39144 (* 0.0909091 = 0.126494 loss)
I0612 18:09:45.329111 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.11583 (* 0.0909091 = 0.101439 loss)
I0612 18:09:45.329125 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 1.42333 (* 0.0909091 = 0.129394 loss)
I0612 18:09:45.329139 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.560859 (* 0.0909091 = 0.0509872 loss)
I0612 18:09:45.329154 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.685356 (* 0.0909091 = 0.0623051 loss)
I0612 18:09:45.329169 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.59378 (* 0.0909091 = 0.05398 loss)
I0612 18:09:45.329182 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.189881 (* 0.0909091 = 0.0172619 loss)
I0612 18:09:45.329196 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.237836 (* 0.0909091 = 0.0216214 loss)
I0612 18:09:45.329210 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.154152 (* 0.0909091 = 0.0140138 loss)
I0612 18:09:45.329224 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.315462 (* 0.0909091 = 0.0286783 loss)
I0612 18:09:45.329238 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.289269 (* 0.0909091 = 0.0262971 loss)
I0612 18:09:45.329253 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0211569 (* 0.0909091 = 0.00192336 loss)
I0612 18:09:45.329267 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00894501 (* 0.0909091 = 0.000813182 loss)
I0612 18:09:45.329282 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00671682 (* 0.0909091 = 0.00061062 loss)
I0612 18:09:45.329295 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00808608 (* 0.0909091 = 0.000735098 loss)
I0612 18:09:45.329309 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00688457 (* 0.0909091 = 0.00062587 loss)
I0612 18:09:45.329341 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.0073548 (* 0.0909091 = 0.000668618 loss)
I0612 18:09:45.329367 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.25
I0612 18:09:45.329381 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.25
I0612 18:09:45.329393 6181 solver.cpp:245] Train net output #149: total_confidence = 0.334473
I0612 18:09:45.329404 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.374221
I0612 18:09:45.329417 6181 sgd_solver.cpp:106] Iteration 15000, lr = 0.001
I0612 18:09:46.459949 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 44.195 > 30) by scale factor 0.67881
I0612 18:10:59.744096 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.6455 > 30) by scale factor 0.948003
I0612 18:11:25.231271 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.2591 > 30) by scale factor 0.902009
I0612 18:12:53.966806 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.4218 > 30) by scale factor 0.986134
I0612 18:13:00.900626 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.4218 > 30) by scale factor 0.801671
I0612 18:14:04.186883 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.2681 > 30) by scale factor 0.929711
I0612 18:15:13.582032 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 44.5606 > 30) by scale factor 0.673241
I0612 18:16:11.027101 6181 solver.cpp:229] Iteration 15500, loss = 4.02686
I0612 18:16:11.027223 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.483871
I0612 18:16:11.027245 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 18:16:11.027259 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.5
I0612 18:16:11.027272 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.75
I0612 18:16:11.027286 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 18:16:11.027298 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.25
I0612 18:16:11.027312 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.25
I0612 18:16:11.027323 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.875
I0612 18:16:11.027336 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.75
I0612 18:16:11.027349 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 18:16:11.027362 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.75
I0612 18:16:11.027375 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 18:16:11.027387 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 18:16:11.027400 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 18:16:11.027412 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 18:16:11.027425 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 18:16:11.027437 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 18:16:11.027449 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 18:16:11.027462 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 18:16:11.027473 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 18:16:11.027487 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 18:16:11.027498 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 18:16:11.027509 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 18:16:11.027521 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.806818
I0612 18:16:11.027534 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.725806
I0612 18:16:11.027550 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.74589 (* 0.3 = 0.523768 loss)
I0612 18:16:11.027565 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.674864 (* 0.3 = 0.202459 loss)
I0612 18:16:11.027580 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.575338 (* 0.0272727 = 0.015691 loss)
I0612 18:16:11.027595 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.52985 (* 0.0272727 = 0.0417233 loss)
I0612 18:16:11.027609 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.37928 (* 0.0272727 = 0.0376168 loss)
I0612 18:16:11.027623 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.75556 (* 0.0272727 = 0.0478788 loss)
I0612 18:16:11.027637 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.43988 (* 0.0272727 = 0.0665423 loss)
I0612 18:16:11.027652 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.13721 (* 0.0272727 = 0.0582876 loss)
I0612 18:16:11.027667 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.844542 (* 0.0272727 = 0.023033 loss)
I0612 18:16:11.027679 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.710751 (* 0.0272727 = 0.0193841 loss)
I0612 18:16:11.027694 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.535129 (* 0.0272727 = 0.0145944 loss)
I0612 18:16:11.027709 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.724787 (* 0.0272727 = 0.0197669 loss)
I0612 18:16:11.027722 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.406041 (* 0.0272727 = 0.0110738 loss)
I0612 18:16:11.027737 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.408118 (* 0.0272727 = 0.0111305 loss)
I0612 18:16:11.027770 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.332359 (* 0.0272727 = 0.00906433 loss)
I0612 18:16:11.027786 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.401661 (* 0.0272727 = 0.0109544 loss)
I0612 18:16:11.027799 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.304243 (* 0.0272727 = 0.00829752 loss)
I0612 18:16:11.027814 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.258135 (* 0.0272727 = 0.00704005 loss)
I0612 18:16:11.027828 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0705459 (* 0.0272727 = 0.00192398 loss)
I0612 18:16:11.027842 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.0123514 (* 0.0272727 = 0.000336857 loss)
I0612 18:16:11.027858 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00654021 (* 0.0272727 = 0.000178369 loss)
I0612 18:16:11.027871 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000136722 (* 0.0272727 = 3.72879e-06 loss)
I0612 18:16:11.027885 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 3.33449e-05 (* 0.0272727 = 9.09406e-07 loss)
I0612 18:16:11.027900 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 6.58645e-06 (* 0.0272727 = 1.7963e-07 loss)
I0612 18:16:11.027912 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.532258
I0612 18:16:11.027925 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.625
I0612 18:16:11.027937 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.5
I0612 18:16:11.027950 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 18:16:11.027961 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 18:16:11.027973 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 18:16:11.027986 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.375
I0612 18:16:11.027997 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 18:16:11.028008 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.75
I0612 18:16:11.028022 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 18:16:11.028033 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.75
I0612 18:16:11.028045 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 18:16:11.028056 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 18:16:11.028069 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 18:16:11.028080 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 18:16:11.028092 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 18:16:11.028105 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 18:16:11.028116 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 18:16:11.028127 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 18:16:11.028139 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 18:16:11.028151 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 18:16:11.028163 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 18:16:11.028175 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 18:16:11.028187 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.806818
I0612 18:16:11.028198 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.741935
I0612 18:16:11.028213 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.46311 (* 0.3 = 0.438932 loss)
I0612 18:16:11.028230 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.592689 (* 0.3 = 0.177807 loss)
I0612 18:16:11.028245 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.93858 (* 0.0272727 = 0.0255976 loss)
I0612 18:16:11.028259 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.55437 (* 0.0272727 = 0.0423919 loss)
I0612 18:16:11.028285 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.09058 (* 0.0272727 = 0.029743 loss)
I0612 18:16:11.028301 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.84616 (* 0.0272727 = 0.0503499 loss)
I0612 18:16:11.028314 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 0.963846 (* 0.0272727 = 0.0262867 loss)
I0612 18:16:11.028328 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.64965 (* 0.0272727 = 0.0449905 loss)
I0612 18:16:11.028342 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.828981 (* 0.0272727 = 0.0226086 loss)
I0612 18:16:11.028357 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.805179 (* 0.0272727 = 0.0219594 loss)
I0612 18:16:11.028370 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.703769 (* 0.0272727 = 0.0191937 loss)
I0612 18:16:11.028384 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 1.01477 (* 0.0272727 = 0.0276754 loss)
I0612 18:16:11.028399 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.278365 (* 0.0272727 = 0.00759177 loss)
I0612 18:16:11.028414 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.304151 (* 0.0272727 = 0.00829502 loss)
I0612 18:16:11.028427 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.0793845 (* 0.0272727 = 0.00216503 loss)
I0612 18:16:11.028441 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.318439 (* 0.0272727 = 0.00868469 loss)
I0612 18:16:11.028455 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.350729 (* 0.0272727 = 0.00956534 loss)
I0612 18:16:11.028470 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.163399 (* 0.0272727 = 0.00445635 loss)
I0612 18:16:11.028483 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.110996 (* 0.0272727 = 0.00302716 loss)
I0612 18:16:11.028497 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.00702545 (* 0.0272727 = 0.000191603 loss)
I0612 18:16:11.028512 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000899092 (* 0.0272727 = 2.45207e-05 loss)
I0612 18:16:11.028527 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000697159 (* 0.0272727 = 1.90134e-05 loss)
I0612 18:16:11.028540 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000368822 (* 0.0272727 = 1.00588e-05 loss)
I0612 18:16:11.028554 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 7.67342e-05 (* 0.0272727 = 2.09275e-06 loss)
I0612 18:16:11.028566 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.693548
I0612 18:16:11.028578 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 18:16:11.028591 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 18:16:11.028604 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 18:16:11.028615 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 18:16:11.028627 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 18:16:11.028640 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 18:16:11.028651 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 18:16:11.028662 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 18:16:11.028674 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 18:16:11.028687 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 18:16:11.028698 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 18:16:11.028710 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 18:16:11.028722 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 18:16:11.028733 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 18:16:11.028744 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 18:16:11.028765 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 18:16:11.028779 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 18:16:11.028790 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 18:16:11.028802 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 18:16:11.028815 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 18:16:11.028825 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 18:16:11.028837 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 18:16:11.028849 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.886364
I0612 18:16:11.028861 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.903226
I0612 18:16:11.028875 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.899637 (* 1 = 0.899637 loss)
I0612 18:16:11.028889 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.344378 (* 1 = 0.344378 loss)
I0612 18:16:11.028903 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.614304 (* 0.0909091 = 0.0558458 loss)
I0612 18:16:11.028918 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 1.03699 (* 0.0909091 = 0.0942717 loss)
I0612 18:16:11.028931 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.582658 (* 0.0909091 = 0.0529689 loss)
I0612 18:16:11.028945 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.623603 (* 0.0909091 = 0.0566912 loss)
I0612 18:16:11.028960 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.439274 (* 0.0909091 = 0.039934 loss)
I0612 18:16:11.028975 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.957399 (* 0.0909091 = 0.0870362 loss)
I0612 18:16:11.028990 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.416814 (* 0.0909091 = 0.0378922 loss)
I0612 18:16:11.029005 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.361808 (* 0.0909091 = 0.0328916 loss)
I0612 18:16:11.029018 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.441805 (* 0.0909091 = 0.0401641 loss)
I0612 18:16:11.029032 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.476439 (* 0.0909091 = 0.0433127 loss)
I0612 18:16:11.029047 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.132941 (* 0.0909091 = 0.0120856 loss)
I0612 18:16:11.029062 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.104989 (* 0.0909091 = 0.00954442 loss)
I0612 18:16:11.029075 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.0984235 (* 0.0909091 = 0.00894759 loss)
I0612 18:16:11.029090 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.039474 (* 0.0909091 = 0.00358854 loss)
I0612 18:16:11.029104 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.111627 (* 0.0909091 = 0.0101479 loss)
I0612 18:16:11.029119 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.0405145 (* 0.0909091 = 0.00368314 loss)
I0612 18:16:11.029132 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.105589 (* 0.0909091 = 0.00959896 loss)
I0612 18:16:11.029147 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.0078161 (* 0.0909091 = 0.000710555 loss)
I0612 18:16:11.029161 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00221354 (* 0.0909091 = 0.00020123 loss)
I0612 18:16:11.029175 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00052144 (* 0.0909091 = 4.74036e-05 loss)
I0612 18:16:11.029189 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000201785 (* 0.0909091 = 1.83441e-05 loss)
I0612 18:16:11.029203 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000141352 (* 0.0909091 = 1.28502e-05 loss)
I0612 18:16:11.029217 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 18:16:11.029228 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.625
I0612 18:16:11.029249 6181 solver.cpp:245] Train net output #149: total_confidence = 0.407823
I0612 18:16:11.029263 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.323452
I0612 18:16:11.029289 6181 sgd_solver.cpp:106] Iteration 15500, lr = 0.001
I0612 18:18:23.360355 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.3662 > 30) by scale factor 0.899112
I0612 18:19:28.907802 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.4929 > 30) by scale factor 0.822079
I0612 18:19:59.776371 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.559 > 30) by scale factor 0.798743
I0612 18:20:35.273360 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.8265 > 30) by scale factor 0.942609
I0612 18:22:07.917279 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.6746 > 30) by scale factor 0.918144
I0612 18:22:36.865912 6181 solver.cpp:229] Iteration 16000, loss = 4.062
I0612 18:22:36.865974 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.590909
I0612 18:22:36.866003 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 18:22:36.866029 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.375
I0612 18:22:36.866052 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 18:22:36.866082 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 18:22:36.866103 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 18:22:36.866127 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.75
I0612 18:22:36.866152 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.875
I0612 18:22:36.866174 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 1
I0612 18:22:36.866196 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 1
I0612 18:22:36.866219 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 1
I0612 18:22:36.866242 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 1
I0612 18:22:36.866264 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 18:22:36.866288 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 18:22:36.866312 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 18:22:36.866333 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 18:22:36.866354 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 18:22:36.866376 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 18:22:36.866399 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 18:22:36.866420 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 18:22:36.866442 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 18:22:36.866466 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 18:22:36.866489 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 18:22:36.866511 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.880682
I0612 18:22:36.866533 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.886364
I0612 18:22:36.866562 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.21489 (* 0.3 = 0.364466 loss)
I0612 18:22:36.866590 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.354108 (* 0.3 = 0.106232 loss)
I0612 18:22:36.866616 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.957086 (* 0.0272727 = 0.0261023 loss)
I0612 18:22:36.866643 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.64838 (* 0.0272727 = 0.0449557 loss)
I0612 18:22:36.866669 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.34117 (* 0.0272727 = 0.0365774 loss)
I0612 18:22:36.866696 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.56767 (* 0.0272727 = 0.0427546 loss)
I0612 18:22:36.866724 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.34329 (* 0.0272727 = 0.0366352 loss)
I0612 18:22:36.866755 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 0.897393 (* 0.0272727 = 0.0244743 loss)
I0612 18:22:36.866783 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.345753 (* 0.0272727 = 0.00942964 loss)
I0612 18:22:36.866809 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.0310188 (* 0.0272727 = 0.000845966 loss)
I0612 18:22:36.866837 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.00802919 (* 0.0272727 = 0.000218978 loss)
I0612 18:22:36.866863 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.001623 (* 0.0272727 = 4.42637e-05 loss)
I0612 18:22:36.866889 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.000390745 (* 0.0272727 = 1.06567e-05 loss)
I0612 18:22:36.866915 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 8.36832e-05 (* 0.0272727 = 2.28227e-06 loss)
I0612 18:22:36.866978 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 6.63384e-05 (* 0.0272727 = 1.80923e-06 loss)
I0612 18:22:36.867007 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 6.77878e-05 (* 0.0272727 = 1.84876e-06 loss)
I0612 18:22:36.867036 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 2.00429e-05 (* 0.0272727 = 5.46626e-07 loss)
I0612 18:22:36.867063 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 2.80686e-05 (* 0.0272727 = 7.65507e-07 loss)
I0612 18:22:36.867089 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 1.12656e-05 (* 0.0272727 = 3.07242e-07 loss)
I0612 18:22:36.867117 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 9.984e-06 (* 0.0272727 = 2.72291e-07 loss)
I0612 18:22:36.867144 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 7.98715e-06 (* 0.0272727 = 2.17831e-07 loss)
I0612 18:22:36.867171 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 1.64366e-05 (* 0.0272727 = 4.4827e-07 loss)
I0612 18:22:36.867197 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 5.17076e-06 (* 0.0272727 = 1.41021e-07 loss)
I0612 18:22:36.867223 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 1.03416e-05 (* 0.0272727 = 2.82045e-07 loss)
I0612 18:22:36.867245 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.818182
I0612 18:22:36.867267 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 18:22:36.867290 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 18:22:36.867312 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 18:22:36.867333 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.75
I0612 18:22:36.867354 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.875
I0612 18:22:36.867377 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.75
I0612 18:22:36.867400 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.75
I0612 18:22:36.867422 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 1
I0612 18:22:36.867444 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 1
I0612 18:22:36.867465 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 1
I0612 18:22:36.867486 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 1
I0612 18:22:36.867506 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 18:22:36.867528 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 18:22:36.867550 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 18:22:36.867571 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 18:22:36.867593 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 18:22:36.867614 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 18:22:36.867635 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 18:22:36.867656 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 18:22:36.867677 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 18:22:36.867698 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 18:22:36.867719 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 18:22:36.867740 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.926136
I0612 18:22:36.867763 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.909091
I0612 18:22:36.867791 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 0.877743 (* 0.3 = 0.263323 loss)
I0612 18:22:36.867822 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.291086 (* 0.3 = 0.0873259 loss)
I0612 18:22:36.867851 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.614534 (* 0.0272727 = 0.01676 loss)
I0612 18:22:36.867892 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.726463 (* 0.0272727 = 0.0198126 loss)
I0612 18:22:36.867921 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.06742 (* 0.0272727 = 0.0291115 loss)
I0612 18:22:36.867949 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.33041 (* 0.0272727 = 0.036284 loss)
I0612 18:22:36.867975 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 0.726735 (* 0.0272727 = 0.01982 loss)
I0612 18:22:36.868001 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.915077 (* 0.0272727 = 0.0249567 loss)
I0612 18:22:36.868028 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.587226 (* 0.0272727 = 0.0160153 loss)
I0612 18:22:36.868055 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.148149 (* 0.0272727 = 0.00404043 loss)
I0612 18:22:36.868083 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.015827 (* 0.0272727 = 0.000431644 loss)
I0612 18:22:36.868110 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.00364076 (* 0.0272727 = 9.92936e-05 loss)
I0612 18:22:36.868137 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.00177464 (* 0.0272727 = 4.83993e-05 loss)
I0612 18:22:36.868168 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.00049201 (* 0.0272727 = 1.34185e-05 loss)
I0612 18:22:36.868196 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.000636614 (* 0.0272727 = 1.73622e-05 loss)
I0612 18:22:36.868221 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.00059458 (* 0.0272727 = 1.62158e-05 loss)
I0612 18:22:36.868248 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.000818849 (* 0.0272727 = 2.23322e-05 loss)
I0612 18:22:36.868276 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.00138875 (* 0.0272727 = 3.78751e-05 loss)
I0612 18:22:36.868304 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.000599812 (* 0.0272727 = 1.63585e-05 loss)
I0612 18:22:36.868330 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000560439 (* 0.0272727 = 1.52847e-05 loss)
I0612 18:22:36.868358 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000247706 (* 0.0272727 = 6.75561e-06 loss)
I0612 18:22:36.868386 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000336605 (* 0.0272727 = 9.18014e-06 loss)
I0612 18:22:36.868412 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.000243696 (* 0.0272727 = 6.64624e-06 loss)
I0612 18:22:36.868439 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000223158 (* 0.0272727 = 6.08613e-06 loss)
I0612 18:22:36.868463 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.886364
I0612 18:22:36.868484 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 18:22:36.868505 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 18:22:36.868526 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 1
I0612 18:22:36.868547 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 18:22:36.868571 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 18:22:36.868593 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 18:22:36.868615 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 1
I0612 18:22:36.868636 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 1
I0612 18:22:36.868659 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 1
I0612 18:22:36.868680 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 18:22:36.868700 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 18:22:36.868722 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 18:22:36.868743 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 18:22:36.868765 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 18:22:36.868787 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 18:22:36.868824 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 18:22:36.868849 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 18:22:36.868873 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 18:22:36.868896 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 18:22:36.868917 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 18:22:36.868939 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 18:22:36.868960 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 18:22:36.868981 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.965909
I0612 18:22:36.869004 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.931818
I0612 18:22:36.869029 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.380037 (* 1 = 0.380037 loss)
I0612 18:22:36.869055 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.129357 (* 1 = 0.129357 loss)
I0612 18:22:36.869083 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.130244 (* 0.0909091 = 0.0118403 loss)
I0612 18:22:36.869109 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.228569 (* 0.0909091 = 0.020779 loss)
I0612 18:22:36.869137 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.319494 (* 0.0909091 = 0.0290449 loss)
I0612 18:22:36.869163 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.606241 (* 0.0909091 = 0.0551128 loss)
I0612 18:22:36.869189 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.440008 (* 0.0909091 = 0.0400007 loss)
I0612 18:22:36.869217 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.414158 (* 0.0909091 = 0.0376507 loss)
I0612 18:22:36.869245 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.0585001 (* 0.0909091 = 0.00531819 loss)
I0612 18:22:36.869271 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.0654493 (* 0.0909091 = 0.00594994 loss)
I0612 18:22:36.869297 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.00954742 (* 0.0909091 = 0.000867947 loss)
I0612 18:22:36.869340 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.00571963 (* 0.0909091 = 0.000519967 loss)
I0612 18:22:36.869372 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.00453635 (* 0.0909091 = 0.000412396 loss)
I0612 18:22:36.869400 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.00128307 (* 0.0909091 = 0.000116643 loss)
I0612 18:22:36.869427 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.000618638 (* 0.0909091 = 5.62398e-05 loss)
I0612 18:22:36.869454 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.000694982 (* 0.0909091 = 6.31802e-05 loss)
I0612 18:22:36.869483 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.00043543 (* 0.0909091 = 3.95846e-05 loss)
I0612 18:22:36.869510 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.000407254 (* 0.0909091 = 3.70231e-05 loss)
I0612 18:22:36.869537 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.000515208 (* 0.0909091 = 4.68371e-05 loss)
I0612 18:22:36.869563 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00041404 (* 0.0909091 = 3.764e-05 loss)
I0612 18:22:36.869591 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000480668 (* 0.0909091 = 4.36971e-05 loss)
I0612 18:22:36.869618 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000461944 (* 0.0909091 = 4.19949e-05 loss)
I0612 18:22:36.869645 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000463263 (* 0.0909091 = 4.21148e-05 loss)
I0612 18:22:36.869673 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.000647508 (* 0.0909091 = 5.88644e-05 loss)
I0612 18:22:36.869695 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.625
I0612 18:22:36.869734 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.75
I0612 18:22:36.869758 6181 solver.cpp:245] Train net output #149: total_confidence = 0.432845
I0612 18:22:36.869781 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.422791
I0612 18:22:36.869804 6181 sgd_solver.cpp:106] Iteration 16000, lr = 0.001
I0612 18:24:32.117497 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 56.8247 > 30) by scale factor 0.527939
I0612 18:25:19.933305 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 39.6536 > 30) by scale factor 0.756552
I0612 18:26:24.726311 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.3667 > 30) by scale factor 0.956427
I0612 18:27:12.599314 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.9765 > 30) by scale factor 0.769695
I0612 18:28:02.750336 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.3099 > 30) by scale factor 0.958164
I0612 18:29:02.534394 6181 solver.cpp:229] Iteration 16500, loss = 3.89656
I0612 18:29:02.534528 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.413793
I0612 18:29:02.534559 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.625
I0612 18:29:02.534582 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.5
I0612 18:29:02.534605 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 18:29:02.534627 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.375
I0612 18:29:02.534651 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.25
I0612 18:29:02.534673 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 18:29:02.534693 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.5
I0612 18:29:02.534716 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 18:29:02.534737 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 1
I0612 18:29:02.534760 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 18:29:02.534782 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 18:29:02.534806 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 18:29:02.534832 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 18:29:02.534853 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 18:29:02.534876 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 18:29:02.534899 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 18:29:02.534921 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 18:29:02.534945 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 18:29:02.534966 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 18:29:02.534988 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 18:29:02.535010 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 18:29:02.535033 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 18:29:02.535058 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.795455
I0612 18:29:02.535081 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.637931
I0612 18:29:02.535111 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.0178 (* 0.3 = 0.60534 loss)
I0612 18:29:02.535140 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.730611 (* 0.3 = 0.219183 loss)
I0612 18:29:02.535167 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.28959 (* 0.0272727 = 0.0351706 loss)
I0612 18:29:02.535194 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.9843 (* 0.0272727 = 0.0541174 loss)
I0612 18:29:02.535224 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.47601 (* 0.0272727 = 0.0402547 loss)
I0612 18:29:02.535253 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.14902 (* 0.0272727 = 0.0586096 loss)
I0612 18:29:02.535280 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.50918 (* 0.0272727 = 0.0684323 loss)
I0612 18:29:02.535305 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.35806 (* 0.0272727 = 0.0643108 loss)
I0612 18:29:02.535328 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.48383 (* 0.0272727 = 0.0404681 loss)
I0612 18:29:02.535353 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.61999 (* 0.0272727 = 0.0169088 loss)
I0612 18:29:02.535378 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.181023 (* 0.0272727 = 0.004937 loss)
I0612 18:29:02.535408 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.311763 (* 0.0272727 = 0.00850262 loss)
I0612 18:29:02.535436 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.346798 (* 0.0272727 = 0.00945812 loss)
I0612 18:29:02.535465 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.278538 (* 0.0272727 = 0.00759649 loss)
I0612 18:29:02.535516 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.333795 (* 0.0272727 = 0.0091035 loss)
I0612 18:29:02.535548 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.355602 (* 0.0272727 = 0.00969824 loss)
I0612 18:29:02.535575 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.326541 (* 0.0272727 = 0.00890566 loss)
I0612 18:29:02.535603 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.453008 (* 0.0272727 = 0.0123548 loss)
I0612 18:29:02.535630 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0028083 (* 0.0272727 = 7.65901e-05 loss)
I0612 18:29:02.535657 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000942933 (* 0.0272727 = 2.57164e-05 loss)
I0612 18:29:02.535684 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000650044 (* 0.0272727 = 1.77285e-05 loss)
I0612 18:29:02.535712 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000532237 (* 0.0272727 = 1.45156e-05 loss)
I0612 18:29:02.535737 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000724115 (* 0.0272727 = 1.97486e-05 loss)
I0612 18:29:02.535763 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000726776 (* 0.0272727 = 1.98212e-05 loss)
I0612 18:29:02.535785 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.551724
I0612 18:29:02.535809 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 18:29:02.535831 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 18:29:02.535853 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.875
I0612 18:29:02.535876 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 18:29:02.535898 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.375
I0612 18:29:02.535922 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.375
I0612 18:29:02.535943 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.5
I0612 18:29:02.535967 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 18:29:02.535989 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 18:29:02.536011 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 18:29:02.536032 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 18:29:02.536054 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 18:29:02.536077 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 18:29:02.536098 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 18:29:02.536120 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 18:29:02.536141 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 18:29:02.536164 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 18:29:02.536185 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 18:29:02.536206 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 18:29:02.536227 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 18:29:02.536249 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 18:29:02.536270 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 18:29:02.536298 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.852273
I0612 18:29:02.536320 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.793103
I0612 18:29:02.536346 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.47623 (* 0.3 = 0.442868 loss)
I0612 18:29:02.536372 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.537138 (* 0.3 = 0.161141 loss)
I0612 18:29:02.536398 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.26566 (* 0.0272727 = 0.0345181 loss)
I0612 18:29:02.536425 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.822747 (* 0.0272727 = 0.0224386 loss)
I0612 18:29:02.536468 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.749918 (* 0.0272727 = 0.0204523 loss)
I0612 18:29:02.536496 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.66592 (* 0.0272727 = 0.0454342 loss)
I0612 18:29:02.536522 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.74194 (* 0.0272727 = 0.0475073 loss)
I0612 18:29:02.536548 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.98846 (* 0.0272727 = 0.0542307 loss)
I0612 18:29:02.536576 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.53408 (* 0.0272727 = 0.0418385 loss)
I0612 18:29:02.536607 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.480674 (* 0.0272727 = 0.0131093 loss)
I0612 18:29:02.536636 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.241038 (* 0.0272727 = 0.00657376 loss)
I0612 18:29:02.536664 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.337163 (* 0.0272727 = 0.00919535 loss)
I0612 18:29:02.536691 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.442471 (* 0.0272727 = 0.0120674 loss)
I0612 18:29:02.536718 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.283213 (* 0.0272727 = 0.00772399 loss)
I0612 18:29:02.536746 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.25674 (* 0.0272727 = 0.007002 loss)
I0612 18:29:02.536773 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.343662 (* 0.0272727 = 0.00937261 loss)
I0612 18:29:02.536801 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.240978 (* 0.0272727 = 0.00657212 loss)
I0612 18:29:02.536828 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.340981 (* 0.0272727 = 0.0092995 loss)
I0612 18:29:02.536855 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0125284 (* 0.0272727 = 0.000341683 loss)
I0612 18:29:02.536882 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.00911089 (* 0.0272727 = 0.000248479 loss)
I0612 18:29:02.536908 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00493161 (* 0.0272727 = 0.000134499 loss)
I0612 18:29:02.536936 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00623968 (* 0.0272727 = 0.000170173 loss)
I0612 18:29:02.536962 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.00574727 (* 0.0272727 = 0.000156744 loss)
I0612 18:29:02.536988 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.00897504 (* 0.0272727 = 0.000244774 loss)
I0612 18:29:02.537010 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.758621
I0612 18:29:02.537034 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 18:29:02.537056 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 18:29:02.537078 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 1
I0612 18:29:02.537101 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 18:29:02.537122 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 18:29:02.537144 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 18:29:02.537166 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.625
I0612 18:29:02.537189 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 18:29:02.537210 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 18:29:02.537232 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 18:29:02.537253 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 18:29:02.537276 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 18:29:02.537297 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 18:29:02.537335 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 18:29:02.537360 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 18:29:02.537400 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 18:29:02.537423 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 18:29:02.537446 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 18:29:02.537467 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 18:29:02.537489 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 18:29:02.537510 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 18:29:02.537531 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 18:29:02.537554 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.914773
I0612 18:29:02.537577 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.913793
I0612 18:29:02.537600 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.818674 (* 1 = 0.818674 loss)
I0612 18:29:02.537626 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.287487 (* 1 = 0.287487 loss)
I0612 18:29:02.537658 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.807359 (* 0.0909091 = 0.0733963 loss)
I0612 18:29:02.537685 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.352354 (* 0.0909091 = 0.0320322 loss)
I0612 18:29:02.537713 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.1738 (* 0.0909091 = 0.0158 loss)
I0612 18:29:02.537739 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.544677 (* 0.0909091 = 0.0495161 loss)
I0612 18:29:02.537765 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.358975 (* 0.0909091 = 0.0326341 loss)
I0612 18:29:02.537791 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.01912 (* 0.0909091 = 0.0926469 loss)
I0612 18:29:02.537816 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.5946 (* 0.0909091 = 0.144963 loss)
I0612 18:29:02.537842 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.737275 (* 0.0909091 = 0.067025 loss)
I0612 18:29:02.537868 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.325327 (* 0.0909091 = 0.0295752 loss)
I0612 18:29:02.537894 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.199862 (* 0.0909091 = 0.0181693 loss)
I0612 18:29:02.537920 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.156796 (* 0.0909091 = 0.0142542 loss)
I0612 18:29:02.537946 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.13677 (* 0.0909091 = 0.0124336 loss)
I0612 18:29:02.537972 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.230223 (* 0.0909091 = 0.0209294 loss)
I0612 18:29:02.537998 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.279915 (* 0.0909091 = 0.0254468 loss)
I0612 18:29:02.538024 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.194744 (* 0.0909091 = 0.017704 loss)
I0612 18:29:02.538050 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.235007 (* 0.0909091 = 0.0213643 loss)
I0612 18:29:02.538076 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.00402402 (* 0.0909091 = 0.00036582 loss)
I0612 18:29:02.538105 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.000521491 (* 0.0909091 = 4.74083e-05 loss)
I0612 18:29:02.538131 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000440424 (* 0.0909091 = 4.00385e-05 loss)
I0612 18:29:02.538158 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000190131 (* 0.0909091 = 1.72847e-05 loss)
I0612 18:29:02.538185 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000218981 (* 0.0909091 = 1.99074e-05 loss)
I0612 18:29:02.538213 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 3.28521e-05 (* 0.0909091 = 2.98656e-06 loss)
I0612 18:29:02.538235 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 18:29:02.538257 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.25
I0612 18:29:02.538295 6181 solver.cpp:245] Train net output #149: total_confidence = 0.265214
I0612 18:29:02.538318 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.288598
I0612 18:29:02.538341 6181 sgd_solver.cpp:106] Iteration 16500, lr = 0.001
I0612 18:29:18.336087 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.8413 > 30) by scale factor 0.792784
I0612 18:29:57.675048 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 48.9971 > 30) by scale factor 0.612281
I0612 18:30:57.806818 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.2032 > 30) by scale factor 0.806382
I0612 18:31:18.634884 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 47.7176 > 30) by scale factor 0.628698
I0612 18:31:49.456324 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.5627 > 30) by scale factor 0.84358
I0612 18:32:01.033839 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.4449 > 30) by scale factor 0.924644
I0612 18:32:13.378605 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 41.1712 > 30) by scale factor 0.728665
I0612 18:32:44.267009 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.3949 > 30) by scale factor 0.872223
I0612 18:33:56.714341 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 43.2691 > 30) by scale factor 0.693335
I0612 18:35:07.738204 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.2364 > 30) by scale factor 0.960419
I0612 18:35:20.071471 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.2827 > 30) by scale factor 0.958998
I0612 18:35:28.193166 6181 solver.cpp:229] Iteration 17000, loss = 3.81769
I0612 18:35:28.193227 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.467742
I0612 18:35:28.193245 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.625
I0612 18:35:28.193259 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.25
I0612 18:35:28.193272 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 18:35:28.193285 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.625
I0612 18:35:28.193297 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.25
I0612 18:35:28.193310 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.25
I0612 18:35:28.193338 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.375
I0612 18:35:28.193356 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.625
I0612 18:35:28.193368 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 18:35:28.193382 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 18:35:28.193394 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 18:35:28.193406 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 18:35:28.193419 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 18:35:28.193431 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 18:35:28.193444 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 18:35:28.193456 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 18:35:28.193469 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 18:35:28.193480 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 18:35:28.193491 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 18:35:28.193503 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 18:35:28.193516 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 18:35:28.193526 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 18:35:28.193538 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.795455
I0612 18:35:28.193550 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.645161
I0612 18:35:28.193567 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.81064 (* 0.3 = 0.843193 loss)
I0612 18:35:28.193581 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 1.04188 (* 0.3 = 0.312563 loss)
I0612 18:35:28.193595 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.56729 (* 0.0272727 = 0.0427443 loss)
I0612 18:35:28.193610 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.89896 (* 0.0272727 = 0.0790625 loss)
I0612 18:35:28.193624 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.28404 (* 0.0272727 = 0.0622921 loss)
I0612 18:35:28.193639 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.2993 (* 0.0272727 = 0.0627082 loss)
I0612 18:35:28.193655 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 3.4709 (* 0.0272727 = 0.0946609 loss)
I0612 18:35:28.193670 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 3.87081 (* 0.0272727 = 0.105567 loss)
I0612 18:35:28.193684 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 3.9212 (* 0.0272727 = 0.106942 loss)
I0612 18:35:28.193699 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 3.72174 (* 0.0272727 = 0.101502 loss)
I0612 18:35:28.193713 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.539758 (* 0.0272727 = 0.0147207 loss)
I0612 18:35:28.193728 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.270771 (* 0.0272727 = 0.00738465 loss)
I0612 18:35:28.193742 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.246036 (* 0.0272727 = 0.00671008 loss)
I0612 18:35:28.193786 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.340848 (* 0.0272727 = 0.00929587 loss)
I0612 18:35:28.193802 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.322867 (* 0.0272727 = 0.00880547 loss)
I0612 18:35:28.193817 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.216154 (* 0.0272727 = 0.0058951 loss)
I0612 18:35:28.193831 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.220073 (* 0.0272727 = 0.006002 loss)
I0612 18:35:28.193845 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.0905821 (* 0.0272727 = 0.00247042 loss)
I0612 18:35:28.193861 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.000550794 (* 0.0272727 = 1.50217e-05 loss)
I0612 18:35:28.193874 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000213008 (* 0.0272727 = 5.80932e-06 loss)
I0612 18:35:28.193888 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 5.18144e-05 (* 0.0272727 = 1.41312e-06 loss)
I0612 18:35:28.193903 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 5.48706e-05 (* 0.0272727 = 1.49647e-06 loss)
I0612 18:35:28.193917 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 5.7512e-05 (* 0.0272727 = 1.56851e-06 loss)
I0612 18:35:28.193931 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 2.53031e-05 (* 0.0272727 = 6.90085e-07 loss)
I0612 18:35:28.193943 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.612903
I0612 18:35:28.193955 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 18:35:28.193969 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.625
I0612 18:35:28.193980 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 18:35:28.193992 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.625
I0612 18:35:28.194005 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.25
I0612 18:35:28.194015 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 18:35:28.194028 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.5
I0612 18:35:28.194039 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.625
I0612 18:35:28.194051 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 18:35:28.194063 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 1
I0612 18:35:28.194075 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 18:35:28.194087 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 18:35:28.194100 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 18:35:28.194113 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 18:35:28.194126 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 18:35:28.194138 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 18:35:28.194150 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 18:35:28.194161 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 18:35:28.194174 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 18:35:28.194185 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 18:35:28.194196 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 18:35:28.194207 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 18:35:28.194219 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.857955
I0612 18:35:28.194231 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.758065
I0612 18:35:28.194245 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.84709 (* 0.3 = 0.554128 loss)
I0612 18:35:28.194258 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.669716 (* 0.3 = 0.200915 loss)
I0612 18:35:28.194284 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.955497 (* 0.0272727 = 0.026059 loss)
I0612 18:35:28.194299 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.9567 (* 0.0272727 = 0.0533646 loss)
I0612 18:35:28.194314 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.92289 (* 0.0272727 = 0.0524425 loss)
I0612 18:35:28.194327 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 2.07766 (* 0.0272727 = 0.0566633 loss)
I0612 18:35:28.194341 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 2.90425 (* 0.0272727 = 0.0792069 loss)
I0612 18:35:28.194355 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 2.43572 (* 0.0272727 = 0.0664288 loss)
I0612 18:35:28.194370 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 2.31104 (* 0.0272727 = 0.0630283 loss)
I0612 18:35:28.194383 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 2.59402 (* 0.0272727 = 0.070746 loss)
I0612 18:35:28.194397 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.531228 (* 0.0272727 = 0.014488 loss)
I0612 18:35:28.194411 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.144902 (* 0.0272727 = 0.00395187 loss)
I0612 18:35:28.194425 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.349505 (* 0.0272727 = 0.00953195 loss)
I0612 18:35:28.194439 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.376711 (* 0.0272727 = 0.0102739 loss)
I0612 18:35:28.194454 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.270067 (* 0.0272727 = 0.00736546 loss)
I0612 18:35:28.194468 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.211585 (* 0.0272727 = 0.00577049 loss)
I0612 18:35:28.194483 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.227311 (* 0.0272727 = 0.0061994 loss)
I0612 18:35:28.194496 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.0292414 (* 0.0272727 = 0.000797494 loss)
I0612 18:35:28.194510 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.00148254 (* 0.0272727 = 4.04329e-05 loss)
I0612 18:35:28.194525 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000163884 (* 0.0272727 = 4.46956e-06 loss)
I0612 18:35:28.194540 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000425161 (* 0.0272727 = 1.15953e-05 loss)
I0612 18:35:28.194553 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000212112 (* 0.0272727 = 5.78486e-06 loss)
I0612 18:35:28.194567 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 6.73234e-05 (* 0.0272727 = 1.83609e-06 loss)
I0612 18:35:28.194581 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000125212 (* 0.0272727 = 3.41488e-06 loss)
I0612 18:35:28.194593 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.758065
I0612 18:35:28.194605 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 18:35:28.194617 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 18:35:28.194629 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 18:35:28.194641 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 18:35:28.194653 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 18:35:28.194665 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 18:35:28.194677 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 18:35:28.194689 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.625
I0612 18:35:28.194701 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 18:35:28.194716 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 18:35:28.194728 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 18:35:28.194741 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 18:35:28.194752 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 18:35:28.194774 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 18:35:28.194787 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 18:35:28.194799 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 18:35:28.194811 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 18:35:28.194823 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 18:35:28.194834 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 18:35:28.194846 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 18:35:28.194857 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 18:35:28.194869 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 18:35:28.194880 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.903409
I0612 18:35:28.194892 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.806452
I0612 18:35:28.194906 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.24666 (* 1 = 1.24666 loss)
I0612 18:35:28.194921 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.478914 (* 1 = 0.478914 loss)
I0612 18:35:28.194934 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.838681 (* 0.0909091 = 0.0762438 loss)
I0612 18:35:28.194948 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 1.73402 (* 0.0909091 = 0.157638 loss)
I0612 18:35:28.194962 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 1.06473 (* 0.0909091 = 0.0967939 loss)
I0612 18:35:28.194977 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.18443 (* 0.0909091 = 0.107676 loss)
I0612 18:35:28.194990 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.79137 (* 0.0909091 = 0.162852 loss)
I0612 18:35:28.195003 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 1.71601 (* 0.0909091 = 0.156001 loss)
I0612 18:35:28.195017 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.76635 (* 0.0909091 = 0.160578 loss)
I0612 18:35:28.195031 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 1.88642 (* 0.0909091 = 0.171493 loss)
I0612 18:35:28.195045 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.558472 (* 0.0909091 = 0.0507701 loss)
I0612 18:35:28.195060 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.138329 (* 0.0909091 = 0.0125754 loss)
I0612 18:35:28.195073 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.320482 (* 0.0909091 = 0.0291347 loss)
I0612 18:35:28.195087 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.503806 (* 0.0909091 = 0.0458006 loss)
I0612 18:35:28.195101 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.207082 (* 0.0909091 = 0.0188257 loss)
I0612 18:35:28.195116 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.201868 (* 0.0909091 = 0.0183516 loss)
I0612 18:35:28.195125 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.165726 (* 0.0909091 = 0.015066 loss)
I0612 18:35:28.195135 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.0657991 (* 0.0909091 = 0.00598174 loss)
I0612 18:35:28.195152 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.00180746 (* 0.0909091 = 0.000164314 loss)
I0612 18:35:28.195168 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.000775673 (* 0.0909091 = 7.05158e-05 loss)
I0612 18:35:28.195181 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000330088 (* 0.0909091 = 3.0008e-05 loss)
I0612 18:35:28.195196 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00017242 (* 0.0909091 = 1.56746e-05 loss)
I0612 18:35:28.195210 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000149492 (* 0.0909091 = 1.35902e-05 loss)
I0612 18:35:28.195225 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 8.98231e-05 (* 0.0909091 = 8.16573e-06 loss)
I0612 18:35:28.195245 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 18:35:28.195260 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 18:35:28.195271 6181 solver.cpp:245] Train net output #149: total_confidence = 0.381961
I0612 18:35:28.195283 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.349924
I0612 18:35:28.195296 6181 sgd_solver.cpp:106] Iteration 17000, lr = 0.001
I0612 18:36:24.831917 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.804 > 30) by scale factor 0.837896
I0612 18:36:28.686035 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 39.4517 > 30) by scale factor 0.760424
I0612 18:40:40.075757 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 42.0971 > 30) by scale factor 0.712639
I0612 18:40:41.629770 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 44.327 > 30) by scale factor 0.676789
I0612 18:41:25.587493 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 57.6889 > 30) by scale factor 0.520031
I0612 18:41:36.378449 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.1932 > 30) by scale factor 0.931875
I0612 18:41:47.168915 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.0459 > 30) by scale factor 0.907828
I0612 18:41:53.741605 6181 solver.cpp:229] Iteration 17500, loss = 3.93235
I0612 18:41:53.741664 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.457627
I0612 18:41:53.741683 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.5
I0612 18:41:53.741698 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.75
I0612 18:41:53.741710 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.375
I0612 18:41:53.741724 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 18:41:53.741736 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.25
I0612 18:41:53.741750 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 18:41:53.741762 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 18:41:53.741775 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 18:41:53.741787 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 1
I0612 18:41:53.741801 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 18:41:53.741813 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 18:41:53.741827 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 18:41:53.741838 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 18:41:53.741850 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 18:41:53.741863 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 18:41:53.741875 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 18:41:53.741888 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 0.875
I0612 18:41:53.741899 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 0.875
I0612 18:41:53.741911 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 18:41:53.741924 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 18:41:53.741935 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 18:41:53.741947 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 18:41:53.741958 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.795455
I0612 18:41:53.741971 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.694915
I0612 18:41:53.741987 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.44513 (* 0.3 = 0.733538 loss)
I0612 18:41:53.742002 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.896251 (* 0.3 = 0.268875 loss)
I0612 18:41:53.742017 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 2.42468 (* 0.0272727 = 0.0661276 loss)
I0612 18:41:53.742032 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.12753 (* 0.0272727 = 0.0580236 loss)
I0612 18:41:53.742045 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 3.24451 (* 0.0272727 = 0.0884866 loss)
I0612 18:41:53.742059 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.74356 (* 0.0272727 = 0.0748245 loss)
I0612 18:41:53.742074 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.17477 (* 0.0272727 = 0.0593119 loss)
I0612 18:41:53.742089 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.36846 (* 0.0272727 = 0.0645944 loss)
I0612 18:41:53.742105 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.10554 (* 0.0272727 = 0.0301511 loss)
I0612 18:41:53.742120 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.55935 (* 0.0272727 = 0.015255 loss)
I0612 18:41:53.742136 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.239676 (* 0.0272727 = 0.00653662 loss)
I0612 18:41:53.742149 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.387006 (* 0.0272727 = 0.0105547 loss)
I0612 18:41:53.742197 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.319946 (* 0.0272727 = 0.00872579 loss)
I0612 18:41:53.742213 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.35812 (* 0.0272727 = 0.00976691 loss)
I0612 18:41:53.742226 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.358327 (* 0.0272727 = 0.00977254 loss)
I0612 18:41:53.742240 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.347615 (* 0.0272727 = 0.0094804 loss)
I0612 18:41:53.742255 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.430372 (* 0.0272727 = 0.0117374 loss)
I0612 18:41:53.742269 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.35431 (* 0.0272727 = 0.009663 loss)
I0612 18:41:53.742282 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.59682 (* 0.0272727 = 0.0162769 loss)
I0612 18:41:53.742297 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.561413 (* 0.0272727 = 0.0153113 loss)
I0612 18:41:53.742313 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.0182514 (* 0.0272727 = 0.000497766 loss)
I0612 18:41:53.742327 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.00518675 (* 0.0272727 = 0.000141457 loss)
I0612 18:41:53.742341 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.0013888 (* 0.0272727 = 3.78764e-05 loss)
I0612 18:41:53.742357 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000563869 (* 0.0272727 = 1.53783e-05 loss)
I0612 18:41:53.742368 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.610169
I0612 18:41:53.742382 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 18:41:53.742393 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 18:41:53.742405 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.5
I0612 18:41:53.742418 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 18:41:53.742429 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.375
I0612 18:41:53.742441 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.625
I0612 18:41:53.742454 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.875
I0612 18:41:53.742470 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 18:41:53.742481 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 18:41:53.742493 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 18:41:53.742506 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 18:41:53.742517 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 18:41:53.742529 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 18:41:53.742542 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 18:41:53.742552 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 18:41:53.742565 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 18:41:53.742576 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 0.875
I0612 18:41:53.742588 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 0.875
I0612 18:41:53.742600 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 18:41:53.742611 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 18:41:53.742624 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 18:41:53.742635 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 18:41:53.742646 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.852273
I0612 18:41:53.742658 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.813559
I0612 18:41:53.742672 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.82618 (* 0.3 = 0.547853 loss)
I0612 18:41:53.742698 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.649144 (* 0.3 = 0.194743 loss)
I0612 18:41:53.742713 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 1.80295 (* 0.0272727 = 0.0491715 loss)
I0612 18:41:53.742727 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 2.56041 (* 0.0272727 = 0.0698295 loss)
I0612 18:41:53.742741 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 2.30989 (* 0.0272727 = 0.062997 loss)
I0612 18:41:53.742755 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.72623 (* 0.0272727 = 0.047079 loss)
I0612 18:41:53.742769 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.69082 (* 0.0272727 = 0.0461133 loss)
I0612 18:41:53.742784 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.57633 (* 0.0272727 = 0.0429908 loss)
I0612 18:41:53.742797 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.357518 (* 0.0272727 = 0.00975048 loss)
I0612 18:41:53.742812 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.508143 (* 0.0272727 = 0.0138584 loss)
I0612 18:41:53.742826 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.165252 (* 0.0272727 = 0.00450689 loss)
I0612 18:41:53.742841 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.337836 (* 0.0272727 = 0.00921371 loss)
I0612 18:41:53.742854 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.309643 (* 0.0272727 = 0.0084448 loss)
I0612 18:41:53.742868 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.189094 (* 0.0272727 = 0.00515711 loss)
I0612 18:41:53.742882 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.297363 (* 0.0272727 = 0.00810989 loss)
I0612 18:41:53.742897 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.1638 (* 0.0272727 = 0.00446728 loss)
I0612 18:41:53.742910 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.187761 (* 0.0272727 = 0.00512076 loss)
I0612 18:41:53.742925 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.150576 (* 0.0272727 = 0.00410662 loss)
I0612 18:41:53.742939 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.326734 (* 0.0272727 = 0.00891093 loss)
I0612 18:41:53.742954 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.234994 (* 0.0272727 = 0.00640893 loss)
I0612 18:41:53.742967 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00369596 (* 0.0272727 = 0.000100799 loss)
I0612 18:41:53.742981 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000250998 (* 0.0272727 = 6.8454e-06 loss)
I0612 18:41:53.742995 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 1.08336e-05 (* 0.0272727 = 2.95462e-07 loss)
I0612 18:41:53.743010 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 1.16229e-06 (* 0.0272727 = 3.1699e-08 loss)
I0612 18:41:53.743021 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.711864
I0612 18:41:53.743033 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 18:41:53.743046 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 18:41:53.743057 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 18:41:53.743069 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 18:41:53.743082 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 18:41:53.743093 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 18:41:53.743105 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 1
I0612 18:41:53.743116 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 1
I0612 18:41:53.743129 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 1
I0612 18:41:53.743140 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 18:41:53.743154 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 18:41:53.743166 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 18:41:53.743188 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 18:41:53.743201 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 18:41:53.743213 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 18:41:53.743224 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 0.875
I0612 18:41:53.743237 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 18:41:53.743248 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 0.875
I0612 18:41:53.743259 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 18:41:53.743271 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 18:41:53.743283 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 18:41:53.743294 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 18:41:53.743306 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.886364
I0612 18:41:53.743319 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.915254
I0612 18:41:53.743332 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.46943 (* 1 = 1.46943 loss)
I0612 18:41:53.743346 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.530632 (* 1 = 0.530632 loss)
I0612 18:41:53.743360 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 1.72782 (* 0.0909091 = 0.157074 loss)
I0612 18:41:53.743374 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 2.21978 (* 0.0909091 = 0.201798 loss)
I0612 18:41:53.743388 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 2.5763 (* 0.0909091 = 0.234209 loss)
I0612 18:41:53.743402 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 2.36331 (* 0.0909091 = 0.214847 loss)
I0612 18:41:53.743417 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.845757 (* 0.0909091 = 0.076887 loss)
I0612 18:41:53.743430 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 2.09193 (* 0.0909091 = 0.190176 loss)
I0612 18:41:53.743444 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.223601 (* 0.0909091 = 0.0203273 loss)
I0612 18:41:53.743458 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.0779003 (* 0.0909091 = 0.00708185 loss)
I0612 18:41:53.743472 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.0657194 (* 0.0909091 = 0.00597449 loss)
I0612 18:41:53.743482 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.101314 (* 0.0909091 = 0.00921035 loss)
I0612 18:41:53.743492 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.0786571 (* 0.0909091 = 0.00715065 loss)
I0612 18:41:53.743510 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.0552762 (* 0.0909091 = 0.00502511 loss)
I0612 18:41:53.743525 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.0857759 (* 0.0909091 = 0.00779781 loss)
I0612 18:41:53.743541 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.0420809 (* 0.0909091 = 0.00382554 loss)
I0612 18:41:53.743554 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.254165 (* 0.0909091 = 0.0231059 loss)
I0612 18:41:53.743568 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.255751 (* 0.0909091 = 0.0232501 loss)
I0612 18:41:53.743582 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0499899 (* 0.0909091 = 0.00454453 loss)
I0612 18:41:53.743597 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.154081 (* 0.0909091 = 0.0140073 loss)
I0612 18:41:53.743610 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00973339 (* 0.0909091 = 0.000884854 loss)
I0612 18:41:53.743624 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00156009 (* 0.0909091 = 0.000141826 loss)
I0612 18:41:53.743639 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000368478 (* 0.0909091 = 3.3498e-05 loss)
I0612 18:41:53.743664 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 1.89554e-05 (* 0.0909091 = 1.72321e-06 loss)
I0612 18:41:53.743676 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 18:41:53.743688 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.625
I0612 18:41:53.743700 6181 solver.cpp:245] Train net output #149: total_confidence = 0.434653
I0612 18:41:53.743712 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.474891
I0612 18:41:53.743726 6181 sgd_solver.cpp:106] Iteration 17500, lr = 0.001
I0612 18:42:26.485136 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.4483 > 30) by scale factor 0.924548
I0612 18:47:06.514408 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 37.3993 > 30) by scale factor 0.802154
I0612 18:48:19.418961 6181 solver.cpp:229] Iteration 18000, loss = 3.95657
I0612 18:48:19.419100 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.316667
I0612 18:48:19.419122 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 1
I0612 18:48:19.419137 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.375
I0612 18:48:19.419150 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 18:48:19.419163 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.125
I0612 18:48:19.419176 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.375
I0612 18:48:19.419190 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.125
I0612 18:48:19.419203 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.625
I0612 18:48:19.419215 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 18:48:19.419231 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.75
I0612 18:48:19.419245 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 18:48:19.419258 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.75
I0612 18:48:19.419271 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.75
I0612 18:48:19.419284 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 18:48:19.419296 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 18:48:19.419309 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 18:48:19.419322 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 18:48:19.419333 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 18:48:19.419345 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 18:48:19.419358 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 18:48:19.419370 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 18:48:19.419381 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 18:48:19.419394 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 18:48:19.419406 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.755682
I0612 18:48:19.419419 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.633333
I0612 18:48:19.419435 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 2.20755 (* 0.3 = 0.662266 loss)
I0612 18:48:19.419450 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.827093 (* 0.3 = 0.248128 loss)
I0612 18:48:19.419464 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.764724 (* 0.0272727 = 0.0208561 loss)
I0612 18:48:19.419479 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.70926 (* 0.0272727 = 0.073889 loss)
I0612 18:48:19.419493 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.87826 (* 0.0272727 = 0.0512254 loss)
I0612 18:48:19.419507 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.91266 (* 0.0272727 = 0.0794361 loss)
I0612 18:48:19.419522 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.68881 (* 0.0272727 = 0.0733312 loss)
I0612 18:48:19.419535 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.70085 (* 0.0272727 = 0.0736595 loss)
I0612 18:48:19.419550 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.05371 (* 0.0272727 = 0.0287376 loss)
I0612 18:48:19.419564 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.506741 (* 0.0272727 = 0.0138202 loss)
I0612 18:48:19.419579 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.718179 (* 0.0272727 = 0.0195867 loss)
I0612 18:48:19.419594 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.578174 (* 0.0272727 = 0.0157684 loss)
I0612 18:48:19.419607 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.726505 (* 0.0272727 = 0.0198138 loss)
I0612 18:48:19.419621 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.733137 (* 0.0272727 = 0.0199946 loss)
I0612 18:48:19.419656 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.393918 (* 0.0272727 = 0.0107432 loss)
I0612 18:48:19.419670 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.584975 (* 0.0272727 = 0.0159539 loss)
I0612 18:48:19.419685 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.0166145 (* 0.0272727 = 0.000453123 loss)
I0612 18:48:19.419699 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.00423537 (* 0.0272727 = 0.00011551 loss)
I0612 18:48:19.419714 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00137922 (* 0.0272727 = 3.7615e-05 loss)
I0612 18:48:19.419728 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000539148 (* 0.0272727 = 1.4704e-05 loss)
I0612 18:48:19.419744 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000317466 (* 0.0272727 = 8.65816e-06 loss)
I0612 18:48:19.419759 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000456865 (* 0.0272727 = 1.246e-05 loss)
I0612 18:48:19.419772 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000371215 (* 0.0272727 = 1.0124e-05 loss)
I0612 18:48:19.419786 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000202378 (* 0.0272727 = 5.5194e-06 loss)
I0612 18:48:19.419800 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.5
I0612 18:48:19.419811 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 1
I0612 18:48:19.419823 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 1
I0612 18:48:19.419836 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.375
I0612 18:48:19.419847 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.25
I0612 18:48:19.419859 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.375
I0612 18:48:19.419872 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.375
I0612 18:48:19.419883 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 18:48:19.419895 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 18:48:19.419909 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.75
I0612 18:48:19.419919 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 18:48:19.419932 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.75
I0612 18:48:19.419945 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.75
I0612 18:48:19.419957 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 18:48:19.419968 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 18:48:19.419981 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 18:48:19.419992 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 18:48:19.420004 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 18:48:19.420016 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 18:48:19.420027 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 18:48:19.420039 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 18:48:19.420052 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 18:48:19.420063 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 18:48:19.420074 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.8125
I0612 18:48:19.420086 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.783333
I0612 18:48:19.420101 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.68844 (* 0.3 = 0.506532 loss)
I0612 18:48:19.420119 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.628592 (* 0.3 = 0.188577 loss)
I0612 18:48:19.420133 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.247754 (* 0.0272727 = 0.00675694 loss)
I0612 18:48:19.420148 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.650074 (* 0.0272727 = 0.0177293 loss)
I0612 18:48:19.420176 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 2.15807 (* 0.0272727 = 0.0588564 loss)
I0612 18:48:19.420191 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 2.76431 (* 0.0272727 = 0.0753903 loss)
I0612 18:48:19.420205 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.96177 (* 0.0272727 = 0.0535029 loss)
I0612 18:48:19.420219 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.58406 (* 0.0272727 = 0.0432017 loss)
I0612 18:48:19.420233 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.681096 (* 0.0272727 = 0.0185754 loss)
I0612 18:48:19.420248 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.490842 (* 0.0272727 = 0.0133866 loss)
I0612 18:48:19.420261 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.727344 (* 0.0272727 = 0.0198367 loss)
I0612 18:48:19.420279 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.49722 (* 0.0272727 = 0.0135605 loss)
I0612 18:48:19.420294 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.770524 (* 0.0272727 = 0.0210143 loss)
I0612 18:48:19.420307 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.532395 (* 0.0272727 = 0.0145199 loss)
I0612 18:48:19.420322 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.218792 (* 0.0272727 = 0.00596705 loss)
I0612 18:48:19.420336 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.138935 (* 0.0272727 = 0.00378914 loss)
I0612 18:48:19.420351 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.0535773 (* 0.0272727 = 0.0014612 loss)
I0612 18:48:19.420366 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.00910473 (* 0.0272727 = 0.000248311 loss)
I0612 18:48:19.420379 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.00213908 (* 0.0272727 = 5.83387e-05 loss)
I0612 18:48:19.420394 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000906923 (* 0.0272727 = 2.47343e-05 loss)
I0612 18:48:19.420408 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000545199 (* 0.0272727 = 1.48691e-05 loss)
I0612 18:48:19.420423 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.000117919 (* 0.0272727 = 3.21598e-06 loss)
I0612 18:48:19.420438 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 6.04676e-05 (* 0.0272727 = 1.64912e-06 loss)
I0612 18:48:19.420451 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 2.46861e-05 (* 0.0272727 = 6.73258e-07 loss)
I0612 18:48:19.420464 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.75
I0612 18:48:19.420475 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 18:48:19.420487 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 18:48:19.420500 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.625
I0612 18:48:19.420511 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.5
I0612 18:48:19.420523 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 18:48:19.420536 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 18:48:19.420547 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 18:48:19.420558 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 18:48:19.420570 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 18:48:19.420583 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 18:48:19.420594 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.75
I0612 18:48:19.420606 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 18:48:19.420619 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 18:48:19.420630 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 18:48:19.420641 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 18:48:19.420653 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 18:48:19.420675 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 18:48:19.420688 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 18:48:19.420701 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 18:48:19.420712 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 18:48:19.420723 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 18:48:19.420735 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 18:48:19.420747 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.903409
I0612 18:48:19.420759 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.916667
I0612 18:48:19.420773 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.86263 (* 1 = 0.86263 loss)
I0612 18:48:19.420788 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.328121 (* 1 = 0.328121 loss)
I0612 18:48:19.420802 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.122926 (* 0.0909091 = 0.0111751 loss)
I0612 18:48:19.420816 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.24342 (* 0.0909091 = 0.0221291 loss)
I0612 18:48:19.420830 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.978832 (* 0.0909091 = 0.0889847 loss)
I0612 18:48:19.420845 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.95537 (* 0.0909091 = 0.177761 loss)
I0612 18:48:19.420858 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 1.02743 (* 0.0909091 = 0.0934026 loss)
I0612 18:48:19.420872 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.526402 (* 0.0909091 = 0.0478548 loss)
I0612 18:48:19.420886 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.853004 (* 0.0909091 = 0.0775458 loss)
I0612 18:48:19.420902 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.521403 (* 0.0909091 = 0.0474002 loss)
I0612 18:48:19.420915 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.406847 (* 0.0909091 = 0.0369861 loss)
I0612 18:48:19.420928 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.39398 (* 0.0909091 = 0.0358164 loss)
I0612 18:48:19.420943 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.659117 (* 0.0909091 = 0.0599197 loss)
I0612 18:48:19.420956 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.273803 (* 0.0909091 = 0.0248912 loss)
I0612 18:48:19.420970 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.363608 (* 0.0909091 = 0.0330553 loss)
I0612 18:48:19.420984 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.0369013 (* 0.0909091 = 0.00335467 loss)
I0612 18:48:19.420999 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.0243449 (* 0.0909091 = 0.00221317 loss)
I0612 18:48:19.421013 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.00372459 (* 0.0909091 = 0.000338599 loss)
I0612 18:48:19.421027 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.00100883 (* 0.0909091 = 9.1712e-05 loss)
I0612 18:48:19.421042 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.000460092 (* 0.0909091 = 4.18266e-05 loss)
I0612 18:48:19.421056 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000243969 (* 0.0909091 = 2.2179e-05 loss)
I0612 18:48:19.421072 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000149151 (* 0.0909091 = 1.35591e-05 loss)
I0612 18:48:19.421082 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.000154899 (* 0.0909091 = 1.40817e-05 loss)
I0612 18:48:19.421097 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 7.13027e-05 (* 0.0909091 = 6.48206e-06 loss)
I0612 18:48:19.421108 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.25
I0612 18:48:19.421120 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.125
I0612 18:48:19.421141 6181 solver.cpp:245] Train net output #149: total_confidence = 0.224297
I0612 18:48:19.421154 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.197145
I0612 18:48:19.421171 6181 sgd_solver.cpp:106] Iteration 18000, lr = 0.001
I0612 18:48:46.804368 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 46.0112 > 30) by scale factor 0.652015
I0612 18:49:03.780764 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 47.0628 > 30) by scale factor 0.637446
I0612 18:49:33.871322 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.685 > 30) by scale factor 0.890604
I0612 18:50:01.621629 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.9794 > 30) by scale factor 0.909658
I0612 18:50:25.540050 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 46.5777 > 30) by scale factor 0.644086
I0612 18:50:51.029245 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 50.3401 > 30) by scale factor 0.595946
I0612 18:50:51.799855 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 34.4938 > 30) by scale factor 0.869722
I0612 18:51:31.133602 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.3244 > 30) by scale factor 0.989303
I0612 18:53:41.505957 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.086 > 30) by scale factor 0.787691
I0612 18:54:35.542120 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.3615 > 30) by scale factor 0.956586
I0612 18:54:45.328102 6181 solver.cpp:229] Iteration 18500, loss = 3.99225
I0612 18:54:45.328174 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.578947
I0612 18:54:45.328194 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.875
I0612 18:54:45.328208 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.5
I0612 18:54:45.328222 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.375
I0612 18:54:45.328233 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.5
I0612 18:54:45.328246 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.25
I0612 18:54:45.328259 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 18:54:45.328272 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.875
I0612 18:54:45.328285 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 18:54:45.328299 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 18:54:45.328311 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 18:54:45.328325 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 18:54:45.328336 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 18:54:45.328349 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 18:54:45.328361 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 18:54:45.328374 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 18:54:45.328387 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 18:54:45.328399 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 18:54:45.328411 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 18:54:45.328423 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 18:54:45.328435 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 18:54:45.328446 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 18:54:45.328459 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 18:54:45.328470 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.857955
I0612 18:54:45.328483 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.824561
I0612 18:54:45.328500 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.39647 (* 0.3 = 0.418942 loss)
I0612 18:54:45.328516 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.47918 (* 0.3 = 0.143754 loss)
I0612 18:54:45.328529 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.499015 (* 0.0272727 = 0.0136095 loss)
I0612 18:54:45.328543 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 0.792945 (* 0.0272727 = 0.0216258 loss)
I0612 18:54:45.328558 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.18358 (* 0.0272727 = 0.0322794 loss)
I0612 18:54:45.328572 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.29042 (* 0.0272727 = 0.062466 loss)
I0612 18:54:45.328586 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.88768 (* 0.0272727 = 0.0514821 loss)
I0612 18:54:45.328600 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.58391 (* 0.0272727 = 0.0431977 loss)
I0612 18:54:45.328614 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.805189 (* 0.0272727 = 0.0219597 loss)
I0612 18:54:45.328629 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.403506 (* 0.0272727 = 0.0110047 loss)
I0612 18:54:45.328642 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.484046 (* 0.0272727 = 0.0132012 loss)
I0612 18:54:45.328657 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.456169 (* 0.0272727 = 0.012441 loss)
I0612 18:54:45.328671 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.345089 (* 0.0272727 = 0.00941152 loss)
I0612 18:54:45.328686 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.414266 (* 0.0272727 = 0.0112982 loss)
I0612 18:54:45.328733 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.420621 (* 0.0272727 = 0.0114715 loss)
I0612 18:54:45.328753 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.556935 (* 0.0272727 = 0.0151891 loss)
I0612 18:54:45.328768 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.158679 (* 0.0272727 = 0.0043276 loss)
I0612 18:54:45.328783 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.0532086 (* 0.0272727 = 0.00145114 loss)
I0612 18:54:45.328797 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.0170121 (* 0.0272727 = 0.000463965 loss)
I0612 18:54:45.328811 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.0112158 (* 0.0272727 = 0.000305885 loss)
I0612 18:54:45.328826 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000930168 (* 0.0272727 = 2.53682e-05 loss)
I0612 18:54:45.328841 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000228718 (* 0.0272727 = 6.23777e-06 loss)
I0612 18:54:45.328855 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 6.79511e-06 (* 0.0272727 = 1.85321e-07 loss)
I0612 18:54:45.328869 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 8.94072e-07 (* 0.0272727 = 2.43838e-08 loss)
I0612 18:54:45.328881 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.631579
I0612 18:54:45.328894 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 1
I0612 18:54:45.328907 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 1
I0612 18:54:45.328918 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.875
I0612 18:54:45.328929 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.5
I0612 18:54:45.328943 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.375
I0612 18:54:45.328953 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.375
I0612 18:54:45.328965 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.875
I0612 18:54:45.328977 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 18:54:45.328989 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 18:54:45.329001 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 18:54:45.329013 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 18:54:45.329025 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 18:54:45.329036 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 18:54:45.329048 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 18:54:45.329061 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 18:54:45.329073 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 18:54:45.329087 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 18:54:45.329099 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 18:54:45.329111 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 18:54:45.329123 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 18:54:45.329134 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 18:54:45.329146 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 18:54:45.329159 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.875
I0612 18:54:45.329170 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.824561
I0612 18:54:45.329185 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.22741 (* 0.3 = 0.368223 loss)
I0612 18:54:45.329198 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.417463 (* 0.3 = 0.125239 loss)
I0612 18:54:45.329212 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.238928 (* 0.0272727 = 0.00651622 loss)
I0612 18:54:45.329239 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.229222 (* 0.0272727 = 0.00625151 loss)
I0612 18:54:45.329254 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.359225 (* 0.0272727 = 0.00979705 loss)
I0612 18:54:45.329268 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 2.56336 (* 0.0272727 = 0.0699098 loss)
I0612 18:54:45.329283 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.70127 (* 0.0272727 = 0.0463984 loss)
I0612 18:54:45.329298 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.17573 (* 0.0272727 = 0.0320654 loss)
I0612 18:54:45.329310 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.542376 (* 0.0272727 = 0.0147921 loss)
I0612 18:54:45.329342 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.4834 (* 0.0272727 = 0.0131836 loss)
I0612 18:54:45.329358 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.359755 (* 0.0272727 = 0.0098115 loss)
I0612 18:54:45.329373 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.346631 (* 0.0272727 = 0.00945357 loss)
I0612 18:54:45.329387 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.341127 (* 0.0272727 = 0.00930345 loss)
I0612 18:54:45.329401 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.519858 (* 0.0272727 = 0.0141779 loss)
I0612 18:54:45.329416 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.374112 (* 0.0272727 = 0.0102031 loss)
I0612 18:54:45.329430 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.55733 (* 0.0272727 = 0.0151999 loss)
I0612 18:54:45.329444 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.453592 (* 0.0272727 = 0.0123707 loss)
I0612 18:54:45.329458 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.0642583 (* 0.0272727 = 0.0017525 loss)
I0612 18:54:45.329473 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0425219 (* 0.0272727 = 0.00115969 loss)
I0612 18:54:45.329488 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.028876 (* 0.0272727 = 0.000787528 loss)
I0612 18:54:45.329501 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00793907 (* 0.0272727 = 0.00021652 loss)
I0612 18:54:45.329516 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00971224 (* 0.0272727 = 0.000264879 loss)
I0612 18:54:45.329531 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.00768691 (* 0.0272727 = 0.000209643 loss)
I0612 18:54:45.329545 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.00411397 (* 0.0272727 = 0.000112199 loss)
I0612 18:54:45.329557 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.859649
I0612 18:54:45.329571 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 1
I0612 18:54:45.329582 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 1
I0612 18:54:45.329593 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 1
I0612 18:54:45.329605 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 18:54:45.329617 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 18:54:45.329630 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 18:54:45.329643 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 1
I0612 18:54:45.329653 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 1
I0612 18:54:45.329665 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 1
I0612 18:54:45.329677 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 18:54:45.329689 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 18:54:45.329700 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 18:54:45.329712 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 18:54:45.329725 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 18:54:45.329736 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 18:54:45.329759 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 18:54:45.329773 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 18:54:45.329784 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 18:54:45.329799 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 18:54:45.329813 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 18:54:45.329824 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 18:54:45.329836 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 18:54:45.329849 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.954545
I0612 18:54:45.329860 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.982456
I0612 18:54:45.329874 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.522824 (* 1 = 0.522824 loss)
I0612 18:54:45.329888 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.171688 (* 1 = 0.171688 loss)
I0612 18:54:45.329902 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.064569 (* 0.0909091 = 0.00586991 loss)
I0612 18:54:45.329916 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.0245112 (* 0.0909091 = 0.00222829 loss)
I0612 18:54:45.329931 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.0509258 (* 0.0909091 = 0.00462961 loss)
I0612 18:54:45.329946 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.15127 (* 0.0909091 = 0.104661 loss)
I0612 18:54:45.329959 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.356573 (* 0.0909091 = 0.0324157 loss)
I0612 18:54:45.329973 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.461839 (* 0.0909091 = 0.0419854 loss)
I0612 18:54:45.329988 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.0535014 (* 0.0909091 = 0.00486376 loss)
I0612 18:54:45.330001 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.0855076 (* 0.0909091 = 0.00777342 loss)
I0612 18:54:45.330015 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.119661 (* 0.0909091 = 0.0108782 loss)
I0612 18:54:45.330029 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.181355 (* 0.0909091 = 0.0164868 loss)
I0612 18:54:45.330044 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.197052 (* 0.0909091 = 0.0179138 loss)
I0612 18:54:45.330057 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.0964639 (* 0.0909091 = 0.00876945 loss)
I0612 18:54:45.330071 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.318364 (* 0.0909091 = 0.0289422 loss)
I0612 18:54:45.330086 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.0373366 (* 0.0909091 = 0.00339423 loss)
I0612 18:54:45.330101 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.184219 (* 0.0909091 = 0.0167472 loss)
I0612 18:54:45.330114 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.0669431 (* 0.0909091 = 0.00608573 loss)
I0612 18:54:45.330128 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0205456 (* 0.0909091 = 0.00186779 loss)
I0612 18:54:45.330147 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.00171537 (* 0.0909091 = 0.000155943 loss)
I0612 18:54:45.330157 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.000347385 (* 0.0909091 = 3.15804e-05 loss)
I0612 18:54:45.330168 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000279503 (* 0.0909091 = 2.54094e-05 loss)
I0612 18:54:45.330181 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 7.65995e-05 (* 0.0909091 = 6.96359e-06 loss)
I0612 18:54:45.330196 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 2.11758e-05 (* 0.0909091 = 1.92507e-06 loss)
I0612 18:54:45.330209 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.625
I0612 18:54:45.330230 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 18:54:45.330245 6181 solver.cpp:245] Train net output #149: total_confidence = 0.599429
I0612 18:54:45.330256 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.548715
I0612 18:54:45.330270 6181 sgd_solver.cpp:106] Iteration 18500, lr = 0.001
I0612 18:55:17.293462 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.5083 > 30) by scale factor 0.983338
I0612 18:57:49.148207 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.4495 > 30) by scale factor 0.823057
I0612 18:59:37.924422 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 33.9749 > 30) by scale factor 0.883004
I0612 19:00:00.271477 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.1597 > 30) by scale factor 0.932843
I0612 19:00:38.883353 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 46.3166 > 30) by scale factor 0.647716
I0612 19:00:55.836545 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 30.0693 > 30) by scale factor 0.997695
I0612 19:01:10.886869 6181 solver.cpp:229] Iteration 19000, loss = 3.9188
I0612 19:01:10.886989 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.480769
I0612 19:01:10.887011 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 19:01:10.887025 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 19:01:10.887038 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 19:01:10.887051 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.25
I0612 19:01:10.887064 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.5
I0612 19:01:10.887078 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.625
I0612 19:01:10.887090 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.875
I0612 19:01:10.887104 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 1
I0612 19:01:10.887116 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 19:01:10.887130 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 19:01:10.887142 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 19:01:10.887154 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 19:01:10.887167 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 19:01:10.887179 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 19:01:10.887193 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 0.875
I0612 19:01:10.887205 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 0.875
I0612 19:01:10.887218 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 0.875
I0612 19:01:10.887233 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 0.875
I0612 19:01:10.887246 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 19:01:10.887259 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 19:01:10.887270 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 19:01:10.887282 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 19:01:10.887295 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.835227
I0612 19:01:10.887307 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.653846
I0612 19:01:10.887325 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.94133 (* 0.3 = 0.582398 loss)
I0612 19:01:10.887339 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.626205 (* 0.3 = 0.187862 loss)
I0612 19:01:10.887354 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 2.01988 (* 0.0272727 = 0.0550875 loss)
I0612 19:01:10.887368 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.05695 (* 0.0272727 = 0.028826 loss)
I0612 19:01:10.887382 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.85532 (* 0.0272727 = 0.0505995 loss)
I0612 19:01:10.887397 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.93953 (* 0.0272727 = 0.0528964 loss)
I0612 19:01:10.887410 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.62827 (* 0.0272727 = 0.0444073 loss)
I0612 19:01:10.887424 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.17623 (* 0.0272727 = 0.0320791 loss)
I0612 19:01:10.887439 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.314616 (* 0.0272727 = 0.00858042 loss)
I0612 19:01:10.887454 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.183708 (* 0.0272727 = 0.00501022 loss)
I0612 19:01:10.887468 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.409401 (* 0.0272727 = 0.0111655 loss)
I0612 19:01:10.887482 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.418865 (* 0.0272727 = 0.0114236 loss)
I0612 19:01:10.887497 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.451068 (* 0.0272727 = 0.0123019 loss)
I0612 19:01:10.887511 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.427562 (* 0.0272727 = 0.0116608 loss)
I0612 19:01:10.887545 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.41039 (* 0.0272727 = 0.0111924 loss)
I0612 19:01:10.887560 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.428308 (* 0.0272727 = 0.0116811 loss)
I0612 19:01:10.887574 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.513129 (* 0.0272727 = 0.0139944 loss)
I0612 19:01:10.887588 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.382337 (* 0.0272727 = 0.0104274 loss)
I0612 19:01:10.887603 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.666635 (* 0.0272727 = 0.0181809 loss)
I0612 19:01:10.887616 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.414923 (* 0.0272727 = 0.0113161 loss)
I0612 19:01:10.887631 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.0189966 (* 0.0272727 = 0.000518089 loss)
I0612 19:01:10.887645 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.0156314 (* 0.0272727 = 0.000426311 loss)
I0612 19:01:10.887660 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00332607 (* 0.0272727 = 9.07109e-05 loss)
I0612 19:01:10.887675 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00226687 (* 0.0272727 = 6.18238e-05 loss)
I0612 19:01:10.887687 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.615385
I0612 19:01:10.887699 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 19:01:10.887712 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.875
I0612 19:01:10.887724 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.5
I0612 19:01:10.887737 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.625
I0612 19:01:10.887748 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.75
I0612 19:01:10.887760 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.75
I0612 19:01:10.887773 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 1
I0612 19:01:10.887784 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 1
I0612 19:01:10.887796 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 19:01:10.887809 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 19:01:10.887820 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 19:01:10.887831 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 19:01:10.887843 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 19:01:10.887856 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 19:01:10.887867 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 19:01:10.887879 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 0.875
I0612 19:01:10.887892 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 0.875
I0612 19:01:10.887902 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 0.875
I0612 19:01:10.887915 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 19:01:10.887926 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 19:01:10.887938 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 19:01:10.887949 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 19:01:10.887961 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.875
I0612 19:01:10.887974 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.730769
I0612 19:01:10.887987 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.43409 (* 0.3 = 0.430226 loss)
I0612 19:01:10.888005 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.443336 (* 0.3 = 0.133001 loss)
I0612 19:01:10.888020 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.735688 (* 0.0272727 = 0.0200642 loss)
I0612 19:01:10.888036 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.533485 (* 0.0272727 = 0.0145496 loss)
I0612 19:01:10.888062 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.29768 (* 0.0272727 = 0.0353912 loss)
I0612 19:01:10.888077 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.6019 (* 0.0272727 = 0.0436881 loss)
I0612 19:01:10.888090 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 0.865219 (* 0.0272727 = 0.0235969 loss)
I0612 19:01:10.888105 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.620015 (* 0.0272727 = 0.0169095 loss)
I0612 19:01:10.888119 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.227991 (* 0.0272727 = 0.00621794 loss)
I0612 19:01:10.888134 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.199584 (* 0.0272727 = 0.0054432 loss)
I0612 19:01:10.888149 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.320345 (* 0.0272727 = 0.00873669 loss)
I0612 19:01:10.888162 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.257689 (* 0.0272727 = 0.00702787 loss)
I0612 19:01:10.888176 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.442374 (* 0.0272727 = 0.0120647 loss)
I0612 19:01:10.888191 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.406749 (* 0.0272727 = 0.0110931 loss)
I0612 19:01:10.888206 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.231732 (* 0.0272727 = 0.00631996 loss)
I0612 19:01:10.888219 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.457244 (* 0.0272727 = 0.0124703 loss)
I0612 19:01:10.888233 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.512632 (* 0.0272727 = 0.0139809 loss)
I0612 19:01:10.888248 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.253338 (* 0.0272727 = 0.0069092 loss)
I0612 19:01:10.888262 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.606379 (* 0.0272727 = 0.0165376 loss)
I0612 19:01:10.888278 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.217649 (* 0.0272727 = 0.00593587 loss)
I0612 19:01:10.888293 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.0255642 (* 0.0272727 = 0.000697204 loss)
I0612 19:01:10.888309 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00514871 (* 0.0272727 = 0.000140419 loss)
I0612 19:01:10.888324 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.00220267 (* 0.0272727 = 6.00729e-05 loss)
I0612 19:01:10.888337 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000282408 (* 0.0272727 = 7.70204e-06 loss)
I0612 19:01:10.888350 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.653846
I0612 19:01:10.888362 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 19:01:10.888375 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 19:01:10.888386 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.75
I0612 19:01:10.888398 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.75
I0612 19:01:10.888411 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 19:01:10.888422 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 19:01:10.888434 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 19:01:10.888447 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 1
I0612 19:01:10.888458 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 1
I0612 19:01:10.888470 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 19:01:10.888483 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 19:01:10.888494 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 19:01:10.888506 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 19:01:10.888519 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 19:01:10.888530 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 19:01:10.888553 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 19:01:10.888566 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 0.875
I0612 19:01:10.888578 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 0.875
I0612 19:01:10.888591 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 19:01:10.888602 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 19:01:10.888613 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 19:01:10.888625 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 19:01:10.888638 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.892045
I0612 19:01:10.888649 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.846154
I0612 19:01:10.888664 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 1.01077 (* 1 = 1.01077 loss)
I0612 19:01:10.888679 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.326562 (* 1 = 0.326562 loss)
I0612 19:01:10.888692 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.446406 (* 0.0909091 = 0.0405823 loss)
I0612 19:01:10.888706 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.425304 (* 0.0909091 = 0.038664 loss)
I0612 19:01:10.888721 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.781441 (* 0.0909091 = 0.0710401 loss)
I0612 19:01:10.888736 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 1.13806 (* 0.0909091 = 0.10346 loss)
I0612 19:01:10.888749 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.7255 (* 0.0909091 = 0.0659546 loss)
I0612 19:01:10.888763 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.571997 (* 0.0909091 = 0.0519998 loss)
I0612 19:01:10.888777 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.241727 (* 0.0909091 = 0.0219751 loss)
I0612 19:01:10.888792 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.113679 (* 0.0909091 = 0.0103345 loss)
I0612 19:01:10.888805 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.0560398 (* 0.0909091 = 0.00509453 loss)
I0612 19:01:10.888819 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.44987 (* 0.0909091 = 0.0408972 loss)
I0612 19:01:10.888834 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.252221 (* 0.0909091 = 0.0229292 loss)
I0612 19:01:10.888849 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.380702 (* 0.0909091 = 0.0346093 loss)
I0612 19:01:10.888862 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.125976 (* 0.0909091 = 0.0114524 loss)
I0612 19:01:10.888876 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.277927 (* 0.0909091 = 0.0252661 loss)
I0612 19:01:10.888890 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.125769 (* 0.0909091 = 0.0114335 loss)
I0612 19:01:10.888905 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.204096 (* 0.0909091 = 0.0185542 loss)
I0612 19:01:10.888918 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.358135 (* 0.0909091 = 0.0325578 loss)
I0612 19:01:10.888932 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.305835 (* 0.0909091 = 0.0278032 loss)
I0612 19:01:10.888947 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.015047 (* 0.0909091 = 0.00136791 loss)
I0612 19:01:10.888962 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00573799 (* 0.0909091 = 0.000521635 loss)
I0612 19:01:10.888975 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00339761 (* 0.0909091 = 0.000308874 loss)
I0612 19:01:10.888989 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.00159474 (* 0.0909091 = 0.000144976 loss)
I0612 19:01:10.889003 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 19:01:10.889014 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.375
I0612 19:01:10.889035 6181 solver.cpp:245] Train net output #149: total_confidence = 0.380818
I0612 19:01:10.889045 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.475605
I0612 19:01:10.889058 6181 sgd_solver.cpp:106] Iteration 19000, lr = 0.001
I0612 19:01:12.020041 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.9711 > 30) by scale factor 0.909888
I0612 19:01:22.833755 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 44.222 > 30) by scale factor 0.678395
I0612 19:05:07.974622 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.9972 > 30) by scale factor 0.833397
I0612 19:05:31.111011 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 48.9951 > 30) by scale factor 0.612306
I0612 19:05:44.242697 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 44.1663 > 30) by scale factor 0.679251
I0612 19:06:08.904712 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 36.8406 > 30) by scale factor 0.814319
I0612 19:07:36.610234 6181 solver.cpp:229] Iteration 19500, loss = 3.92957
I0612 19:07:36.610359 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.574468
I0612 19:07:36.610380 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.625
I0612 19:07:36.610395 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.625
I0612 19:07:36.610407 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 19:07:36.610420 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.75
I0612 19:07:36.610433 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.375
I0612 19:07:36.610446 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.75
I0612 19:07:36.610458 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.875
I0612 19:07:36.610471 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 19:07:36.610484 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 19:07:36.610497 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 19:07:36.610509 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 19:07:36.610522 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 19:07:36.610535 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 19:07:36.610548 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 19:07:36.610558 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 19:07:36.610570 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 19:07:36.610582 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 19:07:36.610594 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 19:07:36.610605 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 19:07:36.610617 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 19:07:36.610628 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 19:07:36.610641 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 19:07:36.610651 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.875
I0612 19:07:36.610664 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.744681
I0612 19:07:36.610680 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.59251 (* 0.3 = 0.477753 loss)
I0612 19:07:36.610695 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.465831 (* 0.3 = 0.139749 loss)
I0612 19:07:36.610709 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.34079 (* 0.0272727 = 0.0365669 loss)
I0612 19:07:36.610724 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 1.70473 (* 0.0272727 = 0.0464927 loss)
I0612 19:07:36.610738 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.22293 (* 0.0272727 = 0.0606255 loss)
I0612 19:07:36.610752 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.15117 (* 0.0272727 = 0.0313956 loss)
I0612 19:07:36.610766 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.67313 (* 0.0272727 = 0.0456307 loss)
I0612 19:07:36.610780 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 0.826736 (* 0.0272727 = 0.0225473 loss)
I0612 19:07:36.610795 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.392164 (* 0.0272727 = 0.0106954 loss)
I0612 19:07:36.610808 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.469623 (* 0.0272727 = 0.0128079 loss)
I0612 19:07:36.610822 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.620272 (* 0.0272727 = 0.0169165 loss)
I0612 19:07:36.610836 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.493049 (* 0.0272727 = 0.0134468 loss)
I0612 19:07:36.610851 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.705258 (* 0.0272727 = 0.0192343 loss)
I0612 19:07:36.610865 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.0239736 (* 0.0272727 = 0.000653826 loss)
I0612 19:07:36.610898 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.0159319 (* 0.0272727 = 0.000434506 loss)
I0612 19:07:36.610913 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.0100424 (* 0.0272727 = 0.000273883 loss)
I0612 19:07:36.610927 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.00952356 (* 0.0272727 = 0.000259734 loss)
I0612 19:07:36.610941 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.00695499 (* 0.0272727 = 0.000189681 loss)
I0612 19:07:36.610955 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00916092 (* 0.0272727 = 0.000249843 loss)
I0612 19:07:36.610970 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.0096637 (* 0.0272727 = 0.000263555 loss)
I0612 19:07:36.610983 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.00926187 (* 0.0272727 = 0.000252596 loss)
I0612 19:07:36.610997 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.0147379 (* 0.0272727 = 0.000401944 loss)
I0612 19:07:36.611012 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.00626659 (* 0.0272727 = 0.000170907 loss)
I0612 19:07:36.611027 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.0040025 (* 0.0272727 = 0.000109159 loss)
I0612 19:07:36.611039 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.638298
I0612 19:07:36.611052 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 19:07:36.611063 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 19:07:36.611075 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.875
I0612 19:07:36.611088 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.875
I0612 19:07:36.611099 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.625
I0612 19:07:36.611111 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.75
I0612 19:07:36.611124 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 1
I0612 19:07:36.611135 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 19:07:36.611146 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 19:07:36.611158 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 19:07:36.611171 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 19:07:36.611181 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 19:07:36.611193 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 19:07:36.611204 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 19:07:36.611217 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 19:07:36.611232 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 19:07:36.611243 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 19:07:36.611254 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 19:07:36.611266 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 19:07:36.611277 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 19:07:36.611289 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 19:07:36.611300 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 19:07:36.611312 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.892045
I0612 19:07:36.611325 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.829787
I0612 19:07:36.611338 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.14283 (* 0.3 = 0.342848 loss)
I0612 19:07:36.611352 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.339706 (* 0.3 = 0.101912 loss)
I0612 19:07:36.611366 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.974356 (* 0.0272727 = 0.0265733 loss)
I0612 19:07:36.611384 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.72769 (* 0.0272727 = 0.0198461 loss)
I0612 19:07:36.611410 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.619762 (* 0.0272727 = 0.0169026 loss)
I0612 19:07:36.611425 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 0.627911 (* 0.0272727 = 0.0171249 loss)
I0612 19:07:36.611440 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.32961 (* 0.0272727 = 0.0362621 loss)
I0612 19:07:36.611454 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.673485 (* 0.0272727 = 0.0183678 loss)
I0612 19:07:36.611469 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.282252 (* 0.0272727 = 0.00769779 loss)
I0612 19:07:36.611484 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.448076 (* 0.0272727 = 0.0122202 loss)
I0612 19:07:36.611497 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 1.01929 (* 0.0272727 = 0.0277989 loss)
I0612 19:07:36.611510 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.749325 (* 0.0272727 = 0.0204361 loss)
I0612 19:07:36.611526 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 1.29551 (* 0.0272727 = 0.0353321 loss)
I0612 19:07:36.611539 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.00213651 (* 0.0272727 = 5.82685e-05 loss)
I0612 19:07:36.611553 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.00163153 (* 0.0272727 = 4.44963e-05 loss)
I0612 19:07:36.611567 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.00181084 (* 0.0272727 = 4.93866e-05 loss)
I0612 19:07:36.611582 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.00214991 (* 0.0272727 = 5.86338e-05 loss)
I0612 19:07:36.611595 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.000802581 (* 0.0272727 = 2.18886e-05 loss)
I0612 19:07:36.611609 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.00110493 (* 0.0272727 = 3.01346e-05 loss)
I0612 19:07:36.611624 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.00109894 (* 0.0272727 = 2.99712e-05 loss)
I0612 19:07:36.611637 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00181749 (* 0.0272727 = 4.9568e-05 loss)
I0612 19:07:36.611651 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.00149854 (* 0.0272727 = 4.08694e-05 loss)
I0612 19:07:36.611665 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.00151599 (* 0.0272727 = 4.13452e-05 loss)
I0612 19:07:36.611680 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.00313002 (* 0.0272727 = 8.53642e-05 loss)
I0612 19:07:36.611692 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.765957
I0612 19:07:36.611704 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.75
I0612 19:07:36.611716 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 19:07:36.611728 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 1
I0612 19:07:36.611740 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 19:07:36.611752 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.75
I0612 19:07:36.611763 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 19:07:36.611775 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 19:07:36.611788 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 19:07:36.611799 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 19:07:36.611810 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 19:07:36.611822 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 19:07:36.611835 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 19:07:36.611846 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 19:07:36.611857 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 19:07:36.611870 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 19:07:36.611891 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 19:07:36.611904 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 19:07:36.611917 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 19:07:36.611924 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 19:07:36.611932 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 19:07:36.611943 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 19:07:36.611955 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 19:07:36.611968 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.9375
I0612 19:07:36.611979 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.87234
I0612 19:07:36.611994 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.766954 (* 1 = 0.766954 loss)
I0612 19:07:36.612007 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.22579 (* 1 = 0.22579 loss)
I0612 19:07:36.612021 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.681604 (* 0.0909091 = 0.061964 loss)
I0612 19:07:36.612035 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.829077 (* 0.0909091 = 0.0753706 loss)
I0612 19:07:36.612049 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.282294 (* 0.0909091 = 0.0256631 loss)
I0612 19:07:36.612063 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.176564 (* 0.0909091 = 0.0160513 loss)
I0612 19:07:36.612077 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.637534 (* 0.0909091 = 0.0579576 loss)
I0612 19:07:36.612092 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.884067 (* 0.0909091 = 0.0803697 loss)
I0612 19:07:36.612105 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.219519 (* 0.0909091 = 0.0199563 loss)
I0612 19:07:36.612119 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.468604 (* 0.0909091 = 0.0426004 loss)
I0612 19:07:36.612133 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.358191 (* 0.0909091 = 0.0325629 loss)
I0612 19:07:36.612148 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.322639 (* 0.0909091 = 0.0293308 loss)
I0612 19:07:36.612161 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.653608 (* 0.0909091 = 0.0594189 loss)
I0612 19:07:36.612175 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.000778291 (* 0.0909091 = 7.07537e-05 loss)
I0612 19:07:36.612190 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.00032842 (* 0.0909091 = 2.98563e-05 loss)
I0612 19:07:36.612203 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.000165314 (* 0.0909091 = 1.50285e-05 loss)
I0612 19:07:36.612217 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 8.40923e-05 (* 0.0909091 = 7.64475e-06 loss)
I0612 19:07:36.612231 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 6.58261e-05 (* 0.0909091 = 5.98419e-06 loss)
I0612 19:07:36.612246 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 4.57421e-05 (* 0.0909091 = 4.15837e-06 loss)
I0612 19:07:36.612259 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 5.64228e-05 (* 0.0909091 = 5.12934e-06 loss)
I0612 19:07:36.612275 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 8.18189e-05 (* 0.0909091 = 7.43809e-06 loss)
I0612 19:07:36.612290 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.000108549 (* 0.0909091 = 9.86809e-06 loss)
I0612 19:07:36.612305 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 5.8221e-05 (* 0.0909091 = 5.29282e-06 loss)
I0612 19:07:36.612319 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 3.78312e-05 (* 0.0909091 = 3.4392e-06 loss)
I0612 19:07:36.612331 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.75
I0612 19:07:36.612344 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.625
I0612 19:07:36.612365 6181 solver.cpp:245] Train net output #149: total_confidence = 0.542377
I0612 19:07:36.612378 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.466012
I0612 19:07:36.612391 6181 sgd_solver.cpp:106] Iteration 19500, lr = 0.001
I0612 19:08:20.115357 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.9816 > 30) by scale factor 0.909598
I0612 19:09:04.190943 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.7055 > 30) by scale factor 0.840207
I0612 19:10:17.447152 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 48.5437 > 30) by scale factor 0.617999
I0612 19:10:57.556361 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 38.1398 > 30) by scale factor 0.786581
I0612 19:11:56.916368 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 42.7634 > 30) by scale factor 0.701535
I0612 19:13:12.500011 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 51.9327 > 30) by scale factor 0.57767
I0612 19:14:01.904474 6181 solver.cpp:456] Snapshotting to binary proto file /mnt/snapshots/mixed_lstm21_iter_20000.caffemodel
I0612 19:14:02.431030 6181 sgd_solver.cpp:273] Snapshotting solver state to binary proto file /mnt/snapshots/mixed_lstm21_iter_20000.solverstate
I0612 19:14:02.701086 6181 solver.cpp:338] Iteration 20000, Testing net (#0)
I0612 19:15:00.371212 6181 solver.cpp:393] Test loss: 2.70551
I0612 19:15:00.371361 6181 solver.cpp:406] Test net output #0: loss1/accuracy = 0.592233
I0612 19:15:00.371382 6181 solver.cpp:406] Test net output #1: loss1/accuracy01 = 0.778
I0612 19:15:00.371397 6181 solver.cpp:406] Test net output #2: loss1/accuracy02 = 0.641
I0612 19:15:00.371409 6181 solver.cpp:406] Test net output #3: loss1/accuracy03 = 0.532
I0612 19:15:00.371423 6181 solver.cpp:406] Test net output #4: loss1/accuracy04 = 0.482
I0612 19:15:00.371435 6181 solver.cpp:406] Test net output #5: loss1/accuracy05 = 0.511
I0612 19:15:00.371448 6181 solver.cpp:406] Test net output #6: loss1/accuracy06 = 0.686
I0612 19:15:00.371460 6181 solver.cpp:406] Test net output #7: loss1/accuracy07 = 0.846
I0612 19:15:00.371474 6181 solver.cpp:406] Test net output #8: loss1/accuracy08 = 0.921
I0612 19:15:00.371489 6181 solver.cpp:406] Test net output #9: loss1/accuracy09 = 0.963
I0612 19:15:00.371502 6181 solver.cpp:406] Test net output #10: loss1/accuracy10 = 0.985
I0612 19:15:00.371515 6181 solver.cpp:406] Test net output #11: loss1/accuracy11 = 0.997
I0612 19:15:00.371527 6181 solver.cpp:406] Test net output #12: loss1/accuracy12 = 0.999
I0612 19:15:00.371539 6181 solver.cpp:406] Test net output #13: loss1/accuracy13 = 1
I0612 19:15:00.371551 6181 solver.cpp:406] Test net output #14: loss1/accuracy14 = 1
I0612 19:15:00.371562 6181 solver.cpp:406] Test net output #15: loss1/accuracy15 = 1
I0612 19:15:00.371574 6181 solver.cpp:406] Test net output #16: loss1/accuracy16 = 1
I0612 19:15:00.371587 6181 solver.cpp:406] Test net output #17: loss1/accuracy17 = 1
I0612 19:15:00.371598 6181 solver.cpp:406] Test net output #18: loss1/accuracy18 = 1
I0612 19:15:00.371609 6181 solver.cpp:406] Test net output #19: loss1/accuracy19 = 1
I0612 19:15:00.371621 6181 solver.cpp:406] Test net output #20: loss1/accuracy20 = 1
I0612 19:15:00.371633 6181 solver.cpp:406] Test net output #21: loss1/accuracy21 = 1
I0612 19:15:00.371644 6181 solver.cpp:406] Test net output #22: loss1/accuracy22 = 1
I0612 19:15:00.371655 6181 solver.cpp:406] Test net output #23: loss1/accuracy_incl_empty = 0.884321
I0612 19:15:00.371667 6181 solver.cpp:406] Test net output #24: loss1/accuracy_top3 = 0.830865
I0612 19:15:00.371685 6181 solver.cpp:406] Test net output #25: loss1/cross_entropy_loss = 1.46451 (* 0.3 = 0.439353 loss)
I0612 19:15:00.371698 6181 solver.cpp:406] Test net output #26: loss1/cross_entropy_loss_incl_empty = 0.420592 (* 0.3 = 0.126178 loss)
I0612 19:15:00.371713 6181 solver.cpp:406] Test net output #27: loss1/loss01 = 0.949073 (* 0.0272727 = 0.0258838 loss)
I0612 19:15:00.371727 6181 solver.cpp:406] Test net output #28: loss1/loss02 = 1.38343 (* 0.0272727 = 0.03773 loss)
I0612 19:15:00.371742 6181 solver.cpp:406] Test net output #29: loss1/loss03 = 1.72476 (* 0.0272727 = 0.0470389 loss)
I0612 19:15:00.371755 6181 solver.cpp:406] Test net output #30: loss1/loss04 = 1.84946 (* 0.0272727 = 0.0504398 loss)
I0612 19:15:00.371769 6181 solver.cpp:406] Test net output #31: loss1/loss05 = 1.65777 (* 0.0272727 = 0.0452119 loss)
I0612 19:15:00.371783 6181 solver.cpp:406] Test net output #32: loss1/loss06 = 1.04556 (* 0.0272727 = 0.0285154 loss)
I0612 19:15:00.371796 6181 solver.cpp:406] Test net output #33: loss1/loss07 = 0.53928 (* 0.0272727 = 0.0147076 loss)
I0612 19:15:00.371811 6181 solver.cpp:406] Test net output #34: loss1/loss08 = 0.276062 (* 0.0272727 = 0.00752897 loss)
I0612 19:15:00.371824 6181 solver.cpp:406] Test net output #35: loss1/loss09 = 0.149533 (* 0.0272727 = 0.00407816 loss)
I0612 19:15:00.371839 6181 solver.cpp:406] Test net output #36: loss1/loss10 = 0.0839164 (* 0.0272727 = 0.00228863 loss)
I0612 19:15:00.371853 6181 solver.cpp:406] Test net output #37: loss1/loss11 = 0.0209799 (* 0.0272727 = 0.000572178 loss)
I0612 19:15:00.371867 6181 solver.cpp:406] Test net output #38: loss1/loss12 = 0.0111391 (* 0.0272727 = 0.000303794 loss)
I0612 19:15:00.371881 6181 solver.cpp:406] Test net output #39: loss1/loss13 = 0.00641032 (* 0.0272727 = 0.000174827 loss)
I0612 19:15:00.371915 6181 solver.cpp:406] Test net output #40: loss1/loss14 = 0.00404502 (* 0.0272727 = 0.000110319 loss)
I0612 19:15:00.371932 6181 solver.cpp:406] Test net output #41: loss1/loss15 = 0.00278837 (* 0.0272727 = 7.60464e-05 loss)
I0612 19:15:00.371945 6181 solver.cpp:406] Test net output #42: loss1/loss16 = 0.00205789 (* 0.0272727 = 5.61243e-05 loss)
I0612 19:15:00.371959 6181 solver.cpp:406] Test net output #43: loss1/loss17 = 0.00189449 (* 0.0272727 = 5.16679e-05 loss)
I0612 19:15:00.371973 6181 solver.cpp:406] Test net output #44: loss1/loss18 = 0.00168656 (* 0.0272727 = 4.59971e-05 loss)
I0612 19:15:00.371987 6181 solver.cpp:406] Test net output #45: loss1/loss19 = 0.00165047 (* 0.0272727 = 4.50129e-05 loss)
I0612 19:15:00.372001 6181 solver.cpp:406] Test net output #46: loss1/loss20 = 0.00137933 (* 0.0272727 = 3.7618e-05 loss)
I0612 19:15:00.372015 6181 solver.cpp:406] Test net output #47: loss1/loss21 = 0.00135147 (* 0.0272727 = 3.68584e-05 loss)
I0612 19:15:00.372030 6181 solver.cpp:406] Test net output #48: loss1/loss22 = 0.00118536 (* 0.0272727 = 3.23279e-05 loss)
I0612 19:15:00.372042 6181 solver.cpp:406] Test net output #49: loss2/accuracy = 0.761263
I0612 19:15:00.372054 6181 solver.cpp:406] Test net output #50: loss2/accuracy01 = 0.85
I0612 19:15:00.372066 6181 solver.cpp:406] Test net output #51: loss2/accuracy02 = 0.827
I0612 19:15:00.372079 6181 solver.cpp:406] Test net output #52: loss2/accuracy03 = 0.741
I0612 19:15:00.372092 6181 solver.cpp:406] Test net output #53: loss2/accuracy04 = 0.661
I0612 19:15:00.372102 6181 solver.cpp:406] Test net output #54: loss2/accuracy05 = 0.608
I0612 19:15:00.372114 6181 solver.cpp:406] Test net output #55: loss2/accuracy06 = 0.799
I0612 19:15:00.372126 6181 solver.cpp:406] Test net output #56: loss2/accuracy07 = 0.873
I0612 19:15:00.372138 6181 solver.cpp:406] Test net output #57: loss2/accuracy08 = 0.934
I0612 19:15:00.372149 6181 solver.cpp:406] Test net output #58: loss2/accuracy09 = 0.964
I0612 19:15:00.372160 6181 solver.cpp:406] Test net output #59: loss2/accuracy10 = 0.982
I0612 19:15:00.372172 6181 solver.cpp:406] Test net output #60: loss2/accuracy11 = 0.996
I0612 19:15:00.372184 6181 solver.cpp:406] Test net output #61: loss2/accuracy12 = 0.998
I0612 19:15:00.372195 6181 solver.cpp:406] Test net output #62: loss2/accuracy13 = 0.999
I0612 19:15:00.372207 6181 solver.cpp:406] Test net output #63: loss2/accuracy14 = 1
I0612 19:15:00.372220 6181 solver.cpp:406] Test net output #64: loss2/accuracy15 = 1
I0612 19:15:00.372233 6181 solver.cpp:406] Test net output #65: loss2/accuracy16 = 1
I0612 19:15:00.372246 6181 solver.cpp:406] Test net output #66: loss2/accuracy17 = 1
I0612 19:15:00.372256 6181 solver.cpp:406] Test net output #67: loss2/accuracy18 = 1
I0612 19:15:00.372267 6181 solver.cpp:406] Test net output #68: loss2/accuracy19 = 1
I0612 19:15:00.372279 6181 solver.cpp:406] Test net output #69: loss2/accuracy20 = 1
I0612 19:15:00.372290 6181 solver.cpp:406] Test net output #70: loss2/accuracy21 = 1
I0612 19:15:00.372301 6181 solver.cpp:406] Test net output #71: loss2/accuracy22 = 1
I0612 19:15:00.372313 6181 solver.cpp:406] Test net output #72: loss2/accuracy_incl_empty = 0.927773
I0612 19:15:00.372324 6181 solver.cpp:406] Test net output #73: loss2/accuracy_top3 = 0.89851
I0612 19:15:00.372339 6181 solver.cpp:406] Test net output #74: loss2/cross_entropy_loss = 0.950581 (* 0.3 = 0.285174 loss)
I0612 19:15:00.372352 6181 solver.cpp:406] Test net output #75: loss2/cross_entropy_loss_incl_empty = 0.286262 (* 0.3 = 0.0858785 loss)
I0612 19:15:00.372369 6181 solver.cpp:406] Test net output #76: loss2/loss01 = 0.691065 (* 0.0272727 = 0.0188472 loss)
I0612 19:15:00.372383 6181 solver.cpp:406] Test net output #77: loss2/loss02 = 0.854227 (* 0.0272727 = 0.0232971 loss)
I0612 19:15:00.372408 6181 solver.cpp:406] Test net output #78: loss2/loss03 = 1.05478 (* 0.0272727 = 0.0287667 loss)
I0612 19:15:00.372423 6181 solver.cpp:406] Test net output #79: loss2/loss04 = 1.25995 (* 0.0272727 = 0.0343622 loss)
I0612 19:15:00.372437 6181 solver.cpp:406] Test net output #80: loss2/loss05 = 1.1905 (* 0.0272727 = 0.0324681 loss)
I0612 19:15:00.372452 6181 solver.cpp:406] Test net output #81: loss2/loss06 = 0.769909 (* 0.0272727 = 0.0209975 loss)
I0612 19:15:00.372464 6181 solver.cpp:406] Test net output #82: loss2/loss07 = 0.438075 (* 0.0272727 = 0.0119475 loss)
I0612 19:15:00.372479 6181 solver.cpp:406] Test net output #83: loss2/loss08 = 0.239106 (* 0.0272727 = 0.00652106 loss)
I0612 19:15:00.372493 6181 solver.cpp:406] Test net output #84: loss2/loss09 = 0.12758 (* 0.0272727 = 0.00347944 loss)
I0612 19:15:00.372506 6181 solver.cpp:406] Test net output #85: loss2/loss10 = 0.077839 (* 0.0272727 = 0.00212288 loss)
I0612 19:15:00.372519 6181 solver.cpp:406] Test net output #86: loss2/loss11 = 0.0227762 (* 0.0272727 = 0.000621169 loss)
I0612 19:15:00.372534 6181 solver.cpp:406] Test net output #87: loss2/loss12 = 0.0131654 (* 0.0272727 = 0.000359057 loss)
I0612 19:15:00.372547 6181 solver.cpp:406] Test net output #88: loss2/loss13 = 0.00897096 (* 0.0272727 = 0.000244663 loss)
I0612 19:15:00.372561 6181 solver.cpp:406] Test net output #89: loss2/loss14 = 0.00662881 (* 0.0272727 = 0.000180786 loss)
I0612 19:15:00.372575 6181 solver.cpp:406] Test net output #90: loss2/loss15 = 0.00514662 (* 0.0272727 = 0.000140362 loss)
I0612 19:15:00.372588 6181 solver.cpp:406] Test net output #91: loss2/loss16 = 0.00437843 (* 0.0272727 = 0.000119412 loss)
I0612 19:15:00.372603 6181 solver.cpp:406] Test net output #92: loss2/loss17 = 0.00418106 (* 0.0272727 = 0.000114029 loss)
I0612 19:15:00.372617 6181 solver.cpp:406] Test net output #93: loss2/loss18 = 0.00398526 (* 0.0272727 = 0.000108689 loss)
I0612 19:15:00.372632 6181 solver.cpp:406] Test net output #94: loss2/loss19 = 0.00354514 (* 0.0272727 = 9.66856e-05 loss)
I0612 19:15:00.372645 6181 solver.cpp:406] Test net output #95: loss2/loss20 = 0.00354338 (* 0.0272727 = 9.66376e-05 loss)
I0612 19:15:00.372659 6181 solver.cpp:406] Test net output #96: loss2/loss21 = 0.0030808 (* 0.0272727 = 8.40219e-05 loss)
I0612 19:15:00.372673 6181 solver.cpp:406] Test net output #97: loss2/loss22 = 0.0024535 (* 0.0272727 = 6.69136e-05 loss)
I0612 19:15:00.372685 6181 solver.cpp:406] Test net output #98: loss3/accuracy = 0.851529
I0612 19:15:00.372697 6181 solver.cpp:406] Test net output #99: loss3/accuracy01 = 0.874
I0612 19:15:00.372709 6181 solver.cpp:406] Test net output #100: loss3/accuracy02 = 0.867
I0612 19:15:00.372720 6181 solver.cpp:406] Test net output #101: loss3/accuracy03 = 0.844
I0612 19:15:00.372732 6181 solver.cpp:406] Test net output #102: loss3/accuracy04 = 0.846
I0612 19:15:00.372743 6181 solver.cpp:406] Test net output #103: loss3/accuracy05 = 0.823
I0612 19:15:00.372755 6181 solver.cpp:406] Test net output #104: loss3/accuracy06 = 0.877
I0612 19:15:00.372766 6181 solver.cpp:406] Test net output #105: loss3/accuracy07 = 0.91
I0612 19:15:00.372781 6181 solver.cpp:406] Test net output #106: loss3/accuracy08 = 0.944
I0612 19:15:00.372789 6181 solver.cpp:406] Test net output #107: loss3/accuracy09 = 0.968
I0612 19:15:00.372802 6181 solver.cpp:406] Test net output #108: loss3/accuracy10 = 0.984
I0612 19:15:00.372813 6181 solver.cpp:406] Test net output #109: loss3/accuracy11 = 0.996
I0612 19:15:00.372825 6181 solver.cpp:406] Test net output #110: loss3/accuracy12 = 0.997
I0612 19:15:00.372836 6181 solver.cpp:406] Test net output #111: loss3/accuracy13 = 0.998
I0612 19:15:00.372848 6181 solver.cpp:406] Test net output #112: loss3/accuracy14 = 0.999
I0612 19:15:00.372859 6181 solver.cpp:406] Test net output #113: loss3/accuracy15 = 0.999
I0612 19:15:00.372871 6181 solver.cpp:406] Test net output #114: loss3/accuracy16 = 1
I0612 19:15:00.372894 6181 solver.cpp:406] Test net output #115: loss3/accuracy17 = 1
I0612 19:15:00.372905 6181 solver.cpp:406] Test net output #116: loss3/accuracy18 = 1
I0612 19:15:00.372917 6181 solver.cpp:406] Test net output #117: loss3/accuracy19 = 1
I0612 19:15:00.372928 6181 solver.cpp:406] Test net output #118: loss3/accuracy20 = 1
I0612 19:15:00.372939 6181 solver.cpp:406] Test net output #119: loss3/accuracy21 = 1
I0612 19:15:00.372951 6181 solver.cpp:406] Test net output #120: loss3/accuracy22 = 1
I0612 19:15:00.372962 6181 solver.cpp:406] Test net output #121: loss3/accuracy_incl_empty = 0.951318
I0612 19:15:00.372973 6181 solver.cpp:406] Test net output #122: loss3/accuracy_top3 = 0.924641
I0612 19:15:00.372988 6181 solver.cpp:406] Test net output #123: loss3/cross_entropy_loss = 0.670462 (* 1 = 0.670462 loss)
I0612 19:15:00.373002 6181 solver.cpp:406] Test net output #124: loss3/cross_entropy_loss_incl_empty = 0.220052 (* 1 = 0.220052 loss)
I0612 19:15:00.373015 6181 solver.cpp:406] Test net output #125: loss3/loss01 = 0.55372 (* 0.0909091 = 0.0503382 loss)
I0612 19:15:00.373029 6181 solver.cpp:406] Test net output #126: loss3/loss02 = 0.627303 (* 0.0909091 = 0.0570275 loss)
I0612 19:15:00.373044 6181 solver.cpp:406] Test net output #127: loss3/loss03 = 0.724081 (* 0.0909091 = 0.0658255 loss)
I0612 19:15:00.373057 6181 solver.cpp:406] Test net output #128: loss3/loss04 = 0.72591 (* 0.0909091 = 0.0659918 loss)
I0612 19:15:00.373070 6181 solver.cpp:406] Test net output #129: loss3/loss05 = 0.749973 (* 0.0909091 = 0.0681794 loss)
I0612 19:15:00.373085 6181 solver.cpp:406] Test net output #130: loss3/loss06 = 0.518669 (* 0.0909091 = 0.0471517 loss)
I0612 19:15:00.373097 6181 solver.cpp:406] Test net output #131: loss3/loss07 = 0.345149 (* 0.0909091 = 0.0313771 loss)
I0612 19:15:00.373111 6181 solver.cpp:406] Test net output #132: loss3/loss08 = 0.218271 (* 0.0909091 = 0.0198428 loss)
I0612 19:15:00.373126 6181 solver.cpp:406] Test net output #133: loss3/loss09 = 0.107893 (* 0.0909091 = 0.00980846 loss)
I0612 19:15:00.373139 6181 solver.cpp:406] Test net output #134: loss3/loss10 = 0.0654004 (* 0.0909091 = 0.00594549 loss)
I0612 19:15:00.373152 6181 solver.cpp:406] Test net output #135: loss3/loss11 = 0.0240047 (* 0.0909091 = 0.00218225 loss)
I0612 19:15:00.373167 6181 solver.cpp:406] Test net output #136: loss3/loss12 = 0.0145055 (* 0.0909091 = 0.00131868 loss)
I0612 19:15:00.373180 6181 solver.cpp:406] Test net output #137: loss3/loss13 = 0.0098429 (* 0.0909091 = 0.000894809 loss)
I0612 19:15:00.373193 6181 solver.cpp:406] Test net output #138: loss3/loss14 = 0.00625184 (* 0.0909091 = 0.000568349 loss)
I0612 19:15:00.373208 6181 solver.cpp:406] Test net output #139: loss3/loss15 = 0.00494648 (* 0.0909091 = 0.00044968 loss)
I0612 19:15:00.373220 6181 solver.cpp:406] Test net output #140: loss3/loss16 = 0.0036255 (* 0.0909091 = 0.000329591 loss)
I0612 19:15:00.373234 6181 solver.cpp:406] Test net output #141: loss3/loss17 = 0.00309581 (* 0.0909091 = 0.000281437 loss)
I0612 19:15:00.373248 6181 solver.cpp:406] Test net output #142: loss3/loss18 = 0.00266371 (* 0.0909091 = 0.000242155 loss)
I0612 19:15:00.373261 6181 solver.cpp:406] Test net output #143: loss3/loss19 = 0.00246768 (* 0.0909091 = 0.000224334 loss)
I0612 19:15:00.373291 6181 solver.cpp:406] Test net output #144: loss3/loss20 = 0.00195723 (* 0.0909091 = 0.00017793 loss)
I0612 19:15:00.373307 6181 solver.cpp:406] Test net output #145: loss3/loss21 = 0.00161238 (* 0.0909091 = 0.00014658 loss)
I0612 19:15:00.373327 6181 solver.cpp:406] Test net output #146: loss3/loss22 = 0.00107178 (* 0.0909091 = 9.74344e-05 loss)
I0612 19:15:00.373350 6181 solver.cpp:406] Test net output #147: total_accuracy = 0.589
I0612 19:15:00.373373 6181 solver.cpp:406] Test net output #148: total_accuracy_not_rec = 0.557
I0612 19:15:00.373388 6181 solver.cpp:406] Test net output #149: total_confidence = 0.505461
I0612 19:15:00.373411 6181 solver.cpp:406] Test net output #150: total_confidence_not_rec = 0.469346
I0612 19:15:00.373430 6181 solver.cpp:338] Iteration 20000, Testing net (#1)
I0612 19:15:58.120465 6181 solver.cpp:393] Test loss: 3.7316
I0612 19:15:58.120589 6181 solver.cpp:406] Test net output #0: loss1/accuracy = 0.563894
I0612 19:15:58.120611 6181 solver.cpp:406] Test net output #1: loss1/accuracy01 = 0.778
I0612 19:15:58.120625 6181 solver.cpp:406] Test net output #2: loss1/accuracy02 = 0.65
I0612 19:15:58.120638 6181 solver.cpp:406] Test net output #3: loss1/accuracy03 = 0.531
I0612 19:15:58.120651 6181 solver.cpp:406] Test net output #4: loss1/accuracy04 = 0.48
I0612 19:15:58.120664 6181 solver.cpp:406] Test net output #5: loss1/accuracy05 = 0.491
I0612 19:15:58.120677 6181 solver.cpp:406] Test net output #6: loss1/accuracy06 = 0.641
I0612 19:15:58.120689 6181 solver.cpp:406] Test net output #7: loss1/accuracy07 = 0.759
I0612 19:15:58.120702 6181 solver.cpp:406] Test net output #8: loss1/accuracy08 = 0.81
I0612 19:15:58.120714 6181 solver.cpp:406] Test net output #9: loss1/accuracy09 = 0.847
I0612 19:15:58.120726 6181 solver.cpp:406] Test net output #10: loss1/accuracy10 = 0.863
I0612 19:15:58.120739 6181 solver.cpp:406] Test net output #11: loss1/accuracy11 = 0.891
I0612 19:15:58.120753 6181 solver.cpp:406] Test net output #12: loss1/accuracy12 = 0.905
I0612 19:15:58.120764 6181 solver.cpp:406] Test net output #13: loss1/accuracy13 = 0.917
I0612 19:15:58.120776 6181 solver.cpp:406] Test net output #14: loss1/accuracy14 = 0.939
I0612 19:15:58.120789 6181 solver.cpp:406] Test net output #15: loss1/accuracy15 = 0.952
I0612 19:15:58.120800 6181 solver.cpp:406] Test net output #16: loss1/accuracy16 = 0.965
I0612 19:15:58.120812 6181 solver.cpp:406] Test net output #17: loss1/accuracy17 = 0.981
I0612 19:15:58.120825 6181 solver.cpp:406] Test net output #18: loss1/accuracy18 = 0.983
I0612 19:15:58.120836 6181 solver.cpp:406] Test net output #19: loss1/accuracy19 = 0.984
I0612 19:15:58.120848 6181 solver.cpp:406] Test net output #20: loss1/accuracy20 = 0.991
I0612 19:15:58.120860 6181 solver.cpp:406] Test net output #21: loss1/accuracy21 = 0.997
I0612 19:15:58.120872 6181 solver.cpp:406] Test net output #22: loss1/accuracy22 = 0.997
I0612 19:15:58.120884 6181 solver.cpp:406] Test net output #23: loss1/accuracy_incl_empty = 0.840684
I0612 19:15:58.120896 6181 solver.cpp:406] Test net output #24: loss1/accuracy_top3 = 0.799624
I0612 19:15:58.120913 6181 solver.cpp:406] Test net output #25: loss1/cross_entropy_loss = 1.58889 (* 0.3 = 0.476666 loss)
I0612 19:15:58.120928 6181 solver.cpp:406] Test net output #26: loss1/cross_entropy_loss_incl_empty = 0.600014 (* 0.3 = 0.180004 loss)
I0612 19:15:58.120942 6181 solver.cpp:406] Test net output #27: loss1/loss01 = 0.981168 (* 0.0272727 = 0.0267591 loss)
I0612 19:15:58.120956 6181 solver.cpp:406] Test net output #28: loss1/loss02 = 1.35505 (* 0.0272727 = 0.0369559 loss)
I0612 19:15:58.120970 6181 solver.cpp:406] Test net output #29: loss1/loss03 = 1.72628 (* 0.0272727 = 0.0470804 loss)
I0612 19:15:58.120985 6181 solver.cpp:406] Test net output #30: loss1/loss04 = 1.86909 (* 0.0272727 = 0.0509751 loss)
I0612 19:15:58.120998 6181 solver.cpp:406] Test net output #31: loss1/loss05 = 1.70557 (* 0.0272727 = 0.0465156 loss)
I0612 19:15:58.121011 6181 solver.cpp:406] Test net output #32: loss1/loss06 = 1.31603 (* 0.0272727 = 0.0358916 loss)
I0612 19:15:58.121026 6181 solver.cpp:406] Test net output #33: loss1/loss07 = 0.891127 (* 0.0272727 = 0.0243035 loss)
I0612 19:15:58.121039 6181 solver.cpp:406] Test net output #34: loss1/loss08 = 0.750286 (* 0.0272727 = 0.0204624 loss)
I0612 19:15:58.121053 6181 solver.cpp:406] Test net output #35: loss1/loss09 = 0.64046 (* 0.0272727 = 0.0174671 loss)
I0612 19:15:58.121068 6181 solver.cpp:406] Test net output #36: loss1/loss10 = 0.557295 (* 0.0272727 = 0.0151989 loss)
I0612 19:15:58.121081 6181 solver.cpp:406] Test net output #37: loss1/loss11 = 0.489586 (* 0.0272727 = 0.0133523 loss)
I0612 19:15:58.121095 6181 solver.cpp:406] Test net output #38: loss1/loss12 = 0.417085 (* 0.0272727 = 0.011375 loss)
I0612 19:15:58.121130 6181 solver.cpp:406] Test net output #39: loss1/loss13 = 0.3637 (* 0.0272727 = 0.00991909 loss)
I0612 19:15:58.121146 6181 solver.cpp:406] Test net output #40: loss1/loss14 = 0.276434 (* 0.0272727 = 0.00753912 loss)
I0612 19:15:58.121160 6181 solver.cpp:406] Test net output #41: loss1/loss15 = 0.240415 (* 0.0272727 = 0.00655677 loss)
I0612 19:15:58.121175 6181 solver.cpp:406] Test net output #42: loss1/loss16 = 0.200588 (* 0.0272727 = 0.00547058 loss)
I0612 19:15:58.121188 6181 solver.cpp:406] Test net output #43: loss1/loss17 = 0.139057 (* 0.0272727 = 0.00379246 loss)
I0612 19:15:58.121202 6181 solver.cpp:406] Test net output #44: loss1/loss18 = 0.128441 (* 0.0272727 = 0.00350293 loss)
I0612 19:15:58.121217 6181 solver.cpp:406] Test net output #45: loss1/loss19 = 0.119776 (* 0.0272727 = 0.00326661 loss)
I0612 19:15:58.121234 6181 solver.cpp:406] Test net output #46: loss1/loss20 = 0.0664326 (* 0.0272727 = 0.0018118 loss)
I0612 19:15:58.121248 6181 solver.cpp:406] Test net output #47: loss1/loss21 = 0.0271356 (* 0.0272727 = 0.000740061 loss)
I0612 19:15:58.121263 6181 solver.cpp:406] Test net output #48: loss1/loss22 = 0.0275285 (* 0.0272727 = 0.000750776 loss)
I0612 19:15:58.121275 6181 solver.cpp:406] Test net output #49: loss2/accuracy = 0.70686
I0612 19:15:58.121287 6181 solver.cpp:406] Test net output #50: loss2/accuracy01 = 0.862
I0612 19:15:58.121299 6181 solver.cpp:406] Test net output #51: loss2/accuracy02 = 0.834
I0612 19:15:58.121310 6181 solver.cpp:406] Test net output #52: loss2/accuracy03 = 0.731
I0612 19:15:58.121336 6181 solver.cpp:406] Test net output #53: loss2/accuracy04 = 0.619
I0612 19:15:58.121351 6181 solver.cpp:406] Test net output #54: loss2/accuracy05 = 0.616
I0612 19:15:58.121362 6181 solver.cpp:406] Test net output #55: loss2/accuracy06 = 0.716
I0612 19:15:58.121374 6181 solver.cpp:406] Test net output #56: loss2/accuracy07 = 0.789
I0612 19:15:58.121387 6181 solver.cpp:406] Test net output #57: loss2/accuracy08 = 0.833
I0612 19:15:58.121397 6181 solver.cpp:406] Test net output #58: loss2/accuracy09 = 0.86
I0612 19:15:58.121409 6181 solver.cpp:406] Test net output #59: loss2/accuracy10 = 0.877
I0612 19:15:58.121422 6181 solver.cpp:406] Test net output #60: loss2/accuracy11 = 0.893
I0612 19:15:58.121433 6181 solver.cpp:406] Test net output #61: loss2/accuracy12 = 0.917
I0612 19:15:58.121445 6181 solver.cpp:406] Test net output #62: loss2/accuracy13 = 0.918
I0612 19:15:58.121456 6181 solver.cpp:406] Test net output #63: loss2/accuracy14 = 0.938
I0612 19:15:58.121469 6181 solver.cpp:406] Test net output #64: loss2/accuracy15 = 0.953
I0612 19:15:58.121480 6181 solver.cpp:406] Test net output #65: loss2/accuracy16 = 0.965
I0612 19:15:58.121492 6181 solver.cpp:406] Test net output #66: loss2/accuracy17 = 0.982
I0612 19:15:58.121503 6181 solver.cpp:406] Test net output #67: loss2/accuracy18 = 0.983
I0612 19:15:58.121515 6181 solver.cpp:406] Test net output #68: loss2/accuracy19 = 0.984
I0612 19:15:58.121527 6181 solver.cpp:406] Test net output #69: loss2/accuracy20 = 0.991
I0612 19:15:58.121538 6181 solver.cpp:406] Test net output #70: loss2/accuracy21 = 0.997
I0612 19:15:58.121551 6181 solver.cpp:406] Test net output #71: loss2/accuracy22 = 0.997
I0612 19:15:58.121562 6181 solver.cpp:406] Test net output #72: loss2/accuracy_incl_empty = 0.881274
I0612 19:15:58.121574 6181 solver.cpp:406] Test net output #73: loss2/accuracy_top3 = 0.870626
I0612 19:15:58.121588 6181 solver.cpp:406] Test net output #74: loss2/cross_entropy_loss = 1.13888 (* 0.3 = 0.341663 loss)
I0612 19:15:58.121603 6181 solver.cpp:406] Test net output #75: loss2/cross_entropy_loss_incl_empty = 0.474682 (* 0.3 = 0.142404 loss)
I0612 19:15:58.121621 6181 solver.cpp:406] Test net output #76: loss2/loss01 = 0.704846 (* 0.0272727 = 0.0192231 loss)
I0612 19:15:58.121635 6181 solver.cpp:406] Test net output #77: loss2/loss02 = 0.791259 (* 0.0272727 = 0.0215798 loss)
I0612 19:15:58.121661 6181 solver.cpp:406] Test net output #78: loss2/loss03 = 1.09243 (* 0.0272727 = 0.0297934 loss)
I0612 19:15:58.121676 6181 solver.cpp:406] Test net output #79: loss2/loss04 = 1.2735 (* 0.0272727 = 0.0347319 loss)
I0612 19:15:58.121690 6181 solver.cpp:406] Test net output #80: loss2/loss05 = 1.29032 (* 0.0272727 = 0.0351906 loss)
I0612 19:15:58.121703 6181 solver.cpp:406] Test net output #81: loss2/loss06 = 1.06333 (* 0.0272727 = 0.0289999 loss)
I0612 19:15:58.121717 6181 solver.cpp:406] Test net output #82: loss2/loss07 = 0.747945 (* 0.0272727 = 0.0203985 loss)
I0612 19:15:58.121731 6181 solver.cpp:406] Test net output #83: loss2/loss08 = 0.652099 (* 0.0272727 = 0.0177845 loss)
I0612 19:15:58.121744 6181 solver.cpp:406] Test net output #84: loss2/loss09 = 0.55861 (* 0.0272727 = 0.0152348 loss)
I0612 19:15:58.121758 6181 solver.cpp:406] Test net output #85: loss2/loss10 = 0.487537 (* 0.0272727 = 0.0132965 loss)
I0612 19:15:58.121772 6181 solver.cpp:406] Test net output #86: loss2/loss11 = 0.432568 (* 0.0272727 = 0.0117973 loss)
I0612 19:15:58.121785 6181 solver.cpp:406] Test net output #87: loss2/loss12 = 0.36198 (* 0.0272727 = 0.00987219 loss)
I0612 19:15:58.121799 6181 solver.cpp:406] Test net output #88: loss2/loss13 = 0.328359 (* 0.0272727 = 0.00895525 loss)
I0612 19:15:58.121814 6181 solver.cpp:406] Test net output #89: loss2/loss14 = 0.246673 (* 0.0272727 = 0.00672745 loss)
I0612 19:15:58.121827 6181 solver.cpp:406] Test net output #90: loss2/loss15 = 0.218173 (* 0.0272727 = 0.00595016 loss)
I0612 19:15:58.121841 6181 solver.cpp:406] Test net output #91: loss2/loss16 = 0.183024 (* 0.0272727 = 0.00499156 loss)
I0612 19:15:58.121855 6181 solver.cpp:406] Test net output #92: loss2/loss17 = 0.130642 (* 0.0272727 = 0.00356296 loss)
I0612 19:15:58.121870 6181 solver.cpp:406] Test net output #93: loss2/loss18 = 0.117865 (* 0.0272727 = 0.00321451 loss)
I0612 19:15:58.121884 6181 solver.cpp:406] Test net output #94: loss2/loss19 = 0.115502 (* 0.0272727 = 0.00315005 loss)
I0612 19:15:58.121898 6181 solver.cpp:406] Test net output #95: loss2/loss20 = 0.0606245 (* 0.0272727 = 0.0016534 loss)
I0612 19:15:58.121912 6181 solver.cpp:406] Test net output #96: loss2/loss21 = 0.028835 (* 0.0272727 = 0.000786409 loss)
I0612 19:15:58.121927 6181 solver.cpp:406] Test net output #97: loss2/loss22 = 0.0274531 (* 0.0272727 = 0.000748722 loss)
I0612 19:15:58.121939 6181 solver.cpp:406] Test net output #98: loss3/accuracy = 0.808679
I0612 19:15:58.121951 6181 solver.cpp:406] Test net output #99: loss3/accuracy01 = 0.875
I0612 19:15:58.121963 6181 solver.cpp:406] Test net output #100: loss3/accuracy02 = 0.868
I0612 19:15:58.121974 6181 solver.cpp:406] Test net output #101: loss3/accuracy03 = 0.84
I0612 19:15:58.121986 6181 solver.cpp:406] Test net output #102: loss3/accuracy04 = 0.827
I0612 19:15:58.121999 6181 solver.cpp:406] Test net output #103: loss3/accuracy05 = 0.793
I0612 19:15:58.122009 6181 solver.cpp:406] Test net output #104: loss3/accuracy06 = 0.834
I0612 19:15:58.122021 6181 solver.cpp:406] Test net output #105: loss3/accuracy07 = 0.88
I0612 19:15:58.122033 6181 solver.cpp:406] Test net output #106: loss3/accuracy08 = 0.876
I0612 19:15:58.122045 6181 solver.cpp:406] Test net output #107: loss3/accuracy09 = 0.897
I0612 19:15:58.122056 6181 solver.cpp:406] Test net output #108: loss3/accuracy10 = 0.901
I0612 19:15:58.122068 6181 solver.cpp:406] Test net output #109: loss3/accuracy11 = 0.909
I0612 19:15:58.122079 6181 solver.cpp:406] Test net output #110: loss3/accuracy12 = 0.93
I0612 19:15:58.122092 6181 solver.cpp:406] Test net output #111: loss3/accuracy13 = 0.945
I0612 19:15:58.122102 6181 solver.cpp:406] Test net output #112: loss3/accuracy14 = 0.95
I0612 19:15:58.122114 6181 solver.cpp:406] Test net output #113: loss3/accuracy15 = 0.964
I0612 19:15:58.122126 6181 solver.cpp:406] Test net output #114: loss3/accuracy16 = 0.972
I0612 19:15:58.122148 6181 solver.cpp:406] Test net output #115: loss3/accuracy17 = 0.984
I0612 19:15:58.122160 6181 solver.cpp:406] Test net output #116: loss3/accuracy18 = 0.986
I0612 19:15:58.122172 6181 solver.cpp:406] Test net output #117: loss3/accuracy19 = 0.983
I0612 19:15:58.122184 6181 solver.cpp:406] Test net output #118: loss3/accuracy20 = 0.993
I0612 19:15:58.122195 6181 solver.cpp:406] Test net output #119: loss3/accuracy21 = 0.997
I0612 19:15:58.122207 6181 solver.cpp:406] Test net output #120: loss3/accuracy22 = 0.997
I0612 19:15:58.122220 6181 solver.cpp:406] Test net output #121: loss3/accuracy_incl_empty = 0.916818
I0612 19:15:58.122231 6181 solver.cpp:406] Test net output #122: loss3/accuracy_top3 = 0.90449
I0612 19:15:58.122246 6181 solver.cpp:406] Test net output #123: loss3/cross_entropy_loss = 0.827024 (* 1 = 0.827024 loss)
I0612 19:15:58.122259 6181 solver.cpp:406] Test net output #124: loss3/cross_entropy_loss_incl_empty = 0.354003 (* 1 = 0.354003 loss)
I0612 19:15:58.122275 6181 solver.cpp:406] Test net output #125: loss3/loss01 = 0.585007 (* 0.0909091 = 0.0531825 loss)
I0612 19:15:58.122290 6181 solver.cpp:406] Test net output #126: loss3/loss02 = 0.621985 (* 0.0909091 = 0.0565441 loss)
I0612 19:15:58.122304 6181 solver.cpp:406] Test net output #127: loss3/loss03 = 0.786851 (* 0.0909091 = 0.0715319 loss)
I0612 19:15:58.122318 6181 solver.cpp:406] Test net output #128: loss3/loss04 = 0.795425 (* 0.0909091 = 0.0723114 loss)
I0612 19:15:58.122333 6181 solver.cpp:406] Test net output #129: loss3/loss05 = 0.884955 (* 0.0909091 = 0.0804505 loss)
I0612 19:15:58.122346 6181 solver.cpp:406] Test net output #130: loss3/loss06 = 0.721322 (* 0.0909091 = 0.0655748 loss)
I0612 19:15:58.122360 6181 solver.cpp:406] Test net output #131: loss3/loss07 = 0.504092 (* 0.0909091 = 0.0458265 loss)
I0612 19:15:58.122376 6181 solver.cpp:406] Test net output #132: loss3/loss08 = 0.476896 (* 0.0909091 = 0.0433542 loss)
I0612 19:15:58.122390 6181 solver.cpp:406] Test net output #133: loss3/loss09 = 0.414555 (* 0.0909091 = 0.0376868 loss)
I0612 19:15:58.122406 6181 solver.cpp:406] Test net output #134: loss3/loss10 = 0.376591 (* 0.0909091 = 0.0342356 loss)
I0612 19:15:58.122419 6181 solver.cpp:406] Test net output #135: loss3/loss11 = 0.335452 (* 0.0909091 = 0.0304957 loss)
I0612 19:15:58.122433 6181 solver.cpp:406] Test net output #136: loss3/loss12 = 0.279781 (* 0.0909091 = 0.0254346 loss)
I0612 19:15:58.122447 6181 solver.cpp:406] Test net output #137: loss3/loss13 = 0.239387 (* 0.0909091 = 0.0217625 loss)
I0612 19:15:58.122457 6181 solver.cpp:406] Test net output #138: loss3/loss14 = 0.201832 (* 0.0909091 = 0.0183484 loss)
I0612 19:15:58.122472 6181 solver.cpp:406] Test net output #139: loss3/loss15 = 0.175708 (* 0.0909091 = 0.0159734 loss)
I0612 19:15:58.122485 6181 solver.cpp:406] Test net output #140: loss3/loss16 = 0.142818 (* 0.0909091 = 0.0129835 loss)
I0612 19:15:58.122499 6181 solver.cpp:406] Test net output #141: loss3/loss17 = 0.112717 (* 0.0909091 = 0.010247 loss)
I0612 19:15:58.122514 6181 solver.cpp:406] Test net output #142: loss3/loss18 = 0.0983437 (* 0.0909091 = 0.00894033 loss)
I0612 19:15:58.122527 6181 solver.cpp:406] Test net output #143: loss3/loss19 = 0.100469 (* 0.0909091 = 0.00913352 loss)
I0612 19:15:58.122541 6181 solver.cpp:406] Test net output #144: loss3/loss20 = 0.0474786 (* 0.0909091 = 0.00431624 loss)
I0612 19:15:58.122555 6181 solver.cpp:406] Test net output #145: loss3/loss21 = 0.0236228 (* 0.0909091 = 0.00214753 loss)
I0612 19:15:58.122570 6181 solver.cpp:406] Test net output #146: loss3/loss22 = 0.0222235 (* 0.0909091 = 0.00202032 loss)
I0612 19:15:58.122581 6181 solver.cpp:406] Test net output #147: total_accuracy = 0.495
I0612 19:15:58.122592 6181 solver.cpp:406] Test net output #148: total_accuracy_not_rec = 0.48
I0612 19:15:58.122604 6181 solver.cpp:406] Test net output #149: total_confidence = 0.432387
I0612 19:15:58.122625 6181 solver.cpp:406] Test net output #150: total_confidence_not_rec = 0.401352
I0612 19:15:58.480612 6181 solver.cpp:229] Iteration 20000, loss = 3.87061
I0612 19:15:58.480667 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.478261
I0612 19:15:58.480685 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.875
I0612 19:15:58.480698 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.375
I0612 19:15:58.480712 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 19:15:58.480725 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.625
I0612 19:15:58.480737 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.75
I0612 19:15:58.480751 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.375
I0612 19:15:58.480762 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.875
I0612 19:15:58.480775 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 1
I0612 19:15:58.480788 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 19:15:58.480801 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 1
I0612 19:15:58.480813 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 19:15:58.480826 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 19:15:58.480839 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 19:15:58.480851 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 19:15:58.480864 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 19:15:58.480875 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 19:15:58.480887 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 19:15:58.480900 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 19:15:58.480912 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 19:15:58.480924 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 19:15:58.480936 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 19:15:58.480947 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 19:15:58.480960 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.857955
I0612 19:15:58.480973 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.782609
I0612 19:15:58.480989 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.73003 (* 0.3 = 0.519009 loss)
I0612 19:15:58.481003 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.525116 (* 0.3 = 0.157535 loss)
I0612 19:15:58.481019 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.908619 (* 0.0272727 = 0.0247805 loss)
I0612 19:15:58.481034 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.21697 (* 0.0272727 = 0.0604629 loss)
I0612 19:15:58.481047 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 2.01649 (* 0.0272727 = 0.0549952 loss)
I0612 19:15:58.481061 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.42718 (* 0.0272727 = 0.038923 loss)
I0612 19:15:58.481076 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.39895 (* 0.0272727 = 0.0381533 loss)
I0612 19:15:58.481092 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.13318 (* 0.0272727 = 0.0581776 loss)
I0612 19:15:58.481107 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 0.481001 (* 0.0272727 = 0.0131182 loss)
I0612 19:15:58.481122 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.263355 (* 0.0272727 = 0.00718242 loss)
I0612 19:15:58.481137 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.529843 (* 0.0272727 = 0.0144503 loss)
I0612 19:15:58.481150 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.233151 (* 0.0272727 = 0.00635865 loss)
I0612 19:15:58.481164 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.301823 (* 0.0272727 = 0.00823153 loss)
I0612 19:15:58.481204 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.162256 (* 0.0272727 = 0.00442517 loss)
I0612 19:15:58.481220 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.0497982 (* 0.0272727 = 0.00135813 loss)
I0612 19:15:58.481235 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.039696 (* 0.0272727 = 0.00108262 loss)
I0612 19:15:58.481248 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.00700785 (* 0.0272727 = 0.000191123 loss)
I0612 19:15:58.481263 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.00491548 (* 0.0272727 = 0.000134059 loss)
I0612 19:15:58.481277 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.00161496 (* 0.0272727 = 4.40444e-05 loss)
I0612 19:15:58.481292 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000916354 (* 0.0272727 = 2.49915e-05 loss)
I0612 19:15:58.481307 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000444534 (* 0.0272727 = 1.21236e-05 loss)
I0612 19:15:58.481335 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000533165 (* 0.0272727 = 1.45409e-05 loss)
I0612 19:15:58.481353 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000238451 (* 0.0272727 = 6.50321e-06 loss)
I0612 19:15:58.481367 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.00010608 (* 0.0272727 = 2.89308e-06 loss)
I0612 19:15:58.481380 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.630435
I0612 19:15:58.481394 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 19:15:58.481405 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.625
I0612 19:15:58.481417 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 19:15:58.481429 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.875
I0612 19:15:58.481442 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.75
I0612 19:15:58.481454 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.5
I0612 19:15:58.481467 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.875
I0612 19:15:58.481477 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 19:15:58.481489 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 19:15:58.481501 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 19:15:58.481513 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 19:15:58.481524 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 1
I0612 19:15:58.481536 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 1
I0612 19:15:58.481549 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 1
I0612 19:15:58.481559 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 1
I0612 19:15:58.481571 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 19:15:58.481583 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 19:15:58.481595 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 19:15:58.481606 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 19:15:58.481617 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 19:15:58.481629 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 19:15:58.481642 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 19:15:58.481654 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.897727
I0612 19:15:58.481667 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.847826
I0612 19:15:58.481683 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.2203 (* 0.3 = 0.366089 loss)
I0612 19:15:58.481698 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.354579 (* 0.3 = 0.106374 loss)
I0612 19:15:58.481725 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.487155 (* 0.0272727 = 0.013286 loss)
I0612 19:15:58.481740 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 1.57662 (* 0.0272727 = 0.0429986 loss)
I0612 19:15:58.481755 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 0.695608 (* 0.0272727 = 0.0189711 loss)
I0612 19:15:58.481768 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 0.646967 (* 0.0272727 = 0.0176446 loss)
I0612 19:15:58.481782 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 1.12206 (* 0.0272727 = 0.0306017 loss)
I0612 19:15:58.481796 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 0.920339 (* 0.0272727 = 0.0251002 loss)
I0612 19:15:58.481812 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 0.319174 (* 0.0272727 = 0.00870474 loss)
I0612 19:15:58.481825 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.31904 (* 0.0272727 = 0.00870109 loss)
I0612 19:15:58.481839 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.452318 (* 0.0272727 = 0.012336 loss)
I0612 19:15:58.481853 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.271773 (* 0.0272727 = 0.00741199 loss)
I0612 19:15:58.481868 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.402881 (* 0.0272727 = 0.0109877 loss)
I0612 19:15:58.481883 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.0110602 (* 0.0272727 = 0.000301643 loss)
I0612 19:15:58.481896 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.00161968 (* 0.0272727 = 4.41731e-05 loss)
I0612 19:15:58.481910 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.000587847 (* 0.0272727 = 1.60322e-05 loss)
I0612 19:15:58.481925 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.000488774 (* 0.0272727 = 1.33302e-05 loss)
I0612 19:15:58.481938 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.00019812 (* 0.0272727 = 5.40327e-06 loss)
I0612 19:15:58.481952 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.000120173 (* 0.0272727 = 3.27743e-06 loss)
I0612 19:15:58.481966 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.000159653 (* 0.0272727 = 4.35417e-06 loss)
I0612 19:15:58.481981 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.000228441 (* 0.0272727 = 6.23022e-06 loss)
I0612 19:15:58.481995 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 6.10845e-05 (* 0.0272727 = 1.66594e-06 loss)
I0612 19:15:58.482009 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 2.3203e-05 (* 0.0272727 = 6.32808e-07 loss)
I0612 19:15:58.482024 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 2.49988e-05 (* 0.0272727 = 6.81785e-07 loss)
I0612 19:15:58.482036 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.847826
I0612 19:15:58.482049 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 19:15:58.482061 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.75
I0612 19:15:58.482074 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 19:15:58.482086 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 1
I0612 19:15:58.482098 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 1
I0612 19:15:58.482110 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.875
I0612 19:15:58.482121 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.875
I0612 19:15:58.482133 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 1
I0612 19:15:58.482147 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 19:15:58.482161 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 1
I0612 19:15:58.482172 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 1
I0612 19:15:58.482183 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 1
I0612 19:15:58.482194 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 1
I0612 19:15:58.482216 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 1
I0612 19:15:58.482230 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 1
I0612 19:15:58.482241 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 19:15:58.482254 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 19:15:58.482265 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 19:15:58.482276 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 19:15:58.482288 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 19:15:58.482300 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 19:15:58.482312 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 19:15:58.482323 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.960227
I0612 19:15:58.482336 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.978261
I0612 19:15:58.482350 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.475618 (* 1 = 0.475618 loss)
I0612 19:15:58.482363 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.132439 (* 1 = 0.132439 loss)
I0612 19:15:58.482381 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.369947 (* 0.0909091 = 0.0336315 loss)
I0612 19:15:58.482396 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.967949 (* 0.0909091 = 0.0879954 loss)
I0612 19:15:58.482410 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.331912 (* 0.0909091 = 0.0301738 loss)
I0612 19:15:58.482424 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.187291 (* 0.0909091 = 0.0170265 loss)
I0612 19:15:58.482439 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.245017 (* 0.0909091 = 0.0222743 loss)
I0612 19:15:58.482452 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.362569 (* 0.0909091 = 0.0329609 loss)
I0612 19:15:58.482467 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 0.205269 (* 0.0909091 = 0.0186608 loss)
I0612 19:15:58.482481 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.120926 (* 0.0909091 = 0.0109933 loss)
I0612 19:15:58.482496 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.34625 (* 0.0909091 = 0.0314772 loss)
I0612 19:15:58.482509 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.0939028 (* 0.0909091 = 0.00853662 loss)
I0612 19:15:58.482523 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.062015 (* 0.0909091 = 0.00563772 loss)
I0612 19:15:58.482537 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.0105926 (* 0.0909091 = 0.000962967 loss)
I0612 19:15:58.482552 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.00097362 (* 0.0909091 = 8.85109e-05 loss)
I0612 19:15:58.482565 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.000335516 (* 0.0909091 = 3.05014e-05 loss)
I0612 19:15:58.482579 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.000146463 (* 0.0909091 = 1.33148e-05 loss)
I0612 19:15:58.482594 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 3.54898e-05 (* 0.0909091 = 3.22634e-06 loss)
I0612 19:15:58.482609 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 2.53923e-05 (* 0.0909091 = 2.30839e-06 loss)
I0612 19:15:58.482622 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 1.9655e-05 (* 0.0909091 = 1.78682e-06 loss)
I0612 19:15:58.482636 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 2.54071e-05 (* 0.0909091 = 2.30974e-06 loss)
I0612 19:15:58.482647 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 1.98041e-05 (* 0.0909091 = 1.80037e-06 loss)
I0612 19:15:58.482662 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 1.79117e-05 (* 0.0909091 = 1.62834e-06 loss)
I0612 19:15:58.482676 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 1.88802e-05 (* 0.0909091 = 1.71638e-06 loss)
I0612 19:15:58.482698 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.375
I0612 19:15:58.482712 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.375
I0612 19:15:58.482727 6181 solver.cpp:245] Train net output #149: total_confidence = 0.305185
I0612 19:15:58.482739 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.293632
I0612 19:15:58.482753 6181 sgd_solver.cpp:106] Iteration 20000, lr = 0.001
I0612 19:16:25.846410 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.3888 > 30) by scale factor 0.955754
I0612 19:16:26.616281 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 47.8217 > 30) by scale factor 0.62733
I0612 19:17:16.759135 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 31.0054 > 30) by scale factor 0.967573
I0612 19:17:45.308020 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 32.1296 > 30) by scale factor 0.933717
I0612 19:19:02.454591 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 35.1385 > 30) by scale factor 0.853764
I0612 19:22:24.150566 6181 solver.cpp:229] Iteration 20500, loss = 3.95834
I0612 19:22:24.150677 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.577778
I0612 19:22:24.150697 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.75
I0612 19:22:24.150712 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.75
I0612 19:22:24.150724 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.625
I0612 19:22:24.150738 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.75
I0612 19:22:24.150750 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.625
I0612 19:22:24.150763 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.5
I0612 19:22:24.150776 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.875
I0612 19:22:24.150789 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.875
I0612 19:22:24.150802 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.875
I0612 19:22:24.150815 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 19:22:24.150827 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 19:22:24.150840 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 0.875
I0612 19:22:24.150854 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 0.875
I0612 19:22:24.150867 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 0.875
I0612 19:22:24.150879 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 19:22:24.150892 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 19:22:24.150902 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 19:22:24.150914 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 19:22:24.150926 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 19:22:24.150938 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 19:22:24.150949 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 19:22:24.150961 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 19:22:24.150974 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.840909
I0612 19:22:24.150985 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.8
I0612 19:22:24.151002 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.31452 (* 0.3 = 0.394355 loss)
I0612 19:22:24.151017 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.595025 (* 0.3 = 0.178508 loss)
I0612 19:22:24.151032 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 0.984542 (* 0.0272727 = 0.0268511 loss)
I0612 19:22:24.151051 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 0.98769 (* 0.0272727 = 0.026937 loss)
I0612 19:22:24.151065 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.5356 (* 0.0272727 = 0.0418799 loss)
I0612 19:22:24.151079 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 1.32464 (* 0.0272727 = 0.0361265 loss)
I0612 19:22:24.151093 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 1.51402 (* 0.0272727 = 0.0412913 loss)
I0612 19:22:24.151108 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 1.88614 (* 0.0272727 = 0.0514402 loss)
I0612 19:22:24.151123 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.09282 (* 0.0272727 = 0.0298041 loss)
I0612 19:22:24.151136 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 0.488163 (* 0.0272727 = 0.0133135 loss)
I0612 19:22:24.151150 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 0.53725 (* 0.0272727 = 0.0146523 loss)
I0612 19:22:24.151165 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.477766 (* 0.0272727 = 0.01303 loss)
I0612 19:22:24.151180 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.410837 (* 0.0272727 = 0.0112046 loss)
I0612 19:22:24.151193 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.309027 (* 0.0272727 = 0.00842802 loss)
I0612 19:22:24.151227 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.270161 (* 0.0272727 = 0.00736804 loss)
I0612 19:22:24.151243 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.22848 (* 0.0272727 = 0.00623129 loss)
I0612 19:22:24.151257 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.172351 (* 0.0272727 = 0.00470049 loss)
I0612 19:22:24.151273 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.0336287 (* 0.0272727 = 0.000917147 loss)
I0612 19:22:24.151286 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.000815397 (* 0.0272727 = 2.22381e-05 loss)
I0612 19:22:24.151300 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000237484 (* 0.0272727 = 6.47684e-06 loss)
I0612 19:22:24.151314 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000266473 (* 0.0272727 = 7.26745e-06 loss)
I0612 19:22:24.151329 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000120829 (* 0.0272727 = 3.29535e-06 loss)
I0612 19:22:24.151343 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000296497 (* 0.0272727 = 8.08629e-06 loss)
I0612 19:22:24.151357 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000307018 (* 0.0272727 = 8.37322e-06 loss)
I0612 19:22:24.151370 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.711111
I0612 19:22:24.151382 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.75
I0612 19:22:24.151394 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.75
I0612 19:22:24.151407 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.625
I0612 19:22:24.151418 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.75
I0612 19:22:24.151430 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.75
I0612 19:22:24.151443 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.625
I0612 19:22:24.151454 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 19:22:24.151465 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.875
I0612 19:22:24.151478 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.875
I0612 19:22:24.151490 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 19:22:24.151502 6181 solver.cpp:245] Train net output #60: loss2/accuracy11 = 0.875
I0612 19:22:24.151515 6181 solver.cpp:245] Train net output #61: loss2/accuracy12 = 0.875
I0612 19:22:24.151526 6181 solver.cpp:245] Train net output #62: loss2/accuracy13 = 0.875
I0612 19:22:24.151538 6181 solver.cpp:245] Train net output #63: loss2/accuracy14 = 0.875
I0612 19:22:24.151551 6181 solver.cpp:245] Train net output #64: loss2/accuracy15 = 0.875
I0612 19:22:24.151562 6181 solver.cpp:245] Train net output #65: loss2/accuracy16 = 1
I0612 19:22:24.151573 6181 solver.cpp:245] Train net output #66: loss2/accuracy17 = 1
I0612 19:22:24.151585 6181 solver.cpp:245] Train net output #67: loss2/accuracy18 = 1
I0612 19:22:24.151597 6181 solver.cpp:245] Train net output #68: loss2/accuracy19 = 1
I0612 19:22:24.151608 6181 solver.cpp:245] Train net output #69: loss2/accuracy20 = 1
I0612 19:22:24.151620 6181 solver.cpp:245] Train net output #70: loss2/accuracy21 = 1
I0612 19:22:24.151631 6181 solver.cpp:245] Train net output #71: loss2/accuracy22 = 1
I0612 19:22:24.151643 6181 solver.cpp:245] Train net output #72: loss2/accuracy_incl_empty = 0.863636
I0612 19:22:24.151655 6181 solver.cpp:245] Train net output #73: loss2/accuracy_top3 = 0.866667
I0612 19:22:24.151669 6181 solver.cpp:245] Train net output #74: loss2/cross_entropy_loss = 1.21051 (* 0.3 = 0.363152 loss)
I0612 19:22:24.151684 6181 solver.cpp:245] Train net output #75: loss2/cross_entropy_loss_incl_empty = 0.574492 (* 0.3 = 0.172348 loss)
I0612 19:22:24.151697 6181 solver.cpp:245] Train net output #76: loss2/loss01 = 0.914679 (* 0.0272727 = 0.0249458 loss)
I0612 19:22:24.151711 6181 solver.cpp:245] Train net output #77: loss2/loss02 = 0.682639 (* 0.0272727 = 0.0186174 loss)
I0612 19:22:24.151736 6181 solver.cpp:245] Train net output #78: loss2/loss03 = 1.34668 (* 0.0272727 = 0.0367276 loss)
I0612 19:22:24.151751 6181 solver.cpp:245] Train net output #79: loss2/loss04 = 1.17042 (* 0.0272727 = 0.0319205 loss)
I0612 19:22:24.151765 6181 solver.cpp:245] Train net output #80: loss2/loss05 = 0.830361 (* 0.0272727 = 0.0226462 loss)
I0612 19:22:24.151779 6181 solver.cpp:245] Train net output #81: loss2/loss06 = 1.2554 (* 0.0272727 = 0.0342382 loss)
I0612 19:22:24.151793 6181 solver.cpp:245] Train net output #82: loss2/loss07 = 1.51215 (* 0.0272727 = 0.0412403 loss)
I0612 19:22:24.151808 6181 solver.cpp:245] Train net output #83: loss2/loss08 = 0.610814 (* 0.0272727 = 0.0166586 loss)
I0612 19:22:24.151821 6181 solver.cpp:245] Train net output #84: loss2/loss09 = 0.463621 (* 0.0272727 = 0.0126442 loss)
I0612 19:22:24.151835 6181 solver.cpp:245] Train net output #85: loss2/loss10 = 0.442043 (* 0.0272727 = 0.0120557 loss)
I0612 19:22:24.151849 6181 solver.cpp:245] Train net output #86: loss2/loss11 = 0.394109 (* 0.0272727 = 0.0107484 loss)
I0612 19:22:24.151865 6181 solver.cpp:245] Train net output #87: loss2/loss12 = 0.357386 (* 0.0272727 = 0.0097469 loss)
I0612 19:22:24.151878 6181 solver.cpp:245] Train net output #88: loss2/loss13 = 0.359845 (* 0.0272727 = 0.00981397 loss)
I0612 19:22:24.151892 6181 solver.cpp:245] Train net output #89: loss2/loss14 = 0.299723 (* 0.0272727 = 0.00817426 loss)
I0612 19:22:24.151906 6181 solver.cpp:245] Train net output #90: loss2/loss15 = 0.248405 (* 0.0272727 = 0.00677468 loss)
I0612 19:22:24.151921 6181 solver.cpp:245] Train net output #91: loss2/loss16 = 0.0888535 (* 0.0272727 = 0.00242328 loss)
I0612 19:22:24.151935 6181 solver.cpp:245] Train net output #92: loss2/loss17 = 0.0122416 (* 0.0272727 = 0.000333863 loss)
I0612 19:22:24.151949 6181 solver.cpp:245] Train net output #93: loss2/loss18 = 0.00359954 (* 0.0272727 = 9.81693e-05 loss)
I0612 19:22:24.151963 6181 solver.cpp:245] Train net output #94: loss2/loss19 = 0.00118413 (* 0.0272727 = 3.22944e-05 loss)
I0612 19:22:24.151978 6181 solver.cpp:245] Train net output #95: loss2/loss20 = 0.0011355 (* 0.0272727 = 3.09683e-05 loss)
I0612 19:22:24.151993 6181 solver.cpp:245] Train net output #96: loss2/loss21 = 0.00112194 (* 0.0272727 = 3.05982e-05 loss)
I0612 19:22:24.152006 6181 solver.cpp:245] Train net output #97: loss2/loss22 = 0.000590817 (* 0.0272727 = 1.61132e-05 loss)
I0612 19:22:24.152019 6181 solver.cpp:245] Train net output #98: loss3/accuracy = 0.822222
I0612 19:22:24.152031 6181 solver.cpp:245] Train net output #99: loss3/accuracy01 = 0.875
I0612 19:22:24.152043 6181 solver.cpp:245] Train net output #100: loss3/accuracy02 = 0.875
I0612 19:22:24.152055 6181 solver.cpp:245] Train net output #101: loss3/accuracy03 = 0.875
I0612 19:22:24.152067 6181 solver.cpp:245] Train net output #102: loss3/accuracy04 = 0.875
I0612 19:22:24.152078 6181 solver.cpp:245] Train net output #103: loss3/accuracy05 = 0.875
I0612 19:22:24.152093 6181 solver.cpp:245] Train net output #104: loss3/accuracy06 = 0.75
I0612 19:22:24.152107 6181 solver.cpp:245] Train net output #105: loss3/accuracy07 = 0.75
I0612 19:22:24.152119 6181 solver.cpp:245] Train net output #106: loss3/accuracy08 = 0.875
I0612 19:22:24.152132 6181 solver.cpp:245] Train net output #107: loss3/accuracy09 = 0.875
I0612 19:22:24.152143 6181 solver.cpp:245] Train net output #108: loss3/accuracy10 = 0.875
I0612 19:22:24.152155 6181 solver.cpp:245] Train net output #109: loss3/accuracy11 = 0.875
I0612 19:22:24.152168 6181 solver.cpp:245] Train net output #110: loss3/accuracy12 = 0.875
I0612 19:22:24.152179 6181 solver.cpp:245] Train net output #111: loss3/accuracy13 = 0.875
I0612 19:22:24.152190 6181 solver.cpp:245] Train net output #112: loss3/accuracy14 = 0.875
I0612 19:22:24.152202 6181 solver.cpp:245] Train net output #113: loss3/accuracy15 = 0.875
I0612 19:22:24.152225 6181 solver.cpp:245] Train net output #114: loss3/accuracy16 = 1
I0612 19:22:24.152237 6181 solver.cpp:245] Train net output #115: loss3/accuracy17 = 1
I0612 19:22:24.152250 6181 solver.cpp:245] Train net output #116: loss3/accuracy18 = 1
I0612 19:22:24.152261 6181 solver.cpp:245] Train net output #117: loss3/accuracy19 = 1
I0612 19:22:24.152272 6181 solver.cpp:245] Train net output #118: loss3/accuracy20 = 1
I0612 19:22:24.152283 6181 solver.cpp:245] Train net output #119: loss3/accuracy21 = 1
I0612 19:22:24.152295 6181 solver.cpp:245] Train net output #120: loss3/accuracy22 = 1
I0612 19:22:24.152307 6181 solver.cpp:245] Train net output #121: loss3/accuracy_incl_empty = 0.903409
I0612 19:22:24.152319 6181 solver.cpp:245] Train net output #122: loss3/accuracy_top3 = 0.888889
I0612 19:22:24.152334 6181 solver.cpp:245] Train net output #123: loss3/cross_entropy_loss = 0.896936 (* 1 = 0.896936 loss)
I0612 19:22:24.152348 6181 solver.cpp:245] Train net output #124: loss3/cross_entropy_loss_incl_empty = 0.496735 (* 1 = 0.496735 loss)
I0612 19:22:24.152362 6181 solver.cpp:245] Train net output #125: loss3/loss01 = 0.550803 (* 0.0909091 = 0.050073 loss)
I0612 19:22:24.152376 6181 solver.cpp:245] Train net output #126: loss3/loss02 = 0.500263 (* 0.0909091 = 0.0454784 loss)
I0612 19:22:24.152386 6181 solver.cpp:245] Train net output #127: loss3/loss03 = 0.684197 (* 0.0909091 = 0.0621997 loss)
I0612 19:22:24.152401 6181 solver.cpp:245] Train net output #128: loss3/loss04 = 0.614865 (* 0.0909091 = 0.0558968 loss)
I0612 19:22:24.152415 6181 solver.cpp:245] Train net output #129: loss3/loss05 = 0.363186 (* 0.0909091 = 0.0330169 loss)
I0612 19:22:24.152429 6181 solver.cpp:245] Train net output #130: loss3/loss06 = 0.947536 (* 0.0909091 = 0.0861397 loss)
I0612 19:22:24.152443 6181 solver.cpp:245] Train net output #131: loss3/loss07 = 1.596 (* 0.0909091 = 0.145091 loss)
I0612 19:22:24.152457 6181 solver.cpp:245] Train net output #132: loss3/loss08 = 0.42121 (* 0.0909091 = 0.0382918 loss)
I0612 19:22:24.152472 6181 solver.cpp:245] Train net output #133: loss3/loss09 = 0.416783 (* 0.0909091 = 0.0378893 loss)
I0612 19:22:24.152485 6181 solver.cpp:245] Train net output #134: loss3/loss10 = 0.414223 (* 0.0909091 = 0.0376567 loss)
I0612 19:22:24.152499 6181 solver.cpp:245] Train net output #135: loss3/loss11 = 0.428883 (* 0.0909091 = 0.0389894 loss)
I0612 19:22:24.152513 6181 solver.cpp:245] Train net output #136: loss3/loss12 = 0.528966 (* 0.0909091 = 0.0480878 loss)
I0612 19:22:24.152528 6181 solver.cpp:245] Train net output #137: loss3/loss13 = 0.552273 (* 0.0909091 = 0.0502066 loss)
I0612 19:22:24.152541 6181 solver.cpp:245] Train net output #138: loss3/loss14 = 0.556417 (* 0.0909091 = 0.0505834 loss)
I0612 19:22:24.152555 6181 solver.cpp:245] Train net output #139: loss3/loss15 = 0.441814 (* 0.0909091 = 0.0401649 loss)
I0612 19:22:24.152570 6181 solver.cpp:245] Train net output #140: loss3/loss16 = 0.125677 (* 0.0909091 = 0.0114252 loss)
I0612 19:22:24.152583 6181 solver.cpp:245] Train net output #141: loss3/loss17 = 0.0123792 (* 0.0909091 = 0.00112539 loss)
I0612 19:22:24.152597 6181 solver.cpp:245] Train net output #142: loss3/loss18 = 0.008116 (* 0.0909091 = 0.000737818 loss)
I0612 19:22:24.152611 6181 solver.cpp:245] Train net output #143: loss3/loss19 = 0.00682144 (* 0.0909091 = 0.000620131 loss)
I0612 19:22:24.152626 6181 solver.cpp:245] Train net output #144: loss3/loss20 = 0.00561305 (* 0.0909091 = 0.000510277 loss)
I0612 19:22:24.152639 6181 solver.cpp:245] Train net output #145: loss3/loss21 = 0.00618193 (* 0.0909091 = 0.000561994 loss)
I0612 19:22:24.152653 6181 solver.cpp:245] Train net output #146: loss3/loss22 = 0.00393601 (* 0.0909091 = 0.000357819 loss)
I0612 19:22:24.152665 6181 solver.cpp:245] Train net output #147: total_accuracy = 0.5
I0612 19:22:24.152678 6181 solver.cpp:245] Train net output #148: total_accuracy_not_rec = 0.5
I0612 19:22:24.152698 6181 solver.cpp:245] Train net output #149: total_confidence = 0.315008
I0612 19:22:24.152711 6181 solver.cpp:245] Train net output #150: total_confidence_not_rec = 0.285396
I0612 19:22:24.152724 6181 sgd_solver.cpp:106] Iteration 20500, lr = 0.001
I0612 19:23:40.883141 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 40.2787 > 30) by scale factor 0.74481
I0612 19:26:06.567219 6181 sgd_solver.cpp:92] Gradient clipping: scaling down gradients (L2 norm 51.3504 > 30) by scale factor 0.584221
I0612 19:28:49.647987 6181 solver.cpp:229] Iteration 21000, loss = 3.84122
I0612 19:28:49.648165 6181 solver.cpp:245] Train net output #0: loss1/accuracy = 0.372881
I0612 19:28:49.648188 6181 solver.cpp:245] Train net output #1: loss1/accuracy01 = 0.5
I0612 19:28:49.648202 6181 solver.cpp:245] Train net output #2: loss1/accuracy02 = 0.375
I0612 19:28:49.648216 6181 solver.cpp:245] Train net output #3: loss1/accuracy03 = 0.5
I0612 19:28:49.648231 6181 solver.cpp:245] Train net output #4: loss1/accuracy04 = 0.25
I0612 19:28:49.648246 6181 solver.cpp:245] Train net output #5: loss1/accuracy05 = 0.25
I0612 19:28:49.648258 6181 solver.cpp:245] Train net output #6: loss1/accuracy06 = 0.25
I0612 19:28:49.648272 6181 solver.cpp:245] Train net output #7: loss1/accuracy07 = 0.375
I0612 19:28:49.648284 6181 solver.cpp:245] Train net output #8: loss1/accuracy08 = 0.625
I0612 19:28:49.648298 6181 solver.cpp:245] Train net output #9: loss1/accuracy09 = 0.625
I0612 19:28:49.648311 6181 solver.cpp:245] Train net output #10: loss1/accuracy10 = 0.875
I0612 19:28:49.648324 6181 solver.cpp:245] Train net output #11: loss1/accuracy11 = 0.875
I0612 19:28:49.648337 6181 solver.cpp:245] Train net output #12: loss1/accuracy12 = 1
I0612 19:28:49.648350 6181 solver.cpp:245] Train net output #13: loss1/accuracy13 = 1
I0612 19:28:49.648362 6181 solver.cpp:245] Train net output #14: loss1/accuracy14 = 1
I0612 19:28:49.648375 6181 solver.cpp:245] Train net output #15: loss1/accuracy15 = 1
I0612 19:28:49.648387 6181 solver.cpp:245] Train net output #16: loss1/accuracy16 = 1
I0612 19:28:49.648399 6181 solver.cpp:245] Train net output #17: loss1/accuracy17 = 1
I0612 19:28:49.648411 6181 solver.cpp:245] Train net output #18: loss1/accuracy18 = 1
I0612 19:28:49.648423 6181 solver.cpp:245] Train net output #19: loss1/accuracy19 = 1
I0612 19:28:49.648435 6181 solver.cpp:245] Train net output #20: loss1/accuracy20 = 1
I0612 19:28:49.648447 6181 solver.cpp:245] Train net output #21: loss1/accuracy21 = 1
I0612 19:28:49.648460 6181 solver.cpp:245] Train net output #22: loss1/accuracy22 = 1
I0612 19:28:49.648473 6181 solver.cpp:245] Train net output #23: loss1/accuracy_incl_empty = 0.761364
I0612 19:28:49.648484 6181 solver.cpp:245] Train net output #24: loss1/accuracy_top3 = 0.694915
I0612 19:28:49.648501 6181 solver.cpp:245] Train net output #25: loss1/cross_entropy_loss = 1.96915 (* 0.3 = 0.590745 loss)
I0612 19:28:49.648516 6181 solver.cpp:245] Train net output #26: loss1/cross_entropy_loss_incl_empty = 0.761308 (* 0.3 = 0.228392 loss)
I0612 19:28:49.648531 6181 solver.cpp:245] Train net output #27: loss1/loss01 = 1.34282 (* 0.0272727 = 0.0366225 loss)
I0612 19:28:49.648545 6181 solver.cpp:245] Train net output #28: loss1/loss02 = 2.60336 (* 0.0272727 = 0.0710009 loss)
I0612 19:28:49.648561 6181 solver.cpp:245] Train net output #29: loss1/loss03 = 1.49354 (* 0.0272727 = 0.040733 loss)
I0612 19:28:49.648574 6181 solver.cpp:245] Train net output #30: loss1/loss04 = 2.57912 (* 0.0272727 = 0.0703398 loss)
I0612 19:28:49.648589 6181 solver.cpp:245] Train net output #31: loss1/loss05 = 2.22027 (* 0.0272727 = 0.0605527 loss)
I0612 19:28:49.648603 6181 solver.cpp:245] Train net output #32: loss1/loss06 = 2.59311 (* 0.0272727 = 0.0707213 loss)
I0612 19:28:49.648617 6181 solver.cpp:245] Train net output #33: loss1/loss07 = 1.53257 (* 0.0272727 = 0.0417974 loss)
I0612 19:28:49.648632 6181 solver.cpp:245] Train net output #34: loss1/loss08 = 1.5584 (* 0.0272727 = 0.0425019 loss)
I0612 19:28:49.648646 6181 solver.cpp:245] Train net output #35: loss1/loss09 = 1.30976 (* 0.0272727 = 0.0357207 loss)
I0612 19:28:49.648660 6181 solver.cpp:245] Train net output #36: loss1/loss10 = 0.472267 (* 0.0272727 = 0.01288 loss)
I0612 19:28:49.648674 6181 solver.cpp:245] Train net output #37: loss1/loss11 = 0.44268 (* 0.0272727 = 0.0120731 loss)
I0612 19:28:49.648690 6181 solver.cpp:245] Train net output #38: loss1/loss12 = 0.0306253 (* 0.0272727 = 0.000835235 loss)
I0612 19:28:49.648705 6181 solver.cpp:245] Train net output #39: loss1/loss13 = 0.00668323 (* 0.0272727 = 0.00018227 loss)
I0612 19:28:49.648743 6181 solver.cpp:245] Train net output #40: loss1/loss14 = 0.00268803 (* 0.0272727 = 7.33099e-05 loss)
I0612 19:28:49.648759 6181 solver.cpp:245] Train net output #41: loss1/loss15 = 0.00150565 (* 0.0272727 = 4.10633e-05 loss)
I0612 19:28:49.648774 6181 solver.cpp:245] Train net output #42: loss1/loss16 = 0.00066264 (* 0.0272727 = 1.8072e-05 loss)
I0612 19:28:49.648789 6181 solver.cpp:245] Train net output #43: loss1/loss17 = 0.000578579 (* 0.0272727 = 1.57794e-05 loss)
I0612 19:28:49.648804 6181 solver.cpp:245] Train net output #44: loss1/loss18 = 0.000216874 (* 0.0272727 = 5.91474e-06 loss)
I0612 19:28:49.648818 6181 solver.cpp:245] Train net output #45: loss1/loss19 = 0.000357299 (* 0.0272727 = 9.74452e-06 loss)
I0612 19:28:49.648833 6181 solver.cpp:245] Train net output #46: loss1/loss20 = 0.000215346 (* 0.0272727 = 5.87307e-06 loss)
I0612 19:28:49.648847 6181 solver.cpp:245] Train net output #47: loss1/loss21 = 0.000152534 (* 0.0272727 = 4.16001e-06 loss)
I0612 19:28:49.648862 6181 solver.cpp:245] Train net output #48: loss1/loss22 = 0.000287475 (* 0.0272727 = 7.84022e-06 loss)
I0612 19:28:49.648875 6181 solver.cpp:245] Train net output #49: loss2/accuracy = 0.576271
I0612 19:28:49.648887 6181 solver.cpp:245] Train net output #50: loss2/accuracy01 = 0.875
I0612 19:28:49.648900 6181 solver.cpp:245] Train net output #51: loss2/accuracy02 = 0.625
I0612 19:28:49.648912 6181 solver.cpp:245] Train net output #52: loss2/accuracy03 = 0.75
I0612 19:28:49.648924 6181 solver.cpp:245] Train net output #53: loss2/accuracy04 = 0.375
I0612 19:28:49.648936 6181 solver.cpp:245] Train net output #54: loss2/accuracy05 = 0.5
I0612 19:28:49.648948 6181 solver.cpp:245] Train net output #55: loss2/accuracy06 = 0.375
I0612 19:28:49.648960 6181 solver.cpp:245] Train net output #56: loss2/accuracy07 = 0.625
I0612 19:28:49.648973 6181 solver.cpp:245] Train net output #57: loss2/accuracy08 = 0.625
I0612 19:28:49.648985 6181 solver.cpp:245] Train net output #58: loss2/accuracy09 = 0.625
I0612 19:28:49.648998 6181 solver.cpp:245] Train net output #59: loss2/accuracy10 = 0.875
I0612 19:28:49.649009 6181 solver.cpp
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment